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Individual Differences in Coalitional Behavior 


Ahmed Sakr Ashour * 
Department of Management, Sir George Williams University, 
Montreal, Canada 


Individual differences in coalitional behavior were investigated in an experi- 
mental setting. Variables representing the choice of partner and bargaining on 
payoff were correlated with one another and with some personality variables. 
The findings indicate that partner power preference is significantly related to 
initial bargaining demands and bargaining endurance. The bargaining variables 
of initial demands, final demands, concessions, and endurance were found to 
be significantly related to one another. The findings indicate that the person- 
ality variable of external control is significantly related to partner power 
preference and bargaining concessions. The results suggest that differences in 
coalitional behavior that occur independently of situational variation have a 
significant systematic component. The findings indicate that an analysis of 
these differences could enhance prediction of coalitional behavior. Prediction 


Research on coalitional behavior is fairly 
recent. An examination of coalition studies 
and the theoretical formulations underlying 
them reveals a predominant interest in two 
dependent variables: (a) coalition formations 
and (6) division of payoff among coalition 
partners. Situational variables such as initial 
power distribution and nature of the con- 
flict are selected to explain the first de- 
pendent variable, and a search for a universal 


. social norm characterizes explanations of the 


second. With the exception of studies on sex 
differences (Bond & Vinacke, 1961; Uesugi 
& Vinacke, 1963; Vinacke, 1959), the be- 
havioral differences observed under the same 


` experimental conditions are usually treated as 
„error variability and interpreted in terms of 
l imperfection of experimental procedures and 


manipulations. This interpretation was im- 
plicit in Gamson’s (1964) review of coalition 
theories and research. Thus, studies examin- 
ing individual differences in coalitional be- 
havior have been extremely rare. 

The lack of focus on individual differences 
in coalitional behavior is partly due to the 
assumption of rationality underlying coalition 
theories and research. This assumption has 
excluded by definition the effect of personal- 


1 Requests for reprints should be sent to Ahmed 
Sakr Ashour, who is now at the Department of 
Business Administration, Faculty of Commerce, 
University of Alexandria, Shatby, Alexandria, Egypt. 


linkages between variables are proposed. 


ity on conflict and coalitional behavior, by 
excluding the possibility that the same situa- 
tion could generate different behaviors from 
different individuals depending on their per- 
sonality and emotional makeup. Thus, if the 
assumption of rationality were relaxed, coali- 
tion research could be directed to examine 
and explain the actual, though nonrational, 
coalitional behavior (Mazur, 1968). 

Consistent with the lack of interest in be- 
havioral differences is the failure of coalition 
research to examine the relationships between 
behavioral variables. Thus, variables of coali- 
tion formation and of bargaining on payoff 
are not usually correlated with one another. 
Rather, they are examined in relation to the 
independent variables of power, nature of 
conflict situation, and the like. In fact, most 
studies focus on gross outcome variables such 
as coalitions formed and final divisions of 
payoff, rather than focusing on intermediate 
behavioral variables such as choice of part- 
ner, bargaining offers, bargaining duration, 
and the like. 

The failure of coalition theories and re- 
search to focus on behavioral differences is 
probably responsible for the sizable unex- 
plained variability in the dependent variables 
that consistently emerges in coalition re- 
search. It is believed that the understanding 
and the explanation of coalitional behavior 
could be enhanced through a closer examina- 
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tion of the smaller variables of coalitional 
behavior and their linkages and also by in- 
corporating personality characteristics as in- 
dependent variables. 

The present study was conducted to fill 
part of the abovementioned gap and to dem- 
onstrate that the proposed approach is feasible 
and fruitful. The situation selected for the 
study was the triadic type most commonly 
investigated in coalition research, It was in- 
corporated in a theoretical formulation sug- 
gested by Caplow (1956, 1959, 1968) and is 
characterized by unequal power distribution 
among the triad members so that A > B > 
C, A < B +C. Coalitional behavior was rep- 
resented by five variables characterizing coali- 
tion formation and bargaining on payoff 
processes. 

Since power is considered to be a critical 
variable influencing coalition formation, it was 
selected to represent the behavioral variable 
characterizing that process. However, unlike 
other coalition studies, the power variable 
was used as a dependent variable. Coalition 
power, along with a random factor, was made 
to determine coalition success, hence creating 
a risk condition of that success. At the be- 
ginning of every trial, each triad member was 
to select his coalition partner. Thus, his 
choice reflected a power preference. The 
power preference characterizing that choice 
was selected as a dependent variable. 

The bargaining process over payoff division 
which took place after coalition success was 
decomposed into four dependent variables: 
(a) initial bargaining demands, which repre- 
Sent the expressed aspiration of the indi- 
vidual at the start of the bargaining; (b) 
final bargaining demands, which represent the 
expressed expectation at the end of the bar- 
gaining; (c) bargaining concessions, which 
represent the individual’s tendency to lower 
his demands at the end of the bargaining 
compared to his initial demands; and (d) 
bargaining endurance, which represents the 
individual’s tendency to lengthen the bargain- 
ing process by failing to meet the demands 
of the other partner over the bargaining ses- 
Slons. 

Four independent personality variables were 
included in the present study. They were se- 


lected on the basis of relevance to conflict and 
coalitional behavior. They were (a) concilia- 
tion, (6) risk avoidance, (c) external con- 
trol, and (d) suspiciousness, The four varia- 


bles along with their scales were selected from 
a personality/attitude schedule developed re- 
cently to evaluate the effect of individual 


differences in gaming and bargaining experi- 
ments (Shure & Meeker, 1967). 

Since none of the theoretical formulations 
on coalitional behavior suggest particular ex- 
pectations concerning the relationships be- 
tween the variables included, the present 
study was seeking the formulation of hy- 
potheses rather than the testing of particular 
ones. 


METHOD 
Subjects 
Subjects were 60 male students enroll d in junior 
and senior business courses at, California State 


University, Fresno. 
to groups of three. 


They were assigned randomly 


Procedure 


There were 20 three-person groups. Subjects were 
to play a game of dice to win a prize. The game 
was played under the supervision of a fourth person 
who was to act as an administrator. In each game 
subjects could form a partnership with one of thi 
other two members of the group. In each group, the 
three subjects were assigned to three positions repre- 
senting powers so that A >B > C, A< B+C. The 
positions were identified by nonsense syllables Sy, 
Fy, and Ny to reduce the effect of the alphabetical 
order of A, B, and C. The Sy, Fy, and Ny positions 
were given power weights of four, three, and two, 
respectively. There were 15 trials for every group 
with the first three trials to be played for practice. 
Subjects were rotated randomly among the po 
tions, with the provision that each subject w: 
assigned five times to each of the three positions over 
the 15 trials. 

Each subject was given a set of 15 envelopes 
numbered consecutively to represent trial numbers, 
Tnside each envelope there was a sheet indicating 
the assigned position with its respective weight. Or 
the sheet, every subject was to indicate his che? F 
of partner. On the same sheet a number of bargain- 
ing options were provided for the partners to decide 
on the division of the coalition payoff if the coalition 
won. 

At the start of every trial the subject was to in- 
dicate his choice of partner by writing his syllable: 
Sy, Fy, or Ny. The subject was instructed to conceal 
his position and his choice of partner until a coalit 
was formed. The two members who chose each ot} 
were considered a coalition, and they pooled their 
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power weights together against the third member. If 
there were no mutual choices, the trial was ended and 
no prize distributed. ` 

Once a coalition was formed, the dice game was 
played. The administrator threw a pair of dice five 
times for the single player and another five times for 
the coalition. The values of the five throws for the 
single player were added and multiplied by his 
weight, The values of the five throws for the coalition 
were added and multiplied by the total weight of the 
coalition partners. As such, a coalition success was 
partially dependent upon the joint powers (weights) 
` of the partners and partially upon the random value 
of the dice throws. The total amount of the prize for 
every trial was 36¢. When the single player won, he 
got the full amount of the prize, When the coalition 
won, the partners had to decide upon the division of 
the prize by bargaining. 

The bargaining on payoff division was to be done 
through written communication supervised by the 
administrator. On the trial sheet a set of options of 
prize division were provided. They were (a) divide 
equally, (b) pay initial resources first (weight mul- 
tiplied by 4¢) then divide the rest equally, and (c) 
divide according to weight. The three options were 
included to represent the three most commonly used 
divisions characterizing empirical findings as well as 
theoretical hypotheses in coalition research (Gamson, 
1961; Kelley & Arrowood, 1960; Rapoport, 1970; 
Wahba, 1971). 

The coalition partners were allowed 10 bargaining 
trials to agree on a division option. For every bar- 
gaining trial each partner was to indicate his choice 
of option on the trial sheet, which in turn was trans- 
mitted by the administrator to the other partner and 
vice versa. If the partners agreed on one of the op- 
tions, the monetary prize was distributed accordingly 
and given to them immediately. If they did not 
agree during the 10 bargaining trials, then they lost 
and the third player received his initial resources 
(weight multiplied by 4¢). 


Measurements 


The power weight of the partner chosen was aver- 
aged over the 12 trials (excluding the three practice 
trials) and was used as the measure of the partner 
power preference variable. 

To measure initial bargaining demands and final 
bargaining demands, scores of 1, 2, or 3 were assigned 
to a subject when bargaining took place (when the 
coalition won), depending cn the relative magnitude 
of gain inherent in the option he selected. These 
scores were averaged for the first bargaining trials 
and also for the last bargaining trials to represent 
initial demands and final demands, respectively. The 
measure of bargaining concessions was obtained by 
subtracting the score of final bargaining demands 
from the score of initial bargaining demands. The 
sure of bargaining endurance was the average 


mea: 
number of bargaining trials used through the bar- 
gaining. 


The four personality variables were measured using 
four of the six scales constituting the personality /atti- 


tude schedule developed by Shure and Meeker (1967). 
The scales are (a) conciliation versus belligerence in 
interpersonal relations, (b) risk avoidance, (c) ex- 
ternal versus internal control, and (d) suspiciousness 
versus trust. The schedule was administered to sub= 
jects one month prior to their participation in the 
experiment. The scales included in that schedule repre- 
sent facter scales and their corresponding scoring keys 
developed through factor-analytic procedure. A cor- 
rected split-half reliability study on a sample (n= 
544) reported by the authors indicated that the cceffi- 
cients were .80, .81, .56, and .72 for personality vari- 
ables a, b, c, and d, respectively. 


Analysis 


A simple product-moment correlation coefficient 
was used to intercorrelate the scores on the five be- 
havioral variables and to correlate them with each of 
the four personality scores. 


RESULTS 


The results summarized in Table 1 repre- 
sent the correlations between the five be- 
havioral variables. The partner power prefer- 
ence variable, representing the behavioral 
choice of coalition formation, had significant 
negative correlations with initial bargaining 
demands (r= .28) and with bargaining en- 
durance (r = .45). However, it had no sig- 
nificant correlation with the other two bar- 
gaining variables. 

Table 1 also shows that initial bargaining 
demands had significant positive correlation 
with final bargaining demands (r = .41 ), with 
bargaining concessions (r= .49), and with 
bargaining endurance (r = .40). Final bar- 
gaining demands had a significant negative 
correlation with bargaining concessions (7 = 
.59), However, the correlations between bar- 
gaining endurance and each of the final bar- 
gaining demands and bargaining cone 
were not signifitant. 


TABLE 1 
BEHAVIORAL VARIABLES 


Bargain- | Bargains 
i ing 
endur- 
ance 


Initial 
bargain- 
ing 
demands 


Variable 


ing 
demands 


sions 


Partner power 
preference 
Initial ba 
demands | 
Final bargaining 
demands 
Concession 


—.05 -.22 — 45 


pane aie | aoe | Aow 
| —.59 | 413 

24 
| | 


*p <.05. 
*+p < 01. 
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TABLE 2 
CORRELATIONS BETWEEN PERSONALITY AND BEHAVIORAL \ 


Behavioral variable 


Personality variable 
Partner power 


| Initial bargaining 


Final bargaining 


preference demands demands 
Conciliation —.07 | 04 | Al 
Risk avoidance 10 —.03 —.14 
External control —.30* 16 | —.22 
Suspiciousness = hh —.02 —.03 
*~ <.05. 
on <0. 


Table 2 shows the correlations obtained be- 
tween the four personality variables and the 
five behavioral variables. External control had 
a significant negative correlation with power 
preference (r = .30) and a significant positive 
correlation with bargaining concessions (r = 
.35). However, it had no significant correla- 
tion with initial bargaining demands nor with 
final þargaining demands. The correlations be- 
tween the other personality variables and the 
five behavioral variables were not significant 


DISCUSSION 


The purpose of this study was twofold: (a) 
to examine the intercorrelations between vari- 
ables of coalitional behavior and identify sys- 
tematic differences in behavioral patterns and 
(6) to examine the relationships between the 
behavioral variables and some personality 
variables. 

The results of intercorrelating the behavioral 
variables showed that the variable of partner 
power preference had a significant negative 
correlation with initial bargaining demands 
and with bargaining endurance. The differ- 
ences in partner power preference were mea- 
sured independently of situational variation, 
The bargaining correlates of these differences 
Suggest that individual variations in partner 
choice have a significant systematic component. 

The results suggest that an individual who 
prefers a less powerful partner tends to have 
higher bargaining demands and to exhibit 
greater endurance in the bargaining than an 
individual who prefers a more powerful 
partner. It seems that the higher bar- 
gaining demands of the individual who pre- 
fers a weak partner represent the compensa- 
tion he feels he deserves for the risk of pre- 
ferring a coalition with a lower probability 


of success. This ex 
dividual to endur 
On the other hand 
gaining stand of the 
strong partner repr: 
has to pay for p 
higher probability 
interpretation is b: 


Bargaining 
ns | endurance 


n motivates the inai 

in the bargaining, 
relatively easy bars 
idividual who prefers a 
} he feels he 
ition with 

cess. This 

ndition of 


coalition success u 
procedure of the ¢ 
ability of coalitior cess 
tionally with coal power. Therefore, a 
triad member who ; ed a less powerful 
partner was in effect expressing a greater pref- 


periment, The 
vilowed the prob- 
to vary propor- 


erence for a coalition with a lower probability 
of success than a member who preferred a 
more powerful partner 

The results were partially in agreement 


with a hypothesis by Gamson (1961) suggest- 
ing that coalition partners would demand a 
share of coalition payoff proportional to their 
contribution to coalition resources. The results 
indicate that a member who preferred a less 
powerful partner, and hence, increased his 
own relative contribut 


ilition power, 
tended to demand a sm: of the payam 
off than a member preferred a more 


powerful partner. However, it is important to 
note that Gamson’s hypothesis considered the 
differences in the contribution to 
coalition resources as a product of situational 
variation (such as position power and power 
distribution). Thus, unlike Gamson’s theory, 
the premises and the procedure of this study 
considered the choice of the partner and, con- 
sequently, the individual’s relative contribu- 
tion to coalition resources to reflect differences 
in choice behaviors that occur independently 

of situational variation, s 


relative 
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` An examination of the correlations between 
the bargaining variables indicates that an in- 
dividual who starts with high bargaining de- 
mands tends to end up making more conces- 
sions than an individual who starts with low 
bargaining demands (7 = .49). However, start- 
ing with high bargaining demands tends to 
provide the individual with leeway to make 
concessions without having relatively low de- 
mands at the end of the bargaining (the cor- 
relation between initial demands and final de- 
mands = .41). This seems to agree with the 


„ commonly observed strategy in negotiation 


that considers conceding more than necessary 
prior to bargaining to be unwise, since one 
expects to compromise. The principle implicit 
in this strategy might be labeled the principle 
of compromise expectancy. Implicit also in 
this strategy is the use of initial bargaining 
demands as a means to manipulate the nego- 
tiation to maintain relatively high demands at 
the end. A Machiavellian tendency seems to 


__ characterize this strategy. 


The results further indicate that an indi- 
vidual who makes higher initial demands 
tends also to perpetuate the bargaining longer 
than an individual who makes lower initial 
demands (r = .40). Thus, knowledge about 
the level of initial bargaining demands per- 
mits the prediction of later bargaining be- 
havior such as concessions, final demands, and 
bargaining endurance. 

It seems, therefore, that the behavioral dif- 
ferences in the choice of partner and in bar- 
gaining include a significant systematic com- 
ponent. The findings suggest that these dif- 
ferences could be predicted if the relevant 
independent personality and other individual 
characteristics variables were identified. 


The results of correlating personality vari- 
ables with behavioral variables indicate that 
the variable of external control could be used 
to predict partner power preference (r= 
—.30) and bargaining concessions (r = .35), 
Thus, an individual who believes that events 
are controlled by external forces (fate, chance) 
tends to prefer a less powerful partner and to 
make more bargaining concessions than an in- 
dividual who believes that he can exercise 
some control over the events around him, The 
latter individual tends to select a more power- 
ful partner, perhaps as a means of increasing 
the probability of coalition success through 
his own act of selecting the partner. He also 
tends to make fewer concessions by keeping 
his final demands in line with his initial ex- 
pectation (the correlation between external 
control and bargaining concessions =.35), 
The relationship between external control and 
behavioral variables is supported by findings 
obtained by Teger (1968) in which this per- 
sonality variable was found to be related to 
behavior in an allocation game. 

Although the personality variables of con- 
ciliation, risk avoidance, and suspiciousness 
did not have significant correlations with the 
behavioral variables, the fact remains, how- 
ever, that personality is potentially fruitful in 
explaining differences in coalitional behavior. 
This is based on the findings of this study that 
these differences have a systematic component 
and is also based on other research findings. 
Terhune (1970) reviewed the research on the 
effects of personality in cooperation and con- 
flict situations and found that behavior in 
these situations correlated with (a) motive 
variables such as aggression, autonomy, abase- 
ment, deference, dominance, affiliation, and 
achievement; (b) cognitive structure vari- 
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FIGURE 2. Prediction linkages between chronologi- 
cally unadjacent variables. 


ables such as abstractness, tolerance for am- 
biguity, dogmatism, trust variables such as 
Machiavellianism; and (c) composite vari- 
ables such as authoritarianism. These per- 
sonality variables along with other individual 
characteristics variables should be included in 
future research on coalitional behavior. 

The findings of the study permit the estab- 
lishment of prediction linkages of coalitional 
behaviors. Figure 1 represents the prediction 
linkages between the variables that are 
chronologically adjacent. Thus, external con- 
trol as a personality variable could be used to 
predict partner power preference, and partner 
power preference could be used, in turn, to 
predict initial bargaining demands, etc. Figure 
2 represents the prediction linkages between 
chronologically unadjacent variables. Thus, 
external control as a personality variable could 
be used to predict a variable such as bargain- 
ing concession. Similarly, partner power pref- 
erence could be used to predict bargaining 
endurance. 

The procedure and findings of this study 
have methodological implications for future 
research on cooperation and conflict. Rapoport 
and Chammah (1965) and Terhune (1970) 
Suggested that behavioral differences in co- 
Operation and conflict situations that are at- 
tributable to personality and other individual 
characteristics variables could be detected in 
(a) simple one-trial games, (b) multitrial 
games with controlled strategy of the partner, 
and (c) experimental designs in which part- 
ners are matched in personality. The pro- 
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cedure and the findings of th tudy 
suggest a fourth approach to dete matic 


individual differences. This app 
on the averaging of individua havior 
across trials in multitrial games. I W ap- 
proach should be included in futu carch 
investigating individual differences jopera- 
tion and conflict. 


based 
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Effects of “Mere Exposure” on Learning and Affect 


David J. Stang 
Queens College of the City University of New York 


The mediating role of learning in the relationship between repeated exposure 
and affect was explored and supported in three experiments involving a total 
of 229 undergraduate participants. It was found that both learning and affect 
measures behaved in essentially the same way as a function of exposure dura- 
tion (Experiments 1 and 3), serial position (Experiments 1 and 2), rating 
delay (Experiment 1) and stimulus properties (Experiment 1). These results 
suggest learning may be intrinsically rewarding and clarify one of the mecha- 
nisms involved in the relationship between exposure frequency and affect, 
extending Berlyne’s two-factor theory of the effects of stimulus familiarity. 


Of the various theories of the effects of 
repeated exposure on affect, perhaps the most 
promising is the two-factor theory proposed 
by Berlyne (1970) and Stang (1973a, 1973b). 
The theory posits that a learning and a satia- 
tion factor have additive, antagonistic roles 
in determining the effects of repeated expo- 

_ sure on affect. The theory suggests that re- 
peated exposure is accompanied by learning 
about the stimulus, which in turn increases 
the pleasantness of the stimulus exposed; 
once the stimulus is learned, a boring un- 
pleasant state of satiation is hypothesized to 
develop, causing a depression of affect ratings. 
Conditions of repeated exposure which favor 
the operation of the learning factor and mini- 
mize satiation (e.g., distributed exposure of 
complex, novel stimuli with delayed affect 
ratings) would be expected to produce a 
frequency-affect function resembling the 
learning curve, while conditions favoring the 
operation of first the learning and then the 
satiation factors would theoretically result in 
an inverted U-shaped function. 

There have been several previous demon- 
strations of links between learning and affect. 
There are reports that recognized paralogs 
were rated as more pleasant than unrecog- 
nized ones (Matlin, 1971), that recalled para- 
logs were rated as more pleasant than un- 
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recalled ones (Stang, 1973b), that first 
syllables were liked better when followed by 
recalled than by nonrecalled syllables (Cran- 
dall, Note 1), that preference for differing 
orders of approximation to English was cor- 
related with recall (Munsinger & Kessen, 
1966), and that rated goodness of trigram 
pairs was positively correlated with degree of 
paired associate learning (Rajecki & Hodes, 
Note 2). In spite of this evidence, the role of 
learning in the relationship between familiar- 
ity and affect has been largely ignored by 
previous investigators. The present article 
focuses on the mediating role of learning in 
the familiarity-affect relationship. 


EXPERIMENT 1 
The first experiment investigated the effects 
of exposure duration, serial position, and 
rating delay on learning and affect. From the 
two-factor theory, it might be expected that 
each of these variables might have effects on 
learning, and hence on affect. 


Method 


Subjects 

Subjects were 46 male and female students drawn 
from the introductory psychology subject pool at 
Syracuse University. 


Stimuli 


Stimuli were 16 of the Turkish adjectives used by 
Solomon and Postman (1952), Johnson, Thomson, 
and Frincke (1960), Zajonc (1968), and others. 


Procedure 


After being told they were participating in a study 
of verbal learning, pairs of subjects viewed the 


or 


Turkish words on a DEC VB10C display screen 
linked to a PDP-10 computer and rated them for 
pleasantness on a 7-point scale. The 16 Turkish 
words were systematically counterbalanced against 
four exposure durations (1, 4, 16, or 64 consecutive 
one-second exposures with effectively no interexpo- 
sure interval) and four rating delays (1, 4, 16, or 64 
seconds between last exposure and rating). Follow- 
ing this first set of affect ratings, subjects completed 
a first recall measure in which they were requested 
to list as many of the words as they could. A second 
recall measure was then administered, this one re- 
quiring subjects to complete the spelling of each 
of the Turkish words, given the first five of the 
seven letters in each word. Finally, about five min- 
utes after completing the first affect measure, sub- 
jects were given a second 7-point pleasantness scale, 
Two weeks later, subjects returned and completed a 
third measure of recall and a third measure of affect, 
both identical to the second measures. 


RESULTS 


The first type of analysis to be described 
examines the effects of the manipulation of 
exposure duration and interval between expo- 
sure and first affect rating on the dependent 
variables of recall and affect. Data from each 
measure were analyzed by sorting the ratings 
obtained into the appropriate cells of a 
16 X 16 matrix, with the rows representing 
the 16 Turkish words and the columns repre- 
senting the 16 combinations of the four levels 
of each of the two independent variables. All 
values within a given cell of the matrix 
were pooled. 

A separate 4 X 4 repeated measures analy- 
sis of variance was performed for each of the 
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six measures, treating the words as “subjects” 
in the analysis. Rating delay (1, 4, 16, or 64 
seconds) was expected to have an effect only 
on the first affect measure, but was included 
as a variable in all six analyses. These analy- 
ses indicated that neither exposure duration 
nor rating delay nor the interaction of these 


two variables had any effect approaching sig- 
nificance on any of the three affect measures. 
Analyses of variance for each of the three 


recall measures indicated that neither expo- 
sure duration nor rating delay had signifi- 
cant main effects, but the interaction of these 
two factors was significant. This interaction 
appeared to be the result of words at the ends 
of the series being recalled with a higher 
probability than words occurring in the 
middle. Although a serial position effect had 
not been anticipated, the design of the experi- 
ment made it possible to examine statistically | 
such an effect. This is described below. 
Stimuli had been exposed in such a way 
that each exposure frequency occurred once 
in each of four blocks of trials. By ignoring 
the initial variable of rating delay (1, 4, 16, 
or 64 seconds), it was possible to examine the 
independent effects of exposure duration and 
serial position. In order to compare directly 
learning and affect scores, scores on each of 
the six measures were standardized, M = 50, 
SD = 10. All six measures were then included 
in a single analysis of variance by considering 
each measure as one of six possible combina- 
tions of two variables: type of measure 
(learning or affect) and rating delay (cither 


obtained immediately after exposure oxi- 
mately 5-10 minutes after exposur two 
weeks after exposure). Again treati words 


as subjects, a 4X 4X 2X3 repeated mea- 
sures analysis of variance was then performed, 
with the within subjects variables being expo- 
sure duration, serial position, type of measure, 
and rating delay. 

As it had appeared from an examination of 
the means, there was a significant, F(3,45) 
= 7.413, p < .01, main effect due to serial 
position, items being learned and liked better 
when they occurred at the ends of the list 
than when they occurred in the middle. The 
main effect of serial position can be inferred 
from Figure 1. 
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Ficure 2. Interaction of exposure duration and 
rating delay in Experiment 1. 


Further, as originally hypothesized, there 
„Was a significant, F (3,45) = 2,847, p < .05, 
main effect due to exposure duration. This 
effect was apparently swamped by serial posi- 
tion in the earlier analyses of variance re- 
ported. This main effect can be inferred from 
Figure 2. 

Because scores on each measure had been 
standardized, no main effect could be expected 
for type of measure (recall versus affect) or 
< rating delay between exposure and measure 
(no delay, 5-10 minutes, or two weeks) and 
none was found. The interaction of serial 
position and exposure duration was not 
significant, F = .962. 

The interaction of serial position and type 
of measure was significant, 7(3,45) = 4.284, 
p< 01, (see Figure 1) and seems to be the 
result of a less pronounced bowing in the 
_ affect curve than in the recall curve. If the 
two-factor theory is correct, and serial posi- 
tion determines learning which in turn deter- 
mines affect, the theory accounts for this 
finding that the effect of serial position on 
learning is stronger than the effect of serial 
position on affect. 

Further, the interaction between serial posi- 
tion and rating delay was significant, F (9, 60) 
= 2.430, p < .05. This interaction is pre- 
sented in Figure 3 and reveals that the serial 
position effect on learning and affect dis- 


sipates over time, being strongest when learn- 
ing and affect are measured immediately and 
weakest when they are measured with a two- 
week delay. 

One remaining interaction approached sig- 
nificance, F(9, 60) = 2.071, p < .08, and is 
rather remarkable. This is the interaction of 
exposure duration and rating delay. Exposure 
duration evidently had no main effect on 
learning and affect when ratings were made 
immediately, but after 5-10 minutes the 
predicted effect occurred and remained strong 
after two weeks (see Figure 2). It has been 
shown previously that exposure duration and 
recall covary only with some interval between 
exposure and rating (Hellyer, 1962) and that 
exposure duration and affect covary only with 
such an interval (Harrison & Crandall, 1972; 
Stang, 1974). The present simultaneous dem- 
onstration of both effects lends plausibility to 
the hypothesis that differential recall mediates 
the relationship between exposure frequency 
and affect. 

Two other interactions which are not sig- 
nificant are of interest: the lack of a triple 
interaction of serial position with type of 
measure and delay and the lack of a triple 
interaction of exposure duration, type of mea- 
sure, and delay. Lack of interaction here in- 
directly indicates that all of the measures are 
positively correlated and behave in essentially 
the same way when affected by serial position 
and exposure duration. This covariation of the 
six measures is discussed below. 
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10 
TABLE 1 
RELATIONSHIPS BETWEEN THE CANONICAL 
VARIATES IN EXPERIMENT 1 
enna Re df | > Raaffect | Rarecall 
= Z {pete 
1 5 | .0416| 3014 | 3749 
2 3 .0086 | .0509 
ki 1 0242 | -0089 


Since it would be difficult to think about 
all of the correlations between the six mea- 
sures simultaneously, a canonical correlation + 
was computed to facilitate generalizations re- 
garding the extent and nature of the inter- 
relationships between the three recall mea- 
sures and the three affect measures. As can be 
seen from Tables 1 and 2, a single canoni- 
cal correlation was significant, R.(5) = .78, 
p < .05, Examination of the correlations be- 
tween the six measures and their respective 
canonical factors for this first root suggests a 
straightforward relationship between the two 
sets of measures, with all correlating posi- 
tively and substantially with their canonical 
factors. Further, the third affect and third 
recall measures, as evidenced by the weights 
in the present analysis, are most closely 
related to these factors. The redundancy of 
the measures which is displayed in the first 
canonical factors is Ry affect = 3014: Ry re- 
call = 3749. The magnitude and similarity of 
these values suggests that the recall and affect 
factors simultaneously possess discriminant 
validity and are useful mutual predictors, 


with affect being a slightly better predictor 
of recall than vice versa, 


EXPERIMENTS 2 AND 3 


Two other experiments provide conceptual 
replications of Experiment 1. Experiment 2 
used different stimuli, a different testing situa- 
tion, a different mode of presenting the list 
and different measures of learning and affect. 
Subjects (115 undergraduates) were given, 
en masse, a list of trigrams printed on a sheet 
of paper to study. Across stimulus pages, 


* Discussions of the canonical correlation can be 
found in Cooley and Lohnes (1971), Stewart and 


Love (1968), Van de Geer (1 
(1967), = eer (1971), and Veldman 


TABLE 


RELATIONSHIPS BETWEEN THE ĪNDIVIDUAL MEASURES 
AND THEIR RESPECTIVE CANONICAL VARIATES 
FOR THE Ì'IRST JT 


Measure | oean wit Weights 
Affect #1 | | ~.0099 
Affect #2 | 5903 | — —=.5199) 
Affect #3 | 867! | 8546 
Recall #1 | .7300 | 4866 
Recall #2 7152 | 3250 
Recall #3 9052 —.8109 


stimuli were counterbalanced with serial posi 
tion. One half were then tested for recal 
while the other half made good—bad rating: 
on 7-point scales. Items were recalled bes 
and liked most when they occurred at the 
beginning of the list, while items were recalle 
and liked least when they occurred g 
(see Figure 4). This deviation from th 
typical serial position effect may have resulte 
from greater attention to early items than t 
later ones, but the link between learning and 
affect was again’ confirmed. 

Experiment 3 provided 68 undergraduate 
subjects with a stimulus page of haphazardly 
scattered Turkish words in the frequencies of 
0, 1, 2, 4, 8, or 16 occurrences. Subjects 
studied the page for five minutes, then tried 
to recall the words, and finally made affect 
ratings. Both affect and recall measures de- 
scribed the typical learning curve as a 
function of exposure frequency. 
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FIGURE 4. The effect of serial position on recall and 
affect in Experiment 2, 
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GENERAL DISCUSSION 


The three experiments reported here sup- 
port the hypothesized mediating role of learn- 
ing in the relationship between exposure fre- 
quency and affect, since both learning and 
affect measures behaved in essentially the 
same way as a function of exposure duration 
or frequency (Experiments 1 and 3), serial 
position (Experiments 1 and 2), rating delay 
(Experiment 1), and stimulus properties (cf. 
the canonical correlation of Experiment 1). 
The results of the present experiments, there- 
fore, can be viewed as evidence that learning 
is intrinsically rewarding. 

With little difficulty, the two-factor theory 
can be extended to explain exploratory behav- 
ior (see Stang, 1973a, 1973b). First, since 
novel stimuli are less well learned than 
familiar stimuli, more learning about them is 
possible, and learning about them is conse- 
quently more reinforcing than learning 
about other relatively familiar stimuli. The 
learning factor would thus be expected to 
cause the organism to approach novel stimuli. 
Second, when an organism has learned to 
recognize an object, satiation begins to build 
and motivates the organism to expose itself 
to other less well learned stimuli. Thus the 
satiation factor induces the organism to avoid 
familiar stimuli. The two-factor theory thus 
provides a point of convergence for those 
theorists (and their data) who explain explo- 
ration in terms of approach tendencies (e.g., 
Dember, 1961) and those who explain explo- 
ration in terms of avoidance tendencies (e.g., 
Glanzer, 1958). 

Several important questions await answers. 
First, there is some previous evidence that 
stimulus affect plays a role in learning and 
memory, and one might speculate that sub- 
~ jects in these experiments learned what they 
liked, rather than vice versa. This seems an 
unlikely explanation of the similar effects of 
serial position, exposure duration, and rating 
delay on recall and affect, since counter- 
balanced designs were used, but may account 
for some portion of the covariation repre- 
sented by the canonical correlation. Logical 
rather than experimental analysis may be 
required to disentangle the causal direction 
between these two intervening variables. 


A second and perhaps more promising 
direction for research may lie in determining 
the relative merits of the response competition 
hypothesis (e.g., Harrison, 1968; Matlin, 
1971) and the two-factor theory. Further 
research may show that the various measures 
of response competition are merely unusual 
measures of learning, merging the data if not 
the rhetoric of these two theoretical positions. 

Other research might be profitably directed 
at the satiation factor. The physiological 
process underlying satiation in the present 
theory may ultimately prove to be the same 
as the process underlying Hull’s notion of 
reactive inhibition (Hull, 1943), Glanzer’s 
notion of stimulus satiation (e.g., Glanzer, 
1958), and Lambert and Jakobovits’s notion 
of semantic satiation (Jakobovits & Lambert, 
1964; Lambert & Jakobovits, 1960). 
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Cross-Modal Effects of Anticipatory Bogus Heart Rate 


Feedback in a Negative Emotional Context’ 


Richard Hirschman * 
State University of New York at Buffalo 


The goal of the present investigation was to examine the effect of altering 
the perceptions of one’s own physiological arousal on the elicitation of emo- 
tional behavior as defined by galvanic skin response, the nonspecific galvanic 
skin response, and verbal report measures of discomfort and unpleasantness. 
After an initial three-minute rest period, 48 subjects were exposed to 10 suc- 
cessive slides of people who died violently and continuous auditory tones. Re- 
sults showed that the groups simultaneously exposed to the noxious slides and 
tones labeled as bogus heart rate feedback responded to the slides with in- 
creased electrodermal activity as compared to subjects exposed to the identical 
auditory feedback labeled as extraneous noise. An increase in bogus heart rate 
tended to elicit the most pronounced electrodermal responses. As expected, 
bogus heart rate conditions resulted in greater self reports of discomfort and 
slide unpleasantness as compared to the noise conditions, These findings were 


explained as a function of a cyclic cognitive-visceral link. 


Emotional states have been described vari- 
ously by James (1890) and by numerous suc- 
cessors (Lazarus, 1966; Schachter, 1966) as 
the products of interactions between cogni- 
tive activity and peripheral autonomic func- 
tioning. Much of the recent empirical work 
in this area (Fehr & Stern, 1970) also sug- 
gests that emotional states have both central 
nervous system and peripheral autonomic de- 
terminants. Schachter (1964), for instance, 
has postulated that an individual identifies 
his emotional state by a cognitive labeling 
process after experiencing a state of physio- 
logical arousal which requires explanation. 
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Valins (1966) supplemented Schachter’s 
theory by suggesting that in the determina- 
tion of one’s emotional state, one’s physio- 
logical reactions may not be as important as 
one’s cognitive interpretation of these reac- 
tions. He presented 10 slides of seminude 
females to a group of male subjects while at 
the same time presenting the subjects with 
auditory beeps. Half of the subjects were told 
that the sounds were their own heart beats, 
and the other half were told that the sounds 
were extraneous noise. In neither case was the 
feedback actually heart rate. Those slides 
which were accompanied by an increase or de- 
crease in the bogus heart rate feedback were 
selected by the subjects as being significantly 
more attractive than those slides which were 
accompanied by no change in the bogus heart 
rate feedback. No such differences in slide 
attractiveness were found in groups who were 
told that the same sounds were extraneous 
noise. Valins claimed that the bogus feedback 
was enough to influence the attractiveness 
ratings of the pictures independent of the 
actual physiological state, but he offered little 
physiological data to support this. 

The physiological consequences of the feed- 
back procedure were examined by Stern, 
Botto, and Herrick (1972). Using essentially 
the same paradigm as Valins, they discovered 
that subjects exposed to sound which was 
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labeled as their own heart rate displayed 
higher actual heart rates than those subjects 
exposed to the sound labeled as extraneous 
noise. No analyses were presented comparing 
physiological responses to pictures accompa- 
nied by feedback of no change in heart rate 
and pictures accompanied by an increase in 
heart rate feedback. The heart rate analyses 
used by Stern et al. reflected only whether 
the feedback was labeled as noise or as heart 
rate, regardless of frequency. Although their 
study suggests that heart rate feedback may 
be physiologically arousing, this conclusion 
must be tempered by the fact that Stern et 
al.’s skin resistance data did not reveal dif- 
ferences comparable to the heart rate data. 
In another recent study Goldstein, Fink, 
and Mettee (1972) reported that the rate of 
bogus feedback accompanying slides of female 
nudes did influence actual heart rate. Male 
subjects responded with larger actual heart 
changes to the bogus heart rate speedup con- 
dition than to the bogus no-change heart rate 
condition. This mimic effect was not found 
when presumably more offensive male nude 
slides were used. In addition, a positive cor- 
relation of actual heart rate and ratings of the 
emotional potency of the slides was found 
only for the male nude slide conditions. 
Goldstein et al. concluded that if actual 
~ physiological activity does mediate emotional 
attribution of external stimuli, then the effect 
is probably a function of the emotional po- 
tency of the stimulus. However, the mim- 
icking effect was defined by any changes in 
heart rate which followed the bogus feedback 
stimuli, Bidirectionality of the heart rate re- 
sponse would have been expected to influence 
the use of feedback cues. In fact, when only 
actual heart rate increases were analyzed the 
differential effect of rate washed out. Lacey, 
Kagan, Lacey, and Moss (1963) proposed 
that the intake of environmental stimuli may 
be facilitated by a heart rate decrease, while 
environmental rejection, particularly of nox- 
ious stimuli, may be facilitated by heart rate 
increase. Consequently, if rate of feedback 
does elicit discriminative heart rate responses, 
then it would be appropriate to look in detail 
at a cross-modal, unidirectional indicator of 
arousal such as the electrodermal response. 
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To examine more closely the temporal order 
of the cognitive—visceral link and to examine 
its modulating effect on electrodermal re- 
sponse to emotionally arousing stimuli, an 
analysis of the time period prior to stimulus 
presentation also is required. It has already 
been shown that conditions antecedent to 
stimulus presentation can be important 
shaping the physiological response to 
stimulus (Geer & Klein, 1969; Monat 
Furthermore, Schachter’s work i 
onstrated that complex cogn een 
ing different stimuli also are important in 
shaping the emotional responses t 
stimuli particularly those cognitions wh 
may occur in anticipation of the stimulu 
in question (Nisbett & Schachter, 1966). 

The major theme of this study was to ex- 
amine the effects of the perception of arousal, 
via false feedback, on actual electrodermal 
changes while anticipating and responding to 
emotionally arousing stimuli. It was expected 
that a bogus increase in heart rate feedback 
in anticipation of the presentation of a nox- 
ious slide would be followed by a consequent 
increase in electrodermal responsivity during 
an anticipatory period. Furthermore, the bo- 
gus heart rate increase condition was expected 
to elicit increased electrodermal responses 
and verbal reports of emotionality to nox 
ious stimuli as a result of this elevated emo 
tional state when compared to the bogus no- 
change heart rate condition. Subjects who 
received sound stimuli not labeled as heart 
rate feedback were not expected to respond 
with increased indications of emotion. 


METHOD 
Subjects 


Subjects were 48 Caucasian undergraduate fe- 
males at the State University of New York at Buf- 
falo who participated to meet a requirement for an 
introductory course in psychology. Twelve subjects, 
randomly assigned, constituted each of the four ex- 
perimental groups. They were told that should they 
decide to leave at any time during the experiment 
they would still receive full credit for participa- 
tion. Seven additional subjects were run but not 
included because of equipment problems or proce- 
dural irregularities. Two other subjects also were 
not included because of their skepticism about the 
genuineness of the feedback procedure, 
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Apparatus 


Two tape recordings were made, one of which 
was used for the increase bogus heart rate feedback 
condition (heart/increase) and the increase extra- 
neous noise condition (noise/increase), and the 
other which was used for the no-change bogus heart 
rate feedback condition (heart/no change) and the 
no-change extraneous noise condition (noise/no 
change). The tapes were played in the experimenter’s 
room and heard through a small speaker located in 
the subject’s room. Wires leading from the tape re- 
corder to the speaker were hidden. 

The visual stimuli were color slides of people 
who died violently (Geer & Klein, 1969). Timing 
of the presentation of the slides as well as the heart 
rate or noise sounds was controlled automatically 
using a preprogrammed tape and sound sensitive 
relays. 

Skin resistance was recorded using two silver- 
silver chloride (Ag-AgCl) electrodes. Amplification 
of the skin resistance measure was obtained using 
a Grass Model 7P1A preamplifier. This apparatus 
produced a current density of 33 microamperes per 
square centimeter for exosomatic recording. Re- 
sponses were recorded on a Grass Model 7 Poly- 
graph. 


Procedure 


Initial period. Upon arriving at the laboratory 
subjects were told that their electrodermal reactions 
to slides would be measured. After attaching the 
electrodes the experimenter left the room supposedly 
to check if the responses were being recorded 
properly. 

Experimental period. After a three-minute rest 
period the experimenter returned to the subject’s 
room and presented the next set of instructions. 
Subjects in the heart group were told: 


We also have been trying to measure heart 
rate reactions, This is accomplished by getting 
a gross measure of heart rate, not so much the 
detail of each heart beat but simply the time 
between each beat to give us an indication of 
the rate. To do this I will attach two more 
electrodes to your middle finger. [The experi- 
menter then attached the electrodes.] The heart 
rate will be picked up by the amplifier located 
on the table which will amplify the sound loud 
enough so it can be heard through this speaker 
located to your left. I then will be able to re- 
cord the sound in the adjoining laboratory and 
have a record of your heart rate. Just try to 
ignore these heart rate sounds. Since the task 
of viewing the slides does not require a great 
deal of activity I do not anticipate that this 
audible sound will interfere with the data in 
a significant way. 


Subjects in the noise group instead were told: 


We also have been trying to measure heart 
rate reactions. This is accomplished by getting 
a gross measure of heart rate, not so much the 


detail of each heart beat but simply the time 
between each beat to give us an indication of 
the rate. To do this I will attach two more 
electrodes to your middle finger. [The experi- 
menter then attached the electrodes.] In pre- 
vious experiments we have had reason to sus- 
pect that extraneous’ sounds have interfered 
with responding to the slides which you will 
be shown soon. To see if this is true during the 
rest period and during the presentation of the 
slides, you will be hearing, through this speaker 
on the table, some extraneous sounds which 
will have no meaning, Just try to ignore them. 


The following instructions were then given to all 
subjects: 


The purpose of the next part of this study 
is to examine your physiological and behavioral 
reactions to slides of people who died violently, 
Ten of these slides will be presented to you at 
irregular intervals. After each slide presentation 
will you please examine the rating scale in front 
of you and say out loud a number which cor- 
responds to the feeling you have as a result of 
viewing the slide. For example if you feel that 
the slide made you overwhelmingly uncom- 
fortable, then you would say number one, 
Please do this after each slide. Do not say the 
number out loud until the image has left the 
screen, Shortly after I leave, the slide presenta- 
tions will begin. It is very important that you 
try not to move during the experiment, espe- 
cially the arm with the apparatus on it, as 
this will make it difficult to obtain accurate 
measurements. I will signal you when the ex- 
periment is over. Do you have any questions? 


Questions were answered by paraphrasing the pre- 
vious instructions. The experimenter then left the 
room, 

The 10 slides used during the experimental period 
were shown in random order on a Kodak Ekto- 
graphic projector located in the room in which the 
subject was seated. Presentation time was 5 sec- 
onds at intertrial intervals of 50-70 seconds with 
a mean intertrial interval of 60 seconds. The first 
intertrial interval began after a 2-minute adapta- 
tion period. For the heart/increase or noise/increase 
conditions, during the adaptation period and inter- 
trial intervals, subjects heard a pulse rate vary be- 
tween 66 and 72 beats per minute every 5 seconds; 
however, 20 seconds preceding stimulus onset (slide 
presentation) the rate increased in 5-second seg- 
ments from 72 to 78 to 84 to 94 to 100 beats per 
minute. A slide then was presented for 5 seconds 
during which time the beats per minute were main- 
tained at 100. At the time of slide offset the beats 
per minute decreased in 5-second segments from 
100 to 94 to 84 to 78 to 72. It then continued to 
vary between 66 and 72 beats per minute until the 
20-second period preceding the next slide presenta- 
tion at which time the beats per minute increased 
as has just been described, In contrast, subjects in 
the heart/no-change or noise/no-change groups heard 
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TABLE 1 
MEAN ELECTRODERMAL AND VERBAL RESPONSES FOR THE FouR EXPERIMENTAL GROUPS 


| 


Response 
z à Slid! | Intertrial i 
3 ary ts ide nte: al 
Group Anticipatory Anticipatory vero aa Slide Silde 
Cae nas GSR GSR discomfort | unpleasantne 
| frequency amplitude ampurude Teste 
Heart increase 2.5 -0087 | -0240 5.6 | 2.0 38.3 
Heart no change 2.0 0043 0165 5.6 3.5 30.3 
Noise increase 1.2 0032 0114 3.4 | 3.4 | 19 
Noise no change 1.6 0028 -0109 3.8 | 3.7 
a GSR = galvanic skin response. . 
V Lower numbers represent increasing levels of discomfort, 
© Higher numbers represent increasing levels of unpleasantness, 


a pulse rate vary between 66 and 72 beats per min- 
ute every 5 seconds throughout the adaptation pe- 
riod, intertrial intervals, and slide presentations, 

The verbal ratings which were recorded after 
each slide presentation were based on a scale of 1 
to 5 representing decreasing levels of discomfort. 
After all of the slide presentations, subjects were 
asked to rate the overall unpleasantness of the slides 
with O being the lowest rating and 50 being the 
highest rating. 


RESULTS 


Relationships among Feedback C. onditions 
and Physiological Indexes 


The electrodermal indexes in the following 
analyses were expressed in terms of ampli- 
tude and frequency characteristics. The gal- 
vanic skin response (GSR) was defined as a 
decrease in skin resistance magnitude of at 
least 100 ohms, with a latency of one to five 
seconds after the onset of the stimulus. Am- 
plitude of the GSR was expressed as change 
in log conductance. The formula for this con- 
o version is logio (1/Rz) — logio (1/Rı) in 
Which R» was the peak response (resistance 
_ decrease) and R, was the level of resistance 
~ at stimulus onset, 

Rest period. An analysis of variance of 
nonspecific GSRs during the initial three- 
minute rest period yielded no significant dif- 
ferences between the heart and noise groups 
(F = 05, df= 1/44) and the increase and 
no-change groups (F = AS, df = 1/44) and 
no interaction among these groups (F = .03, 
df = 1/44). These analyses suggest that sub- 
Jects did not differ significantly in their spon- 
taneous electrodermal activity preceding the 
exposure to the four experimental conditions. 

Anticipatory period. The means of the fre- 


quency of nonspecific GSRs during the 20 
seconds preceding each stimulus onset are 
shown in Table 1. Subjects exposed to the 
heart feedback exhibited greater nonspecific 
GSR activity than subjects exposed to the 
noise feedback regardless of rate (F = 9:70 
dj=1/44, p<.01). A significant Pulse 
Rate X Trial interaction (F = 2.43, dj = 
9/396, p < .01) also was found, but no other 
interactions were obtained. 

Examination of the change in log conduc- 
tance of nonspecific GSRs during the antici- 
patory period revealed patterns similar to 
those found for the frequency data (see Table 
1). Mean respon for the combined heart 
groups were greater than the mean responses 
for the combined noise groups. Analysis of 
the variance of these data yielded significant 
main effects for heart/noise feedback (F= 
17.52, df = 1/44, p< .001), for increase /no- 
change pulse rates (F = 8.09, df = 1/44, 
< .01) and for the Type of Feedback 
Pulse Rate interaction (F = 5.54, df = 1/4 
$ < .05). 

Subsequent analyses of the data indicated 
that the significant Type of Feedback x 
Pulse Rate interaction and both of the sig- 
nificant main effects were accounted for pri- 
marily by the high amplitude responses of 
the heart/increase subjects. A Scheffé test 
(Petrinovich & Hardyck, 1969) was used 
to compare the four experimental groups. 
Scheffé’s method (Kirk, 1968) revealed sig- 
hificant differences between the heart /increase 
and heart/no-change groups (S = .0035, df 
= 1/44. < 05) as well as significant dif- 
ferences between the heart/increase and 
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noise/no-change groups and the heart/in- 
crease and noise/increase groups (S = .0047, 
df = 1/44, p < .01). No other comparisons 
were statistically significant. 

Response period. Mean amplitude of re- 
sponses to the slide presentations is presented 
in Table 1. Those subjects exposed to the 
heart rate feedback responded to the slides 
with greater electrodermal activity than sub- 
jects exposed to the noise feedback irrespec- 
tive of pulse rate. Although responses from 
both groups habituated across trials, subjects 
exposed to the heart feedback consistently 
maintained higher amplitude response levels 
across trials as compared to subjects exposed 
to the noise feedback. These observations 
were substantiated by an analysis of variance 
yielding a significant main effect for heart/ 
noise feedback (F = 9.18, df = 1/44, p< 
.01) and a significant main effect for trials 
(F = 15.96, df = 9/396, p < .001) but no 
other significant main effects or interactions. 
The main effect of heart/noise feedback and 
the main effect of trials are graphically de- 
picted in Figure 1. 

Intertrial intervals. An inspection of Table 
1 indicates that more GSRs were produced 
by subjects exposed to the bogus heart rate 
feedback during the intervals between slide 
offset and the start of the 20-second antici- 
patory period than subjects exposed to the 
same sound labeled as extraneous noise (F = 
8.69, df = 1/44, p< 01). Subjects in the 
heart group produced more GSR responses 
than subjects in the noise group during all 
intertrial intervals, but responses in both 
groups habituated across trials (F = 2.85, 
df = 9/396, p < .01). 


Relationships among Feedback Conditions 
and Verbal Report Measures 


Subjects were asked to respond on a dis- 
comfort continuum in an attempt to identify 
changes in the affective state associated with 
the slide viewing and feedback. In contrast, 
the unpleasantness rating was more akin to 
Valins’ (1966) use of an attractiveness con- 
tinuum to identify changes in the subject’s 
judgment of an external stimulus. 

Table 1 reveals that the ratings of discom- 
fort from the heart/increase group were lower 
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Ficure 1, Mean response amplitude in change in 
log conductance (X 100) across trials for the heart 
and noise conditions. 


(greater discomfort) than the ratings from 
the other three groups. An analysis of vari- 
ance of these data yielded a significant Feed- 
back X Pulse Rate interaction (F = 2,09, 
df = 9/396, p < .05) and a significant main 
effect of trials (F = 6.20, df = 9/396, p< 
001). The analysis also yielded main effects 
for type of feedback (F = 26.47, df = 1/44, 
p < .001) and pulse rate (F = 33.54, df = 
1/44, p < .001). Subsequent analysis indi- 
cated that these main effects were an artifact 
of the Feedback X Pulse Rate interaction 
and were largely determined by the heart/ 
increase group. Scheffé’s method yielded sig- 
nificant differences between heart/increase 
and each of the three other conditions (S = 
1.03, df = 1/44, p < .01), but no other sta- 
tistically significant comparisons. 

The data for the mean overall slide un- 
pleasantness ratings paralleled the data for 
the mean discomfort ratings (see Table 1). 
Although the most pronounced responses ap- 
parently occurred for the heart/increase sub- 
jects, no interactions were found among the 
four conditions. Significant main effects were 
found for type of feedback (F = 5.72, df = 
1/44, p < .05) and for pulse rate (F = 4.70, 
df = 1/44, p < 05). 
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Relationship between Physiological and 
Verbal Responses 


To examine the relationship between phys- 
iological and verbal report indicators of emo- 
tionality, Pearson product-moment correla- 
tions were computed for responses to each 
slide for each condition. The correlations be- 
tween discomfort and GSR slide response 
magnitude revealed a moderate relationship 
between these two variables. The two high- 
est correlations were found for the heart/ 


increase condition (r= —.29) and for the 
noise/no-change condition (r = oh). 
Discussion 


The contention that perceived arousal 
would increase cross modal physiological re- 
sponding to emotionally laden stimuli was 
Supported. Subjects simultaneously exposed 
to bogus heart rate feedback and noxious 
slides responded to the slides with increased 
electrodermal activity as compared to sub- 
jects exposed to identical auditory feedback 
labeled as extraneous noise. The heart/in- 
crease subjects tended to exhibit the most 
pronounced electrodermal responses, although 
no significant interactions were found between 
pulse rate and type of feedback. 

These findings do not confirm Valins’ 
(1966) conclusion that physiological response 
systems are selectively unaffected by the 
feedback procedure, However, the physiologi- 
cal changes appeared to be unrelated to the 
verbal ratings as attested to by the low 
within-group correlations. The relationships 
between these variables might be affected by 
gradations of stimulus potency and individual 
differences in response capability. For ex- 
ample, if more ambiguous less emotionally 
arousing stimuli were employed, greater con- 
gruence between physiological and verbal re- 
sponses might occur. Subjects who interpret 
such stimuli as arousing might attend to 
finer differences in actual Physiological feed- 
back cues to help identify their own subjec- 
tive reactions. It also might be expected that 
high-anxiety subjects would be more vul- 
nerable to such stimulus manipulations than 
low-anxiety subjects. Mandler, Mandler, and 
Uviller ( 1958) found a positive correlation 
between Scores on the Manifest Anxiety Scale 
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and scores on a questionnaire designed to re- 
veal ability to perceive one’s own autonomic 
activity (r = +.70). Furthermore, the high 
autonomic perceivers demonstrated greater 
physiological activity than the low-autonomic 
perceivers in a stress situation. Thus, high- 
anxiety subjects might be able to discrimi- 
nate physiological feedback cues more easily 
than low-anxiety subjects, particularly if the 
effects of individual differences in anxiety 
not washed out by the use of potent 
tive stimuli. 

As expected, differences were fo 
tween the heart and noise groups in 
the amplitude GSR measure and the fi 
quency of nonspecific GSR measure during 
the anticipatory period. With respect to GSR 
amplitude, the heart/increase group was sig- 
nificantly more reactive than the three other 
groups. Although not significant, a similar 
trend was found for the frequency measure. 
Clearly, the arousal cues preceding stimulus 
onset altered electrodermal activity in antici- 
pation of the slides. These data lend credence 
to the idea that one’s self-perception of being 
aroused is capable of influencing the physi- 
ological component of reactions to antici- 
pated emotionally laden stimuli. 

Differences also were found between sub- 
jects exposed to the heart and noise conditions 
in the slide unpleasantness ratings. Although 
there was a tendency for the heart/increase 
condition to elicit the most pronounced re- 
sponses, this was not substantiated statis- 
tically. These data, together with Valins’s 
(1966) related data for attractiveness rat- 
ings, suggest that perceived arousal can in- 
fluence judgments of both strongly positive 
and strongly negative emotional stimuli, 

As predicted, the heart conditions resulted 
in greater self-reports of discomfort in re- 
sponse to the slides as compared to the noise 
conditions, The most pronounced responses 
were associated with the heart/increase con- 
dition. These findings indicate that the ver- 
bal description of one’s own emotional data 
can be influenced by manipulations of per- 
ceived arousal. 

The cognitive-visceral links which would 
account for these findings are no doubt com- 
plex. Mandler (1962) suggested that the in- 
itial “cognitive arousal” may occur as a 
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response to an external, predetermined con- 
ditioned stimulus rather than as a response 
exclusively to actual physiological feedback 
cues. Once established, the perception of being 
aroused thus could be influenced further 
not only by a bogus feedback but also by 
cues from actual physiological activity. This 
hypothesis suggests a cyclic cognitive—visceral 
link occurring in anticipation of an emotion- 
ally arousing stimulus. The initial percep- 
tion of being aroused could sufficiently in- 
crease actual physiological activity so that 
this activity, in turn, could provide feedback 
cues to reinforce the initial perception. How- 
ever, being able to perceive one’s own physi- 
ological activity may not necessarily be the 
same phenomenon as using such information 
to define one’s own emotional state. In the 
present study, feedback perception was opti- 
mized selectively by instructing subjects to 
ignore or not necessarily ignore the control 
and experimental feedback conditions, re- 
spectively. If there are specific sources of 
actual physiological cues, it would be help- 
ful to examine additional criteria of autono- 
mic nervous system activity and their rela- 
tionships to subjects’ perceptions of this 
activity. 
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Reactions to Male and Female Success and Failure 
in Sex-Linked Occupations: 


Impressions of Personality, Causal Attributions, and 
Perceived Likelihood of Different Consequences ' 
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Female subjects responded to three short verbal cues in which either a male 
or female character succeeded or failed in an examination qualifying the male 
(or female) for entry into an occupation. Three occupations were involved 


varying in masculine dominance 


(medicine, teaching, and nursing). 


In re- 


sponding to each cue, subjects first rated the character in the cue on seman- 


tic differential scales 


to provide impressions of personality, then rated the 
importance of different Possible causes of the outcome 

and finally rated the likelihood that each of a set of pi 
might follow the outcome. Results indicated a fairly per 
the female subjects to upgrade successful males in relation to 


sal attribution), 
Me consequences 
ye tendency for 
unsuccessful 


males but to downgrade successful females in relation to unsuccessful females, 


Results were discussed in relation to sex roles in society 


as they relate to 


permissible achievements for males and females. 


Success or failure at occupations may have 
different consequences within our society de- 
pending upon whether the persons involved 
are male or female and whether the occupa- 
tions are traditionally the province of the 
male or the female. There is a division of 
labor along sex lines so that some jobs (us- 
ually the higher status ones) are more likely 
to be filled by males, while other jobs are 
more traditionally held by females. In our 
society, for example, males are more fre- 
quently found in occupations such as medi- 
cine, science, and politics; while females are 
more likely to be involved in occupations 
Such as nursing, care of children, and secre- 
tarial work. There is no easy answer to the 
question of why this division of labor has oc- 
curred, One could advance historical, eco- 
nomic, political, ecological, sociological, and 
psychological Teasons, all at different levels 
of analysis, The Present article makes no at- 
tempt to deal with these complex issues. It 
reports research concerned with the following 
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question: To what extent is one’s evaluation 
of a person along various dimensions (¢.¢., 
evaluation, potency, activity) and one’s as 
sumptions about the causes and consequences 
of his (or her) success or failure at an occu- 
pation influenced by the degree to which the 
occupation is traditionally masculine or femi- 
nine? For example, how is a male perceived 
if he succeeds at medicine, and how do people 
perceive the causes and consequences of his 
success? Would these perceptions alter if he 
failed at medicine or if it was a female in- 
stead of a male who succeeded or failed at 
medicine? What differences might be expected 
to occur if the occupations were teaching or 
nursing rather than medicine, that is, if they 
were occupations which are more likely to 
be taken up by women? 

These questions were suggested by recent 
research into the so-called “fear of success” 
(Feather & Raphelson, 1974; Feather & 
Simon, 1973: Horner, 1968, 1970, 1972a, 
1972b). In these studies subjects wrote 
Stories to verbal cues, For example, in the 
original study by Horner (1968), female sub- 
jects from a large midwestern university 
wrote stories to the cue, “After first-term 
finals, Anne finds herself at the top of her 
Med. School class,” and male subjects from 
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the same university wrote stories to the cue, 
“After first-term finals, John finds himself 
at the top of his Med. School class.” Stories 
were scored for fear of success if there was 
evidence that the central character in the 
story (Anne or John) was concerned about 
the possible negative consequences of success. 
For example, there might be evidence that 
the central character suffered social rejec- 
tion, wondered about normality and/or femi- 
ninity, denied or misrepresented the success 
in various ways, or acted in a bizarre and 
inappropriate way. Horner found that ap- 
proximately 65% of her female subjects 
wrote fear of success stories to the female 
cue, whereas less than 10% of her male 
subjects wrote fear of success stories to the 
male cue. 

Horner’s (1968) study raises some inter- 
esting questions. The cues that she used 
specified a male-dominated occupation. Fe- 
males who succeeded at medical school would 
do so in direct competition with males and 
would end up occupying a position that was 
usually held by males. Under these condi- 
tions success for the female was not an un- 
mixed blessing. But success for the male in 
the same situation had consequences that 
were virtually always positive. Yet one can 
imagine situations where the results might 
be different. For example, one would expect 
much less evidence of fear of success stories 
among females if the cue indicated that Anne 
came top of the nursing school. In this case 
success would involve a female-dominated 
occupation, and fear of success stories might 
be more evident among male subjects than 
among female subjects. 

This line of argument suggests that part 
of the definition of sex roles in society in- 
volves assumptions about the types of occu- 
pations which are held to be appropriate for 
males and females. Horner’s procedure may, 
in fact, be providing evidence about social 
stereotypes relating to sex roles at the occu- 
pational level, about what are and what are 
not appropriate achievements for males and 
females. The general principle that is sug- 
gested by the fear of success studies is that 
success at an occupation is more valued when 
this success is consistent with societal concep- 
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tions about the sex role than when it is in- 
consistent with these conceptions. For ex- 
ample, the consequences of succeeding at 
medical school are more positive for males 
(success is consistent with the masculine 
image) than for females (success is incon- 
sistent with the feminine image). This type 
of analysis can also be applied to the ques- 
tion of how the consequences of failure relate 
to stereotypes about what occupations are or 
are not appropriate for males and females. 
Failure at an occupation would seem to be 
more distressing when this failure is incon- 
sistent with societal conceptions about the 
sex role than when it is consistent with these 
conceptions. A female who fails at nursing 
school may be more distressed about her 
failure than a male would be, since she has 
failed at an occupation commonly held to be 
appropriate for her sex (failure is inconsis- 
tent with the feminine image). The male 
could excuse his failure in terms of the in- 
appropriateness of the occupation (failure is 
consistent with the masculine image). It 
should be evident that this type of analysis 
assumes that people prefer consistent states 
of affairs, an assumption that is central to 
the various consistency theories (Abelson, 
Aronson, McGuire, Newcomb, Rosenberg, & 
Tannenbaum, 1968; see also Feather, 1971). 

These considerations suggest some new 
areas of enquiry into the consequences of 
success or failure at different occupations for 
males and females. But one need not be re- 
stricted to the study of consequences. It is 
also possible to explore subjects’ impressions 
of the personalities of those who succeeded 
or failed and the causal factors that subjects 
believe are responsible for this success or 
failure, These questions lead one into the 
study of impression formation and causal at- 
tribution, both fields of research that have 
been extensively investigated in the past 
(Brown, 1965; Feather, 1969; Feather & 
Simon, 1971, 1972; Simon & Feather, 1973; 
Weiner, Frieze, Kukla, Reed, Rest, & Rosen- 
baum, 1971). In regard to impression forma- 
tion, one can ask subjects to use the semantic 
differential scales to rate males or females 
who succeeded or failed at different occupa- 
tions in regard to the usual factors of evalua- 
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tion, potency, and activity, as well as other 
more specific factors (e.g., masculinity—femi- 
ninity). In addition, one can investigate the 
degree to which subjects perceive different 
causal factors (ability, task diffculty, luck, 
and effort) as responsible for the success or 
failure of males or females at different occu- 
pations. 

These procedures would provide answers to 
the following types of question. Are persons 
who succeed evaluated more positively and 
are they perceived as stronger, more active, 
and more masculine than persons who fail, 
irrespective of their sex and the occupation 
involved? Or are these judgments also influ- 
enced by the sex of the person who succeeds 
or fails and by the degree to which success 
or failure at the occupation is consistent with 
societal conceptions about the sex role? Are 
subjects more likely to assign causal respon- 
sibility for success or failure at an occupa- 
tion to external factors (ease or difficulty of 
the task, good or bad luck) than to internal 
factors (ability or inability, effort or lack of 
effort) when success or failure at that occu- 
pation is not consistent with societal con- 
ceptions about the sex role than when it is? 
For example, when a female succeeds at a 
male-dominated occupation, is her success 
more likely to be attributed to good luck than 
would be the case for a male who succeeds 
at the same male-dominated occupation? 
These and many other similar questions await 
exploration, 

The present study may be seen as a start 
in this direction. It involved presenting sub- 
jects with short verbal cues (similar to those 
used by Horner) in which male or female 
characters succeeded or failed, but a range 
of occupations was used so as to sample male- 
dominated and female-dominated occupations, 
Subjects’ responses to these cues were stud- 
ied by objective rather than projective means 
to facilitate scoring and to sample a wide 
range of possible answers. Subjects used se- 
mantic differential scales to provide their 
impressions of the personality of the char- 
acter involved in each cue; they rated the 
degree to which various causal factors (in- 
ternal and external, stable and variable) may 
have been responsible for the success or fail- 
ure; and they judged the likelihood of a 


number of possible consequences that mig 
follow the success or failure. The study w 
essentially exploratory in nature, openi: 
up new territory in the psychology of s: 
roles, in particular the way subjects’ conce; 
tions about sex roles influence reactions ti 
male and female success and failure at differ- 
ent occupations. The general prediction wa 
that success at an occupation would be m 
attractive (in various ways) for a perso 
success was consistent with the social st: 
type about the sex role than if it was ir on- 
sistent, And failure at an occupation would 
be more distressing (in various way: 

person if failure was inconsistent wi 

sex role stereotype than if it was consis: 

It was expected that the effects of the degre 
to which outcome was consistent with sex 
role stereotypes would be evident in subjects’ 
judgments about the characters in the cues 
in regard to personality, causal factors, and 
the possible consequences of success or failure 


METHOD 
Subjects 


Subjects were 48 girls from a private, fee-cha 
ing, Church of England girls’ school in metropolit: 
Adelaide. Forty-four of these girls were in their 
fourth year of study at secondary school, and 4 
girls were in their fifth year of study (the final 
year). The girls came from families of relatively 
high socioeconomic status. 


Test Administration 


These subjects were administered a questionnaire 
which could take any one of four possible forms 
depending upon whether it involved cues concern- 
ing male success, male failure, female success, or 
female failure. The four forms were randomly dis- 
tributed so that 12 subjects completed one form 
of the questionnaire. Subjects were tested in two 
groups in regular classroom sessions during the 
school day. They were told only that the Psychology 
Department at Flinders University was intereste:| 
in their attitudes toward various issues and that 
testing would be anonymous, In particular, no in- 
dication of the relevance of the sex-of-cue variable 
was given. Both groups were administered by the 
same person, a female teacher at the school, with 
assistance from a second female teacher. Temporal 
overlap across the two testing sessions precluded 
communication between groups. Testing involved 
minimal disruption of normal school activity, 


Questionnaire 


Three cues were used in each of the four forms 
of the questionnaire. In the form concerning male 
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success the cues were “After first term finals, John 
finds himself at the top of the Med. School class”; 
“After first term finals, Peter finds himself at the 
top of his Teacher’s College class”; and “After first 
term finals, Jim finds himself at the top of his 
Nursing School class.” In the form concerning fe- 
male success, the cues were exactly the same except 
that female characters were involved: Anne, Sue, 
and Jill, respectively. In the form concerning male 
failure the cues were “After first term finals, John 
finds himself at the bottom of his Med. School 
class”; “After first term finals, Peter finds himself 
at the’ bottom of his Teacher’s College class”; and 
“After first term finals, Jim finds himself at the 
bottom of his Nursing School class.” Again the cues 
were identical in the form concerning female fail- 
ure except that the three characters were Anne, Sue, 
and Jill, respectively. In each form, the three cues 
were arranged in all six counterbalanced orders, and 
it was possible to present each order twice within 
each group of 12 subjects who took that form, the 
different orders being randomly allocated across sub- 
jects. In summary, the study was designed as a 
2X2 3, three-factor experiment with sex of char- 
acter in cue (male or female) as the first factor, 
outcome (success or failure) as the second factor, 
and occupation (medicine, teaching, or nursing) as 
the third factor with repeated measures on the last 
factor (since the same subjects responded to each 
occupation), with order of occupations counter- 
balanced across subjects, and with equal Ns in the 
four experimental groups. 

In the general instructions to the questionnaire 
subjects in the success (or failure) conditions were 
told: 


You are going to see a number of statements, 
and your task is to answer some questions about 
the situation described in each statement. We are 
concerned with the general problem of how people 
react to various kinds of academic and occupa- 
tional success [or failure]. Consequently, each of 
the statements describes a situation in which some- 
one has done particularly well [or failed] on a 
course of studies associated with a particular 
occupation, 

Please note that the statements are brief and 
supply only a limited amount of information. In 
answering these questions, therefore, you will have 
to use quite a lot of imagination in thinking about 
the person who succeeded [or failed]. Spend a 
few seconds trying to form in your own mind an 
idea of what is involved in each situation. 

There is no need to write your name at the 
top of the page. Answers are anonymous. Work 
rapidly, and rely on your first impressions. There 
are no right or wrong answers, so feel free to 
give whatever answers you think are most ap- 
propriate. Treat each new situation quite sepa- 
rately. Do not be influenced by your answers to 
questions about previous situations. 

As already mentioned, each statement concerns 
a person who has achieved success (or failed) on 
a course of studies. The questions about each 


statement are of three types. One type asks you 
to describe the successful [or unsuccessful] per- 
son, while the second type asks you to think 
about the possible reasons or causes that might 
have contributed to the person’s success [or fail- 
ure]. A third type asks you to consider some of 
the possible consequences or effects of that suc- 
cess [or failure] for the person, 

Remember to use your imagination when you 
read each short sentence. Then go on to answer 
the questions, Read the instructions carefully and 
make sure you answer all questions. Do not leave 
any blanks. Please work quickly giving your first 
impressions, 


Subjects then read their first cue and proceeded to 
rate the person involved on a set of twenty-one 
7-step bipolar adjective scales on the basis of “. . . 
what you think the person might be like.” The usual 
detailed instructions for the semantic differential 
procedure were used in relation to the first cue but 
were not repeated for the remaining two cues. The 
bipolar adjective scales were selected so as to in- 
clude the dimensions of evaluation, potency, and 
activity. In addition, there were scales concerned 
with masculinity/femininity and with rule following 
and some filler items, The order of the scales was 
as follows: pleasant-unpleasant, weak-strong, lucky- 
unlucky, tense-relaxed, sweet-sour, soft—hard, active- 
passive, tough-tender, lazy—busy, good-bad, polite- 
impolite, worthless-valuable, feminine-masculine, 
wise-foolish, quiet-restless, logical-intuitive, honest- 
dishonest, rugged-delicate, disobedient-obedient, fast- 
slow, clean-dirty. The four scales defining evalua- 
tion were pleasant-unpleasant, sweet-sour, good- 
bad, and worthless-valuable, and they were scored 
from 1 to 7 in the direction of positive evaluation 
to give a possible total range of scores from 4 to 
28. The four scales defining potency were weak- 
strong, soft-hard, tough-tender, rugged~delicate, 
scored from 1 to 7 in the direction of potency to 
give a possible total range of scores from 4 to 28. 
The four scales defining activity were tense-relaxed, 
active-passive, quiet-restless, fast-slow, scored from 
1 to 7 in the direction of activity to give a possible 
total range of scores from 4 to 28. Two scales were 
used to define rule following, polite-impolite and 
disobedient-obedient, scored from 1 to 7 in the di- 
rection of rule following (polite and obedient) to 
give a possible total range of scores from 2 to 14, 
The one scale feminine-masculine was used to de- 
fine femininity and was scored from 1 to 7 in the 
direction of femininity. The bipolar adjectives used 
to define evaluation, potency, and activity were 
those commonly used in studies employing the se- 
mantic differential procedure (Nunnally, 1970, pp. 
440-441). 

Subjects were then asked to consider the possi- 
ble reasons or causes for success (or failure) at a 
course of studies. The set of reasons mainly con- 
sisted of causal factors discussed by Heider (1958) 
and investigated by the authors in previous stud- 
ies of attribution of responsibility (e.g., Feather & 
Simon, 1972; Simon & Feather, 1973). The reasons 
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Were presence (or lack) of natural ability and in- 
telligence, presence (or lack) of application and hard 
work, good (or bad) luck, and easy (or difficult) 
course. In addition, causal factors relating to various 
forms of illegitimate behaviour including cheating, 
alleged cheating, and examiner’s error were included. 
Thus a person might succeed because he cheated and 
be failed because it was alleged that he cheated. 
Or the outcome might be the result of an error by 
the examiner, These various reasons were then re- 
lated to the character who succeeded or failed in 
the verbal cue. For subjects responding to success 
cues, six 8-step rating scales were provided under 
the following headings: Ability, Hard Work, Luck, 
Ease of Course, Cheating, and Examiner’s Error. 
For subjects responding to failure cues the scales 
were presented under the following headings: Lack 
of Ability, Lack of Hard Work, Bad Luck, Diffi- 
culty of Course, Alleged Cheating, and Examiner's 
Error, Each scale was labeled Not important at 
all as a cause at one extreme and Very important 
as a cause at the other extreme, Responses were 
scored from 1 to 8 in the direction of increasing 
importance. 

Finally, subjects were presented with a set of 17 
possible events that might follow the success [or 
failure] of the character in the cue. For the success 
cues the possible consequences were as follows: 


[The character in the cue] continues to top the 
class, isn’t invited to parties as often as before, 
works less hard in future, wonders if he [or she] 
is “normal,” feels thoroughly contented, drops out 
of (medicine, teaching, nursing) to become a so- 
cial worker, is praised by his [or her] parents, 

- quits his [or her] course to get married, wins a 
scholarship, has a nervous breakdown, later be- 
comes famous in his [or her] occupation, wonders 
if it is all worthwhile, worries that his [or her] 
girlfriend [or boyfriend] might be jealous, throws 
a wild party, feels he [or she] has sacrificed too 
“much, worries that others might accuse him [or 
her] of studying too hard, and thinks that it isn’t 
true. 


For the failure cues the possible consequences were 
as follows: 


[The character in the cue] continues to come 
bottom of the class, isn’t invited to parties as 0°- 
= ten as before, works harder in the future, wonders 
if he [or she] is “normal,” feels thoroughly un- 
happy, drops out of (medicine, teaching, nursing) 
to become a social worker, is criticized by his [or 
her] parents, quits his [or her] course to get mar- 
tied, loses his [or her] scholarship, has a nervous 
breakdown, later becomes famous in his [or her] 
occupation, wonders if it is all worthwhile, wor- 
- ries that his [or her] girlfriend [or boyfriend] 
might reject him, throws a wild party, feels he 
[or she] has sacrificed too much, worries that 
others might accuse him [or her] of not studying 
hard enough, and thinks that it isn’t true. 


1 These different possible outcomes of success or 
= failure were devised to cover the main types of con- 
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sequence described by Horner (1970) in her a 
of fear of success stories, namely social re 
concern about normality, denial of the 
affect, and instrumental activity, Subjects ra 
likelihood of each consequence on a § ster 
labeled Very Likely, Likely, An Even Chance. 
likely, and Very Unlikely. Their responses 
scored from 1 to § in the dirc 
likelihood that the consequence would occur 

When subjects had completed their answers to /} 
first cue, they moved on to the second and t 
cues, again providing for each cue semantic i 
ential ratings concerning their impressions c 
person involved in the cue, ratings of the import hce 
of causal factors involved in this person’s su- ess 
or failure, and judgments about the likelihood of 
possible consequences of the person’s succe or 
failure. It took subjects about 35 minutes to n= 
plete the questionnaire, 


of increasing 


RESULTS 
Impressions of Personality 


Table 1 presents mean semantic differential 
ratings for evaluation, potency, activity (sep- 
arate scales), rule following, and femininity 
in relation to the sex of the character in the 
cue, outcome (success or failure). and occu. 
pation. Results are presented íc our 
separate scales concerned with activity, 
the data analysis indicated that there was : 
consistent pattern in these ratings across all 
four scales. In fact, the results for the scales 
tense-relaxed and restless-quiet went in the 
opposite direction to the results for the scales 
active-passive and fast-slow (see below). In 
contrast, the results of the data analysis wer: 
in the same direction for all of the component 
scales making up the evaluation, potency, and 
rule-following total scores. Hence data for 
these component scales are not reported sepa- 
rately; only the mean total scores based on 
the scales are given. 

Data were analyzed using 2 X 2 X 3 anal- 
yses of variance with sex of character in cue 
(male or female) as the first factor, outcome 
(success or failure) as the second factor, and 
occupation (medicine, teaching, or nursing) 
as the third factor and with repeated mea- 
sures on the last factor, since the same sub- 
jects were involved (Winer, 1962). Some 
significant main effects were present involy- 
ing occupation and outcome. There were sig- 
nificant main effects of occupation in regard 
to potency, rule following, and femininity. 
A person was seen as less powerful (p< 
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TABLE 1 
MEAN SEMANTIC DIFFERENTIAL RATINGS IN RELATION TO SEX OF CHARACTER IN CUE, 
ü OUTCOME, AND OCCUPATION i 
c Success Failure 
ating | Set, ; 
| Medicine Teaching Nursing Medicine Teaching Nursing 
Evaluation Male 22,58 20,58 22.58 19.00 20.25 19.00 
Female 18.50 5 18.83 22.67 22.00 22.50, 
Potency Male 19.50 16.67 15.42 14.58 12.08 
Female 16.00 12.33 16.42 16.58 16.58 
Activity 

Tense-relaxed Male 3.58 3.92 5.58 5.00 4.75 
Female 3.92 5.00 3.17 3,50 3.33 
Restless-quiet Male 3.92 4.25 5.17 4.50 4.17 
Female 4.92 5.75 4.17 3.92 3.33 
Active-passive Male 5.25 4.83 3.92 4,00 3.50 
Female 4.00 4.75 5.58 5.33 5.83 
Fast-slow Male 5.92 5.58 4.50 3.08 3.17 
Female 4,58 4.08 5.67 5.25 5.50 
Rule Following Male 11.17 10.92 8.83 9,42 9.17 
Female 8.50 7.15 11.08 10.17 11.00 
Femininity Male 1.67 1.83 5.17 5.83 5.58 
Female 3.08 | 2.50 | 4.75 5.42 5.50 


Nolte: The higher the score, the more the character was seen as positive, powerful, active, a rule follower, and feminine in the 


order of the dimensions listed, 


,001), more obedient and polite (p< .05), 
and more feminine (p < .01) if he (or she) 
succeeded or failed at nursing than at teach- 
ing or medicine. The significant main effects 
of outcome occurred in regard to potency and 
femininity. A person was seen as more 
powerful (p < .05) and less feminine ($ < 
.001) if he (or she) succeeded than if he (or 
she) failed, irrespective of occupation. 

In regard to interactions there were signifi- 
can Sex X Outcome effects which were fairly 
general. Males were evaluated more posi- 
tively, seen as more powerful, viewed as more 
obedient and polite (rule following), and 
perceived as less feminine if they succeeded 
than if they failed. But females were evalu- 
ated more positively, seen as more powerful, 
viewed as more obedient and polite, and per- 
ceived as more feminine if they failed than 
if they succeeded. This pattern of differences 
also occurred for ratings of the scales active— 
passive and fast-slow, but it was in the re- 
verse direction for ratings of the scales tense— 
relaxed and restless-quict, the male being 
seen as less tense and restless when he suc- 
ceeded than when he failed and the female 


` 


being seen as less tense and restless when she 
failed than when she succeeded. All of thèse 
Sex X Outcome interaction effects were highly 
significant (p < .01 or p < .001). 

There was one significant Occupation X 
Outcome effect due to the fact that success 
at nursing by either a male or female char- 
acter was seen as relatively more feminine 
than success at either medicine or teaching in 
that order, whereas failure at teaching was 
seen as relatively more feminine than failure 
at nursing and medicine in that order (p < 
.05). 

Finally, there were a number of other sig- 
nificant effects involving scales other than 
those included in Table 1. Most of these addi- 
tional effects involved significant Sex X Out- 
come interactions (all at p < .001). Males 
who succeeded were seen as luckier, busier, 
wiser, more logical, and more honest than 
were males who failed. But these differences 
went in the opposite direction for females. 
The female characters in the cues were seen 
as luckier, busier, wiser, and more honest if 
they failed than if they succeeded. Tn addi- 
tion, the main effect of occupation was sig- 
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TABLE 2 


MEAN RATED ĪMPORTANCE OF DIFFERENT CAUSAL FACTORS AFFECTING OUTCOME IN RELATION TO 
SEX OF CHARACTER IN CUE, OUTCOME, AND OCCUPATION 


Success 
Causal factor oaeen —— ~- 
Medicine Teaching | Nursing | Medicine 
Ability (or lack of ability) | Male 7.50 
Female 5.67 
Hard work (or lack of Male 7.08 | 6.67 
hard work) Female 6.50 7.6 
Good luck (or bad luck) Male 3.17 4,17 
Female 3.83 4.17 | 
Ease of course (or difficulty of Male 3.08 5.50 5.67 | 7 
course) Female 6.17 5.17 6.42 1 
Cheating (or alleged cheating) Male | 2.17 3.58 3.0 3 
x Female | 3.83 ere ee ees 1.58 
Examiner’s error Male 3.08 | 2.50 | 2.4 15 
Female 2.83 | 3.00 3 58 
Note. The higher the score, the more the possible causal factor was seen as important. 


nificant for ratings of cleanliness (p < .01). 
Persons who succeeded or failed at nursing 
were seen as cleaner than persons who suc- 
ceeded or failed at medicine or teaching. 


Causal Attribution for Success and Failure 


Table 2 presents mean attribution scores 
for different causal factors in relation to the 
sex of the character in the cue, outcome (suc- 
cess or failure), and occupation. The higher 
the score, the more that factor was seen as an 
important cause. As before, data were an- 
alyzed by 2 x 2X3 analyses of variance. 

The following effects were significant. In 
regard to ability, the Sex X Outcome inter- 
action and the Sex X Outcome X Occupation 
interaction were both statistically significant 
(P< .01 and p< .05, respectively). These 
interactions were due to the fact that subjects 
tended to see ability as a more important 
cause of male success than female success. In 
contrast, lack of ability was seen as a more 
important cause of female failure than of male 
failure. These differences were esp-cally evi- 
dent when the occupation was medicine. The 
main effect of occupation was significant p< 
05) in regard to hard work. Hard work or 
lack of hard work was seen by subjects as 
more important as a cause of success or failure 
at medicine, teaching, and nursing, in that 
order. In regard to ease of course, the main 
effect of outcome and the Sex x Outcome X 


Occupation interaction were both statistically 
significant (p< .05, and p< .01, respec- 
tively). The main effect was due to the fact 
that failure was more likely to be attributed 
to the difficulty of the task than success was 
to the ease of the task. The interaction effect 
reflected the following differences: When the 
female character succeeded at medicine, sub- 
jects were more likely to explain her success 
in terms of an easy course of studies, whereas 
an easy task was seen as a less important 
cause when the male character succeeded at 
medicine. In contrast, there was a slight tend- 
ency for the difficulty of the course of stud 
to be seen by subjects as a more import 
cause when the male character fai! di- 
cine than when the female charac‘ dat 
medicine (see Table 2). Finally, was a 
significant Sex X Outcome interaction effect in 
regard to cheating (p < .05). Cheating was 
seen as less important as a cause when males 
succeeded than when females succeeded. Al- 
leged cheating leading to failure as a penalty 
Was seen as more important as a cause when 
males failed than when females failed. 

There were no other significant effects aris- 
ing from the analyses of variance of the at- 
tribution ratings. 


Consequences of Success and Failure 


Table 3 presents mean likelihood scores for 
possible consequences in relation to the sex of 
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- the character in the cue, outcome (success or 


failure), and occupation. The higher the score, 
the more likely it was that subjects considered 
that consequence would occur. Again the data 
were analyzed by 2 X 2 X 3 analyses of vari- 
ance. 

The results indicated two significant main 
effects. The first, a main effect of outcome 
(p < .05), was due to the fact that subjects 
saw persons who failed at occupations as more 
likely to quit to get married than persons who 
succeeded. The second was a main effect of 
occupation (~ < .01). It occurred because 
subjects saw those involved in the nursing 
course as more likely to quit and get married 
than those who were involved in teaching and 
medicine, in that order. 

As in the previous analyses, there were sev- 
eral significant Sex X Outcome interaction ef- 
fects which are indicated by the following dif- 
ferences. The male who succeeded was seen by 


subjects as more likely to continue to top the 
class, more likely to be praised by his par- 
ents, more likely to later become famous in 
his occupation, and more likely to throw a 
wild party than the female who succeeded. 
He was also seen as less likely to work less 
hard in the future, less likely to wonder if 
he was “normal,” less likely to wonder if it 
was all worthwhile, less likely to worry that 
his girl friend might be jealous, less likely 
to feel that he has sacrificed too much, and 
less likely to worry that others might ac- 
cuse him of studying too hard than the fe- 
male who succeeded, All of these differences 
were in the reverse direction for the male 
who failed and the female who failed. The 
male who failed was seen as less likely to 
continue to come at the bottom of the class, 
less likely to be praised by his parents, less 
likely to become famous in his occupation, 
and less likely to throw a wild party than the 


TABLE 3 


MEAN LIKELIHOOD OF DIFFERENT POSSIBLE CONSEQUENCES OF OUTCOME IN RELATION TO SEX OF 
CHARACTER IN CUE, OUTCOME, AND OCCUPATION 


Success 
Possible consequence oped = aman fe | Bi: 
Medicine Teaching Nursing Nursing 
Continues t r bott Male 4.00 3.58 4.00 3.17 2.92 3.25 
sper aa Female 3.42 3.00 3.33 3.92 3.67 3.50 
Works less hard (or harder) Male 2.08 | 2.17 1.92 4.25 442 3.67 
in future Female 3.83 3.83 3.83 2.00 2.33 1.92 
sif “ ui Mal 1.33 1.42 1.67 2.08 2.67 2.92 
a Sea reals 2.50 2.42 3.17 2.00 1.92 2.08 
Praised (or criticized Male 4.75 4,83 4.25 3.67 3.67 3.92 
by AE ! Female 3.83 4.00 3.75 4.75 4.83 4.67 
Jui g ied Male 1.75 2.08 2,42 2.25 2,50 2.92 
bap saa Female | 2.08 2.33 2.33 2.17 2.58 2.83 
Male 3.67 3.00 3.00 242 2.42 2.17 
e nos Female | 2.75 2.58 2.67 3.67 3.00 3.33 
if it i il Male 1.92 2.25 2.75 3.33 3.58 3.83 
ee heen Kee 3.92 3.92 3.75 2.08 2.58 2.42 
3 ; 2.50 2.33 
Worries about jealousy (or Male 1.50 1.92 1.92 2.08 2.5 3 
rejection) from friend Female 2.92 2.83 3.00 2.17 2.50 1.92 
awi y Male 3.75 3.33 3.33 2.00 1.50 2.00 
i sa aaa Female 1.83 2.50 242 2.83 3.42 2.92 
y i 1.75 2.00 1.92 2,50 2,50 2.33 
Reine eee ee reais 2 2.67 3.17 2.08 2.00 2.25 
i 5 3.00 2.92 2.83 
Accused of studying too hard Male 2.00 2.58 2.33 00 9; 

a anA AES Female 3.92 3.50 3.25 2.25 2.50 2.75 


Note. The higher the score, 
analysis of variance yielded at 


the more that consequence was seen as likely to occur. Only scores for those con 
Teast one significant effect are included. 


sequences whose 
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female who failed. He was also seen as more 
likely to work harder in the future, more 
likely to wonder if he was “normal,” more 
likely to wonder if it was all worthwhile, 
more likely to worry that his girl friend 
might reject him, more likely to feel that he 
has sacrificed too much, and more likely to 
worry that others might accuse him of not 
studying hard enough than the female who 
failed. All of these Sex X Outcome interaction 
effects were highly significant (p< .01 and 
p< .001) except the item concerned with 
worry about jealousy or rejection from boy 
friend or girl friend, which was at a lower 
level of significance (p < .05), 

There was one significant Outcome X Oc- 
cupation interaction effect (p< .05) due to 
the fact that characters in the cues were seen 
as increasingly more likely to be thoroughly 
unhappy if they failed at nursing, teaching, 
and medicine, in that order; whereas they 
were seen as increasingly more likely to be 
thoroughly contented if they succeeded at 
teaching, nursing, and medicine, in that order, 

Finally, there was one significant Sex x 
Outcome X Occupation interaction effect (p 
< .05) due to the fact that females who suc- 
ceeded at medicine were seen as especially 
more likely to be accused by others of study- 
ing too hard in comparison to males who suc- 
ceeded, while males who failed at medicine 
were seen as especially more likely to be ac- 
cused by others of not studying hard enough 
in comparison to females who failed (see 
Table 3). 


DISCUSSION 


The most striking finding of the present 
study was the strength and generality of the 
Sex of Character in Cue X Outcome inter- 
actions. Fully 70% of these interactions 
achieved statistical significance, the major- 
ity at p < .001. They ranged from global 
_ judgments about the cue person (e.g., suc- 
= Cessful males and unsuccessful females were 
= tated significantly higher on the composite 
Er measure of evaluation than either unsuc- 
cessful males or successful females) to as- 
= Sumptions regarding quite trivial conse- 

quences of succeeding or failing (e.g. 
Successful males and unsuccessful females 
were rated more likely to throw a wild party 


than either unsuccessful males or successful 
females). 

Taken together, these interactions consti- 
tute compelling evidence that success and 
failure in competitive achievement situations 
are differentially perceived and evaluated ac- 
cording to the sex of the successful or unsuc- 
cessful person. More particularly, there were 
pervasive indications of a tendency to up- 
grade successful males in relation to unsuc- 
cessful males, whereas the opposite tendency 
applied for females: Successful females were 
depreciated in relation to unsuccessful fe- 
males. Put another way, successful males were 
judged more positively than successful fe- 
males, but unsuccessful males were judged 
less positively than unsuccessful females. 
These trends were most directly apparent in 
the semantic differential results, but they 
were also seen, at least by implication, in 
many of the judged consequences of male 
and female success and failure. Moreover, 
there was surprisingly little evidence of in- 
teractive influences involving occupations 
with respect to these findings. The foregoing 
trends typically applied across all three oc- 
cupations represented in the study. Nor was 
there much evidence to suggest that success 
or failure in itself affected subjects’ ratings. 
Only four main effects involving outcome at- 
tained significance, and of these the strongest 
was in regard to femininity: A successful per- 
son was seen as less feminine than an unsuc- 
cessful person. 

In general, then, the present study indi- 
cates that a person who succeeds (or fails) 
may be described in favorable or unfavorable 
terms, and his or her success (or failure) 
may be seen as having positive or negative 
consequences solely on the basis of that per- 
son’s sex. If it is assumed that subject’s re- 
sponses reflected their conceptions about 
male and female sex roles, then the present 
results provide support for Horner’s (1970) 
assertion that femininity and individual 
achievement are two desirable but mutually 
exclusive goals of women in our society. An 
important facet of the social definition of 
approved role behavior seems to involve as- 
sumptions about what are appropriate or nor- 
mal levels of performance on achievement- 
related activities for males and females. 
Women who pursue success in competitive 
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achievement situations face the prospect of a 
wide range of negative consequences in addi- 
tion to whatever positive consequences are 
usually contingent upon success. Success for 
females is decidedly a package deal, having 
unpleasant as well as pleasant implications 
(see Feather & Simon, 1973). Males face 
no such problem: Success in competitive 
achievement situations is consistent with so- 
cietal conceptions of the male sex role, so 
that the consequences of success for a male 
are almost always positive. 

Of course, these conclusions are no longer 
novel, and in this respect the present findings 
concerning successful males and females serve 
mainly to reinforce previous findings (e.g., 
Horner, 1968; Feather & Raphelson, 1974; 
Feather & Simon, 1973). On the other hand, 
an important contribution of the present 
investigation is its extension to the failure 
condition. There is a paucity in the litera- 
ture of studies which focus explicitly on the 
question of sex differences regarding failure 
in achievement situations. It is significant 
in this regard that Horner (1972a) points 
out that her formulations are specifically di- 
rected toward success experiences. To say 
that women have a motive to avoid success 
is not to say that they also have a motive 
to approach failure. The two are independent, 
both conceptually and in respect of their be- 
havioral implications (Horner, 1972a). The 
existence of a motive to approach failure—a 
will to fail—remains, however, a logical pos- 
sibility. Its behavioral manifestations would 
presumably entail the active seeking out of 
failure experiences because of the anticipa- 
tion of associated positive consequences of 
failure. Although it is clearly not possible to 
assume the existence of such a motive from 
the present results, there was a wealth of 
evidence that, for females, failure in com- 
petitive achievement situations may have a 
multitude of favorable concomitants, irre- 
spective of the intrinsically negative aspects 
it carries with it. But males who fail can 
expect no compensatory positive conse- 
quences. Our society, it may be argued, has 
designated failure in important competitive 
achievement situations as inappropriate to 
the male sex role but as quite consistent with 
approved role behavior for females. Horner 
(1970a), following Mead (1949), has ob- 


served that whereas men are -unsexed by 
failure, women tend to be unsexed by suc- 
cess. Previous studies (e.g., Horner, 1968) 
have lent support to the latter assertion. The 
present study is somewhat unique in provid- 
ing empirical support for the former assertion 
as well. 

Tt is noteworthy that the present investi- 
gation found very little evidence of general- 
ized devaluation of females, For example, 
there were virtually no main effects due 
to sex. Indeed, for unsuccessful outcomes, 
females were viewed more favorably than 
males, as detailed above. Only when assign- 
ing causality for outcome did subjects reveal 
any overall downgrading of females relative 
to males, and then only for the factors of 
ability (or inability), cheating (or alleged 
cheating), and task ease (or difficulty). Thus 
ability was seen as a more important cause, 
and cheating and (for medicine only) task 
ease were seen as less important causes of 
male success than of female success; but lack 
of ability was seen as a more important cause, 
and alleged cheating and (for medicine only) 
task difficulty were seen as less important 
causes of female failure than of male fail- 
ure. There were indications, therefore, that 
males more so than females were assigned 
personal credit and responsibility for suc- 
cess, while females more so than males were 
assigned personal blame and responsibility 
for failure. For success, males were rated 
more favorably than females; for failure, 
males were rated less unfavorably than fe- 
males. Note, however, that there was no evi- 
dence of corresponding trends for the factors 
of hard work, luck, and examiner’s error. 
Even within the restricted area of causal at- 
tribution, then, the results of the present 
study are far from uniform with respect to 
the tendency to devalue females in relation 
to males. 

It was mentioned earlier that there was 
hardly any evidence that assumptions about 
sex-appropriate occupations influenced sub- 
jects’ responses in the present study as had 
been expected. The occupations of medicine, 
teaching, and nursing were chosen specifically 
to be representative of male-dominated oc- 
cupations, mixed-sex occupations, and female- 
dominated occupations, respectively. Why 
then the negative findings regarding occupa- 
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tion in interaction with sex and outcome? 
What can explain the many indications that 
female success at nursing was not perceived 
more positively than male success at nurs- 
ing or that female failure at nursing was not 
perceived more negatively than male failure 
at nursing, and so on? Although the nature 
of the present study prohibits the formula- 
tion of definite answers to these questions, 
some tentative suggestions can be offered. It 
might have been, for example, that subjects 
were focusing not so much on different types 
of occupation as on the fact that common to 
all cues was a concern with academic per- 
formance and, moreover, with extremes of 
academic performance. For them, the out- 
standing success (top of the class) and the 
disastrous failure (bottom of the class) may 
have been salient in addition to the sex of 
the character in each cue. The different oc- 
cupations—access to which presumably de- 
pends upon demonstrating a satisfactory 
level of performance on the relevant course 
of studies—perhaps had diminished salience 
in such circumstances. On the other hand, 
it is not suggested that the occupation vari- 
able was given no weight at all. On the con- 
trary, the occurrence of main effects involv- 
ing occupation, mostly in the direction of 
occupational stereotypes, implies that sub- 
jects in fact did discriminate, to some extent 
at least, between cues for different occupa- 
tions. But perhaps a revised procedure not 
presenting extremes of achievement but re- 
taining the suggestion of competition would 
be worth investigating. 

Another possible explanation for the rela- 
tive paucity of interaction effects involving 
occupation is somewhat more speculative: 
Perhaps success at competitive achievement 
activity of any kind is inconsistent with our 
social conception of femininity, so that fe- 
males suffer few negative consequences for 
working in a particular occupation provided 
they do not do too well in that occupation. 
Could it be that it is a high level of achieve- 
ment within a given occupation rather than 
the nature of the occupation as such which 
is seen to be role inappropriate for women 
in our society? Twenty-five years have passed 
since Margaret Mead asserted in this regard 
that while there exists relatively high ac- 
cessibility of most occupations to women, 


such accessibility is accompanied by “a bitter 
fight that is put up, even in those fields where 
women are the best trained . . . , against 
giving women jobs that carry high salaries 
or administrative powers over people—the 
two most usual ways in which men demon- 
strate their success [1949, p. 286; italics 
added],” resulting in “the contradictory pic- 


ture of a society that appears to throw its 5 


doors wide open to women, but translates 
her every step towards success as having heen 
damaging—to her own chances of marriage, 
and to the men whom she passes on the road 
[p. 287].” 

In conclusion, it is important again to 
stress the essentially exploratory character of 
the present investigation. Both theoretically 
and in terms of the assessment procedures 
used, relatively fresh ground was being 
broken. More definite formulations of many 
of the issues raised here must await further 
research. There is, for example, a clear need 
for more extensive study of the assessment 
procedures used in the present study. The 
use of a variety of objective rating scales 
rather than Horner’s (1968) projective test 
with its simple present-absent scoring sys- 
tem would seem to be an important advance, 
given the apparently systematic trends un- 
derlying most of the significant effects in the 
present study. That is, the broad nature of 
the sex-linked tendencies postulated by Hor- 
ner were also apparent in the present study, 
but a more refined scrutiny of these tenden- 
cies was possible. Further development of the 
assessment technique might involve the elimi- 
nation or modification of certain scales, the 
combining of scales (e.g., certain of the con- 
sequence items might form a composite “so- 
cial rejection” measure), and the addition of 
new ones. 

Perhaps most important is the urgent need 
to go beyond student populations to the 
community at large. The female subjects in 
the present study came from a fairly exclu- 
sive girl’s school associated with the Church 
of England. An interview with the headmis- 
tress indicated that most of the girls would 
have fathers in higher status occupations and 
that most of the mothers would be nonwork- 
ing and, if working, in part-time jobs only. 
She believed that the girls would tend to see 
tertiary study as the means of providing them 
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with a general education and that they would 
be more interested in marriage than a career 
and more likely to marry men in relatively 
high status occupations compared, say, with 
girls from a large general state high school 
or from a technical school. She indicated that, 
in addition to its academic aims, her school 


tried to foster Christian values and “lady- - 


like” behavior, good manners and courtesy, 
neatness of dress, correct speech, and polite- 
ness and service to others. Being a fairly 
small school, it would not be difficult to 
transmit the accepted values and standards 
of behavior as, for example, at daily school 
assemblies where the whole school was ad- 
dressed as a group, and these values and 
standards of behavior would probably be re- 
inforced in the home environment (Feather, 
1972). The fact that the school was a single- 
sex girls’ school rather than coeducational 
would also decrease the possibility of testing 
the accuracy of traditional male and female 
stereotypes against reality, since interaction 
between the sexes in an academic setting was 
minimal. These various characteristics sug- 
gest that the group of girls may have had a 
fairly well-defined and traditional concept of 
their sex role in which the valued achieve- 
ment was being a good wife and mother, sup- 
ported by a husband in a financially secure 
occupation of relatively high status. 

But would this be true of other groups in 
society? The generality of the present results 
remains in doubt until subzequent studies 
using different populations are undertaken. 
These studies should include both male and 
female subjects and cover a broad spectrum 
of society, since any differences discovered 
may also be a function of social and demo- 
graphic characteristics that are correlated 
with differences in socialization (e.g., sex, 
age, and socioeconomic status). Future stud- 
ies might also examine what effects the every- 
day social environment (e.g., school or work 
setting) has in maintaining social stereotypes 
once developed. 


3 Senior girls had some contact with boys at a 
nearby Church of England boys’ school, especially 
in joint science classes. 
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Time Perception and Stimulus Preference as a Function 
of Stimulus Complexity | 


H. Wayne Hogan * 
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The present research joins previous efforts to define more clearly the phe- 
nomenon of perceived time as an important social-psychological dimension. 
In this study, time was considered a socially existenti: variable that can most 
meaningfully be measured by comparing two or more subjectively experi- 
enced durations, each with the other(s). Combining this view of time judg- 
ment with the adaptation level/optimal arousal principle, two hypotheses were £ 
tested: (a) that test stimuli differing in degree of complexity elicit from the 
individual differential degrees of affect, and (b) that intervals during which 
the individual is exposed to either overly simple or overly complex stimuli 
are judged as comparatively longer than objectively equal durations filled 


with moderately complex stimuli. Both hypotheses were affirmed. 


There is general agreement that people 
require a certain amount of environmental 
variety and stimulation (Fiske, 1961). Just 
how much stimulation is required and used is, 
according to one line of thought, largely the 
function of an “adaptation level” principle 
which proposes that each individual’s need 
may be satisfied with differing types and 
quantities of stimulus input, depending upon 
such factors as the individual’s past experi- 
ences and future expectations (Helson, 1947). 
Moreover, “human beings and higher animals 
.., normally strive to maintain an intermedi- 
ate amount of arousal potential [| Berlyne, 
1960, p. 200, italics added].” 

Thus the satisfaction of the individual’s 
need for stimulation is a function of both the 
perceiver himself and the nature of the stim- 
uli experienced. The adaptation level/stimu- 
lus need principle therefore suggests that up 
to a point, increases in stimulus complexity 
(e.g., more versus fewer experiential units; 
less versus more experiential redundancy; 
more versus less random ordering of experi- 
ential units) are associated with increased 
positive affect. Beyond the point of optimal 
complexity, stimulus affect decreases to near 
the level experienced by exposure to the least 
complex stimuli. That is, the curvilinear 


1 Requests for reprints should be sent to H. 
Wayne Hogan, who is now at the Department of 
Sociology and Philosophy, Tennessee Technological 
University, Cookeville, Tennessee 38501. 


shape of the relationship between stimulus 
complexity and affective response is an in- 
verted-U (Nicki, 1972). 

Coupled with the theoretical assumption 
that people are optimally stimulated by ob- 
jects and events whose stimulus potentials 
are moderately expressed is the corollary 
hypothesis that time periods encompassing 
either too little or too much stimulation are 
both judged as comparatively longer than 
objectively equal durations filled with mod- 
erately affective (complex) stimuli. Thus, 
while much of the literature (cf. Fraisse, 
1963; Harton, 1939; Loehlin, 1959) reports 
the dichotomous emphasis on “empty” time 
(e.g., time during which stimuli are relatively 
absent) versus “filled” time (e.g., time dur- 
ing which stimuli are relatively abundantly 
present), it is presently proposed that though 
empty time may be conceptually unidimen- 
sional, this is not the case with filled time. 
Time that is too filled may therefore affect 
the individual’s perception of duration in 
much the same way it is affected by empty 
time. That is, just as stimuli having too little 
stimulation potential inherently limit the de- 
gree of information processing that is possi- 
ble, so it is that stimuli with too much stimu- 
lation value prove aversive, hence are avoided 
to the degree that circumstances permit. It 
therefore follows that avoided stimuli can- 
not be cognitively and perceptually processed 
to anywhere near the fullest possible extent. 
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Durations filled with avoided stimuli should 
therefore be experienced as of at least equal 
length to durations objectively empty of 
stimuli. Sensory deprivation (understimula- 
tion) and sensory overload (overstimulation) 
may be viewed, then, not as polar opposites 
but rather as two expressions of the same 
phenomenon. 

But while the literature indicates that time 
perception is of increasing concern to many 
researchers,? it nevertheless remains a con- 
cept associated with much definitional. im- 
precision and measurement inconsistency (cf. 
Clausen, 1950; Loehlin, 1959), For present 
purposes, time perception was considered a 
socially existential variable that is most 
meaningfully measured by comparing two or 
more subjectively experienced durations, each 
with the other(s) (Ornstein, 1970). This view 
takes into account two theoretical principles 
central to the disciplines of psychology and 
social psychology: selective perception and 
differential rates of processing environmental 
stimuli. Again, how stimuli are perceptually 
and cognitively processed may be expected to 
affect directly the individual’s subjective 
judgment of the passage of time. 


METHOD 

Subjects 

From a subject pool of 318, the subjects who 
voluntarily and anonymously contributed data for 
the present study were 137 (42 male and 95 female) 
undergraduates at a small liberal arts college in 
South Carolina. The subjects’ median age was 21 
years. 


Test Stimuli 


The test stimuli consisted of two sets of materials: 
(a) six color slides of line drawings—a standard, 
neutrally complex slide and five companion slides 
varying in complexity (defined as the number of 
each drawing’s interior angles; the more angles, the 
greater the complexity) from least to most com- 
plex, and (b) six color slides of abstract paintings— 
also consisting of a standard slide and five others 
varying from least to most complex (defined as the 
number and randomness of the units making up 


2 It appears, however, that this concern is largely 
limited to psychologists. A recent sampling of in- 
dexes for 43 major texts written by sociologists and 
social psychologists and published between 1896 and 
1972 revealed that no fewer than 36 failed to con- 
tain even one reference either to “time” or to a 


“time”-related term. 


the paintings; the more units and the more random 
their ordering, the greater the complexity). The 
complexity ordering of both sets of stimuli had pre- 
viously been determined by two separate groups of 
student judges. For the line drawings, the ordering 
used was that agreed on by 91% of the judges; 
for the abstract paintings, the complexity ordering 
used was that agreed on by 85% of the second 
group of judges, 


Procedure 


The subjects were tested in groups of 5 to 10 per- 
sons each. Each group was seated slightly to the 
left and right of the testing room’s center aisle and 
some 25 feet in front of a ceiling-attached projec- 
tion screen. A Kodak Carousel 850-H slide pro- 
jector with a 15-second seli-timer was centered 
about 10 feet behind the subjects, Prior to the en- 
try of each group of subjects into the testing room, 
the projector’s circular magazine was loaded with 
two slides: either the standard line drawing slide 
and one of its least-to-most complex test slides, or 
the standard abstract painting slide and one of its 
least-fo-most complex test slides, But whether for 
given groups either drawings or paintings were the 
stimuli shown, the standard slides were always pro- 
jected first (for 15 seconds), followed immediately 
by their corresponding test slides (also shown for 
15 seconds). The 10 pairs of slides therefore made 
up 10 test conditions, or treatments, with each con- 
dition being experienced by different groups of 
subjects, 

As soon as both slides in each condition had been 
seen, each group of subjects indicated on a 9-point 
interval scale how much they considered the time 
interval associated with the second-presented test 
slide shorter than, equal to, or longer than the in- 
terval during which the first-presented standard 
slide was shown, The subjects then responded to 
three questions asking which of the two slides they 
thought was most complex, which one they liked 
best, and which one they thought was most aestheti- 
cally pleasing. Responses to these instruments con- 
stituted the measurement of the study’s dependent 
variables of time perception and cognitive-per- 
ceptual affect. 


RESULTS AND Discussion 


Male and female responses were combined 
for all analyses, since their time perception 
scores did not appreciably differ. Further, 
since the direction of the duration scores was 
nearly identical for drawings and paintings 
and since the magnitude of the scoring dif- 
ferences between these two types of stimuli 
was not statistically significant, the decision 
was made to combine the subjects’ responses 
to the slides of drawings and paintings. 

As anticipated, when the second-presented 
test stimuli were less complex than the first- 
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TABLE 1 


MEAN Time PERCEPTION SCORES BY STIMULUS 
COMPLEXITY LEVEL 


Complexity level M n 
1 (least) 44 29 
2 43 32 
3 4.0 27 
4 43 24 
5 (most) 4.5 25 


presented standard slides, the subjects indi- 
cated that they thought the latter were in 
fact most complex; when the test stimuli 
were more complex than the standard slides, 
the former stimuli were considered the more 
complex of the two stimulus sources (x° = 
37.8, df= 1, p< 05). 

Tn line with the subjects’ subjective assess- 
ment of the objective complexity levels of 
the test stimuli, they also predictably differ- 
entiated between the slides they liked best. 
Thus, when the test stimuli were less com- 
plex than the standard stimuli, the latter 
were liked better than the former; corre- 
spondingly, when the test stimuli were more 
complex than the standard slides, the former 
tended to be liked best (x? = 6.3, df =1, 
p <05). 

A somewhat unexpected finding was the 
near zero association between the complexity 
of the test slides and the subjects’ choice of 
either the standard slides or the test slides as 
the ones they thought were most aesthetically 
pleasing (x? = .01, df=1, p>.05). Vir- 
tually the same proportion of subjects chose 
as most aesthetically pleasing the standard 
slides, regardless of whether the test stimuli 
were more or less complex than the standard 
slides. 

Theoretically opposed to this finding, how- 
ever, is the reasoning that a subject’s choice 
of the stimuli he likes best is consistent with 
his choice of stimuli he considers most pleas- 
ing. By way of accounting for the failure of 
the data to support this proposition, three 
possible explanations are offered: (a) because 
stimuli may be liked and considered aestheti- 
cally pleasing for different reasons, they may 
also be differentially evaluated; (b) uncer- 
tainty as to the “correctness” of their aes- 
thetic judgments may have contributed to the 


subjects’ ambivalent evaluations; or (c) 
subjects may simply have misunderstoo: 
meaning of the term “aesthetically.” 
of these incidental hypotheses merit a 
tional study. 

Table 1 illustrates the affirmation of the 
hypothesis that stimuli both least and most 
complex are experienced as involving lengt) 
durations than stimuli of moderate comp! 
ity (F=9.10, df = 4/132, p<.01). He 
ever, while this finding is in agreement w 
theoretical expectations, it does not corr: 
orate the recent results reported by Ornstein 
(1970). Testing similarly sized groups with | 


the same set of six line drawings and the 
same complexity ordering as were used in 
the present study, Ornstein’s data indicate 


a positive association between stimulus com- 
plexity and time perception. This relation- 
ship, as predicted by the “storage size” p 
ciple of information processing, was genet 
linear, though there was a tendency for 
Ornstein’s subjects to judge the intervals 
filled with the most complex slides as shorter 
than the duration filled with the slide oj 
medium complexity. In related audio and 
visual experiments, however, Ornstein con- 
sistently found a positive and linear asso- 
ciation between stimulus complexity and 
perceived time duration. Relative to the 
experimental use of the line drawings, it is 
to be noted that Ornstein exposed his subjects 
to the various stimuli for 30 seconds per 
slide, while the present subjects viewed their 
stimuli for 15 seconds per stimulus. While 
the literature does not suggest that different 
time periods of less than 1 minute are dif- 
ferentially experienced, this is nevertheless 
perhaps the most reasonable explanation for 
the divergent results of these two studies that 
in all other respects appear to be methodo- 
logically identical. Future research empiri- 
cally relating different amounts of clock time 
to the subjective and comparative judgment 
of intervals filled with varyingly complex 
stimuli should prove theoretically productive. 
The present results relating to line draw- 
ings, however, do corroborate Ornstein’s data 
in their failure to show the consistently re- 
ported negative time-order effect: In both 
sets of data there was an overall tendency 
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for the subjects to judge the second-presented 
stimulus as filling Jess time than the first 
slide. Ornstein offers no explanation for this 
result. One possible interpretation of both 
sets of data is that there is an equation in 
which the effect of cumulative simpler stim- 
uli is comparable to a single complex 
stimulus.® 

This study has empirically linked with 
time perception the literature relating to the 
preference for and aesthetic judgment of en- 
vironmental stimuli varying in degree of com- 
plexity. The present subjects were generally 


„found to be optimally stimulated by mod- 


erate rather than extreme degrees of stimulus 
intensities. Correspondingly, time intervals 
that were relatively empty and intervals that 
were “excessively” filled were both experi- 
enced as longer than “moderately” filled in- 
tervals of objectively equal duration. 

8 This hypothesis is currently being tested, Analy- 
sis of relevant data will shortly be submitted for 
publication consideration. 
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Differential Self-Concept in American Indian Children 


as a Function of Language and Examiner’ 


Harriet P. Lefley? 
Department of Psychiatry, University of Miami 


The hypothesis that minority self-concept may vary as a function of im- 
plicit ethnic and nonethnic evaluative criteria was investigated by testing 
40 Mikasuki Seminole Indian children in their native language (Mikasuki) 
with Indian examiners and in English with Anglo examiners. Two reser- 
vation groups, matched for age and sex, received four administrations of a 
personal (Piers-Harris) and an Indian self-concept scale, in a repeated mea- 
sures counterbalanced design, varying language and order. Significant test- 
retest correlations indicated response stability across languages and time 
Results, homogeneous for all subgroups, showed significantly higher personal 
self-concept in Mikasuki, significantly higher Indian self-concept in English. 
It was suggested that minority testing may involve culturally patterned and 


context-bound response tendencies. 


Disparate responses to the same personal- 
ity items, when administered to bilinguals 
in their own and a second language, have 
been reported by Ervin-Tripp (1964b) and 
Lambert, Havelka, & Crosby (1958), among 
others. Such differences have generally been 
attributed to the implicit role-defining con- 
ditions of the two languages, rather than to 
cognitive or structural dissimilarity of the 
linguistic stimuli. Assuming semantic equi- 
valence, it would appear that each language 
evokes a specific culture frame, with selective 
demand characteristics for the assessment of 
one’s own behavior or the projection of per- 
sonality variables. In an interactional setting, 
research by Ervin-Tripp (1964a) indicates 
that projective responses to the same verbal 
materials may vary widely not only with re- 
spect to the presenting language but as a 
function of the ethnicity of respondent and 
interviewer. 
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Annex, Miami, Florida 33152, 


In this connection, a number of investiga- 
tors (e.g., Heiss & Owens, 1972; Hodgkins & 
Stakenas, 1969) have questioned findings of 
negative self-concept in minority group chil- 
dren on the grounds that the subjects are 
typically asked to evaluate themselves within 
a particular cultural context, that is, in a 
school setting, in standard English, and us- 
ually by Anglo teachers. These authors con- 
tend that children in ethnic cultures may 
have multiple self-evaluations, contingent on 
perceived role expectations and implicit com- 
parison groups. They suggest that when eval- 
uative criteria are perceived as those of the 
membership group, self-concept testing is 
likely to produce a normal distribution of 
personality variants. Against the implicit cri- 
teria of the dominant society, however, the 
children may perceive themselves as inade- 
quate, with the group distribution skewed 
toward devaluation. Accordingly, self-concept 
should be more favorable when testing con- 


ditions imply a within-culture frame of refer- 
ence. 


that the self-concept of disadvantaged native 
American children would vary as a function 
of perceived ethnic and nonethnic evaluative 
criteria by administering native language and 
English versions of an instrument tapping 
personal and Indian self-concept in a re- 
peated measures counterbalanced design. The 
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The present research tested RTE. 
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subjects were Mikasuki Seminole reservation 
children who speak the Mikasuki language 
exclusively outside of the school setting and 
restrict English usage to their interactions 
with Anglo teachers. It was assumed that 
items administered in Mikasuki by Mikasuki 
examiners would be perceived in an Indian 
evaluative context, while those administered 
in English by non-Indian examiners would 
be perceived in a nonethnic or Anglo con- 
text. Under these conditions, it was hypothe- 
sized that both personal and ethnic self- 
concept wəuld be higher in the native lan- 
guage/native examiner administrations. To 
determine the generalizability of the findings, 
testing was done on two reservations over 100 
miles apart. 


METHOD 
Subjects 


A total of 72 children, aged 7-15, were tested 
on two Mikasuki-speaking reservations. These sub- 
jects comprised the full populations of two un- 
graded reservation schools, Levels 1-6. In each 
school, the children were divided into two sex- 
equivalent, age-graded groups. Subjects from each 
group were then randomly assigned to one of two 
language-order conditions. The final analysis was 
based on four groups of subjects (N = 40), exactly 
matched for age, sex, and reservation, with each 
subject exposed to a total of four language-order 
conditions. 


Materials 


The subjects were administered a battery of self- 
concept instruments which had been selected as the 
most culturally appropriate, for this population, by 
Indian informants. These included a verbal self- 
report measure, the Piers-Harris Children’s Self- 
Concept Scale (Piers, 1969) and a modified version 
of the Sarason and Ganzer (1971) Word-Rating 
Scales, a semantic differential consisting of 14 cul- 
turally adapted bipolar adjectives. A nonverbal 
measure, the Ziller, Hagey, Smith, and Long (1969) 
Self-Social Esteem symbols task, was piloted and 
found to be culturally inapplicable. Since ethno- 
graphic accounts (e.g, Garbarino, 1972) suggested 
that behavioral correlates of self-esteem in the An- 
glo culture may not apply in this Indian popula- 
tion, there was no further attempt to devise a be- 
havioral measure. However, the Coopersmith (1967) 
Behavior Rating Form, assumed to tap self-esteem 
indicators, was completed by the teachers and prin- 
cipal for each child, for correlation with the self- 
report scores, The teachers also gave a global esti- 
mate of self-esteem on a 5-point scale, and self- 
concept scores were correlated with school achieve- 
ment (grade point average) for each child. 


The language manipulation was done on the 
Piers-Harris scale, since this scale permitted com- 
parison with Anglo norms. Ethnic self-concept was 
tapped by a 12-item Indian self-esteem scale, ration- 
ally developed and piloted with the aid of Indian 
informants. The Indian self-esteem scale contained six 
pairs of equivalent items, matched to elicit positive 
and negative responses, tapping ethnic pride and 
satisfaction with Indian life-style, personality, in- 
telligence, and body image; examples: “I am glad 
I am a Mikasuki/I am sometimes ashamed to be 
an Indian; I think white children are smarter than 
we are/We know many things that white children 
don’t know,” Weighted mean correlations of posi- 
tively and negatively phrased items were .89 in 
English, .71 in Mikasuki, both significant at the .01 
level. These items were randomly interspersed with 
the Piers-Harris items to mitigate their stimulus 
value and thus were functionally part of the same 
test. Another Indian self-esteem measure (Indian 
stimulus scale), in which the subjects rank-ordered 
their preference for Indian and Anglo symbols, em- 
bedded in neutral stimuli, was also administered. 

Both the Piers-Harris scale and Indian self-esteem 
scale showed good concurrent validity with the 
other instruments used to tap the same variables, 
with all intercorrelations significant beyond the 01 
level. (The Word-Rating Scale and Indian stimulus 
scale were also administered bilingually but not as 
independent sequential tests.) Additionally, split- 
half correlations (Spearman-Brown) for each mea- 
sure were all significant beyond the .01 level, in- 
dicating that the instruments were internally con- 
sistent for this population. 


Translation 


All Piers-Harris and Indian self-esteem scale items 
were initially translated by two Indian teachers’ 
aides at one reservation, who spent three days work- 
ing on appropriate formulations. They were then 
taped, brought to the second reservation, and inde- 
pendently back-translated, Questioned items were 
then reread to the back-translator in English, who 
either agreed that the original translation conveyed 
the central meaning or alternatively taped her sug- 
gested revisions. The revisions were satisfactorily 
back-translated by a third Indian aide. The tapes 
were then returned to the first reservation for dis- 
cussion. They were played to three bilingual Indian 
judges, in English and Mikasuki, and each item was 
discussed until consensual agreement was reached, 
Decentering, the multistage procedure suggested by 
Werner and Campbell (1970) which yields a modi- 
fied back-translated English version, was not at- 
tempted since this would have precluded com- 
parison with Anglo norms, However, on a few 
Piers-Harris items, for which there were no exact 
Mikasuki equivalents, the English was changed to 
correspond with its best translation (eg, “I am 
cheerful” to “I act happy”). Partial piloting of the 
final two language versions, on a small bilingual 
adult sample, indicated that the items appeared to 
be conceptually equivalent. 


38 Harriet P. LEFLEY 


TABLE 1 


LANGUAGE SCORES BY SEX, RESERVATION, 
AND PRESENTATION ORDER 


Piers-Harris scale P DENS 
Variable | 
English | Mikasuki | English | Mikasuki 
Sex* 
Male 
M 37.65 39,25 8.80 7.46 
SD 7.67 7.97 1.54 1.58 
Female 
M 43.12 44.72 9.06 8.31 
SD 7.40 8.24 1.62 1.85 
Reservation» 
CM 40.56 | 42.18 | 909 | 7.96 
SD 8.01 9.27 1.83 1.98 
B 
M 40.22 41.78 8.77 7.81 
SD 7.83 7.93 1.85 1.61 
Order® 
1 
M 40.24 41.84 8.96 7.75 
SD 9.16 8.18 1.70 1.56 
2 
M 40.52 42.12 8.90 8.04 
SD 6.98 9.21 2.02 2.04 
Trial totals 
d 40,38 41.98 8.93 7.89 
SD 8.15 8.71 1.86 1.80 


Note. F ratios are based on four trials. 

s Subjects were age matched within sex groups; for 
= Harris scale, F = 4.43, df = 1/32, p < .05; for Indian 
: esteem scale, P = 4.18, df = 1/32, p <.05. 

b df = 1/32 for both scales. 

© df = 1/32 for both scales. 

“For Piers-Harris scale, F = 2.93, df = 3/96, p < .05; for 

Indian self-esteem scale, F = 9.71, df = 3/96, p < .01. 


Procedure 


Because of the fatigue involved in four adminis- 
trations, the tests were given in a pretest-posttest 
design. In each school, the two age-sex matched 
groups were randomly assigned to two language- 
order conditions: English first, Mikasuki second 
(E1, M2), and Mikasuki first, English second (M1, 
E2). The tests were administered sequentially and 
took approximately two hours. The same groups 
were then retested about two months later with 
order counterbalanced. In the posttest, the E1, M2 
groups became M1, E2, and the M1, E2 groups be- 
came E1, M2. Thus, each subject was exposed to 
four conditions: E1, E2, M1, and M2. 

Testing was oral in both languages, administered 
by two Anglo teachers and two Mikasuki teachers? 
aides in each school, with item order counterbal- 
anced for the two experimenters in each language 
condition, All examiners were familiar figures and 
well liked by the children. Since Mikasuki is an un- 
written language, tapes were played and stopped 
after each item to ensure exact repetition of the 
standardized translation. The subjects responded by 
circling yes or no on answer sheets to the appro- 
priate number. Per manual instructions, teachers 


and aides acted as proctors to ensure coni 
and correct matching of item numbers. T: 
ditions were identical in the two schools. 


Analysis 

For each language-order group and for « 
vation, test-retest correlations were cov i to 
determine response stability across lang d 
across time. Separate analyses of variance 
puted for the Piers-Harris scale and I 
esteem scale to determine the effects o 
reservation, order, and trials (language 
and their interactions. For each interac 
effects were based on orthogonal comparis: 
lish versus Mikasuki in pretest and po l 
pretest versus posttest) as a within-subje: e. 
As noted, language and examiner were 15 
a unitary effect. 


RESULTS 


Test-retest correlations for both me 
computed separately for each languag 
group within each reservation, rang 
.71 to .92 for language administrati 
from .65 to .79 for temporal (pre-post) ad- 
ministrations. All correlations were 
cant, most beyond the .01 level. These fnd- 
ings tend to support the assumption of 
content equivalence in the two languages. 


order 


from 


The results indicated that across erva- 
tions females were significantly higher in 
self-concept than males (see Table 1), and 
younger (7-10 years) subjects were is 
cantly higher in self-concept than older ‘| 1- 
14) subjects (F = 10.98, df = 1/: p< 


01). Thus, it is noteworthy that consistently 
across all age-sex groups and regardless of 
order of presentation or geographical location 
(reservation), the subjects were significantly 
higher in personal concept in their native 
language (see Table 1). 

The ethnic self-concept measure, however, 
showed completely opposite results. On the 
Indian self-esteem scale, both reservation 
groups were significantly higher in the Eng- 
lish language testing. This response was 
also consistent for all subgroups and was 
even more pronounced than the Piers-Harris 
differences, although the items were admin- 
istered in the same instrument. 

As anticipated, correlations between Coop- 
ersmith’s (1967) Behavior Rating Form 
and self-report scores were in the zero range 
(from —.08 to .10). Many of Coopersmith’s 
indicators of low self-esteem, for example, 
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shyness, subdued speaking manner, failure to 
express Opinions, are approved behaviors in 
the Mikasuki culture. However, the teachers’ 
ratings yielded a weighted mean correlation 
of .43 (p < .05), indicating that the intuitive 
judgment of those who know the culture may 
be superior to an objective but culturally 
inappropriate measure. With respect to school 
achievement as a behavioral indicator, the 
correlations between self-esteem and grade 
point average on the two reservations were 
15 and .16, both considerably lower than 
Coopersmith’s correlation of .30 for a middle- 
class Anglo population (Coopersmith, 1967, 
p. 129). 

Despite the significant differences in the 
language-examiner conditions, it may be noted 
that in both languages, the Indian subjects 
were significantly below Piers-Harris norms 
for children of the same age (Piers, 1969, p. 
12). Differences between the normative mean 
of 51.57 (N = 540), for both the Mikasuki 
mean of 41.98 and the English mean of 40.38, 
were significant at the .001 level (¢ = 6.35 
and 7.82, respectively, df = 578). Addition- 
ally, the Indian children were significantly 
more homogeneous with respect to their self- 
concept with standard deviations of 8.71 in 
Mikasuki and 8.15 in English, vis-a-vis a 
normative standard deviation of 14.56 (F = 
2.79 and 3.19, respectively, df = 539/39, p 
< .001 for both). This variability was far 
lower than any standard deviation reported 
for Anglo groups, many of much smaller N, 
in the Piers-Harris manual (Piers, 1969). 


Discussion 


The hypothesis that perceived role expec- 
tations in an Anglo context would evoke 
lower self-esteem than those in an ethnic 
(Mikasuki) context appeared to be confirmed 
with respect to personal, but not to Indian, 
self-concept. 

Tt was noted that language and examiner 
were treated as a unitary effect in order to 
create optimal conditions for perception of 
ethnic evaluative criteria. Thus, there was 
no attempt to control the independent effects 
of linguistic stimuli and examiner ethnic- 
ity, Retrospective evaluation suggests that 
the latter may have been a critical variable 
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in the Indian self-esteem scale response, 
since the subjects were likely to be defen- 
sive, in rejecting such items as “I am some- 
times ashamed to be an Indian,” when these 
words were uttered by Anglo teachers. Al- 
ternatively, however, it is possible that while 
the subjects may have felt more inadequate 
personally under conditions implying Anglo 
evaluation, conditions implying Mikasuki 
evaluation may have generated guilt about 
their self-concept as Indians. Thus within 
the Anglo or “school” context they were 
likely to be defensive about their Indianness; 
while within the Mikasuki or “tribal-family” 
context they may have felt that they were 
“bad Indians,” both because of the feelings 
of personal inadequacy evoked by the Anglo 
context and because of their feelings of in- 
adequacy in fulfilling the Indian role expec- 
tations of their family and tribe. According 
to Indian informants, the latter is particu- 
larly likely because the Mikasuki parents 
are concerned about cultural erosion and 
stress Indianness to their increasingly alien- 
ated children. Since it was postulated that 
self-concept in both identity areas would vary 
as a function of perceived Mikasuki and An- 
glo context, it is believed, in retrospect, that 
these findings are more in accord with the 
basic hypothesis than the initial prediction, 

The finding of significant intragroup age 
and sex differences is contrary to the norma- 
tive data offered by Piers (1969) on Anglo 
groups. It is believed that the sex difference 
may be due in part to the fact that Mikasuki 
Seminole kinship structure is matrilineal and 
uxorilocal; this has always afforded high 
status for women and respected role models 
for girls. Contemporary ethnographers (Gar- 
barino, 1972; Skafte, 1969) have observed 
increasing family disorganization and eroded 
clan structure on the Mikasuki reservations, 
resulting in fewer or unhealthy role models 
for boys. The disappearance of the avuncu- 
late (mother’s brother as teacher-disciplinar- 
ian), combined with increasing powerlessness 
and absenteeism of fathers, has tended to 
lower the status of males in the formerly 
cohesive tribal society. 

With respect to the age difference, it is be- 
lieved that a primary factor in the lower self- 
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esteem of older children may be their greater 
exposure to the status, economic, and experien- 
tial discrepancies existing between Indian and 
Anglo cultures, correlative with increasing 
psychological distance from the family. This 
is the first generation exposed to formal educa- 
tion, television, and off-reservation travel 
(80% of the mothers do not speak or under- 
stand English). Several anthropologists (Free- 
man, 1965; Garbarino, 1972; Skafte, 1969) 
have commented on the experiential gap be 
tween the generations and the inability of par- 
ents to control or communicate with their 
older children, particularly boys. While 
younger children are still family oriented, 
aimlessness, deviant behavior, and rejection 
of tribal tradition seem to be increasingly 
prevalent among preadolescents and adoles- 
cents. Observation of the social and material 
benefits of the dominant culture, without com- 
pensatory familial supports, may well underlie 
the lower self-esteem of the older children. 

With respect to the group as a whole, child- 

rearing practices which Coopersmith (1967) 
has found correlated with low self-esteem, for 
example, inconsistent regulatory behaviors, 
ambiguous authority roles, and lack of struc- 
ture and limits, have been reported by the 
cited ethnographers and were found in mater- 
nal interviews in the present research (Lefley, 
1973). Thus, it is quite possible that the In- 
dian children indeed have a negative self- 
concept. Moreover, since the subjects were 
significantly lower than Anglo norms in their 
own culture frame, the hypothesis of the au- 
thors cited earlier, namely, that a perceived 
ethnic context would generate a normal dis- 
tribution of personality variants, does not 
seem to be confirmed. 

Alternatively, however, it has been con- 
tended (e.g., Heiss & Owens, 1972) that stan- 
dard self-concept instruments are heavily 
weighted for behaviors that may be differen- 
tially valued in mainstream and ethnic cul- 
tures, for example, classroom proficiency, need 
for achievement, satisfaction of parental ex- 
pectations, and the like. Thus, self-report on 
these items may bear little functional rela- 
tionship to actual self-esteem. In terms of this 
explanation, the Indian children’s responses 
may not indicate negative self-concept, but 
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rather a normal distribution for those Piers- 
Harris items which represent legitimate be- 
havorial correlates of self-esteem in the Mika- 
suki culture. The findings that Anglo be- 
havioral indicators were not applicable and 
that school achievement was unrelated to self- 
esteem, tend to make some of the items on the 
standardized instruments rather suspect as 
veridical measures of the construct in all 
populations. It is quite possible that had the 
Piers-Harris scale contained items such as “I 
can do good beadwork” or “I am very co- 
operative”—behavioral and dispositional vari- 
ables known to be valued in the culture— 
rather than “I can draw well” or “I am well 
behaved in school,” the correlations between 
self-report and observer ratings may have 
been considerably higher. 

Even within this framework, it would ap- 
pear from the dichotomous directionality of 
the findings that when a minority group child 
is asked to report on his self-concept, both 
the identity area tapped and the implicit 
evaluative criteria of the testing conditions 
may be salient variables. These data tend to 
highlight the complexity of measuring self-per- 
ception in ethnic cultures, and indicate a need 
for refining and norming instruments in many 
groups, independently, to elicit appropriate 
behavioral correlates and to eliminate cul- 
turally patterned and context-bound response 
tendencies. 
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Humorous Laughter in Children’ 
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the notion that the sharing of humor enhances 
(laughter and smiling) and humor ratings. Inde- 
pendent groups of seven- to eight-year-old children listened on headphones 
to amusing material. They were tested alone or in dyads or triads with con- 
federates of the same sex. In triads, duration of laughter and smiling was 


This experiment examines 
overt expressive responses 


inversely related to the amount that confederates looked at one another; this 
was the case whether confederates were thought to be listening to the same 
or different recordings. Laughter and smiling scores support the notion that 
sharing the social situation is crucial in the facilitation of “humorous laughter.” 


A theory of socially facilitated laughter 


is proposed which draws upon social- 


facilitation drive theory and the tension-reduction aspects of humor theory. 


When young children are listening to laugh- 
ter-provoking material, they laugh more when 
with a companion than when alone, whether 
or not their companion is able to hear the 
material; they also consider the material to 
be funnier in dyadic situations (Chapman, 
1973b). Even when a confederate companion 
has been trained to ignore subjects and react 
blankly to humorous material, children in 
dyadic situations tend to laugh and smile more 
than alone children and they rate the hu- 


The presence of companions has some! 
(e.g., Calvert, 1949; Malpass, 1955; Om 
1939, 1942; Perl, 1933), but not always 
McKibben, 1969; Young & Frye, 1966) 
to higher humor ratings for jokes and cart: 
The discrepancies may be attributable to a 
failure on the part of experimenters to ap 
preciate the influence of their own presence 
and behavior upon the subjects; the experi- 
menter’s role is not discussed in any of the 
earlier articles. There has been no comple- 


morous material as funnier (Chapman, 1972). mentary laughter study on small groups, but 

It is likely that laughter and smiling influ- Morrison (1940) and Andrus (1946) e 
ence cognitive evaluations of humor through shown that frequency of laughter tends to in- 
dissonance (Festinger, 1957) or persuasion crease with size of adult theater audiences. 


due to exposure to self-actions (Bem, 1967). 
There is some experimental evidence that 
measures of overt mirth and subjective humor 
ratings are equivalent indicators of humor ap- 
preciation (e.g., Martin, 1905; Shultz, 1972), 
but there is also evidence that this is not 
always the case (e.g, Chapman, 1973a; 
Dresser, 1967; Leventhal & Mace, 1970). 


1 This experiment is one conducted for a doctoral de- 
gree from the University of Leicester, England. It was 
sponsored by a studentship from the Social Science 
Research Council, The author wishes to express spe- 
cial thanks to Derek Wright for his supervision of 
this research. Anthony Gale and Dylan Jones kindly 
commented on an earlier draft of the article. 

? Requests for reprints should be sent to Antony 
J. Chapman, Department of Applied Psychology, 
University of Wales Institute of Science and Tech- 
nology, Llwyn-y-Grant Road, Cardiff, Wales, United 
Kingdom. 
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Also, Malpass and Fitzpatrick (1959) state 
that presentation of jokes and cartoons to 
small groups resulted in more spontaneous 
overt reactions than did presentation to in- 
dividuals or large groups; however, laughter 
and smiling were not measured formally in 
their study. 

This experiment is designed to examine ihe 
notion that humor evokes more overt :nirth 
and is rated as funnier when it is shared with 
others. Degree of sharing between children is 
manipulated indirectly by the systematic 
variation of the amount that pairs of con- 
federate companions look at each other in 
triadic conditions when their laughter, smiling, 
and looking at subjects are constants for all 
conditions. It was predicted that subjects 
would laugh and smile less and would give 
lower humor ratings when companions looked 
at each other relatively more, since the com- 
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panions’ sharing of the experience with the 
subjects would be reduced. 

An equally plausible explanation of such an 
outcome would be that sharing similar circum- 
stances rather than sharing the humor is the 
crucial factor in the facilitation of laughter. 
In order to distinguish between the two hy- 
potheses, recordings were presented through 
headphones and instructions were varied so 
that half of the subjects considered that their 
companions were listening to the same material 
and the other half considered that they were 
listening to different humorous material. 

Sex is included as a third independent vari- 
able in this study because under some natural- 
istic conditions girls have been found to smile 
significantly more than boys (Chapman, 
1972). Although other interpretations are pos- 
sible, this could indicate that girls at this age 
tend to be more concerned than boys in shar- 
ing the situation as a whole, which boys tend 
to be more interested than girls in the humor 
stimuli, 


S METHOD 
Subjects 


The subjects were 70 girls and 70 boys between 
7 and 8 years of age. They were selected on the basis 
of age alone, from an English junior school in Leices- 
tershire. The school’s pupils are aged between 7 and 
11 years, and entry to the school is dependent only 
upon age and location of the home. The school is in 
a middle-class area. 


Material 


The humorous material comprised a story and a 
song which were of 8 minutes 5 seconds and 2 minutes 
33 seconds duration, respectively.* The story was en- 
titled The Funny Green Hair and was professionally 
recorded for this research.* The song was The Laugh- 
ing Policeman by Charles Penrose.5 In terms of 
Kappas’s (1967) classification system which com- 
prises 10 categories of humor, the story and song 
each contain absurd and incongruous elements, but 
the story occasionally relies upon surprise for its 
humor. The material was chosen from a batch of pro- 
fessional recordings after pilot work with more than 
200 children: With the experimenter present, it 
elicited laughter from most children, and it did not 
seem to give rise to any sex differences. 


Apparatus and Environment 


Subjects were tested under standard physical con- 
ditions in a children’s dual-compartment mobile lab- 
oratory which was parked near their school. The 
humorous material was presented to subjects through 


a stereophonic tape recorder and headphones. Con- 
federates received taped instructions through head- 
phones which were connected to the second (inde- 
pendent) channel of the tape recorder. The subjects 
were observed through a one-way screen which was 
an integral part of the dividing wall and was about 
30 centimeters above their seated eye level. The 
experimenter recorded their responses with an event 
recorder in the other compartment, 


Confederates 


Five girls and five boys performed as companions 
to individual subjects in like-sex triads and dyads, 
They were nine years of age and were from the 
same school as subjects, (In a naturalistic experiment, 
the two-year age difference between subjects and 
companions has been found to have little or no effect 
on laughter and smiling, Chapman, 1972.) Each 
confederate received several days of training in groups 
of varying size before the start of the experiment." 
The prime purpose of the training was to ensure that 
the children later adhered to the prerecorded verbal 
instructions which were presented continuously dur- 
ing experimental sessions. The instructions related to 
their direction of gaze and laughter and smiling be- 
haviors. Looking behavior varied according to experi- 
mental conditions, but pairs of companions within 
triadic groups received the same directions. It was 
emphasized during practice runs that their behavior 
should appear natural and that, therefore, members 
of a pair should rarely begin to respond to taped 
directions at precisely the same instant, especially 
when directions related to looking behavior. 

Confederates were also given guidance on how to 
maintain the deception outside the laboratory en- 
vironment and, in particular, how to behave im- 
mediately before and after testing sessions. It was 
arranged that the confederates and subjects would 
each approach the laboratory from a different direc- 
tion but arrive at the same time. Confederates were 
thoroughly acquainted with procedures and method- 
ology, but the experimenter did not discuss aims or 
hypotheses with them. 


Design 

A 5X22 factorial design was employed. There 
were 5 degrees of looking between confederates and 
2 instructional sets (same recording/different re- 
cordings) for both boys and girls, 

Two baseline alone groups and two dyadic condi- 
tions were also incorporated in the testing schedule, 
Some girls and boys were tested on their own during 

8No child reported having previously heard either 
the story or the song. 

4 The story is from Bobby Brewster's Camera by 
H. E. Todd. Leicester, England: Brockhampton Press, 
1959. 

© Columbia Records #SEG 7743. 

6Full details of selection and training procedures 
are reported in Chapman (1972). 
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their only visit to the laboratory (unfamiliar-alone 
condition), and others were tested on their second 
visit (familiar-alone conditon); the latter were the 
only subjects who had prior laboratory experience. 
In the dyadic conditions, male and female subjects 
were paired with like-sex confederates who laughed, 
smiled, and looked (at subjects) according to the in- 
structions recorded for the triadic conditions. The 
instructional set was varied for the pairs of triadic 
conditions: In one dyadic condition subjects were led 
to believe that the companion was listening to dif- 
ferent humorous material (a different-recording con- 
dition), and in the other condition subjects thought 
that their companion was listening to the same mate- 
rial (a same-recording condition). 


Dependent Measures 


Laughter was defined as inarticulate vocal sounds, 
and smiling was defined as an upward stretching of 
the mouth occurring without vocal sound; it was 
recognized, however, that a smile can sometimes be 
accompanied by a loud exhalation of breath at its 
genesis. 

Measures were taken of total time spent laughing 
and total time spent smiling for each subject. Mirth 
scores were computed from these data, being defined 
as laughter plus smiling. Except for the alone condi- 
tions, measures of time spent looking at the com- 
panion(s)’ face(s) were obtained for the first minute 
and for sessions as a whole: No distinction was made 
in the triadic conditions between the subject’s looking 
at one companion rather than the other. 

Humor ratings were obtained by asking subjects 
whether they thought that the material was not very 
funny, quite funny, or very funny; in the course of 
pilot work it was found that some children were not 
able to use more sophisticated scales. Subjects were 
asked a number of additional questions, including 
questions relating to (a) whether the companion(s) 
had looked at them not very much, quite a lot, or 
lots, (b) whether the companions had looked more 


or less at each other than at them, and (c) what had 
made them laugh. 


Procedure 


Subjects listened to the humorous material once 
only. They were told that they were cooperating in 
the selection of material for a children’s library. Boys 
and girls were assigned randomly to conditions, there 
being five boys and five girls tested in each of the 
10 triadic, 2 dyadic, and 2 alone conditions. Experi- 
mental group sessions were sequentially randomized, 
except that all familiar-alone subjects were tested on 
the third day after they had spent 90 minutes to- 
gether listening to stories in the laboratory. 

The confederates’ five sets of instructions were 
based upon previous observations of children who 
listened to the same material in naturalistic dyads; 
the patterns of behavior were derived specifically 
from children who engaged in little laughter and smil- 
ing. There were no variations in laughter or smiling 
instructions: Confederates were instructed to laugh 
on 3 occasions for a total of 6 seconds and to smile 
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on 10 occasions for a total of 20 seconds. All 10 smiles 
were intended to be interpreted as low-intensity 
laughs, and they therefore concurred with amusing 
points on the subjects’ tape, as did their laughter. 

Confederates were directed through prerecorded on- 
line instructions to look at subjects’ faces for ap- 
proximately 30% of the total time. Instances of look 
ing were the same in all conditions: They were asso- 
ciated with amusing points in the humorous r ial, 
and they always accompanied the confederat wn 
laughter and smiling. Levels of looking betw 
of confederates defined the five pairs of triad 
ment groups: Under Conditions A, B, C, D 
they looked at one another’s faces for appro 
60%, 45%, 30%, 15%, and 0%, respectively 
total session time. In Conditions A to D, con $ 
looked at each other as well as at subject 
each of their own laughs and smiles, Cond T 
instructions were also used for dyadic sessior 

The pretesting instructions to subjects w 
ported by a practical exercise which invo! 
inspecting the (two or three) sets of headp! 
listening to music for up to 90 seconds. Depi 
the experimental instruction, subjects (and « 
ates) either heard the same music through 
the headsets, or else they were able to confi 
the sets were connected to two different cha: 

The experimenter allocated seating positions and 
headphones in an apparently arbitrary fashion. In 
fact, it was arranged that in triadic conditions sul 
jects sat facing the screen forming an equilateral 
triangle with the confederates. Alone subjects Jike- 
wise faced the screen. In the dyadic conditions, sub- 
jects sat sideways to the screen face to fac» with 
companions. 

There is no reason to suspect that they w 
of being watched or of participating in a lau 
periment. Nor is there any reason to su 
subjects knew that their companions were either 
federates or frequent visitors to the laboratory 


RESULTS 

Mean laughter, smiling, and looking 3 
for boys and girls tested in the triadic co ci- 
tions are presented in Table 1, and the < 
responding scores for subjects in the alone 
dyadic conditions are presented in Table 2 

Three-way analyses of variance were com- 
puted on the sets of data from the tria on- 
ditions. They indicated that as confederates 
looked at each other less, so subjects’ laughter 
(F = 3.59, df = 4/80, p < .01), smiling (F = 
2.94, df = 4/80, p < .05), and mirth (F = 
3.85, df = 4/80, p< .01) increased. There 
were also significant differences in looking 
scores across Conditions A to E for sessions 
as a whole (F = 3.27, df = 4/80, p < .05), 
but ¢ tests indicated that no two conditions 
were different with respect to time spent look- 
ing at companions in the first minute of test- 


r- 
and 
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TABLE 1 
MEAN SCORES FOR TRIADIC CONDITIONS 
Percentage of time confederates looked at one another ie 
Condition 60% (A) 45% (B) 30% (C) | 15% (D) 0% (E) 
Boys | Girls | Boys | Girls | Boys | Girls | Boys | Girls | Boys | Giris 
Same recording 
Laughing ee 3.4 7.2 9.0 14.4 19.2 18.6 22.2 16.0 18.4 
Smiling 31.0 26.8 42.2 36.4 49.2 52.6 50.2 66.2 62.8 74.6 
Looking 67.2 52.2 85.2 77.4 | 106.6 | 132.8 | 136.8 | 149.0 | 150.6 | 162.4 
Different recording 
Laughing 8.0 7A 7.8 8.2 13.2 9.0 17.8 12.6 17.4 16.4 
Smiling 28.6 38.2 38.8 48.4 46.4 63.4 38.0 69.4 51.0 82.6 
Looking 75.6 107.6 | 108.4 | 142.8 | 114.4 | 152.4 | 108.6 | 149.8 | 100.2 | 146.4 
gh RR ee y iy 
ing. Girls tended to smile (F = 2.71, df= raters. No significant associations were found 


1/80, ns) and look at companions (F = 2.76, 
df = 1/80, ns) more than boys, especially in 
different-recording conditions but the instruc- 
tional set did not significantly affect any be- 
havioral measure, and there were no signifi- 
cant sex or interaction effects, 

Similarly, there were no significant differ- 
ences resulting from the instructional set in 
the dyadic conditions. Subjects’ laughter and 
smiling scores in the latter conditions ap- 
proximated those of subjects in’ the triadic 
Conditions A and B. Except for same-record- 
ing Condition A, looking scores in the triadic 
conditions were significantly higher than in 
the dyadic conditions. 

t tests revealed no significant differences in 
laughter (¢ = 1.08, df = 18, ms) or smiling (¢ 
= .89, df = 18, ms) scores for familiar-alone 
and unfamiliar-alone subjects. The familiar- 
alone subjects laughed less than subjects in 
same-recording Condition C (¢ = 2.53, df= 
18, p < .05) and therefore less than subjects 
in Conditions D and E, but other differences 
in laughter were not statistically significant. 
Familiar-alone subjects smiled less than sub- 
jects tested in any of the dyadic or triadic 
conditions. 

The chi-square statistic was applied to data 
obtained from subjects in Conditions A to E 
to test whether there were any significant as- 
sociations between their humor ratings and 
the amount of time spent in laughter, smiling, 
or mirth, Some 3 X 2 tables were devised by 
dividing subjects into above- and below- 
median responders with respect to behavioral 
measures and into high, medium, and low 


when data from all triadic conditions (n = 
100) were contained in the 3 X 2 analyses. 
However, when data from Conditions A and 
B were analyzed separately from others, it was 
found that the higher humor raters were also 
the children who generated the most laughter 
(x? = 8.31, df=2, p< .02), most smiling 
(x? = 6.42, df = 2, p < .05), and most mirth 
(x? = 8.97, df = 2, p < .02). Likewise, in the 
dyadic conditions, the higher humor ratings 
were given by the above-median laughers 
(x? = 8.58, df= 2, p< .02), above-median 
smilers (x? = 6.30, df=2, p< .05), and 
above-median mirth scorers (°= 7.95, df = 
2, p< .02). 

Although subjects’ laughter, smiling, and 
mirth increased as pairs of confederates looked 
at each other less, the associations with humor 
ratings diminished, and all were nonsignificant 
in analyses in which the number of subjects 
was greater than or equal to 60. Chi-square 


TABLE 2 
MEAN Scores FOR ALONE AND 


Dyapic CONDITIONS 
Laughter Smiling Looking 
Condition — => ae a 
Boys | Girls | Boys | Girls | Boys | Girls 
Unfamiliar- 
alone 6.2 | 1.8 | 7.8} 11.6 
Familiar-alone 8.0 | 5.4 | 13.2 | 13.6 
Same-recording 
dyadic 7.5 | 9.5 | 27.0 | 33.0 | 48.6 | 50.2 
Different- 
recording 
dyadic 7.7 | 6.1 | 28.6 | 40.3 | 53.1 | 67.2 


Note. Scores are given in seconds. 
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values became progressively smaller as the fre- 
quency scores for subjects in pairs of adjacent 
triadic conditions were added to those from 
A and B and analyses were conducted on cells 
of 60 (A, B, & C), 80 (A, B, C, & D), and 
100 (A, B, C, D, & E). Fisher exact probabil- 
ity tests and chi-square tests revealed nonsig- 
nificant associations between ratings and be- 
havioral responses for subjects in Conditions 
C, D, and E. Subjects’ ratings were not af- 
fected by instructional set (x? = 1.29, df = 2, 
p > .50) or by changes in companions’ look- 
ing at one another (x? = 8.30, df = 4, p< 
10): Subjects were divided into high and 
medium-low raters in the latter analysis. 


DISCUSSION 


This experiment shows that the degree of 
interaction between pairs of companions, as 
defined by amount of looking at one another’s 
faces, is influential in determining subjects’ 
responses to humor. In particular, laughter 
and smiling were found to vary inversely with 
amount of interaction between companions 
under conditions in which the latter’s laughter, 
smiling, and looking at subjects were con- 
stants. Furthermore, the companions’ influence 
was not diminished through informing subjects 
that they were listening to different humorous 
material, The results are consistent, therefore, 
with the notion that sharing the social situa- 
tion per se rather than sharing humor is im- 
portant in the social facilitation of children’s 
humorous laughter. 

The significant differences in response levels 
between alone subjects and subjects in dyadic 
and triadic conditions are not readily explic- 
able in terms of conventional social-facilita- 
tion mechanisms (cf. Cottrell, 1972). Nor can 

they be explained in terms of allied concepts 
such as evaluation apprehension, competition, 
collaboration, imitation, or social reinforce- 
ment (cf. Simmel, Hoppe, & Milton, 1968). 
In the absence of other viable explanations, it 
is suggested that the facilitation effects in the 
triadic conditions were a function of the chil- 
dren’s cognitive structuring of the social situa- 
tions, particularly the impression which they 
formed of their own prominence within the 
group. Their answers to the question concern- 
ing how much the companions had looked at 
them indicated that the less confederates 
looked at each other the more subjects felt 
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that confederates had looked at them. But 
confederates always looked at subjects for 
30% of the total session time, and so subjects 
were not blatantly excluded from any of the 
group situations. However, during test ses- 
sions, casual observation indicated that vary- 
ing the amount of time confederate com 
panions looked at one another did influence 
the degree to which subjects perceived that 
they were accepted as members of the group 
and permitted to share the same circum- 
stances. 

After the first few minutes of companions 
looking more at each other than at subjects, 
the latter typically appeared dejected, and 
they seemed to become lethargic as sessions 
progressed. Some slight indication of this is 
apparent in the contrast between the differ- 
ences in looking scores across conditions for 
the first minute of sessions and for sessions as 
a whole: There were clearly no group differ- 
ences in the first minute, but there were sig- 
nificant group differences overall. In the A 
conditions especially, subjects appeared grad- 
ually to isolate themselves psychologically 
from their companions, perhaps because they 
realized after the first few minutes that they 
were not able to achieve equality of member- 
ship within the group. It is interesting in this 
context that additional analyses of data re- 
vealed that group differences in laughter, smil 
ing, and mirth were not statistically significan 
until after the third minute of sessions, but it 
is conceivable that the first few minutes of the 
humorous story were not particularly amusing 
to children. 

Malpass and Fitzpatrick (1959) have men- 
tioned that “intimacy of social relationship” 
might be a factor affecting reactions to humor. 
This suggestion has yet to be explored, but it 
is similar to the “sharing” proposal which is 
examined in this experiment and which has 
been derived from the phrase “to share a 
joke” which exists in the English language. 
There is no established concept which is 
equivalent to sharing and is in current psy- 
chological usage, although the term “dein- 
dividuation” (Cannavale, Scarr, & Pepitone, 
1970; Festinger, Pepitone, & Newcomb, 1952; 
Singer, Brush, & Lubin, 1965) has been used 
to describe the state in which persons do not 
feel that they stand out as individuals: Dein- 
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dividuation leads to the expression of normally 
inhibited behavior. The concept has not been 
applied in the context of very small group 
situations, however, and it is not appropriate 
to this analysis since laughter is not a be- 
havior which is normally inhibited. 

Although the taped instructions were iden- 
tical within sessions for pairs of companions, 
there may nevertheless have been a tendency 
for the presence of a second companion to 
facilitate further the subjects’ laughter and 
smiling, especially when the extra companion 
looked at the subject more than at the fellow 
companion. In practice, the subjects’ laughter, 
smiling, and humor ratings in the dyadic con- 
ditions approximated those of subjects in the 
triadic Conditions A and B in which con- 
federates looked at one another more than 
they looked at the subjects. Overall, therefore, 
there is little evidence to suggest that the 
addition of a second companion necessarily 
enhances children’s responses to humor. 

It was considered beforehand that responses 
may sometimes be inhibited, in the alone con- 
ditions especially, because of apprehension at- 
tributable to the novelty of experimental ar- 
rangements. No sign of this was detected dur- 
ing testing, however, and the familiar-alone 
subjects, who were acquainted with most as- 
pects of the experimental procedures and en- 
vironment, did not laugh or smile significantly 
more than unfamiliar-alone subjects. 

This experiment provides equivocal data 
concerning the relationship between ratings 
and amount of laughter: The association be- 
tween the two measures was significant in 
Conditions A and B only, although there was 
a nonsignificant tendency for ratings to in- 
crease across Conditions A to E. It is pos- 
sible that children in Conditions C, D, and E 
sometimes recognized that they were laughing 
and smiling more than the laughter stimuli 
demanded. Virtually all subjects in Conditions 
C, D, an E, unlike those in A and B, stated 
that their companions had looked more at 
them than at each other, and laughter and 
smiling were facilitated more in these condi- 
tions than in A or B. They might therefore 
have been aware of their facilitated response 
when, for no apparent reason, they found 
themselves the central figure in a group of 
three persons. When subsequently asked to 
judge the funniness of material, they may 


have disregarded their previous responses and 
made genuinely objective appraisals. 

There were no significant sex or interaction 
effects in any of the statistical analyses, but 
the range of mean smiling scores was numer- 
ically greater for girls (see Table 1). This is 
explicable in terms of the hypothesis that girls 
tend to be more interested than boys in shar- 
ing the social situation: Girls would be ex- 
pected to engage in proportionally more non- 
mirthful smiling than boys, and nonmirthful 
smiling would presumably have been relatively 
low in A conditions and relatively high in E 
conditions. Also, girls tended to look more and 
to smile more, especially in different-recording 
conditions. The latter observations are also 
consistent with the above hypothesis, since it 
might be expected that girls would attempt to 
engage in more eye contact and mutual smil- 
ing in order to alleviate the isolating effect of 
the instructional set. The instructional set did 
not affect laughter scores, but nonsignificant 
differences which emerged were in line with 
the hypothesis: The set had virtually no effect 
on the amount boys laughed, but there was a 
tendency for girls to laugh more in the same- 
recording conditions, especially in Conditions 
C, D, and E where they invariably reported 
that their companions had looked more at 
them than at each other. 


A Theory of Socially Facilitated Laughter 


The remainder of this discussion is concerned 
with a new proposal, namely that the tension- 
reduction aspects of humor and laughter can 
be successfully combined with social-facilita- 
tion theory to provide an explanation of so- 
cially facilitated laughter. According to social- 
facilitation drive theory (cf. Cottrell, 1968; 
Weiss & Miller, 1971; Zajonc, 1965) and a 
number of humor and laughter theorists (e.g., 
Berlyne, 1960, 1972; Freud, 1905, 1928; 
Spencer, 1860), levels of responses are related 
to motivational arousal. There are behavioral 
data (cf. Cottrell, 1972; Zajonc, 1972) and 
limited psychophysiological data (e.g., Chap- 
man, 1974; Latané & Cappell, 1972; 
Martens, 1969a, 1969b) to support social- 
facilitation drive theory. Regrettably, there 
appear to have been no psychophysiological 
studies relating to the notion that humor and 
laughter are tension reducing and most of the 
behavioral data are unsupportive (e.g., Byrne, 
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1958; Goldstein, 1970; Lamb, 1968; O’Con- 
nell, 1960; Porr, 1961; Sears, Hovland, & 
Miller, 1940; Singer, Gollob, & Levine, 1967; 
Young & Frye, 1966). However, it has been 
reported that sexual arousal can enhance ap- 
preciation of sexual humor (Dworkin & Efran, 
1967) and that hostile subjects prefer car- 
toons depicting hostile scenes (Strickland, 
1959). Practically all studies have employed 
subjective scales of funniness as the sole 
means of measuring humor appreciation, how- 
ever, and it may be that this measure is too 
insensitive; the results of this and other ex- 
periments (e.g. Chapman, 1973a; Dresser, 
1967; Leventhal & Mace, 1970) suggest that 
different results could possibly have been ob- 
tained by incorporating laughter and smiling 
in measures of humor appreciation. Also, the 
experimental situations and humor stimuli 
may not have been conducive to laughter or 
anything approaching natural humor responses. 

Tt is suggested here that the primary and 
evolutionary function of laughter may be the 
alleviation of various forms of motivational 
arousal in a manner which is socially accept- 
able and physically harmless to others. Super- 
imposed upon this function may be a number 
of others: For example, laughter may inci- 
dentally convey information, or it may some- 
times be used as a means of gaining attention, 
but these are secondary functions. 

Increases in arousal may not be sufficient in 
themselves to induce laughter, but it is pos- 
tulated that a high level is necessary for 
laughter to be triggered, though why arousal 
might be reduced in this particular manner is 
far from obvious. Different types of motiva- 
tional arousal can be recognized from the 
arousal-producing stimuli through which they 
are operationally defined, and they may be 
alleviated by laughter which is prompted by 
various appropriate stimuli. These stimuli may 
sometimes promote short-term increases in 
arousal until the level is reached where 
laughter can be triggered. The arousal which 
is reduced by a particular kind of humor may 
specifically relate to the theme of the humor: 
That is, humor may tap specific areas of mo- 
tivational arousal and may rarely if ever oper- 
ate on certain types of arousal states. This 
would explain why a joke might produce 


laughter under some circumstances and not 
others. 


On the other hand, social situations may 
induce a more generalized “social arousal” and 
this might also be alleviated through laughter, 
via humor and other stimuli: hence, the spon- 
taneous laughter to be witnessed in a stress- 
ful interview situation. Laughter can be a 
means of raising or lowering arousal to an 
optimum level. Under normal circumstances 
in a small group situation, humor stimuli may 
not always be the most potent laughter-pro- 
voking stimuli but may constitute a form of 
background noise against which laughter is 
perceived as a natural, acceptable response and 
which serves to reduce social arousal. It could 
also be the case, as Berlyne (1969) and others 
suggest, that increases and decreases in arousal 
which accompany changes in stimulation are 
pleasurable in themselves. Humor may be em- 
ployed in a social situation where arousal is 
near the laughter threshold to supplement 
levels of arousal so that laughter is triggered 
and a rapid drop in arousal is experienced, 
perhaps to below the prior “social level,” since 
laughter may provide an opportunity to with- 
draw attention and hence to reduce tem- 
porarily the psychological presence of others. 
Common experience suggests that it is rare for 
a humor stimulus to be of such intensity that 
it induces sufficient arousal to trigger laughter 
in a nonsocial situation. Rather, humor may 
be a trigger for the release of socially induced 
arousal. 
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Causal Attribution in the Mixed-Motive Game: 
Effects of Facilitory and Inhibitory 
Environmental Forces* 

Michael E. Enzle, Ranald D. Hansen, and Charles A. Lowe? 
University of Connecticut 
The effects of environmental forces in a matrix game on subjects’ causal at- 


tributions for a simulated other’s behavior were investigated using female 
undergraduates in a 2X 2 factorial design (100% cooperative versus 100% 


competitive strategy, competition-facilitating versus competition-inhibiting ` 
matrix). Significant Matrix X Strategy interactions for attribution measures P 
indicated that personal attributions were made for behavior which occurred 4 


despite inhibitory environmental forces and that environmental attributions 
were made for behavior which occurred in line with facilitory environmental 
forces. There was no evidence that causal attributions produced by the pro- 


cedures mediated either game behavior or trust and liking for the other, 


Kelley (1971) has suggested in a recent 
conceptual analysis of causal attribution proc- 
esses that both personal and environmental 
forces may be viewed as either facilitating or 
inhibiting behavior. Given that a behavior has 
occurred, inhibitory personal forces are as- 
sumed to be absent. Whether or not facilitat- 
ing personal forces are seen to cause the be- 
havior is determined largely by the strength 
and direction of concomitant environmental 
forces. Specifically, when the behavior occurs 
in the presence of inhibitory environmental 
forces, strong personal attribution should be 
made. When the behavior occurs in the pres- 
ence of facilitory environmental forces, causal 
attribution to both personal and environ- 
mental forces is likely (Kelley, 1971), since 
both forces could have facilitated the be- 
havior. Since facilitory environmental forces 
provide sufficient cause for the behavior’s oc- 
currence and are readily observable (personal 
forces must typically be inferred), we might 
expect environmental attribution to exceed 
personal attribution in this instance (Heider, 
1958), 

The present study applied this conceptual 
analysis to the mixed-motive game situation. 
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of Connecticut Research Foundation Grant 5.171-*- 
6-0215-35-179 to the third author. 

2 Requests for reprints should be sent to Charles 
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sity of Connecticut, Storrs, Connecticut 06268. 


Because causal attributions are assumed to be 
systematically related to molar behavior and 
to be more sensitive than the latter to experi- 
mental manipulations, this focus on the sub- 
jective meaning of game behavior (i.e., on 
causal attribution) should be highly informa- 
tive. The game situation was structured such 
that subjects would unambiguously interpret 
the environment as either facilitating competi- 
tion and inhibiting cooperation or inhibiting 
competition and facilitating cooperation. Spe- 
cifically, subjects interacted with a simulated 
player whose cooperative or competitive be- 
havior occurred in the context of a matrix de- 
signed to either facilitate or inhibit that be- 
havior. Strong personal attribution was ex- 
pected whenever the matrix inhibited the 
observed behavior; strong environmental at- 
tribution was expected whenever the matrix 
facilitated the observed behavior. Further, 
within experimental conditions, personal at- 
tributions were expected to exceed environ- 
mental attributions when the matrix inhibited 
the observed behavior and conversely when 
the matrix facilitated the observed behavior. 

Although the major focus was on causal 
attributions, predictions regarding traditional 
game measures were also made. Specifically, 
more cooperative choices were expected when 
the simulated other’s strategy was cooperative 
rather than competitive (see Oskamp, 1971) 
and when the matrix inhibited rather than 
facilitated competition (e.g., Bixenstine & 
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* Blundell, 1966). Subjects were also expected PERSON 2 

to evaluate (trust and like) the simulated BLACK RED 
other more positively when he cooperated than = el 

when he competed. Finally, attributi YOUR PTS. © OUE PTEIN 

r pete ally, ributions should BLACK | HER PTS r6 -| HER PTS. 26 

mediate evaluative ratings such that attribu- PERSON I 

tion to personal forces should produce more RED | YOUR PTS.=6 | YOUR PTS.=0 
extreme evaluations (Heider, 1958; Jones & HER IPTS 54i RERIPTAN 
Davis, 1965). Thus, it was expected that the 

cooperative other would be trusted and liked PERSON 2 

more and the competitive other would be BLACK RED 
trusted and liked less when the matrix in- 

ibi An ; YOUR PTS.=6 | YOUR PTS.=0 
hibited rather than facilitated the other’s be- BLACK | HER PTS. <6 | HER PTS. <6 
havior. In two experimental conditions, espe- person | 
cially strong correlations between evaluative RED | YOUR PTS.=6 | YOUR PTS. =-4 
ratings and personal attribution measures AER ETS AAO EER RTS 


should obtain—positive correlations with en- 
vironmentally inhibited cooperative behavior 
and negative correlations with environmentally 
inhibited competitive behavior. 


METHOD 
Subjects 


Subjects were 124 females who participated in 
order to partially fulfill an introductory psychology 
course requirement. Eighty-four subjects were ran- 
domly assigned in pairs to the four experimental con- 
ditions. The remaining 40 subjects were employed to 
check the matrix manipulation. 


Procedure and Manipulations 


Dyad members were isolated from each other and 
provided with a typical game apparatus consisting of 
a black button and a red button and four lenses 
which displayed the payoff matrix. Recorded instruc- 
tions introduced the experiment as a study of dyadic 
interaction and stressed the accumulation of points, 
but avoided producing a strong competitive or co- 
operative orientation. Each matrix shown in Figure 1 
was employed for one half of the dyads. For both 
matrices, if the other person had made the black 
choice, the competitive choice (red) led to the high- 
est relative gain, while the cooperative choice 
(black) led to the higher mutual gain. While maxi- 
mum values. for the red and black choices in both 
matrices were identical, the minimum values differed. 
For the competition-facilitating matrix (CFM), the 
lower payoff for the red choice was zero but was 
minus four for the black choice. This relationship 
was reversed in the competition-inhibiting matrix 
(CIM). 

Subjects responded to a trial light on each of 10 
trials by pushing one of the two buttons. Although 
each subject was led to believe she was interacting 
with the other, the experimenter simulated the 
other’s game choices by illuminating the appropri- 
ate matrix cell for each subject. One half of the 
subjects in each matfix condition received 100% red 


Ficure 1, Payoff matrices displayed to subjects 
from the point of view of Person 1. Competition 
facilitating matrix (CFM) is above and competition 
inhibiting matrix (CIM), is below, with red choice 
representing competition. 


choices (competitive strategy), and the other half 
received 100% black choices (cooperative strategy). 


Measures 


The primary dependent measures, contained in a 
postexperimental questionnaire, were two factors 
specified as potential forces influencing the other 
player’s behavior: a personal factor and an environ- 
mental factor. Subjects rated the influence of each 
factor on the other’s behavior on 7-point scales after 
reading both statements. Subjects also responded on 
O to 6 scales to two sentiment items (trust and 
liking). As a check on the strategy manipulation, 
subjects rated the other’s behavior on a 7-point scale 
where —3 was very cooperative and 3 was very 
competitive. 

Two control groups were used to check the mat- 
rix manipulation. Twenty-two subjects viewed a 
videotaped instruction session for the CFM condi- 
tions, and a second group of 18 subjects viewed a 
similar videotape for the CIM conditions. After the 
videotape presentation, subjects indicated which 
game choice, made over a series of trials, would “in- 
sure the least possible loss and the greatest possible 
gain regardless of the other’s choices.” 


RESULTS AND DISCUSSION 


The competitive other was rated as more 
competitive than the cooperative player (M 
= 2.21 versus M = —1.33; F = 92.09, dj = 
1/80, p < .001). No other effects reached sig- 
nificance for the competitiveness measure. 
Data from control subjects indicated that 
the two matrices were also perceived differ- 
ently. Eighty-two percent of the subjects who 
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TABLE 1 
MEANS FOR PERSONAL AND ENVIRONMENTAL ATTRIBUTIONS 


Personal Environmental FS 
Experimental condition spana attep on (PA versus EA) 
Competition facilitating matrix f: R 
Cooperative strategy 4.25 3.19 pie 
Competitive strategy $ 3.33 4.71 —3. 
Competition inhibiting matrix z 
Cooperative strategy 2.86 3.86 - 7a 
Competitive strategy 3.81 3.14 4 
Note. For each mean, » = 21. Attribution means are based on 0 to 6 scales with higher numbers indicating stronger attributions 
s Two-tailed tests on 40 dfs, 
*p <.05. 
*p <.01. 


received simulated CFM instructions, and 
6% of those who received CIM instructions 
indicated red as the more rational choice 
(x? = 22.51, df = 1/38, p < 001). 

Table 1 presents the means for causal at- 
tributions to personal and environmental fac- 
tors. Analyses of variance revealed no signifi- 
cant main effect for either variable. In line 
with predictions derived from Kelley’s (1971) 
model, both analyses yielded significant Mat- 
rix X Strategy interactions (F = 8.92, df= 
1/80, p < .01 for personal attributions, and 
F = 10.24, df= 1/80, p < .01 for environ- 
mental attributions), indicating that personal 
attributions tended to be greater when en- 
vironmental forces inhibited rather than fa- 
cilitated another’s behavior and that environ- 
mental attributions tended to be greater when 
the environment facilitated rather than in- 
hibited an observed behavior. 

Planned comparisons between these means 
were conducted to determine the effects of 
each matrix on causal attributions. As ex- 
pected, environmental attributions for com- 
petitive behavior and personal attributions 
for cooperative behavior were greater for the 
CFM than for the CIM (F = 10.27, df= 

1/80, p < .01 for environmental attributions, 
and F = 9.85, df = 1/80, p < .01 for per- 
sonal attributions). Although the means were 
in the expected direction, the effects of the 
CIM on personal attributions for competitive 
behavior and environmental attributions for 
cooperative behavior did not reach signifi- 
cance. These failures to obtain significance 
suggest that this matrix (CIM) was not per- 
ceived to have a strong effect on the simu- 
lated other’s behavior, Indeed, that the level 


of cooperation failed to exceed 50% with 
this matrix lends some support to this in- 
terpretation. However, the measure employed 
in the control conditions focused on the ra- 
tional aspects of choosing a particular game 
alternative and unfortunately did not assess 
the perceived facilitory and inhibitory effects 
of each matrix. While this shortcoming pre- 
cludes a clear interpretation of these pre- 
dictive failures, it underscores the necessity 
for attribution researchers to obtain thorough 
and independent assessments of the perceived 
causal properties of the actual situation pre- 
sented to the subjects. 

Individual ¢ tests were employed to deter- 
mine the relative strengths of attributions 
within each experimental condition. As ex- 
pected, environmental attributions were 
greater than personal attributions for com- 
petitive behavior with the CFM and for co- 
operative behavior with the CIM. Also, per- 
sonal attributions were greater than environ- 
mental attributions for cooperative behavior 
with the CFM. However, the difference be- 
tween personal and environmental attribu- 
tion means for competitive behavior with the 
CIM, although in the expected direction, did 
not reach an acceptable level of significance 
(p> 2). 

These results contradict the suggestion 
(Kelley, 1971) that the two different facili- 
tory forces (environmental and personal) 
should produce causal ambiguity. When both 
personal and environmental forces could have 
produced the observed behavior, subjects in 
the present study preferred to attribute the 


cause of the behavior primarily to environ- 
mental forces. 
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Consistent with previous research (Galio 
& McClintock, 1965; Oskamp, 1971), the 
analysis of variance performed on the game 
choice scores revealed that game behavior 
was influenced by both strategy and matrix. 
More cooperative choices were made in re- 
sponse to a cooperative than a competitive 
other (M = 5.24 versus M = 3.36, F = 9.56, 
df = 1/80, p< .01). Subjects also made 
more cooperative choices with the CIM than 
with the CFM (M = 4.95 versus M = 3.64, 
F = 4.48, df = 1/80, p< .05). The inter- 
action effect was not significant. 

No evidence was found for the assumption 

hat attribution to personal forces leads to 
increased reciprocal cooperation or competi- 
tion (Nemeth, 1972). When internal attri- 
butions regarding the simulated other’s be- 
avior were pitted against the facilitating 
ffects of the matrix, the latter proved to be 
more powerful determinant of subjects’ 
ame behavior (i.e., main effect for matrix). 
f supporting evidence for the assumption 
hat attributions mediate behavior is to be 
obtained, such evidence will not be found em- 
ploying situations which exert strong facili- 
tory environmental forces. 

The expected extreme ratings of trust and 
liking were not obtained in the two relevant 
conditions (i.e. cooperative behavior with 
the CFM and competition with the CIM). In 
fact, analyses of variance on trust and liking 
scores produced only one significant effect— 
a main effect for strategy on trust scores 
(F= 7.25, df = 1/80, p<.01). In agree- 
ment with theoretical game considerations 
(Deutsch, 1960), the cooperative other (M 
= 4.07) was trusted more than the competi- 
tive other (M = 3.09). The failure to obtain 
a similar result for liking was surprising in 
view of the positive findings reported by 
Kleinke and Pohlen (1971) and Wilson 
(1969). This apparent inconsistency might 
be accounted for by potentially important 
procedural variations, Subjects in these latter 
studies were seated near each other and were 
in visual contact while dyad members in the 
present study were completely isolated. Tf, as 
Mehrabian (1968) suggests, proximity and 
visual contact are important determinants of 
attraction, the failure of the liking predic- 
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tions in the present study is more under- 
standable. Some minimal amount of nongame 
contact between subjects may be basic to the 
development or the expression of differences 
in liking. The results for trust and liking pro- 
vide little support for the proposition that 
attributions mediate evaluations (Heider, 
1958; Jones & Davis, 1965). Although Pear- 
son product-moment correlations did show 
that personal attribution was positively re- 
lated to both trust (r = +.62, df = 20, p< 
01) and liking (r = +.54, df = 20, p < .01) 
for cooperative behavior with the CFM, the 
expected negative relations between these 
measures for competitive behavior in the 
CIM were not found (r = —.04 for trust and 
r= —.22 for liking). Of course, these data 
may mean that attribution measures are more 
sensitive to game manipulations than are 
trust and liking measures. 

More importantly, these results together 
with those for game behavior emphasize the 
possibility that causal attributions may con- 
stitute a response class parallel but unrelated 
to behavior and attitudes (Bem, 1972). How- 
ever, these results should probably not be 
taken as an indication that attributions never 
play a mediational role. Attributions may be 
quite important when the attributor’s þe- 
havior is not highly determined by environ- 
mental forces and/or when sufficient cues for 
evaluative discriminations are available. 

In conclusion, the present study provides 
basic support for Kelley’s (1971) model, with 
the revision that when both environmental 
and personal forces are plausible causes for 
behavior, attribution to environmental forces 
predominates. Although this study was not 
designed specifically to assess the hypothe- 
sized mediational role of causal attributions, 
the data bearing on this issue question the 
use of attribution theory for predicting be- 
havior and sentiment. 
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Sets of three propositions (A, B, and C) were selected. In each set, B was 
more likely to be true if A was true than if A was false, and C was more 
apt to be true if B was true than if B was false. After receiving information 
that other persons either agreed or disagreed with B, subjects wrote an essay 
either supporting or opposing this proposition. Both essay writing and others’ 
opinions affected beliefs that B was true (Pz) in the direction of the position 
advocated, Effects of these variables upon beliefs that C was true (Pc) 
were in the same direction but less in magnitude. Essay writing affected 
beliefs that A was true (Ps) only when subjects were told that other persons 
agreed with B. Changes in Ps and Pe did not depend upon whether A and 
C were mentioned in the essays about B. Changes in beliefs were similar to 
those predicted by Wyer’s subjective probability model of cognitive organiza- 
tion and were attributed to a tendency for subjects to regain cognitive con- 


sistency as defined in terms of this model. 


Two general methods have often been used 
to induce a subject to change his beliefs. In 
one procedure, the subject is induced to pre- 
pare and deliver a communication advocating 
the position that one wishes him to adopt 
personally (cf. Brehm & Cohen, 1962; Col- 
lins & Hoyt, 1972; Rosenberg, 1965; Scott, 
1957; Sherman, 1970; Wallace, 1966). In 
the second, most often used in studies of 
conformity, the subject is informed that an- 
other person or group (O) advocates this 
position (cf. Harvey, Kelley, & Shapiro, 
1957; Hollander, 1958; Jackson & Saltzstein, 
1958; Sampson & Insko, 1964; Tannenbaum, 
1967; Wyer, 1966, 1967). At times this in- 
formation is accompanied by a verbal com- 
munication from O containing arguments in 
support of the position being advocated. In 
other cases, only O’s position is conveyed. 
Belief change has been successfully induced 
through each procedure. However, the com- 
bined effects of these procedures are unclear, 
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particularly under conditions in which their 
implications differ (e.g., when a subject is 
asked to defend a proposition with which O 
disagrees). Equally unclear is the generaliza- 
tion of these effects to beliefs other than the 
one to which the procedures directly pertain. 
The study reported here was concerned with 
these issues. 

A technique that changes a belief in Prop- 
osition B may also affect beliefs both in 
propositions that if true, imply B (premises) 
and in propositions that are likely to be 
true if B is true (conclusions), McGuire 
(1960) has hypothesized that when beliefs 
in logically related propositions are made sa- 
lient to a subject by presenting them in a 
questionnaire, cognitive reorganization, in- 
volving change in one or more of these be- 
liefs, occurs in order to eliminate inconsis- 
tencies among them. Empirical support for 
this hypothesis has been obtained when con- 
sistency is defined by subjective probability 
models of cognitive organization (Rosen & 
Wyer, 1972; Wyer & Goldberg, 1970). It 
follows that if a set of related beliefs is made 
salient in an initial questionnaire, and if one 
of these beliefs is then modified using either 
of the influence techniques described above, 
corresponding changes in the other beliefs 
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should occur in a manner that maintains their 
internal consistency. 

Research based upon other theories of cog- 
nitive consistency (e.g, Festinger, 1957; 
Heider, 1958; Osgood & Tannenbaum, 1955) 
also indicates that changes in one cognition 
are accompanied by changes in other, related 
cognitions, To give but a few examples, Tan- 
nenbaum (1966) found that information 
bearing upon the favorableness of a concept 
X affected not only evaluations of a person 
whose attitude toward X was known, but 
also evaluations of a second concept, Y, 
which this person favored or opposed. Gerard 
(1965) found that information about the sim- 
ilarity of a subject’s judgment of a stimulus 
to that of another person affected the sub- 
ject’s attitude toward this person. Finally, 
research designed to test implications of cog- 
nitive dissonance theory (for recent summa- 
ries, see Collins & Hoyt, 1972; Zajonc, 1968) 
suggests that behavior-related cognitions 
(e.g., one’s belief that he has decided to en- 
gage in a certain activity) may often affect 
other cognitions (beliefs that the chosen ac- 
tivity is desirable or that alternative behav- 
iors are undesirable). 

While the tendency to regain cognitive 
consistency may be one factor underlying the 
effect of social influence procedures upon be- 
liefs to which they do not directly pertain, 
there are other possible reasons for these 
effects. For example, when a subject writes 
an essay pertaining to B, he is likely to make 
statements that, if true, support the posi- 
tion he is advocating. In the course of pre- 
senting these arguments as persuasively as 
possible, the subject may become personally 
more convinced of their validity, and this 
may be reflected in his subsequent responses 
to questionnaire items that pertain to these 
arguments. One implication of this hypothe- 
sis is of course that a subject’s beliefs in 
propositions related to B will be affected by 
essay writing only when these propositions 
are used in preparing his essay. Thus, if A 
implies that B is true, it should be used more 
often in pro-B essays than in anti-B essays, 
and beliefs in A should change more when 
the first type of essay is written, Moreover, 
beliefs in A are less likely to change when 
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a conformity paradigm is used to affect be 
liefs in B; if a subject is simply told that O 
advocates B without being given any basis 
for O’s belief, there is no reason to expect 
that his beliefs in other propositions will 
change except through a process of cognitive 
reorganization. 

The effects of cognitive reorganization ar 
often difficult to isolate because simultaneou 
changes in several different beliefs may b 
involved. The relative magnitudes of thes¢ 
changes are hard to predict a priori. For ex- 
ample, a subject may believe that B is true 
because A is true and A implies B. If he sul 
sequently is induced to believe that B is not 
true, this could lead to a change in his be 
lief that A is true. However, if it does not 
one would not know whether the subject ha 
failed to reorganize his cognitions accordin 
to the laws of logic or whether he has simp! 
changed other relevant beliefs (e.g., his b 
lief that A implies B). To understand the 
dynamics of cognitive reorganization when 
many beliefs are potentially involved, a pre 
cise description of the manner in which thes« 
beliefs are theoretically interrelated is re- 
quired. The subjective probability model pro- 
posed by Wyer and Goldberg (1970) may 
serve this purpose. According to this formu 
lation, the beliefs associated with two proj 
ositions, A and B, are related according to 
the equation: 


Pp =PasPayat+ (1—Pa)Pryw, [1] 


where P4 and Ppr are the subjective proba- 
bilities (beliefs) that A and B are true, re 
spectively, and Pz/4 and Prya: are the beliefs 
that B is true if A is and is not true, respec 

tively. Similarly, beliefs associated with B 


and a third proposition are related according 
to the equation 


Po = PpPoyn + (1—Px)Poje, [2] 


where Po is the belief that C is true and 
Pon and Pe, are the beliefs that C is true 
if B is and is not true. Several studies (Rosen 
& Wyer, 1972; Wyer, 1970, 1972, 1973: 
Wyer & Goldberg, 1970) suggest that equa- 
tions similar in form to those above provide 
accurate descriptions of the manner in which 
beliefs are related and of the effects of one 
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belief upon others. If Equations 1 and 2 are 
valid, changes in P, as a result of the influ- 
ence techniques to be considered in this ar- 
ticle may affect not only Py and Pe but also 
the “conditional beliefs” relating A, B, and 
C (Ppa, Pozn, etc.). However, the similarity 
between obtained and predicted changes in 
Ps (based upon Equation 1), and between 
obtained and predicted changes in Pe (based 
upon Equation 2), should indicate the extent 
to which changes in beliefs other than Py are 
attributable to cognitive reorganization of the 
sort implied by these equations. 

An additional concern of the present study 
was to determine the combined effects of role 
playing and conformity techniques. It seemed 
possible that the effects of the two procedures 
would be additive. On the other hand, if the 
belief that O disagrees with a proposition is 
dissonant with the cognition that one is pub- 
licly advocating it (Festinger, 1957) and if 
a change in one’s belief that the proposition 
is true results from an attempt to reduce this 
dissonance, then essay writing should have 
less effect when subjects are told that O sup- 
ports the position advocated in the essay 
than when they are told that O opposes this 
position. Alternatively, information that O 
disagrees with the position to be advocated 
may affect the effort expended in role play- 
ing and therefore may affect the change in 
beliefs produced by it for this reason. These 
factors would produce interactive effects of 
the two influence procedures, 


METHOD 
Overview 


One hundred twenty-eight introductory psychol- 
ogy students participated in the experiment to ful- 
fill a laboratory requirement. Subjects were initially 
administered a belief questionnaire containing seven 
items associated with each of four issues. The items 
associated with each issue concerned three proposi- 
tions (A, B, and C) which were related in the 
manner described by Equations 1 and 2. After com- 
pleting the questionnaire, subjects received informa- 
on that either most other college students or very 
few other students agreed with Proposition B per- 
taining to one of the four issues and then wrote an 
essay either supporting or opposing B. The origi- 
nal belief questionnaire was then readministered. 
Eight subjects participated under each of the 16 
combinations of “test” issue, others’ position (pro-B 
versus anti-B) and the position advocated in the 
essay (pro-B versus anti-B). 
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Construction of Belief Items 


Four sets of three statements (A, B, and C), cach 
pertaining to a different contemporary issue, were 
selected: 


Issue 1. 

A. Cigarette sales will be severely restricted 
by 1978. 

B. Cigarette smoking will decrease greatly 
over the next 10 years. 

C. Lung cancer and respiratory ailments will 
become less common during the next 10 
years, 

Issue 2. 

A. People unconsciously imitate the behavior 
of persons they see on television. 

B. Watching television causes people to be- 
have more violently. 

C. Television programs showing unnecessary 
violence should be banned. 

Issue 3. 

A. The number of students per faculty mem- 
ber at UICC [University of Illinois at 
Chicago Circle] will increase over the next 
5 years. 

B. The personal contact between students 
and faculty at UICC will decrease during 
the next 5 years, 

C. The quality of undergraduate education 
at UICC will decrease during the next 5 
years. 


Issue 4. 

A. Marijuana is physically and psychologi- 
cally harmless. 

B. The use of marijuana will be legalized by 
1978. 

C. Marijuana will become more widely used 
than alcoholic beverages within the next 
10 years, 


In each set of statements, it was expected that B 
would be believed more likely to be true if A was 
true than if A was false and that C would be be- 
lieved more likely to be true if B was true than if 
B was false, 

Seven items were constructed for cach set of prop- 
ositions, beliefs which corresponded to the sub- 
jective probabilities Pa, Px, Po, Puja, Pujar, Pem, 
and Poy» as defined in Equations 1 and 2. (For 
example, the belief items corresponding to Pa, Pa, 
and Po for the first issue were identical to the three 
propositions, A, B, and C listed above; the items 
corresponding to Paya and Prw were “Tf cigarette 
sales are severely restricted by 1978, cigarette smok- 
ing will decrease greatly during the next 10 years” 
and “Even if cigarette sales are mot severely re- 
stricted by 1978, cigarette smoking will decrease 
greatly during the next 10 years”; similarly, the 
items corresponding to Po; and Pe» were “Tf 
cigarette smoking decreases greatly over the next 10 
years, lung cancer and respiratory ailments will be- 
come Jess common during the next 10 years” and 
“Eyen if cigarette smoking does not decrease greatly 
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TABLE 1 


MEAN CONDITIONAL PROBABILITIES ASSOCIATED 
WITH EAcH Issue 


| 
Pcja' | Difference 


Issue | Paya | Paya’ | Difference | Poze 
1 | 603 | 420 183 -598 | .383 215 
2 | 452 | 294 AS8 -609 | -406 | 203 
3 | .697 | .391 306 574 | 381 193 
4 | .622 | .429 +193 586 | 417 -169 


over the next 10 years, lung cancer and respiratory 
ailments will become less common during the next 
10 years.”) The resulting 28 items were ordered 
randomly and presented in a questionnaire; each 
item was followed by an 1l-point scale from 0 
(not at all likely) to 10 (extremely likely) for sub- 
jects to use in recording their belief that it was true. 
These responses were subsequently divided by 10 to 
convert them to units of subjective probability, 


Procedure 


Subjects, run in groups of 8 to 10, were told that 
the experiment was concerned with students? atti- 
tudes and beliefs about various contemporary issues. 
They initially completed the belief questionnaire 
described above with instructions to estimate the 
likelihood that each statement was true along the 
scale provided, Then they were told that a second 
part of the study was to determine how well per- 
Sons could prepare persuasive communications on 
the spur of the moment, without much opportunity 
to think about the issues before developing their 
arguments, Each subject was then given a sheet of 
Paper, at the top of which was written the state- 
ment labeled B pertaining to one of the four issues 
described above, and was told to prepare a com- 
munication that would argue as persuasively as pos- 
sible that the statement was either true or false, as 
indicated on the sheet, Before beginning, subjects 
were told to note the proportion of introductory 
psychology students who had expressed agreement 
with the statement in an earlier experiment, since 
this information might be helpful to them in decid- 
ing what sorts of arguments would be most effec- 
tive. This proportion, which was typed on the 
answer sheet below the Proposition each subject 
Was asked to discuss, varied slightly over issues but 
was either between 89% and 91% (under pro-B 


conditions) or between 9% and 11% (under anti-B 
conditions), 


W person reports his be- 
liefs more accurately after he has had a chance to 
has become familiar with the 
To decrease commitment ef- 
were told to consider each 
answer carefully, since their responses to this ques- 
tionnaire rather than the preceding one would be 


used to represent their true beliefs about the issue 
described. 


RESULTS 
Preliminary Analyses 


Analyses oj conditional probabilities. To 
validate the assumption that the propositions 
pertaining to each issue were related, the rela- 
tive magnitudes of Py, and Pha, and of 
Poyn and Peyx, were determined for each 
test issue. These values, collapsed over influ- 
ence conditions and the two administrations 
of the questionnaire, are shown in Table 1, 
While the difference between conditional 
probabilities varied over issues, in all cases 
Paya > Prya and Pozn > Pew as expected 
(F > 125.0, df = 1/112, p < 001). 

Analyses of essay content. Each communi- 
cation was scored for whether the arguments 
contained in it pertained to premises (prop- 
ositions which if true implied B) or conclu- 
sions (propositions which would be true if 
B were true) and also for whether any of the 
arguments pertained to the specific proposi- 
tions about which beliefs were assessed (A 
or C). One subject considered both premises 
and conclusions in writing his essay, and 
two others wrote essays that were not classi- 
fiable. As might be expected, arguments per- 
tained to premises a greater proportion of 
times (.84) than to conclusions (.16), re- 
gardless of whether the essay supported or 
opposed B. Of those cases in which essays 
pertained to premises, Proposition A was 
mentioned a greater proportion of times in 
pro-B essays (.59) than in anti-B essays 
(39; xX = 4.21, p< 05), and a nonsignifi- 
cantly lower proportion of times when other 
persons were ostensibly pro-B (.44) than 
when they were anti-B (Sa NSIS, $> 
-10). Finally, the Proportion of times that 
arguments pertained to A rather than to 
other premises depended greatly upon the 
test issue involved (xX? = 28.85, p< 01), 
The implications of these data are considered 
presently, 


Effects of Influence Procedures 


To determine the effect of influence pro- 
cedures upon the belief in B (Px), the “rela- 
tive change” in this belief was calculated for 
each subject separately by subtracting the 
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mean change in Pp for each of the three 
“control” issues to which questionnaire items 
pertained from the change in Py for the test 
issue. (Pooled over subjects in each influence 
condition, each issue appeared the same pro- 
portion of times as a test issue and as a 
control issue.) Relative changes in P4 and 
Po were calculated similarly, An analysis of 
variance was then performed on these change 
scores as a function of belief (P4, Ps, or 
Po), others’ position (pro or anti), essay po- 
sition (pro or anti), and test issue. 

Averaged over beliefs, both influence pro- 
cedures had the expected effects (in each 
case, F > 6.31, df = 1/112, p < .02). How- 
ever, the triple interaction of belief with the 
two influence variables (F= 6.17, df= 
2/224, p < .01) indicated that these effects 
depended upon the belief involved. Data rele- 
vant to this interaction, which was not con- 
tingent upon the test issue to which influence 
procedures pertained (F = 1.54, df = 6/224, 


‘p> .10) are shown in Table 2. To help 


clarify the nature of this interaction, relative 
changes in each belief were analyzed sepa- 
rately. Analyses of change in Pz yielded main 
effects of both others’ position (F = 5.59, 
dj=1/112, p<.05) and essay position 
(F =4.10, df = 1/112, p < .05). Although 


‘the interaction of these variables only ap- 


proached significance (F = 3.42, df = 1/112, 
p< .10), data in Table 2 indicate that Pz 
decreased only when subjects wrote anti-B 
essays after being told that others were 
anti-B; the change under this condition 
(M = —.127) differed substantially from 
the change occurring in each of the other 
three combinations of essay position and 
others’ position (in each case, F > 7.50, df 
= 1/112, p<.01), while changes in the 
latter three conditions did not differ from one 


_ another (in each case, F < 1.0). 


The effects of influence variables upon Po 
were less in magnitude than their effects upon 
Pp but were similar in direction. A separate 
analysis of change in Pe yielded no signifi- 
cant main effects or interactions involving 
these variables (in each case, F < 1.45, df 
= 1/112, p>.10). However, a post hoc 
analysis of data pertaining to Pr and Po in 
combination indicated that the main effects 
of others’ position and essay position were 
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TABLE 2 


Mean ReLAtive Ci IN BELIEFS AS A 
FUNCTION OF INFLUENCE VARIABLES 


Others’ position 
Belief -= w 
Pro-B Anti-B M 
Pa | 
Pro-B essay 167 005 O86 
Anti-B essay —.051 005 —.023 
058 005 032 
Px 
Pro-B essay 076 052 064 
Anti-B essay 068 —.127 —.030 
M 072 —.038 O17 
Pe 
Pro-B essay -057 018 038 
Anti-B essay 018 —.036 | —.009 
M -038 —.009 O15 
(Para + Pea’) 
Pro-B essay -086 -035 -061 
Anti-B essay 020 —.036 —.009 
M .053 —.001 026 
(Pern + Pere) 
Pro-B essay —.005 100, 048 
Anti-B essay — .064 —.042 —.053 
M —.035 -029 —.003 
I 


both significant (F > 4.34, df = 1/112, p < 
.05) but did not depend upon the belief 
involved (F < 2.58, dj = 1/112, p > .10). 
In other words, the effects of influence vari- 
ables upon Po, although not themselves sig- 
nificant, did not significantly differ from the 
effects of these variables upon beliefs to which 
they directly pertained. 

Analysis of the relative change in Pa 
yielded an interaction of others’ position and 
essay position (F = 6.97, dj = 1/112, p< 
.01). This interaction indicated that writing 
essays about B affected P4 only when other 
persons ostensibly agreed with B; when 
others disagreed with B, essay writing had no 
effect whatsoever upon P4. 

Effects of type of argument. It was specu- 
lated that subjects who write essays advocat- 
ing a position on B call forth specific argu- 
ments in defense of this position and become 
more personally convinced of the validity of 
these arguments as a result of their attempt 
to present them persuasively. However, two 
comparisons suggest that this cannot account 
for the effects of influence variables upon Pa. 
First, Table 2 shows clearly that the effect 
of essay writing affected P4 only when others 
agreed with B. However, the proportion of 
essays which pertained to A was slightly 
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lower when others agreed with B (.38) than 
when others disagreed with B (.44). Second, 
the effect of essay writing upon P4 was not 
significantly contingent upon the issue about 
which essays were written despite the fact 
that essays about Issue 1 mentioned A in 
only 3 of 32 cases, while those about Issue 
2 mentioned A in 20 of 32 cases. More spe- 
cifically, A was mentioned in .125, .625, .812, 
and .438 of the pro-B essays pertaining to 
Issues 1-4, respectively, but the relative 
changes in P4 were +.026, —.071, +.062, 
and +.267. Similarly, the proportion of times 
A was mentioned in anti-B essays was .062, 
-625, .187, and .375 when Issues 1-4 were 
involved, yet the changes in P, were —.068, 
—.011, —.029, and +.017, respectively. The 
proportion of times A was mentioned and the 
mean change in P4 in the direction implied 
by the essay were correlated .091 over the 
experimental conditions (n= 8, p> .25). 
While these data are too limited to be con- 
clusive, there is no evidence of a relationship 
between the likelihood that a propozition is 
mentioned in the essay pertaining to B and 
change in beliefs that this proposition is 
valid. 

Effects of cognitive reorganization. If 
changes in P4 and Pg are the results of the 
cognitive reorganization required to main- 
tain consistency among beliefs and if Equa- 
tions 1 and 2 are valid descriptions of the 
relations among beliefs, then the values of 
Py and Po predicted by these equations 
should approximate the obtained values of 
these beliefs, and in addition, obtained and 
predicted values of these beliefs should be 
related similarly to the two influence varia- 
bles. Predicted values of Py both before and 
after influence were therefore calculated for 
each subject separately on the basis of his 
questionnaire responses to items pertaining 
to the test issue and also items pertaining to 
each of the three control issues. The relative 
change in predicted values was determined 
for each subject separately in a manner anal- 
ogous to that used to estimate the relative 
change in obtained values. An analysis of 
variance was then performed on the relative 
change in P» as a function of others’ position, 
essay position, test issue, and “dummy” vari- 
able, (predicted versus obtained). If Equa- 
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tion 1 is valid and if the changes in beliefs 
related to Pg are the consequence of cogni- 
tive reorganization, all main effects and in- 
teractions involving this dummy variable 
should be nonsignificant. This was in fact the 
case (in each instance, F < 2.02, df = 1/112, 
p> .10). 

A similar analysis was performed on ob- 
tained and predicted values of the relative 
change in Pg based upon Equation 2. Unlike 
the analysis of obtained values alone, the 
main effect of others’ position was significant 
(F = 6.40, df = 1/112, p < .02). However, 
no main effects or interactions involving pre- 
dicted versus obtained were reliable (F < 
2.16, df = 1/112, p > .10). 

The preceding analyses indicate that Equa- 
tions 1 and 2 provide accurate descriptions of 
the relationships among the beliefs to which 
they pertain and give indirect support to the 
hypothesis that beliefs other than P changed 
primarily as a result of cognitive reorganiza- 
tion in a manner implied by these equations. 
Some additional implications of this hypothe- 
sis should be noted. Table 2 shows that when 
other persons disagreed with B, essay writing 
changed Py substantially but had no effect 
upon P4. Moreover, when subjects wrote anti-B 
essays after being told that other persons sup- 
ported B, Pp increased while P4 decreased. If 
Equation 1 is valid, these differences could 
occur only if, under these experimental con- 
ditions, Pp, and Ppa both changed in 
the same direction as Py (i.e, if subjects 
changed their beliefs that B was valid for rea- 
sons unrelated to their beliefs in A). Data in 
Table 2 indicate that the relative change in 
the mean of Py, and Prya was positive 
under each of the three conditions in which 
Pg increased and was negative under the con- 
dition in which Py, decreased. Whether the 
magnitudes of these changes are sufficient to 
account for the observed changes in Pp is un- 
clear; * however, the fact that obtained 
changes and predicted changes did not sig- 


® Comparisons of magnitude are somewhat mis- 
leading, since according to Equation 1 the relations 
among various beliefs are multiplicative, The mean 
change in the product of two beliefs (PaPpja) can- 
not be estimated from the mean change in each 
belief separately. 


nificantly differ supports this possibility. It 
might be argued that the similarity between 
change in Py and change in the two condi- 
tional beliefs is simply a result of the subjects’ 
failure to understand the difference between 
conditional and unconditional probabilities. 
However, if this were so, the mean change in 
Poyn and Poy should also be similar to the 
change in Pe. Data shown in Table 2 indi- 
cate that this was not the case. 


Supplementary Analyses 


Subjects may be less influenced by the 


“opinions of others when they are initially in- 


clined to disagree with these opinions than 
when they are predisposed to agree. On the 
other hand, defending a position with which 
one personally disagrees may be dissonance 
arousing (Festinger, 1957) and therefore may 
produce greater change than arguing a position 
with which one already agrees. To explore these 
possibilities, 12 initially pro-B subjects (those 
whose initial estimates of Pr were greater than 
'5) and 12 initially anti-B subjects (those 
whose initial estimates of Py were less than 
.5) were selected under each combination of 


_ influence variables, controlling insofar as pos- 


sible for the proportion of times that each test 
issue was represented. An analysis of variance 
was then performed on the relative change in 
beliefs as a function of initial attitude, belief 
(Pa, Pn, or Po), and the two influence vari- 
ables. While the significant effects obtained in 
the main analyses were again reliable, only 
the interaction of belief and others’ position 
was contingent upon initial attitude (F = 
3.53, df = 2/224, p > .01). Data pertaining 
to this interaction, shown in Table 3, are diffi- 
cult to interpret psychologically. However, 
supplementary analyses revealed that the mean 
initial (pretest) belief was greater than .5 (the 
scale midpoint) in each cell of Table 3 in 
which a decrease in beliefs occurred but was 
less than .5 in each cell in which an increase 
occurred. Thus, the contingencies described in 
this table may well be an artifact of regres- 


sion of beliefs toward the scale midpoint, and 


not an indication that the effects of experi- 
mental variables per se were contingent upon 
initial opinions. 
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TABLE 3 


RELATIVE BELIEFS AS A FUNCTION OF INITIAL 
ATTITUDE AND OTHERS’ POSITION 


à Change in 
Initial Others" 
opinion potion [| ~~ ne 
| Pa | Pu Pe 

Pro-B Pro-B —.007 | ~.031 | —.011 
Anti-B 2051 — 154 —.046 

Anti-B | Pro-B 160 096 044 
Anti-B | —.024 048 048 

Discussion 


The results of this study can be summarized 
briefly. Subjects’ beliefs that Proposition B 
was true (Pr) were affected both by other 
persons’ beliefs in B and by writing a per- 
suasive communication about B. The com- 
bined effect of these variables was positive 
when subjects either wrote a pro-B essay or 
received information that other persons agreed 
with B, a negative change in Pp occurring 
only when subjects wrote anti-B essays under 
the illusion that others also disagreed with B. 
The effects of influence variables upon beliefs 
in a proposition implied by B (Pe) were sim- 
ilar in direction to their effects upon Pp but 
were less in magnitude. Writing an essay about 
B affected beliefs in a premise which if true 
implied B (P4) only when subjects were told 
that other persons agreed with B. Changes in 
P, under various influence conditions were 
unrelated to the proportion of times that A 
was mentioned in the essays subjects had writ- 
ten. Obtained changes in Py and Po under 
these conditions did not significantly differ in 
magnitude from changes predicted by the sub- 
jective probability model of cognitive organ- 
ization proposed by Wyer (see Equations 1 
and 2). The latter two findings, considered in 
combination, suggest that the changes in be- 
liefs other than Pp were attributable to a 
tendency for subjects to reorganize sets of re- 
lated beliefs in a manner that maintains con- 
sistency among them as defined by the prob- 
ability model. 

Although changes in Po under different in- 
structional conditions did not differ signifi- 
cantly from predicted values based upon 
Equation 2, analyses of changes in Po alone 
yielded no significant results. However, as Mc- 
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Guire (1960) points out, changes in the con- 
clusion of a syllogism should theoretically be 
less than changes in beliefs in the premises. 
(The reason for this is apparent from Equa- 
tion 2; that is, a change in Pe would be equal 
in magnitude and direction to a change in Pg 
only if Poyg =1 and Pep =0. This was 
clearly not the case in this study; see Table 
1.) Perhaps if the direct effects of influence 
variables upon Py, had been greater, the 
changes in Po would have been sufficiently 
large to be detected statistically. In other 
studies, where greater changes in beliefs 
analogous to Pp have been induced, signifi- 
cant changes in beliefs analogous to Po have 
typically been detected (cf. Wyer, 1970, 
1972). 

Two findings in this study are difficult to 
interpret. First, Pg increased both when sub- 
jects wrote pro-B essays after receiving in- 
formation that others opposed B and when 
subjects wrote anti-B essays after learning 
that others supported B. This suggests that 
each influence procedure was more effective in 
gaining support for a proposition than in gain- 
ing opposition to it, regardless of the nature 
of this proposition. Although this interpreta- 
tion would seem reasonable if subjects were 
predisposed to agree with B before they were 
exposed to influence, the fact that the effec- 
tiveness of influence procedures was not con- 
tingent upon the subjects’ initial attitudes 
argues against this interpretation. It is con- 
ceivable that subjects have an “agreement” bias 
which leads them to accept propositions as 
true more readily than to reject them as false, 
all other things being equal. To this extent a 
given procedure should be more effective in 
increasing beliefs in the proposition “mari- 
juana will not be legalized by 1978” than in 
decreasing beliefs in the proposition ‘“mari- 
juana will be legalized by 1978.” Although 
this interpretation is not very convincing, no 
alternatives seem any more compelling. 

The second curious aspect of the data re- 
ported here concerns the different effects of 
influence variables upon Py and Py. When 
subjects were told that other persons dis- 
agreed with B, the changes in Pg produced by 
essay writing were not accompanied by 
changes in beliefs in A, but rather by changes 


in beliefs that B was true for reasons unre- 
lated to their beliefs in A. (This does not 
mean that subjects changed their beliefs that 
A was relevant to B under these conditions; 
the difference between Py,4 and Py, 4» was not 
significantly affected.) The reason for this, 
which is apparently not attributable to differ- 
ences in essay content, can only be speculated. 
One possibility is suggested by self-perception 
theory (Bem, 1967) which assumes that sub- 
jects infer their beliefs from observations of 
their own behavior. Thus, in the present study, 
a person who writes a pro-B (anti-B) essay 
may infer that he personally agrees (disagrees) 
with B both if A is true and if A is false but 
would not necessarily change his belief in A. 
This explanation is not wholly satisfactory, 
however, since it does not account for the fact 
that Pa was affected by essay writing when 
others ostensibly agreed with B. 

Difficulties are also encountered in applying 
cognitive dissonance theory (Festinger, 1957) 
to these results. It is conceivable that changes 
in P4 occur in order to increase consonance 
with one’s decision to advocate a position re- 
garding B. However, why this should occur 
only when other persons agree with B is un- 
clear. Moreover, in this study subjects were 
not told that they could refuse to participate 
in the essay-writing task. Since evidence for 
dissonance reduction is typically not obtained 
in counterattitudinal advocacy situations un- 
less the freedom to refuse is made explicit 
(cf. Collins & Hoyt, 1972), it may not be a 
valid explanation of behavior in the present 
experiment. Thus, while the conclusion seems 
justified that procedures intended to influence 
beliefs in B affect beliefs in related proposi- 
tions through the process of cognitive reorgan- 
ization, an understanding of which cognitions 
are affected in any given instance remains 
elusive. 

In this regard, some caution should be 
taken in generalizing the results of this experi- 
ment. As noted previously, numerous situa- 
tional variables affect the magnitude of change 
induced by the two influence procedures in 
beliefs to which they directly pertain. Similar 
variables conceivably affect the generaliza- 
tion of this influence to peripherally related 
beliefs. The formulation of cognitive organiza- 
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tion considered in this article may help to 
identify the nature of these effects. 
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To Comply or Not Comply: 
Testing the Self-Perception Explanation of the 
“Foot-in-the-Door” Phenomenon ' 


Mark Snyder? and Michael R. Cunningham 
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A field experiment was conducted to test the self-perception explanation of 
the “foot-in-the-door” phenomenon of increased compliance with a substan- 
tial request after prior compliance with a smaller demand. In this study, 
some subjects were first approached with a small request (answer 8 questions 
in a telephone survey) the size of which was virtually certain to guarantee 
compliance. Other subjects were first approached with a request sufficiently 
large to guarantee noncompliance (answer 50 questions). Subjects in both 
of these conditions were subsequently approached with a moderately sized 
request (30 questions sponsored by a different public service organization), 
As predicted by self-perception theory, subjects in the small-initial-request 
condition showed a higher rate of compliance to the second request (,519), 
whereas subjects in the large-initial-request condition showed a lower rate 
of compliance (.219) than subjects in the no-initial-request control con- 


dition (.333). 


In an attempt to determine how a person 
can be induced to do something he might 
otherwise not do, Freedman and Fraser (1966) 
demonstrated that a powerful determinant of 
compliance is prior compliance itself; A per- 
son who can be induced to comply with an 
initial small request is then more likely to 
comply subsequently with a larger and more 
substantial demand. 

To demonstrate this “foot-in-the-door” phe- 
nomenon experimentally, Freedman and Fraser 
(1966) had experimenters contact suburban 
housewives in their homes. The housewives 
were first asked either to place a small sign 
in their window or to sign a petition on the 
issue of either keeping California beautiful or 
safe driving. Two weeks later a different ex- 
perimenter returned to each home and asked 
each housewife to place a large and unattrac- 

tive billboard Promoting auto safety on her 
front lawn for the next couple weeks. A con- 
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University of Minnesota to Mark Snyder. The authors 
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in this investigation. 
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Minneapolis, Minnesota 55455. 
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trol group was contacted only about the second 
large-billboard request. The results showed a 
very strong foot-in-the-door effect. Subjects 
who had complied with the earlier request were 
much more likely to comply with the larger 
one several weeks later. More intriguing yet 
was the apparent generality of the effect. It 
did not matter which action or which issue 
had been involved in the initial request: The 
compliance generalized equally in all four 
conditions to the later larger request. 
Freedman and Fraser (1966) offered the 
following explanation of the findings: 


What may occur is a change in the person’s feelings 
about getting involved or about taking action. Once 
he has agreed to a request, his attitude may change. 
He may become in his own eyes, the kind of person 
who does this sort of thing, who agrees to requests 
made by strangers, and who takes action on things 
he believes in, who cooperates with good causes 
[p. 201]. 


Such an explanation is congruent with the self- 
Perception and attribution theories of Bem 
(1972) and Kelley (1973) who suggest that 
people frequently make self-attributions about 
their attitudes and dispositions on the basis of 
the experience of their own overt behavior and 
the context in which it occurs, Thus a house- 
wife who observes herself taking action on a 
good, if innocuous cause, without any obvious 
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external pressure, infers that she must be the 
kind of generous, compliant person who be- 
comes involved with such causes. This inferred 
change in self-perception would then lead to a 
change in her subsequent likelihood of engag- 
ing in o‘her compliant acts. Although self-per- 
ception theory itself is well supported by ex- 
perimental evidence in numerous domains of 
self-attribution (Bem, 1972), little research 
has directly concerned the self-perception in- 
terpretation of the foot-in-the-door effect. 

Lepper (1973) has demonstrated that an 
effect conceptually analogous to that of Freed- 
man and Fraser (1966) is most likely to oc- 
cur under conditions of minimal external pres- 
sure, as would be expected on the basis of self- 
perception theory. Lepper proposed that in the 
classical “forbidden toy” situation (Aronson 
& Carlsmith, 1963), the child who avoids the 
forbidden toy under a mild threat might infer 
that he is one who can successfully resist 
temptation and that this increased self-control 
should generalize to a different temptation 
situation. As predicted, children who complied 
with the initial prohibition under a mild threat 
were less likely to cheat in a game situation 
several weeks later than were control subjects 
or subjects who had initially complied under 
a severe threat. An attempt to assess changes 
in children’s self-perceptions of honesty and 
ability to resist temptation, which had been 
proposed to mediate these effects, provided 
qualified support for the self-perception analy- 
sis. 

In a conceptual replication of Lepper’s 
(1973) demonstration, Uranowitz (in press) 
has reported that women in a shopping center 
who were induced to help an experimenter un- 
der low justification were more likely to help 
a second experimenter than were subjects who 
had been asked to help the first experimenter 
under high justification. No attempt was made 
in this investigation to assess intervening 
changes in self-perceptions of helpfulness. 

The converging results of Lepper (1973) 
end Uranowitz (in press) mzke the self-per- 
ception hypothesis a highly plausible account 
of the foot-in-the-door phenomenon. However, 
there is as yet no more direct experimental 
test of this interpretation. Our confidence in 
this explanation, as well as the positive heuris- 


tic value of self-perception theory, would be 
bolstered by examining an experimental situa- 
tion in which subjects’ compliance experience 
is controlled so that they come to infer that 
they are noncompliant individuals who do not 
do favors. According to the self-perception 
hypothesis, such individuals should be much 
less likely to comply to subsequent requests 
than subjects without such prior experience, 
Thus a subject who had previously been in- 
duced not to comply with a request ought to 
be less likely to comply with another reason- 
able request than a subject who only receives 
the second request. 

Accordingly, a field experiment was con- 
ducted to test the self-perception interpreta- 
tion of the foot-in-the-door technique. In this 
study, some subjects were first approached 
with a small request, the size of which was vir- 
tually certain to guarantee compliance but not 
so small as to appear inappropriately trivial. 
Other subjects were first approached with a 
request sufficiently large to guarantee non- 
compliance but not so large as to appear 
incredible. 

These constraints in the size of the request 
are necessary to ensure that the compliance or 
noncompliance behavior could function as 
credible cues to inferring corresponding dis- 
positions. Subjects in both of these conditions 
were subsequently approached with a moder- 
ately sized request. A third control group was. 
contacted only with the second request. The 
self-perception explanation predicts that sub- 
jects in the small-initial-request condition 
should show a higher rate of compliance to 
the second request than subjects in the control 
condition, whereas subjects in the large-initial- 
request condition should show a lower rate of 
compliance than subjects in the control condi- 
tion. 

This experiment additionally allows a test 
of an alternative explanation of the foot-in- 
the-door effect. It may be that the initial 
small request serves as a standard of com- 
parison against which to evaluate the second 
larger request. Thus, the second request may 
appear smaller in the context of previous 
compliance experience than in the context of 
no such experience. As an analogy, a request 
for $10 may not seem so very large to one 
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who has become accustomed to being asked 
for $5 loans, but it may be perceived as a 
major demand to another who has never be- 
fore been asked for money. Stated more for- 
mally, prior requests for compliance may sim- 
ply define one’s adaptation level (Helson, 
1959) for perceiving the magnitude of current 
compliance requests. Such reasoning suggests 
that the subjects in the large-initial-request 
condition of the present experiment should 
show a higher rate of compliance to the sec- 
ond request than control subjects. For these 
individuals, the second request should appear 
small by comparison with the initial larger re- 
quest which was refused. Again as an analogy, 
a request for only $10 may seem very small 
and reasonable to one who has just refused 
another request for $100. 


METHOD 


Subjects were 78 female and 14 male Minneapolis 
residents randomly chosen from the telephone direc- 
tory and randomly assigned to conditions. 

The subjects were contacted by a male or female 
experimenter on a Wednesday afternoon and assigned 
at random to either the large-initial-request or small- 
initial-request condition. The requests made of the 
subjects involved answering a specified number of 
questions for a telephone survey. The small-initial- 
request subjects (#=30) were asked if they would 
be willing to answer 8 questions, and the large-initial- 
request subjects (n= 32) were asked to answer 50 
questions for a fictitious public service organization, 
Pretesting had indicated that these particular request 

~ magnitudes guaranteed a high or low initial compli- 
ance rate, respectively. 

After verifying and recording the identity of the 
respondent, the experimenter identified himself with 
either of two fictitious public service organizations 
and explained his purpose as follows: 


My name is » of the Consumer's 
Interest Group. We are compiling a guide to aid 
consumers and are calling you today to ask if 
you would be willing to answer —— questions 
over the teleplone concerning your use of various 
Paper products. Would you be willing to cooper- 
ate in this survey? for] My name is r 
of the Bureau of Civic Safety. We are calling 
you today to ask if you would be willing to aid 
us in our study of motor vehicle safety by 
answering — questions over the telephone con- 
cerning your response to highway laws and 
winter driving hazards in Minnesota, Would you 
be willing to cooperate with us in this survey? 


If the subject agreed to Participate in the survey, 
he or she was informed that the interviewer was 
Presently lining up people for the survey which would 
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be conducted at an unspecified later date. These sul 
jects were told that they would be called back if they 
were needed, 

Two days after the first request, on a Friday 
afternoon, each subject was called back by a second 
experimenter of the other sex. This experimenter was 
ignorant as to both the magnitude of the initial re- 
quest and the subject’s initial response. 

In the second contact, the experimenter first asce 
tained that the original subject had been reached and 
then without mentioning the earlier request presented 
himself as a representative of the service organizatio 
with which the subject was not previously acquainted 
The experimenter then asked if the subject would bi 
willing to answer 30 questions for his survey, and 
the subject’s response was noted. Once again subje 
were informed that the interviewer was only lining 
up participants for a survey to be conducted at » 
unspecified later date. 

Subjects in the no-initial-contact comparison con 
dition (n= 30) were contacted on the same day’: 
the second request was made of the experiment: 
subjects and asked if they would answer 30 question 
for either the Consumer Interest Group or the B 
reau of Civic Safety survey.’ 


RESULTS AND DISCUSSION 


Of the 62 experimental subjects, 3 could 
not be reached for the second contact. These 
subjects were eliminated from ‘further con- 
sideration in the experiment. 

According to the self-perception hypothesis, 
the small-initial-request condition should pro 
duce a higher rate of compliance with th: 
second request than the no-initial-request con- 
dition, whereas the large-initial-request con- 
dition should produce a lower rate of com- 
pliance than the comparison condition, 

The results are in line with the self-per- 
ception hypothesis. Over one half (.519) of 
the subjects who had initially been approached 
with a small request subsequently complied 
with the moderately sized request, whereas 
only one-third (.333) of the comparison group 
of subjects who had only been approached 
with the final moderately sized request com- 
plied (2 = 1.76, p= -039, one-tailed test) 


There is a possible sam 
the controls only at Time 2 
be contacted at Time 1 had 
experimental grou 
However, 


only 3 could not be reached at Time 2, This suggests 
that the subject Populations available at both Time 


1 and Time 2 and that available only at Time 2 are 
virtually identical. 


pling error in contacting 
+ Subjects who could not 
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By contrast, subjects who had initially been 
approached with a large initial request were 
less likely to comply (compliance rate = .219) 
with a second moderately sized request than 
were subjects who had had no initial contact 
(z = 1.43, p = .076, one-tailed test). Finally, 
as a test of the overall self-perception pre- 
dicted effect, the final compliance rate in 
the small-initial-request experimental condi- 
tion was significantly larger than that in the 
large-initial-request condition (z = 2.46, p = 
.007, one-tailed test).* 

Thus the results of this experiment provide 
further experimental support for the self-per- 


4The self-perception hypothesis assumes that the 
increase in compliance in the small-initial-request con- 
dition is due to previous compliance with the small 


-request, whereas the decrease in compliance in the 


large-initial-request condition results from prior non- 
compliance with the large request. However, 5 sub- 
jects in the small-initial-request condition refused to 
comply on the first contact, and 8 subjects in the 
large-initial-request condition actually complied with 
that very large request. If these subjects are elimin- 
ated from consideration in the data analysis, a more 
striking confirmation of the self-perception predic- 
tions ensues. Almost two thirds (.636) of the sub- 
jects who had initially agreed to a small request 
subsequently complied with the moderately sized 
request, whereas only one third (.333) of the com- 
parison group complied (s=2.17, p=.015, one- 
tailed test). By contrast, subjects who had been 
initially led to refuse a large request were consider- 
ably less likely to comply (compliance rate = .125) 
with a second moderately sized request than were 
subjects who had had no initial contact (z= 1.78, 
p= .038, one-tailed test). Although in some sense it 
is appropriate to eliminate these 13 subjects because 
the self-perception predicted effects could not have 
possibly worked on them, this analysis unfortunately 
suffers from a possible subject self-selection problem. 


ception interpretation of the foot-in-the-door 
phenomenon, Although it was not possible to 
assess changes in actual self-attributions con- 
cerning generosity and willingness to get in- 
volved in this field experimental situation, the 
pattern of behavioral compliance rates does 
corroborate the predictions from self-percep- 
tion théory. In addition to replicating the 
Freedman and Fraser (1966) demonstration 
of a technique by which “a person [can] be 
induced to do something he would rather not 
do [p. 195],” the study also suggests an 
inoculation procedure to increase resistance to 
external compliance demands and influence 
attempts. 
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Combined Effects of Interpersonal Attraction and Goal-Path 


Clarity on the Cohesiveness 


of Task Oriented Groups' 


Alonzo B. Anderson 2 
Michigan State University 


This research examined possible differential effects of value simil 


goal-path clarity on the cohesiveness 
formed, 20 high and 20 low in value 


sessions. The first session asked subjects to get 


and 
triads were 
and examined during two 
to know each other. The 


of small groups, Forty 
similarity, 


second session presented a specific task that permitted the manipulation of 


goal-path clarity. Results indicate that 


(a) in the first session, reported inter- 


personal attraction was a function of value similarity, and (b) in the sec- 
ond session, only goal-path clarity (and not value similarity or prior at- 


traction level) affected cohesion. The importance of 


the extent of goal 


orientation in determining the variables that affect group cohesiveness was 


discussed. 


Festinger, Schachter, and Back (1950), in 
their theoretical treatment of the concept of 
group cohesiveness, distinguish two major 
classes of forces that “act on members to re- 
main in the group”: (a) the attractiveness of 
the group for its members and (b) the extent 
to which the group mediates goals for its 
members. These forces, acting in conjunction, 
are postulated to be immediate determinants 
of cohesiveness. The first class of forces has 
reference to the needs of the person for 
affiliation, recognition, security, and other in- 
dividual variables which can be satisfied by 
some facet of group participation. The second 
class of forces has reference to such properties 
of the group as its goals, programs, size, type 
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of organization, and position in the com- 
munity. 

Most empirical research (e.g., Back, 1951; 
Exline, 1957; Libo, 1953), however, while 
providing support for the position introduced 
by Festinger et al} (1950), has focused on at- 
tractiveness. When investigators have desired 
to manipulate the cohesiveness of groups, re- 
gardless of their nominal definition of the con- 
cept, the operations performed have typically 
involved telling the members of some groups 
that they would probably like each other, be 
congenial, etc., while telling others the op- 
posite or nothing. Further, in measuring the 
cohesiveness of experimental or naturally ex- 
isting groups, some sociometric instrument is 
generally used to determine how much each 
member likes or is attracted to the other 
members. For the most part, the other forces 
which are postulated to determine, or be re- 
flected by, the cohesiveness of a group have 
received less attention. Given the large number 
of these studies that have yielded positive re- 
sults, it is easy to understand why cohesive- 
hess and interpersonal attraction per se are 
accepted as being nearly synonymous. How- 
ever, theoretical grounds exist which point to 
the possibility that this accepted relationship 
does not hold for all groups. 

In developing this position it is important 
to consider that groups display a wide variety 
of properties. Of these, perhaps the most im- 


portant is the primary purpose for the group’s 


TINAR 


COHESIVENESS OF TASK-ORIENTED GROUPS 69 


development and existence. Researchers in 
group dynamics (e.g., Bales, 1950; Cattell, 
1948; Homans, 1950) suggest that there are 
two major purposes, and groups can be dis- 
tinguished by the extent to which one or the 
other is of primary concern. One type of group 
has been variously referred to as friendship 
groups, psyche groups, or socioemotional 
groups. By whatever name these groups are 
called, the reference is clearly to groups whose 
members belong to them for the emotional 
satisfaction they derive from participation, 
that is, support of one’s self-conception, con- 
sensual validation about the world, etc. The 
other type of group is the task-oriented group. 
These groups exist to accomplish a task whose 
primary purpose is to manipulate the environ- 
ment. However, it is necessary to point out 
that the two classes of purposes are not mu- 
tually exclusive; that is, both are present in 
all groups, but usually one is dominant de- 
pending on the situation, characteristics of the 
members, etc. 

This distinction between types of groups is 
crucial to understanding group cohesiveness. 
It is assumed that to the extent a group is 
primarily task oriented, its members will place 
more weight on those factors that contribute 
to successful task completion, while placing 
ess emphasis on those factors that contribute 
to interpersonal attraction. On the other hand, 
it is assumed that to the extent a group’s pur- 
pose is primarily socioemotional in nature, its 
members will place greater emphasis on those 
factors which they perceive as contributing to 
an increase in interpersonal attraction. In the 
context of exchange theory (Thibaut & Kelley, 
1959) it could then be postulated that the pri- 
mary basis for a group member’s comparison 
level and comparison level for alternatives 
would be the extent to which the group effi- 
ciently fulfills its primary mission. Simply 
stated, group cohesiveness can be conceived 
of as something other than a special case of 
interpersonal attraction. 

Support for this point of view can be found 
in a study conducted by Raven and Rietsema 
(1957), who, by using tape recordings, ma- 
nipulated work groups’ perceptions of the 
clarity of their goal and path. Their data 
indicate that the incentive value of a particu- 


lar group goal for a particular person will de- 
pend not only on its content but also on how 
explicitly the goal is formulated and how 
clear the paths for goal attainment are. 
Where goal and path clarity were minimal, 
disinterest in the task and hostility were 
greater relative to high goal and path clar- 
ity. In addition, feelings of group belonging- 
ness and sympathy for group emotions were 
greater in the high-clarity groups. Unfortu- 
nately, because of the manipulation used in 
this study, it is not clear which condition 
caused the effect. That is, there is no way to 
determine whether the condition of the goal 
or the condition of the path, or both, gen- 
erated the obtained results. 

It is the purpose of the present study to 
investigate the differential influence of value 
similarity and goal-path clarity—here defined 
as the extent to which there exists, among all 
members, only one possible sequence of ac- 
tions available for the group to reach a de- 
sired outcome—on cohesiveness in task-ori- 
ented groups. It was hypothesized that in 
groups of this type the only variables salient 
to cohesiveness are those which affect task 
performance. Therefore, value similarity 
should be less relevant, and goal-path clarity 
should be central to a member’s decision to 
remain in or leave a particular group, Within 
this framework two hypotheses were derived: 

1. It was expected that primarily task- 
oriented groups under clear goal-path condi- 
tions would experience significantly more co- 
hesion than groups under unclear goal-path 
conditions. 

2. It was expected that value similarity 
would have a minimal overall effect on level 
of cohesion in primarily task-oriented groups. 
However, this variable should influence inter- 
personal attraction during a preliminary ses- 
sion designed to foster socioemotional con- 
cern.* 


METHOD 
Subject Recruitment and Selection 


Three hundred eighty-six female subjects were 
recruited through the use of sign-up sheets posted 


3 The purpose of this session was (4) to generate 
prior differences in interper: l attraction and (b) 
to be sure that value similarity was operating as 
intended. 
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TABLE 1 


MEAN STANDARD DEVIATION, AND 
COEFFICIENT OF THE 
E SCORES OF GROUPS 


Group composition | | | 


R lea ee M Me | Mr 
Goal-patti | uw | 
clarity | similarity | | 
High High | 39.79 | 9.75 a 
Low | 40.70 | 9.13 10 
Low | High | 40.76 | 9.85 69 
40.80 | 8.89 18 


| Low | 


in introductory level psychology classrooms at the 
university. The sign-up sheets presented a number 
of times and places where preliminary testing would 
take place. This testing, which lasted for about an 
hour, involved the administration of the Allport- 
Vernon-Lindzey (1960) Study of Values. Every 
subject’s total score on each of three values (theo- 
retical, social, and religious) was rated either high, 
medium, or low based on the national average of 
college females reported in the Allport-Vernon- 
Lindzey (1960) testing manual. Based on these 
scores, 120 subjects were selected. 


Measurement Materials 


Two types of dependent measures were employed 
in this study to test the effects of value similarity 
and goal-path clarity on the functioning of the 
group: (a) a questionnaire designed to examine sub- 
jects perception of various dimensions of group 
oning and (b) a behavioral measure of co- 

hesiveness. : 

Questionnaire. The first measure was a 28-item 
member reaction questionnaire constructed for this 
study. The questionnaire used a 7-point Likert-type 
response scale for each item. A preliminary factor 
analysis indicated that the questionnaire contained 
six subscales, On the basis of the factor loadings, 
the subse: were named general cohesion, inter- 
personal liking (used to assess the validity of the 
manipulation of interpersonal attraction described 
below), general group evaluation, leadership/atmo- 
sphere evaluation, task evaluation, and interaction 
evaluation.* 

Behavioral measure. The second measure was a 
group transfer form constructed by the author 
and administered at the end of the second session. 
The form explained that some subjects in other 
groups had requested that they be changed to a 
group where they would have a better chance of 
winning the prize offered. Then it invited the re- 
spondent to make a choice as to whether or not 
she wanted to change her group assignment. It 
also invited the subject to state a preference, if 
any, as to which other subject from their group she 


4 Copies of this instrument can be obtained by 
writing to the author. 
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would like to work with in the next group. The 
subjects were under the impression, at the time, of 
administration, that there was going to be at least 
one more session. 

This measure was thought to be a direct test of 
the cohesiveness of a group in that if cohesiveness 
is the resultant of all forces working on members 
to remain in the group, then choosing to remain 
would be the most appropriate measure of the phe- 
nomenon. Thus, the unit of analysis was the group, 
and the behavioral test was the number of groups 
disrupted by at least one person choosing to leave 
the group. This measure was only used after the 
second session and is seen as the major test of 
the hypothesis of the conditions that contribute 
significantly to the cohesiveness of task-oriented 
groups. 


Design 


The subjects were run in 40 three-person groups, 
20 groups in which there was high value similarity 
among the three members and 20 groups in which 
there was low value similarity among the three 
group members. The groups were later exposed to 
a high-goal-path condition or a low-goal-path con- 
dition, This procedure yielded a 2 X 2 experimental 
design whose dimensions were value similarity and 
goal-path clarity. There were 10 groups in each cell. 

To establish that groups differed only on value 
similarity and were otherwise similar, the means, 
standard deviations, and correlation coefficients for 
all four conditions were computed prior to the first 
experimental session, Table 1 presents these data 
which indicate that groups were composed as in- 
tended; only the correlation coefficients differed be- 
tween groups in the two value similarity conditions, 


Procedure 


The first session. This session was designed to 
create psychological groups, here defined as a num- 
ber of persons who interact and influence each other 
and who are recognized in some special way be- 
cause of the interaction. Previous research, by ma- 
nipulating a subject’s perception of the other group 
members, has varied interpersonal attraction in 
laboratory situations, In groups outside of the lab- 
oratory there is rarely, if ever, any public state- 
ment to the effect that the members would probably 
like one another, or be congenial, etc, but rather 
this knowledge generally results from actual social 
interaction, Therefore, the first experimental session 
was devoted to allowing subjects within a group to 
become acquainted. This was accomplished through 
intensive discussion of meaningful topics. The first 
session was also planned to allow a determination 
of the effects of assigning subjects to groups based 
on their value similarity or dissimilarity. One mea- 
sure of interpersonal liking was used, a subscale in 
the questionnaire discussed above, 

As the subjects arrived, the experimenter, who 
was unaware of the composition of the group, seated 
the subjects in their preassigned seats and told 


them the session would begin as soon as everyone 
arrived. When all subjects were seated the experi- 
menter passed out instruction sheets and requested 
“that the subjects read the instructions to themselves 
as the experimenter read them aloud. The instruc- 
tions informed the subjects that the purpose of this 
part of the study was to “find out how people com- 
municate in a group of strangers.” They were asked 
to imagine that they were a panel of experts on the 
subject of human relations and that they were to 
discuss’ comprehensively the following issues: (a) 
euthanasia, (b) capital punishment, and (c) the 
morality of war. After the instructions were read, 
‘the experimenter left the room and returned after 
45 minutes. Upon returning the experimenter asked 
the subjects to conclude their discussion and com- 
plete a short questionnaire. When the questionnaires 
were completed, the experimenter told the subjects 
‘that they would be debriefed when the study was 
completed, and she excused them. 

The second session, As the subjects arrived the 
‘experimenter, who was unaware of the hypotheses, 
again seated the subjects in their preassigned seats 
which were the same as in the first session. When 
all subjects were seated, the experimenter told them 
that what they were going to do would probably be 
interesting and profitable. She told them that their 
A group had a chance to win a $15 prize. After this 
introduction, she passed out the instruction sheet 
and asked them to read it to themselves as she read 
it aloud. The instructions informed the group that 
the purpose of this part of tfe study was “to in- 
vestigate decision-making communication patterns 
in small groups.” They were told that they would 
first be given some background information to 
serve as a basis for their decision. Then they would 
be given an opportunity to put this information to 
work by making a group decision about a hypo- 
thetical situation. They were also told that all the 
infggmation they would need to know to make their 
decision was summarized in one deck of cards. They 
were told what the deck represented and were given 
a demonstration of the proper procedure to follow 
in going through the training deck and recording 
their predictions, Each subject was given a training 
deck, a pencil, and an answer sheet and then asked 
to move to one of the desks in the room. The ex- 
perimenter was present during the entire training 
session. 

This procedure permitted the manipulation of 
goal-path clarity. It is based on a research paradigm 
developed by Hammond (1965). His method is a 
laboratory technique which involves two stages: 
(a) a training stage in which the subjects are trained 
in such a way that each learns to think differently 
about a common problem and (b) a conflict stage 
in which the subjects are brought together to at- 
tempt to arrive at a joint decision concerning the 
problem, The aim of the training stage is to de- 
velop different sets of cue dependency in subjects. 
In the present study a three-cue probability task 
was employed. The cues were: (a) educational fa- 
cilities, (6) recreational facilities, and (c) resi- 
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dential facilities, all for a living/learning dormitory. 
Subjects were trained to become dependent upon 
one of these cues by being required to learn to 
estimate the “level of desirability” of a particular 
university’s dormitories. The cues were presented in 
a deck of 60 data cards in bar graph form, 

Subjects were led to believe that the decks repre- 
sented a sample of universities throughout the coun- 
try, and each one of the cards represented one 
particular school in the nation. Each card had three 
bar graphs on the front of it which represented the 
amount of a given facility the dormitories on that 
campus had. That is, it represented the makeup of 
the dorms at one school in terms of the amount of 
each facility in all dorms at that school, The num- 
ber on the back of the card represented how the 
dormitories had been rated in terms of “their de- 
sirability as a total living/learning situation.” Sub- 
jects were told that the ratings were based on the 
responses of students, faculty, and administrators 
from all over the country as reported in a recent 
issue of the magazine American Universities and 
Colleges. 

Each of the three bar graphs had five levels and 
two steps within each level. One of the graphs pro- 
vided information about educational facilities, an- 
other about recreational facilities, and the third 
about residential facilities. On the basis of this 
information, subjects were instructed to predict how 
the dorms were rated. The procedure was to predict 
the rating on each trial (i.e, with each card) and 
record this prediction on the answer sheet provided. 
Then a subject could turn to the back of the card, 
where the mean student rating was given, so that 
she could see how accurate her prediction was. By 
learning how the values on. the graphs related to 
the ratings on the back, the subject learned how to 
make accurate predictions of those ratings. 

In the unclear goal-path situation the valid cue 
was different in cach of the three decks (i.e. each 
group member received different training), To illus- 
trate, Subject 1 was given a deck in which the 
educational facility variable accounted for 98% of 
the variance in the level of desirability (criterion) 
variable. Furthermore, the educational facility vari- 
able was related to the criterion variable in a non- 
linear manner. That is, both low and high degrees 
of educational facilities indicated a low level of 
desirability and a moderate level of educational fa- 
cilities indicated a high level of desirability. For 
Subject 1, the other variables (recreational and 
residential facilities) were randomly related to the 
criterion variable. Thus, Subject 1 built up a high 
degree of dependency on educational facilities as 
an indicator of level of desirability. In the same 
manner, Subject 2 learned to depend most heavily 
on residential facilities, and Subject 3 learned to 
depend most heavily on recreational facilities. Cog- 
nitive differences were thus generated in the three 
subjects through their individual training experi- 
ences. In the clear goal-path situations the training 
experience was exactly the same except that the 
valid cue was the same in each of the three decks 
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(i.e, all group members were trained to depend on 
the same cue for the accuracy of their predictions). 

At the completion of training, which took about 
45 minutes, the subjects were told that they had 
mastered the training task and that their next prob- 
lem was to apply what they had learned; further- 
more, they were to carry out the second part of 
the task with the other two subjects, and they were 
to make a joint decision. The specific task was to 
plan and sketch a living/learning dormitory. They 
were told that it was important for them to decide 
and include in their plan, “the most important fa- 
cility to hold at a moderate level as compared to 
the other facilities.’ 5 They were first to spend five 
minutes working on their plan for the ideal living/ 
learning dorm; then they would come together to 
work on the group plan. Thus, the goal was the 
same in both the high- and low-clarity conditions. 
However, the path (procedure) to the goal was 
clear in the high-clarity conditions and unclear in 
the low-clarity conditions, since in the former but 
not the latter all members of a group had the same 
conception of what was the important variable to 
consider in designing dormitories; in the lo: -clarity 
conditions each subject had a different conception 
of what this variable was. 

After working five minutes on their individual 
plan, the experimenter asked the subjects to please 


come back to the table d work on their group 
plan. The experimenter then collected the training 
decks, turned on the tape recorder, advised the 

t5 minutes for their de- 


group she would return in 
cision, and left the roo 


At the end of 45 he experimenter re- 


turned, ed the group to conclude discussion, 

and turned off the tape record erimenter 

BD) then collected all mate exce i id asked 
Mihe subjects to move back to desks. ne then 

1 out the questionnaire to be ¢ mpleted. When 

rst person completed the questionnaire, it was 


unced that she 
to come to the 
complete the last 
r then took each 
e and gave her 
ng its use. 


lected and the experimenter 
Would like each person separate 
"desk just outside the door to 
Step for the day. The experimente 
Subject into the hall one at 
a group transfer form, explain u 

When the subject completed the form she was 
told that she had just completed the last session of 
the research (each subject previously thought there 
would be a third session) and was advised of the 
time and place of the debriefing session. Each sub- 
ject was then thanked for her participation and ex- 
cused. This procedure was followed for each subject 


individually. 


RESULTS 
Value Similarity and Interpersonal Attraction 
in Session 1 
An analysis of variance on the first session 


data revealed a significant difference (F = 
7.13, df= 1/36, p < .025) between high- 
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similar groups (X = 37.87) and low-similar’ 

groups (X = 35.62) on the measure of inter- 

personal liking.” Persons in high-similar 

groups liked one another more than did those i 
in the low-similar groups. This result indi-4 
cates that the preexperimental assignments), 
were valid, and it tends to support the posi- 7 
tion that value similarity is an antecedent 
of interpersonal liking in groups whose pri 
mary purpose is socioemotional in nature. 


Cohesion in Task-Oriented Groups K 


Hypothesis 1 predicted that groups under: 
clear goal-path conditions would experience 
significantly more cohesion than groups un- 
der unclear goal-path conditions. Data which 
examined this prediction are presented below. 

Behavioral test. To test Hypothesis 1, the 
number of groups in each condition of goal- 
path clarity that were disrupted by at leas 
one member expressing a desire to leave ie 
group was examined. Table 2 presents thi 
data, which were examined through a Fish 
exact test. This analysis revealed a signi 
cant difference (p < .01) between the clear 
goal-path and the unclear goal-path condi- 
tions. As predicted from Hypothesis 1, the 
number of groups that would have been dis- 
rupted by at least one person choosing to 
leave the group was significantly higher in 
the unclear goal-path than in the clear goal- 
path conditions, 


ë This phrase and the use of the word control were 
intentionally made to be somewhat ambiguous and 
contradictory, since it was assumed that the state- 
ment of a group's goals is rarely unequivocal, given 
that usually there are individual differences in the 
way people define and/or interpret words, 

EA debriefing session was held two weeks after the 
completion of the research. At the beginning of the 
session, subjects were asked if they knew what the 
research was about (none did) and what they thought 
about it. Questions were asked for and answered. 
A description of the entire Project was given, along 
with the announcement of the winners of the $15 
prize (a team chosen at random), and subjects were 
thanked again for their participation, 

7TA preliminary test was performed to examine the 
effects of groups within conditions (a nested factor). 
The results of this test indicated that groups did 
generate significant differences (p< .05) in the sub- 
ject’s response. Therefore, the error term of groups 
within conditions (df = 36) was employed in further 
tests. 


A 


TABLE 2 


BEHAVIORAL MEASURE OF COHESIVENESS 
CLASSIFIED BY CONDITIONS OF 
GoaL-PATH CLARITY 


Groups not 


Goal-path clarity disrupted 


Groups disrupted 


Clear i 17 
Unclear 11 9 


_ Subjective perception test. Examination of 
We subscales derived from the questionnaire 
“administered after the second session tended 

to support the results of the behavioral data. 

Table 3 presents the relevant cell means for 

‘each of the dependent variables, as well as 

the ¢ values derived from analysis of vari- 

ance (df = 1/38) that were performed on 
these data.® Table 3 indicates that the means 
were in the predicted direction for all six de- 
‘pendent measures (p < .02, binomial test) 
ind that two of these differences—for inter- 
rsonal liking and general satisfaction—were 
‘of sufficient magnitude to yield marginally 
significant (p < .10) main effects. Thus, Hy- 
pothésis 1 receives some support from both 
the behavioral and subjective perception de- 
pendent measures. 


Test of Hypothesis 2 
Hypothesis 2 predicted that in task- 
oriented groups, value similarity would have 
` a minimal overall effect on level of cohesion. 
Data which examined these predictions are 
presented below. 
Behavioral test. To test Hypothesis 2, the 
number of groups in each condition of value 
similarity that were disrupted by at least 


. * one member expressing a desire to leave the 


group was examined. Table 4 presents these 
data, which also were subjected to a Fisher 
exact test. This analysis revealed no signifi- 
cant difference between the high- and low- 
interpersonal-attraction conditions (p < .50). 

Subjective perception test. Examination of 
the subscales derived from the questionnaire 


8Since the hypothesis (that groups under clear 
goal-path conditions would experience significantly 
more cohesion than groups under unclear goal-path 
conditions) was directional; comparisons were per- 
formed by computing t values for the main effects of 
goal-path clarity, a procedure suggested by Winer 
(1962, p. 208). 
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administered after the second session tended 
to support the results of the behavioral data. 
Table 5 presents the relevant cell means for 
each of the dependent variables, and the F 
values for the main effects of value similarity 
(no significant Value Similarity X Goal-Path 
Clarity interactions were obtained) derived 
from the analyses of variance that were per- 
formed on the data. Table 5 indicates that 
there were no significant differences between 
high- and low-value-similarity means for 
either condition of goal-path clarity. 


DISCUSSION 


The results of the present study lend sup- 
port to the conceptualization of group co- 
hesiveness advanced by Festinger et al. 
(1950), who distinguished between two 
classes of forces working on the individual 
to remain in the group, those of the individ- 
ual and those of the group. The lack of any 
statistically significant interactions between 
the two independent variables examined in 
the present research indicates that the two 
classes of forces do in fact affect group co- 
hesiveness quite independently and that a 
variable representing either class cannot be 
considered as a singular determinant of group 
cohesiveness. Therefore, these results should 
serve to emphasize the fact that a broader 
and more complex approach, involving the 
effect of several variables on cohesiveness, is 
suggested if a truer understanding of the 
phenomenon is to be achieved. Thus, it is ap- 
parent that group cohesiveness cannot be de- 
fined in terms of interpersonal attractiveness 
alone. 


TABLE 3 


| MEAN QUESTIONNAIRE RESI 
VALUES RELEVANT TO HYPOTHE: 


Dependent measure — 


Clear | Unclear j, 

Cohesion 5.28 4.92 1,67* 
Interpersonal liking 6.26 5.88 | 1.70** 
Satistaction 

General 6.12 5.16 | 2.00** 

Leadership/atmosphere | 5.74 5.71 

Task performance 5.80 5.41 

Interaction 6.18 5.94 

*p <.075. 

** p <05. 
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TABLE 4 


BEHAVIORAL MEASURE OF COHESIVENESS CLASSIFIED 
BY CONDITIONS OF VALUE SIMILARITY 


| ae 
Value similarity | Groups disrupted | Gyonps net 
oe ee | 
High 7 | 13 
Low 7 13 


While supporting the existence of two in- 
dependent classes of forces which act on the 
individual to remain in the group, this re- 
search also points to an additional distinction 
that must be made. The results suggest that 
one class of forces, particularly interpersonal 
attraction, may be dominant in determining 
group cohesiveness in socioemotional groups, 
while the other class for forces, particularly 
goal-path clarity, may be dominant in deter- 
mining group cohesiveness in task-oriented 
groups, 

The data from this study clearly indicate 
that the level of cohesiveness of a task-ori- 
ented group is a function of goal-path clarity. 
As predicted from Hypothesis 1, the number 
of groups disrupted by at least one person 
leaving was significantly greater for groups 
with an unclear goal path than for groups 
with a clear goal path. These differences were 
consistent across conditions of value simi- 
larity. Furthermore, these data suggest that 
the effects of value similarity are less salient 
to group cohesiveness than are the effects of 
goal-path clarity in task-oriented groups. 

Two results should be considered here. 
First, groups that acquired through Session 
1 a high amount of prior interpersonal liking 
were identical to groups that acquired a low 
amount of prior interpersonal liking in terms 
of the number of groups that were disrupted 
by at least one person leaving. This suggests 
that prior interpersonal liking based upon 
value similarity (and, perhaps, other socio- 
emotional factors) has little to do with co- 
hesiveness in highly task-oriented groups. It 
can also be noted in this context that the 
introduction of an unclear goal path to groups 
high on prior interpersonal liking caused a 
decrease in the mean level of liking experi- 
enced in these groups. Second, consider that 
a very strong trend existed for groups with 
a clear goal path to be consistently higher 


on cohesiveness and general satisfaction than 
groups with an unclear goal path. 

These results, taken together, indicate that 
goal-path clarity plays an important role in 
the determination of cohesiveness in task 
oriented groups. Although this conclusion at 
first may seem surprising and contrary to a 
great deal of research in this area, it is not. 
The relationships derived in this study were, 
in the past, difficult to examine, since most 
researchers had restricted their examination” 
to groups that were low on task orientation. 
Most researchers (e.g., Berkowitz, 1956, 
1957; Festinger, 1953; Libo, 1953) have 
been concerned with friendship groups, and 
those who were not (e.g., Back, 1951; Berko- 
witz, 1954; Exline, 1957; Schachter, Eller- 


ton, McBride, & Gregory, 1951) used no“ 


direct measures of cohesiveness. (It should 
also be noted that even those concerned with 
friendship groups, with the exception of Libo, 
used nondirect measures of cohesiveness.) 
Therefore, the differential importance of 
value similarity and the group property of 
goal-path clarity on task-oriented groups was 
never examined directly. 

Certainly this is not to say that value 
similarity is not important to group cohesive- 
ness. This and similar variables have been 
shown to be very important to the cohesive- 
ness of friendship groups, where people have 
come together for socioemotional support and 
consensual validation of the world. However, 
as was previously assumed, the centrality of 
value similarity (and many other antecedents 
of interpersonal attraction) to group cohesive- 
ness does not apply to all groups. When there 
is a task to be performed, variables relevant 


TABLE 5 


MEAN QUESTIONNAIRE RESPONSES AND F 
VALUES RELEVANT TO HYPOTHESIS 2 


Interpersonal attraction 
Dependent measure -E 


| High Low | i 

Cohesion 5.04 Bib 3 
Interpersonal liking 6.01 6.13 24 
Satisfaction 

General 5.25 6.03 1.54 

Leadership/atmosphere 5.67 5.78 24 

Task performance 5.40 5.81 -61 

Interaction | 5.94 618 | 1.23 

| 


y 
fe 
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to the successful completion of the task are 
more salient. 
Certainly more research is needed in this 


“area; the present research does not answer 


many of the questions regarding the deter- 
minants of cohesiveness. However, it does 
establish the need to acknowledge the distinc- 
tion between the two types of groups when 
conducting research of this type. Furthermore, 
it points to the possibility that group prop- 
erties, particularly goal-path clarity, deserve 
more attention in developing an understanding 
of the phenomenon of cohesiveness in task- 
oriented groups. The present study also 
strongly indicates that interpersonal attraction 
(based on value similarity) is not central to 
the cohesiveness of all groups. 

Further research is needed to define more 
clearly the conditions under which interper- 
sonal attraction is important, and this issue 
may be more complex than previously con- 
sidered. For example, as the results of previous 
research (Bales, 1950) suggest, in groups 
which extend further into time than those in 
this study, interpersonal attraction may 
emerge as a significant variable in task- 
oriented groups as well. As the task becomes 
more routinized in high-clarity groups, change 
to a greater emphasis on interpersonal attrac- 
tion could occur. The possibility also exists 
that in low-clarity groups, negative reaction to 
continued failure could generate greater 
emphasis on interpersonal attraction. The 
consequences of this change may be a mani- 
festation of slower deterioration of high-inter- 
personal-attraction groups and the occurrence 
of mutually ingratiating behavior in low-inter- 
personal-attraction groups. If so, such findings 
would support the general premise of this re- 
search; group process variables such as co- 
hesiveness are the product of a complicated 
and extensive set of variables whose interrela- 
tionships are just beginning to become clear. 
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deCharms has hypothesized that increasing extrinsic rewards may lead in- 
dividuals to perceive their behavior as under the control of the rewards and 
that this, in turn, may reduce their intrinsic motivation. Recently, Deci has 
reported several studies dealing with this interaction between intrinsic and 
extrinsic motivation. A number of methodological problems with Deci’s 
experiments are discussed. Support for deCharms’s hypothesis is critically re- 
viewed in order to direct future research. 


An important question concerns how in- 
trinsic and extrinsic motivation combine to 
determine an individual’s overall level of mo- 
tivation. Although common sense would lead 
one to expect that intrinsic and extrinsic mo- 
tivation summate, deCharms (1968) has 
argued that intrinsic and extrinsic motivation 
may interact. Several recent studies have ex- 
plored the effects of extrinsic rewards on an 
individual’s intrinsic motivation to perform a 
task. Deci, one of the more active researchers 
in this area, has followed deCharms in predict- 
ing that intrinsic and extrinsic rewards are not 
additive in their effect on motivation, and that 
the introduction of contingent monetary re- 
wards or punishment reduces intrinsic motiva- 
tion to perform an activity (Deci, 1971; Deci 
and Cascio, 1972). However, unlike deCharms, 
Deci has also predicted that verbal reinforce- 
ment increases intrinsic motivation to perform 
a task (Deci, 1971, 1972a), while noncon- 


t Requests for reprints should be sent to Bobby J. 
Calder, Organizational Behavior Program, 194 Com- 
merce West, University of Illinois, Urbana, Illinois 
61801. 

2 A field experiment reported by Deci (1971, Ex- 
periment 2) is not discussed since it provides evidence 
neither for nor against the interaction of intrinsic 
and extrinsic motivation. The experiment showed a 
marginally significant decrease in the dependent mea- 
sure of intrinsic motivation when subjects were paid 
on a piece-rate basis, but the sample was extremely 
small (n= 4) and there was a 50% mortality rate 
among the unpaid control group. Also, it is not clear 
theoretically how the dependent measure (the speed 


> writing headlines) is related to intrinsic motiva- 
on. 
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tingent financial rewards leave intrinsic mo- 
tivation intact (Deci, 1972b). Unfortunately, 
there are a number of methodological prob- 
lems associated with Deci’s experiments. The 
purpose of this article is to review critically 
the support for each of these propositions in 
order to direct future research. 

Research on the effects of contingent mone- 
tary rewards on intrinsic motivation is most 
relevant to the deCharms hypothesis. To ex- 
plore these effects, Deci (1971, 1972a) had 
subjects work on a series of interesting puz- 
zles. The experimental sessions were divided 
into work periods and free-time periods. Dur- 
ing a free-time period subjects could read 
magazines, remain idle, or continue to work 
on the puzzles; the amount of free time spent 
on the puzzles was taken on face validity as 
a dependent measure of intrinsic motivation. 
The results of these studies indicated that 
subjects who were paid contingent on their 
task performance (the number of puzzles 
solved) spent less free time on the puzzles than 
did unpaid controls. Deci interprets this find- 
ing as demonstrating that the contingently 
paid subjects lose intrinsic motivation for the 
activity. Their reevaluation of the activity is 
produced by the perception that “it is mo- 
tivated by the money” rather than “it is in- 
trinsically motivated [Deci, 1972a, p. 114].” 

Although these contingent reward studies do 
seem to support deCharms’s hypothesis, there 
are problems with their interpretation which 
should be made explicit in order to direct fur- 
their research.2 First, in none of the Deci 
studies are the performance data reported for 
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the experimental task. It is thus unclear 
whether any change in free time spent on the 
task is due to a change in intrinsic motiva- 
tion or merely to differences in performance. 
That is, in terms of a causal model, the per- 
- formance of the subjects is an uncontrolled 
variable which possibly mediates the relation- 
ship between the manipulated variable and 
the dependent measure of intrinsic motivation. 
One would expect, for example, that the in- 
troduction of contingent rewards increased ef- 
fort in solving puzzles during the experimental 
sessions. Therefore, the decreased amount of 
free time spent on the puzzles after the ex- 
perimental sessions could be due to factors 
such as satiation or fatigue rather than any 
cognitive reevaluation of why one is perform- 
ing the task.* 

A second ambiguity in the contingent re- 
ward studies is the magnitude of the reward. 
Since the rewards were administered contin- 
gent on performance, we have no information 

. about the amount of reinforcement which ac- 
tually constituted the manipulated variable. 
This omission is unfortunate, since Deci 
(1972b) later compares the data from a con- 
tingent reward experiment (Deci, 1972a) to 
data from a study using fixed, noncontingent 
reinforcement. In a comparison of intrinsic 
motivation resulting from these two manipula- 
tions, it is thus impossible to tell whether any 
differences are due to the contingent versus 
noncontingent factor or to possible differences 

‘in the amount of reinforcement delivered. 


*Deci (1971, Experiment 1) employed three ses- 
sions over which a control group was never rewarded 
and an experimental group was rewarded during the 
second session but “told that they would not be paid 
for the third session because there was only enough 
money available to pay them for one of the sessions 
[p. 1091.” A free-time period was embedded in the 
middle of each session as a measure of intrinsic 
motivation. Although this dependent variable is not 
the same as experimental task performance, it does 
point up the confounding of performance effects. 
Unlike the controls, in the second session, subjects’ 
free-time performance increased markedly and then 
dropped in the third session. Deci analyzed this data 
with difference scores between the first and third ses- 
sions. Obviously the difference for the experimental 
group could be due to satiation, fatigue, etc. pro- 
duced by their heightened performance in the second 
session, Moreover, note the confounding effect of 
“withdrawing” a previous reward in the third session. 


A third ambiguity concerns the timing of 
the reward. In the contingent reward studies, 
extrinsic rewards decreased intrinsic motiva- 
tion when subjects expected the reward but 
were not actually paid until after the comple- 
tion of the entire experiment, including the 
free-time period. However, in one of these 
same experiments, Deci (1972a) reports data 
showing that intrinsic motivation increased 
when contingent payment was made after the 
task but before the free-time period. This lat- 
ter finding was interpreted as supporting 
equity theory, since increased free time spent 
on the task could have provided a means of 
resolving overpayment inequity. Whatever the 
merits of this interpretation, it still should be 
noted that the data provide a relevant, albeit 
nonconforming, test of the deCharms hypothe- 
sis. That is, one would expect payment before 
the free-time period to have made the extrinsic 
reward even more salient, thereby reducing 
intrinsic motivation. In short, there does not 
seem to be any obvious theoretical rationale 
for limiting changes in intrinsic motivation to 
contingent rewards presented after the free- 
time period. Equity considerations provide 
only a post hoc explanation for the absence of 
the effect when the reward was presented 
before the free-time period. 

Another issue relevant to the occurrence of 
rewards is whether or not they are expected. 
In the experiment just described, Deci manip- 
ulated the timing of the reward but subjects 
always expected to receive the reward. Lepper, 
Greene, and Nisbett (1973) varied the ex- 
pectation as well as the level of rewards. An 
extinsic reward (a good player award consist- 
ing of a gold star and red ribbon) was prom- 
ised to some children before they performed 
an interesting task (playing with magic mark- 
ers), while other children were not told of the 
award until after completion of the task, The 
extent to which the children played with the 
magic markers was later recorded in a free- 
play situation as a measure of intrinsic mo- 
tivation. The amount of free time spent on the 
markers was lowest for the group expecting 
the reward and highest for the group not ex- 
pecting the reward, with a no-reward control 
group intermediate between the two extremes. 
Thus it is possible that any decrease in in- 


trinsic motivation is limited to expected ex- 
trinsic rewards. Furthermore, the Lepper et al. 
finding suggests the possibility that it is not, 
as Deci contends, the self-perception of in- 
trinsic motivation which is the crucial factor 
but rather the perception of the offer of the 
reward. That is, it is possible that the extrinsic 
reward is perceived as a bribe (cf. Steiner, 
1970) or as conyeying information that the 
experimenter does not view the activity as en- 
joyable enough to be performed without an 
extra reward. More research is needed on this 
possibility before we can be confident that the 
decline in performance is produced by a de- 
Crease in intrinsic motivation as opposed to a 
reaction to the offer of the reward. 

Several other studies are also relevant to 
the interaction between intrinsic and extrinsic 
motivation. In order to determine the effect 
of verbal reinforcement on intrinsic motiva- 
tion, Deci (1971) manipulated verbal rewards 
using the same design employed for monetary 
rewards. Subjects again worked on a puzzle- 
solving task and then were exposed to a free- 
time period. Some subjects, but not others, 
were told that they had performed much bet- 
ter than average on the puzzle task. The re- 
sults of this experiment showed that the ver- 
bally reinforced subjects spent more free time 
on the puzzle than controls. A second experi- 
ment (Deci, 1972a) combined the verbal re- 
ward variable with the previously discussed 
manipulation of the timing of the contingent 
monetary reward (money after the free-time 
period, money before the free-time period, no 
money). In this study, the effect for verbal 
reinforcement is nonsignificant by conven- 
tional standards. Nontheless, Deci attempts to 
salvage the verbal reinforcement effect by in- 
terpreting a nonsignificant Verbal Reinforce- 
ment X Sex interaction, and verbally rein- 
forced male subjects do appear to spend more 
time on the puzzles during the free-time pe- 
riod. In any case, Deci (1972a, 1972b) in- 
terprets these two experiments as indicating 
that verbal reinforcement is not phenomeno- 
logically distinct from intrinsic rewards and, 
therefore, adds to one’s intrinsic motivation to 
perform a task. Certainly the evidence for this 
Proposition is most ambiguous. Furthermore, 
stimulus generalization or secondary reinforce- 
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ment appear to be alternative interpretations 
to any hypothesized changes in intrinsic mo- 
tivation.* 

Let us now turn to a recently reported 
study (Deci, 1972b) on the effects of noncon- 
tingent rewards on intrinsic motivation. As in 
earlier experiments, subjects participated in an 
experimental work session and a free-time pe- 
riod. In this study, however, subjects in the 
experimental group were paid $2 regardless of 
their performance on the puzzle task, while 
control subjects were not paid at all. he re 
sults indicated no significant difference in t 
amount of free time spent on the puzzle J 
tween the experimental and control groups 
This failure to find a significant difference 
was interpreted as demonstrating that noncon- 
tingent monetary rewards do not change in- 
trinsic motivation, and that with noncontingent 
rewards subjects are less likely to perceive 
themselves as being motivated by the reward 
This conclusion is not justified by the data. 
Notice that Deci has essentially affirmed ihe 
null hypothesis. Because we can never know 
what factor, if any, accounts for a lack of 
change, it is logically impossible to prove the 
absence of an effect. Was the receipt of non- 
contingent rewards in this experiment the 
same as receiving no treatment at all, or were 
there other variables which caused the sub- 
jects’ intrinsic motivation to remain intact? 

With respect to this same noncontingent re- 
ward study, we should also note that Deci at- 
tempts to make his results more meaningful 
by comparing them with two cells (money 
after—no verbal reinforcement and no money 
no verbal reinforcement) from an earlier con- 
tingent monetary rewards experiment (Deci, 
1972a). Unfortunately there are severe prob- 
lems in comparing two studies in this way: 


4 Deci and Cascio (1972) also tested the hypothe- 
sis that punishment (a noxious buzzer) for poor task 
performance would decrease intrinsic motivation. The 
results showed that subjects who were threatened 
with contingent punishment spent (marginally sig- 
nificant) less free time on a puzzle task than did con- 
trols. However, as in the verbal reinforcement studies 
(Deci, 1971, 1972a), the aversive stimulus, threat of 
punishment, may have generalized to puzzle solving 
during the experimental sessions with the negative as- 
sociation maintained for the same activity during the 
free-time period. 
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Any changes in the two experiments could 
have produced the different results. Deci 
states in passing that the small, nonsignificant 
difference in the two no-reward (control) 
groups could be due “to the fact that a differ- 
ent experimenter conducted the two studies 
[1972b, p. 226],” but it is also clear that the 
difference in the two reward groups could be 
due to the same kind of factors. In any event, 
the more appropriate interaction test is not 
reported. 

A recent experiment by Kruglanski, Fried- 
man, and Zeevi (1971) provides better evi- 
dence about the effects of noncontingent re- 
wards or intrinsic motivation. In this study, 
some children, but not others, were offered an 
extrinsic reward (an interesting laboratory 
tour) for participating in an experiment. The 
reward, as in Deci’s (1972b) experiment, was 
not contingent upon high performance but 
upon participation in the activity. Contrary to 
Deci’s hypothesis, a decrease in intrinsic mo- 
tivation was obtained for the extrinsic, non- 
contingent reward condition. Children who 
were offered the extrinsic reward were less 
satisfied with the experimental task and less 
likely (marginally significant) to volunteer for 
similar experiments. In addition, the extrinsi- 
cally rewarded group did not perform as well 
on the experimental task (in terms of recall, 
creativity, and the Zeigarnik effect) as the 
nonrewarded children. While more research is 
needed to determine whether contingent and 
noncontingent rewards both decrease intrinsic 
motivation, this study and the Lepper et al. 
study discussed earlier indicate that they do. 
If contingency makes rewards more salient as 
extrinsic forces, perhaps the effect of extrinsic 
rewards on intrinsic motivation is actually an 
interaction such that both contingent and non- 
contingent rewards decrease intrinsic motiva- 
tion, but contingent rewards produce the larg- 
est change. In fact, the salience of extrinsic 
rewards may be one of the strongest factors 
influencing intrinsic motivation. 


CONCLUSIONS 


What then is the status of the deCharms 
hypothesis that there is an interaction between 
intrinsic and extrinsic motivation? Obviously 
the present experimental evidence is incon- 


clusive, though it does provide a basis for fur- 
ther research. We should note, however, that 
in any future research there also needs to be 
more attention given to the operationalization 
of intrinsic motivation as a dependent vari- 
able. Deci and Lepper et al. used persistence 
on a task as their single measures of intrinsic 
motivation. Nonetheless, there are other in- 
dicators which can and should be used in as- 
sessing intrinsic motivation. Perhaps the most 
obvious indicator is reported task satisfaction, 
since one certainly should like a task if he is 
willing to perform it for no other apparent 
reward. Kruglanski et al. did report that task 
enjoyment of the nonrewarded subjects ex- 
ceeded that of subjects who received an ex- 
trinsic reward. Deci reported that subjects 
rated the puzzle task for interest and enjoy- 
ment at the end of the experimental sessions 
in his 1971 study. Although Deci found that 
rated task satisfaction did not differ between 
the experimental and control groups or among 
the experimental sessions, he did not mention 
the apparent inconsistency of this attitudinal 
data with his observed behavioral change. 

Attention should also be given to the the- 
oretical status of intrinsic motivation as a de- 
pendent variable. Researchers have frequently 
failed to maintain a distinction between in- 
trinsic motivation as a perception and as an 
actual psychological process. Deci implicitly 
contends that extrinsic rewards decrease the 
perception of intrinsic motivation and this in 
turn decreases actual intrinsic motivation, We 
would argue that a clear distinction should 
be made between the two. Too little is known 
about intrinsic motivation as a psychological 
process even to assume a direct relationship 
with the perception of intrinsic motivation. 
The attributed cause of a behavior need not 
be veridical with its objective cause. Thus, re- 
search should attempt to relate the self-per- 
ception of intrinsic motivation to task per- 
formance and attitudes without making any 
premature assumptions about the actual exist- 
ence or nature of intrinsic motivation. 

This review has sought to specify some of 
the difficulties with research on intrinsic and 
extrinsic motivation and to point out some 
possibilities for advancement. In view of the 
obvious practical importance of the problem 
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and the theoretical significance of self-percep- 
tion in general (cf. Bem, 1972), one can only 
conclude that this area warrants considerable 
conceptual as well as empirical attention. 
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Cognitive Evaluation Theory and Some Comments 
on the Calder and Staw Critique 
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In criticizing our work about the effects of extrinsic rewards on intrinsic 
motivation, Calder and Staw point out that additional data are necessary 
to substantiate several of our assertions. Those data are presented here and 
they are seen to be consistent with our earlier findings. Other criticisms are 
considered within the context of cognitive evaluation theory, and areas are 
pointed out where additional research is necessary. 


Calder and Staw (1974) have made a num- 
ber of criticisms about our work dealing with 
the effects of extrinsic rewards on intrinsic 
motivation. Their criticisms concern both 
methodological issues and issues of interpreta- 
tion. We would like to respond by considering 
each point in turn. 

The first criticism was that performance 
data (i.e., the amount of time spent by sub- 
jects working on the puzzles during the first 
part of the experiment) were not reported in 
the contingent payment studies (Deci, 1971, 
1972b). Calder and Staw suggested that the 
paid subjects may have worked harder to earn 
the rewards, so the apparent decrease in in- 
trinsic motivation may have been simply a 
satiation effect for the paid subjects. Their 
point is a good one; however, there were no 
significant differences between the average 
amount of time spent by experimental sub- 
jects and control subjects in the two con- 
tingent payment studies. These data are re- 
ported in Table 1. Since there are no signifi- 
cant differences in the performance data, there 
is no evidence that the lower level of dis- 
played intrinsic motivation in the contingently 
paid subjects could be a satiation or fatigue 
effect. 

Our primary hypothesis in this program of 
research was that expected, contingently ad- 


1 Requests for reprints should be sent to Edward 
L. Deci, Department of Psychology, University of 
Rochester, Rochester, New York 14627. Gis 

2 Now at Florida International University, Miami, 


Florida. r) 
3 Now at Children’s Psychiatric Center, Inc., Mon- 


mouth County, New Jersey. 
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ministered rewards would cause a decrease in 
intrinsic motivation. The evidence lends clear 
support to the hypothesis, since this basic find- 
ing has been replicated many times in many 
situations (Deci, 1971, 1972b; Deci & Cascio, 
1972; Lepper, Greene, & Nisbett, 1973). Al- 
though any one of these five studies which 
support the hypothesis could be criticized, the 
important support comes not from any one 
study, but rather from the fact that this find- 
ing has been replicated several times. 

The second criticism made by Calder and 
Staw was that the amount of contingent pay- 
ments which subjects received in the Deci 
(1972b) study were not reported, yet the re- 
sults of this study were compared to a non- 
contingent payment study (Deci, 1972a). 
They suggest that it is not possible therefore 
to tell whether the differences in the findings 
of the two studies were due to different 
amounts of money earned or were due to the 
contingent/noncontingent variable. The aver- 
age earnings of subjects in the contingent pay- 
ment study was $2.38, whereas the earnings 
in the noncontingent study was $2.00 per sub- 
ject. It seems very unlikely that this differ- 
ence of $.38 would account for the differential 
findings in the two studies. 


TABLE 1 
AVERAGE NUMBER OF SECONDS SPENT ON 
Eacu PUZZLE 


Deci Control Paid after | Paid before 
1971 428.4 372.5 
1972b 307.7 382.3 314.2 


Note. All three differences between paid and control are 
nonsignificant. 
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Calder and Staw also take issue with our 
reasoning in the Deci (1972b) experiment 
where rewards were administered both before 
and after the free choice period. This experi- 
ment developed out of anecdotal observations 
in the first payment study (Deci, 1971) that 
subjects seemed very uneasy when they were 
actually handed the money, though this un- 
easiness did not appear until they received the 
money. Our interpretation of this uneasiness 
and reluctance to accept “all that money” 
was that they were feeling inequitably over- 
paid. 

It was hypothesized in the next experiment 
(Deci, 1972b) that if subjects were paid be- 
fore the free choice period that there would be 
an inequitable overpayment effect, whereas if 
subjects were paid after the free choice period 
this effect would not be operative so the de- 
creased intrinsic motivation would be appar- 
ent. We asserted that intrinsic motivation 
would decrease even for the subjects paid be- 
fore the free choice period and that the in- 
creased activity was a short-term inequity 
effect, not an increase in intrinsic motivation. 
Conceptually it seems quite reasonable to ex- 
pect that if a behavior which initially was 
intrinsically motivated comes under the con- 
trol of extrinsic rewards, then questions of 
equity would come into play. Unfortunately, 
our measure of intrinsic motivation and the 
measure of performance to reduce inequity 
were the same. Therefore, although the data 
support our hypotheses, they do not rule out 
the alternative interpretation made by Calder 
and Staw that payment before the free choice 
period increased intrinsic motivation. 

We think that Calder and Staw have a valid 
point in that the use of free choice time both 
as a measure of intrinsic motivation and as a 
measure of performance to reduce inequity 
leaves the results open to multiple interpreta- 
tions. But their statement that the data dis- 
confirm the hypothesis that rewards decrease 
intrinsic motivation is only an alternative ex- 
planation of the data and is not a definitive 
interpretation as they imply. In addition, their 
assertion that our explanation was post hoc is 
simply wrong. In sum, then, our hypotheses 
concerning inequity effects were confirmed by 
the predicted data, though the results are open 


to an alternative interpretation. Further work 
is required to rule out the alternative inter- 
pretation. 

In order to respond adequately to the re- 
mainder of the criticisms it is necessary to 
present cognitive evaluation theory, so that 
the criticisms can be evaluated in relation to 
the theory. 

Cognitive evaluation theory assumes that 
intrinsically motivated behavior is behavior 
which allows a person to feel competent and 
self-determining (cf. deCharms, 1968; White, 
1959). The theory then asserts that there are 
two processes by which extrinsic rewards can 
affect intrinsic motivation: (a) a change in 
perceived locus of causality and (b) a change 
in one’s feelings of competence and self-deter- 
mination. 

When a person is intrinsically motivated the 
locus of causality (deCharms, 1968; Heider, 
1958) is within himself. However, when he re- 
ceives external rewards he begins to perceive 
that he is doing the activity for the external 
rewards, so the perceived locus of causality 
changes from within himself to the external 
reward leaving him with less intrinsic mo- 
tivation. 

People are intrinsically motivated to per- 
form activities which make them feel com- 
petent and self-determining; therefore, re- 
wards or feedback can affect their intrinsic 
motivation by affecting their feelings of com- 
petence and self-determination. Rewards or 
feedback that strengthen these feelings en- 
hance intrinsic motivation, and feedback (or 
punishment) that weakens these feelings de- 
creases intrinsic motivation. Experiments have 
shown that positive feedback (if it does not 
initiate the change in perceived locus of 
causality process) increases intrinsic motiva- 
tion, and negative feedback decreases it (Deci, 
Cascio, & Krusell, 1973). 

The fact that these two processes are initi- 
ated at different times is because every reward 
has two aspects. The first is a controlling as- 
pect, which initiates the change in perceived 
locus of causality process. The other aspect 
to every reward is the information it gives a 
person about his competence and self-deter- 
mination. Therefore, whether the “change in 
perceived locus of causality” process or the 
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“change in feelings of competence and self- 
determination” process is invoked depends on 
which aspect of the reward is more salient. 
Calder and Staw raised the question of 
whether a reward needs to be expected in or- 
der to cause a decrease in intrinsic motivation. 
Lepper et al. (1973) found that when per- 
formers expected rewards before they began 


: the task, there was a decrease in intrinsic 


motivation, but when they did not expect 
them there was no decrease. On the other 
hand, Kruglanski, Alon, and Lewis (1972) 
found that intrinsic interest for an activity 
decreased when people received unexpected 
rewards after they had performed interesting 
activities. The empirical results then are 
mixed. 

Cognitive evaluation theory predicts that 
expected rewards would lead to a larger de- 
crease in intrinsic motivation than unexpected 
rewards would. If a person expects a reward 
while he is doing the activity, then it is very 
likely that he perceives that he is doing the 
activity for the reward (i.e., it is very likely 
that the change in perceived locus of causality 
is operating). On the other hand, if he does 
not expect the reward, there is less likelihood 
that he comes to perceive that the reason he 
is doing the activity is to get the extrinsic 
reward. It is possible though that when he 
gets unexpected rewards he retrospectively re- 
evaluates the task from one that he does for 
intrinsic interest to one that he does to get 
rewards. This would of course leave him with 
less intrinsic motivation. So either an expected 
or an unexpected reward could decrease in- 
trinsic motivation, but the unexpected reward 
would decrease it the least and would be less 
likely to do so. 

Calder and Staw have said that the Lepper 
et al. failure to get a decrease from unex- 
pected rewards raises the possibility that the 
decreases in our experiments were due to the 
fact that the extrinsic reward is perceived as 
a bribe and conveys to the subject that the 
task is not very interesting. We see this 
criticism as a straw man, When a subject is 
offered a reward for performing an activity, 
the reward leads to a reevaluation of the task 
from one that he does for intrinsic reasons to 
one that he does for the reward. He is then 


TABLE 2 


MEAN NUMBER OF SECONDS OF FREE CHoIcE 
TIME SPENT WORKING ON THE PuzzLES 


Female subjects Male subjects 
Group 7 i 
Female | Male | Female | Male 
experi- | experi- | experi- | experi- 
menter menter menter menter 
Positive 
: feedback 136.50 | 454.50 | 340.25 
Control 354.88 | 239.75 | 275.25 
Note. Each cell n = 8, 
ipso facto less intrinsically motivated to do 


it. Whether the offer alone without any per- 
formance is enough to produce a change in 
perceived locus of causality, and therefore a 
decrease in intrinsic motivation, is an empir- 
ical question. We suspect that the offer would 
not be a sufficiently powerful manipulation to 
lead to the reevaluation, though indeed it may 
be. If it does, the decrease would be less than 
in situations where the person performed the 
activity for the reward. 

Calder and Staw then point out that the 
findings in relation to the effects of positive 
feedback (or verbal reinforcement) are am- 
biguous. In the experiment which they criti- 
cized (Deci, 1972b) the results suggested that 
positive feedback may increase the intrinsic 
motivation of males and decrease it for females, 
though the results were equivocal. Therefore, 
we performed an additional experiment to 
clarify this issue. This experiment used both a 
male and a female experimenter who gave 
positive feedback to both male and female 
subjects. The results appear in Table 2 and 
an analysis of variance in Table 3. Female sub- 
jects who received positive feedback spent less 
free choice time working on the puzzles than 
subjects who got no feedback regardless of 
whether the experimenter was male or female. 
In other words, females who received positive 
verbal feedback showed less intrinsic motiva- 
tion following the puzzle-solving experience 
than females who received no feedback. 

On the other hand, positive feedback in- 
creased the intrinsic motivation of males just 
as it did in previous experiments (Deci, 
1971, 1972b). This phenomenon was pro- 
duced when the experimenter was female just 
as it was when the experimenter was male. 
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TABLE 3 


ANALYSIS OF VARIANCE ON THE MEAN NUMBER 
OF SECONDS OF FrEE CHOICE Time SPENT 
WORKING ON THE PUZZLES 


Source df MS F 
Feedback (A) 1 172.4 | <1 
Experimenter sex (B) $ 2,438.0 | <1 
Subject sex (C) 1 | 207,138.9 5.714* 
AXB 1 | 102,319.9 2 
AXC 1 | 298,528.0 8.235** 
BX C 1 42,797.1 1.181 
AXBXC 1 412.8 | <1 

Error 56 36,252.41 | 
*p <.05. 
+p <01. 


The analysis of variance summary is pre- 
sented in Table 3 and shows this Sex of Sub- 
ject X Feedback interaction to be highly 
significant. The summary table also shows a 
main effect (p < .05) for sex of subjects sug- 
gesting that males are more intrinsically mo- 
tivated than females. However, this is some- 
what misleading, in that this main effect is 
caused entirely by the positive feedback con- 
dition, The more valid test of whether males 
and females differ is done by comparing 
control conditions. Here we see no difference. 
Women control subjects spent an average of 
280.3 seconds while males spent an average 
of 257.5 seconds of free choice time work- 
ing on the puzzles. The main effect for sex 
of subject simply underscores the strength of 
the differential effect of positive feedback on 
males and females since all of the main effect 
is accounted for by the feedback condition. 
In sum, positive feedback increases the in- 
trinsic motivation of males, whereas it de- 
creases the intrinsic motivation for females. 
Our interpretation is that for males, the 
positive feedback strengthened their feelings 
of competence and self-determination, whereas 
for females it changed their perceived locus 
of causality. This can be accounted for in 
terms of the socialization of males versus fe- 
males in our society. The role traditionally 
ascribed to women is a more dependent one. 
Further, they are encouraged to be more sen- 
sitive to what other people say. Consequently, 
they would be more likely to be sensitive to 
the positive feedback from others, and they 
are more likely to become dependent on it. 
So the controlling aspect of this verbal re- 


inforcement is more salient than the infor- 
mational aspect, so it initiates the change in 
perceived locus of causality process, leaving 
them with less intrinsic motivation. Males, 
however, evaluate the feedback differently. 
They are less sensitive to and dependent on 
this reinforcement so it does not change 
their perceived locus of causality, it simply 
strengthens their feelings of competence and 
self-determination, leaving them with more 
intrinsic motivation. 

We agree with Calder and Staw that the 
question of whether noncontingent monetary 
rewards decrease intrinsic motivation is not 
fully settled at this time. Our study (Deci, 
1972a) showed that noncontingent rewards 
did not decrease intrinsic motivation, though 
Kruglanski, Friedman, and Zeevi (1971) 
found a decrease from noncontingent re- 
wards. Cognitive evaluation theory asserts 
that a change in perceived locus of causality 
is one way to decrease intrinsic motivation. 
If a reward is contingent on performance, 
then the performance is instrumental for get- 
ting the reward, so the person perceives that 
he is doing the activity to get the reward— 
his perceived locus of causality has changed. 
If the reward is noncontingent, the link be- 
tween the activity and the reward is less 
apparent, so it is less likely to lead to a change 
in perceived locus of causality. Nonetheless, 
it could lead to a change and therefore a 
decrease in intrinsic motivation, but the de- 
crease would be smaller and less likely to 
occur. 

Both with unexpected rewards and noncon- 
tingent rewards, the situation is more am- 
biguous than with expected, contingent re- 
wards, so the experimental findings have been 
less clear-cut. It seems important that both 
these questions be investigated further, 

We agree with Calder and Staw that more 
attention should be given to operationaliz- 
ing intrinsic motivation. In a recent study 
from our laboratory (Deci, Benware, & 
Landy, 1974), we used a pencil-and-paper 
measure to assess attributions of intrinsic 
and extrinsic motivation. The use of such 
measures could be useful as a supplemental 
measure for studying the effects of rewards 
on intrinsic motivati 
behavioral a ee eee 

ly preferable to 
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a pencil-and-paper measure as the primary 
measure. 

Finally, we agree with Calder and Staw 
that more attention needs to be given to an 
understanding of the psychological basis of 
intrinsic motivation. This has been done in 
great detail elsewhere (Deci, in press), 
though space limitations prevent us from 
dealing with the question here. 
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In two separate experiments, attributions of success and failure experiences to 
the four factors of effort, ability, luck, and task difficulty were investigated. 
Results of both experiments confirmed (a) a tendency to attribute success 
more than failure to internal factors and to unstable factors, (b) a strong 
interactive tendency to attribute success to effort as opposed to ability, but to 
attribute failure to lack of ability as opposed to lack of effort, and (c) no 
effect for a manipulation of expected difficulty of task. The significance of these 


results is discussed. 


The manner in which individuals attribute 
the cause for their success and failure is re- 
ceiving increased attention. Among numerous 
unresolved issues, there is the question of 
whether an individual attempts to maintain 
his self-esteem through the mechanism of 
attributing success internally and failure ex- 
ternally. Some studies have yielded evidence 
consistent with this possibility (Fitch, 1970; 
Frieze & Weiner, 1971; Johnson, Feigenbaum, 
& Weiby, 1964) while evidence from other 
studies is unsupportive (Chaikin, 1971; 
Feather, 1969). 

Another issue concerns the manner in which 
a person’s attributions of external and inter- 
nal causation vary as a function of his expec- 
tations regarding task performance. It has 
been shown that unexpected success or failure 
is attributed more externally than is expected 
success or failure (Feather & Simon, 1971a, 
1971b). However, other evidence suggests 
that unexpected success or failure is attrib- 
uted more internally than is expected success 
or failure (Frieze & Weiner, 1971; Luginbuhl, 
1972). 

Finally, recent work (Feather & Simon, 
1971; Frieze & Weiner, 1971; Weiner, Frieze, 
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Kukla, Reed, Rest, & Rosenbaum, 1972) 
indicates that earlier conceptions of what 
constitutes internal versus external attribu- 
tion have been inadequate and to some extent 
misleading. In the past, an internal attribu- 
tion has usually been interpreted as referring 
to the actor’s personal ability, while an ex- 
ternal attribution has most frequently been 
interpreted as assignment of causality to 
chance factors. As Frieze and Weiner (1971) 
have pointed out, this conceptualization con- 
founds an internal-external dimension with 
a stability—instability dimension. To remedy 
this difficulty, they proposed a two-dimen- 
sional scheme which gives rise to four attribu- 
tional factors: effort (internal and unstable), 
ability (internal and stable), luck (external 
and unstable), and task difficulty (external 
and stable). Preliminary work by Frieze and 
Weiner (1971) and by Feather and Simon 
(1971b) confirms the use of these four 
attributional categories. 

The present article reports two separate 
experiments. Experiment 1 examined three 
aspects of the interaction between success— 
failure and internal-external attribution. The 
first was to ascertain whether individuals 
tend to attribute success internally and fail- 
ure externally. While people do attempt to 
maintain self-esteem, they also need to behave 
in a manner consistent with reality. There- 
fore, while under most conditions success may 
be attributed more internally (to effort and/or 
ability) than failure, this might not occur in 
a situation where there were compelling fac- 
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tors which suggested that success was not 
internally caused. 

It was therefore predicted that (a) success 
would be attributed more internally than 
failure, and (b) success would be attributed 
more internally than externally, except (c) 
under conditions of great uncertainty. For 
example, in the case where the determining 
causes for a subject’s success or failure were 
thoroughly unclear, we expected internal at- 
tribution for success to decline. We were par- 
ticularly interested in determining how read- 
ily subjects would attribute their success to 
“luck” under conditions of great uncertainty. 

The second purpose of Experiment 1 was 
to attempt to differentiate more fully the 
types of attributional patterns individuals 
would make regarding their success or failure. 
For example, if an individual attributes his 
success to internal causes, to what extent will 
Heider’s (1958) two factors of effort and abil- 
ity dominate one over the other? No specific 
predictions concerning the nature of this 
attributional pattern were made. 

Finally, it was felt that an individual’s per- 
ception of his own degree of control over his 
environment might affect his reactions to, and 
attributions of, success and failure. The 
simple prediction was made that those who 
scored in the internal direction on Rotter’s 
Internal—External (I-E) scale (Rotter, 1966) 
would attribute both their successes and fail- 
ures more internally than those who scored 
in the external direction. Other research (e.g., 
Crandall, Katkovsky, & Crandall, 1965; 
Weiner & Kukla, 1970) has shown little cor- 
relation between attributions for success and 
those for failure. However, these researchers 
used an T-E scale other than the one devised 
by Rotter, and it was felt that additional 
information would be provided by using the 
Rotter scale in the present experiment. 


EXPERIMENT 1 
Method 
Design 


Two independent variables were manipulated. 
The first consisted of the outcome on the experi- 
mental task (success or failure), The second con- 
sisted of four levels of task ambiguity. The design 


was thus a 2 X 4 factorial with six males and four 


females in each cell. 


87 


Subjects 


Introductory psychology students volunteered to 
Serve as subjects as one manner of satisfying an 
experimental requirement. In addition to the 80 
subjects for whom data is reported, 2 subjects were 
eliminated from the analysis because they failed to 
follow instructions, and 2 were eliminated because 
they perceived the nature of the deceptions. 


Procedure 


Each subject entered the experimental room indi- 
vidually and was seated by the experimenter (the 
second author) at the end of a long table. About 
60 centimeters in front of the subject was a 90 X 150 
centimeter rectangular partition. After completing 
the I-E scale (Rotter, 1966), subjects read a set 
of instructions which described the upcoming task 
as having to do with “signal identification under 
varying conditions of interference.” The task was 
analogous to that of a radar operator who must 
discriminate among objects such as airplanes, birds, 
and clouds which appear on his screen. These objects 
possess overlapping characteristics which make it 
difficult to tell them apart. In addition, interference 
in the form of sunspots and high frequency radio 
waves add to the radar operator’s difficulty, The 
subject was told that the experimenters were inter- 
ested in how people cope with varying types and 
degrees of interference and that he (she) would 
perform a task analogous to that of the radar 
operator. The instructions continued: 


You will be presented with a series of patterns, 
and your task will be to identify each pattern 
as representing one of three types of objects. 
To simplify the procedure somewhat, instead of 
aircraft, birds, or clouds, we have labeled the 
objects A, B, and C. As each sequence of pat- 
terns is presented to you, you will tell whether 
you think it is an A, a B, or a C. You will then 
be told whether you are correct or incorrect, 


Subjects were told they would be right a third of 
the time just by guessing. 

They were then informed of the particular level 
of interference they would experience, after which 
they viewed a series of 30 slides through an opening 
in the partition behind which they sat. The slides 
were projected on a wall approximately five meters in 
front of the subject, and a screen covering the open- 
ing in the partition was raised so that each slide 
could be viewed for about four seconds, Subjects 
indicated after viewing each slide whether the pat- 
tern represented an A, a B, or a C, after which they 
received feedback appropriate to their experimental 
condition. 

The patterns consisted of different arrays of nine 
geometric forms. The patterns had been photo- 
graphed and were presented on 35 millimeter slides 
by means of a carousel projector. Following presen- 
tation of the 30 slides, the dependent measures were 
obtained. 
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Independent Variables 


Outcome. After every trial subjects were told 
whether they were right or wrong. Those in the 
success condition were told they were right on 23 
out of the 30 trials, while those in the failure condi- 
tion were told they were wrong on 23 out of the 
30 trials. 

The pattern of these reinforcements was ran- 
domly determined with the restriction that success 
(failure) subjects experienced 11 rights (wrongs) 
during the first 15 trials and 12 rights (wrongs) 
during the last 15. 

Interference. There were four levels of interfer- 
ence: 0%, 33%, 66%, and 100%. Interference was 
manipulated through the simple process of changing 
the focus on the projector so as to blur the slides 
almost beyond recognition. When blurred, the indi- 
vidual geometric shapes were indistinguishable from 
each other, although there were still some vague 
differences in overall effect from slide to slide. At 
0% interference, no slides were blurred; one third 
were blurred at 33% interference; two thirds at 
66% interference; and every slide was blurred at 
100% interference. Before the experimental trials, 
subjects were shown an example of a clear and of a 
blurred slide. 

It is quite possible that subjective difficulty of the 
task increased along with increasing interference. 
However, this first experiment was not designed to 
explicitly manipulate difficulty and no assessment of 
subjective difficulty was obtained. 


` Dependent Measures 


The primary dependent measure was a question 
asking the success subjects to attribute (in percent- 
ages totaling 100%) their performance as due to 
effort, good luck, ability, or ease of task, and the 
failure subjects to attribute their performance in the 
same manner to lack of effort, bad luck, lack of 
ability, or task difficulty. In addition to checks on 
the success—failure manipulation, subjects were asked 
to estimate the percentage of their total effort which 
was devoted to the first 10 trials, the second 10 
trials, and the final 10 trials. This latter measure 
yielded no effects. 


Transformed Variables 


In order to investigate the differential attribution 
of performance in success and failure conditions, the 
repeated and dependent measures of attribution to 
effort, luck, ability, and task difficulty were trans- 
formed to three orthogonal measures for each sub- 
ject. The first of these variables, called INT-EXT, 
was the sum of the attributions made to internal 
factors (effort and ability) minus the sum of the 
attributions made to external factors (luck and task 
difficulty). A score of zero on this variable indicates 
equal amounts of internal and external attribution; 
& positive score indicates relatively more internal 
attribution, while a negative score indicates rela- 
tively more external attribution. The second derived 
Variable, called UNS-STA, measures temporal stabil- 
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ity and was created by taking the sum of the 
attributions to unstable factors (effort and luck 
minus the sum of the attributions to stable factors 
(ability and task difficulty). A positive number indi 
cates greater attribution to unstable factors, whereas 
a negative number indicates greater attribution tc 
stable factors. A third variable represents the onl) 
remaining orthogonal measure and is the sum of 
effort and task difficulty minus the sum of ability 
and luck. Its composition was dictated by mathe- 
matical considerations, and it was included in the 
multivariate analysis for the sake of completeness 
rather than for psycho!ogical interest. 


Results 


All analyses revealed no significant rela 
tionships between sex and any of the inde- 
pendent or dependent measures. Therefore, 
the data from males and females are com 
bined for all of the results presented. 

The evidence strongly indicates that the 
success—failure manipulation worked as in 
tended. Subjects indicated on a 13-poin 
scale how well they thought they performe: 
how satisfied they were with their perform- 
ance, and the degree of aptitude they fel 
they possessed for this task. On all three mea 
sures the F ratio from the multivariate analy 
sis of variance was significant at p < .001, 
indicating better estimated performance, X 
success = 8.75 and X failure = 2.43, greater 
satisfaction, X success = 8.58, and X failure 
= 2.80, and greater degree of aptitude, Ñ 
success = 6.90 and X failure = 2.40, for those 
subjects who succeeded as compared to those 
who failed. 

The means for the original four attribution 
measures and the two derived measures of 
interest (INT-EXT and UNS-STA) are pre 
sented in Table 1. Multivariate analyses, 
using the two derived variables as dependent 
measures and the dimensions of success—fail 
ure and interference as independent measures 
were performed and are shown in Table 2. 

It can be seen that the multivariate F for 
the success-failure manipulation was highly 
significant as were the univariate Fs for the 
two derived variables of INT-EXT and 
UNS-STA. Subjects who succeeded attrib- 
uted their performance to internal factors and 
to unstable factors to a significantly greater 
degree than did subjects who failed. Follow- 
ing this significant between-condition result, 
tests were conducted within the success condi- 
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TABLE 1 
ATTRIBUTION MEANS ror THE Four Factors AND Two DERIVED VARIABLES IN EXPERIMENT 1 


Outcome Effort Luck Ability Task INT-EXT UNS-STA 
Success 44.750 19.500 23.875 11.875 37.250 28.51 
Failure 9.425 8.075 39.875 42.625 —1.400 65.000 


Note. Attributions to the four fact 
possible 100%. Attribution to INT-E. 
attribution to UNS-STA is the sum 


tion and within the failure condition on the 
two derived variables (“grand mean” tests). 

Inspection of Tables 1 and 2 reveals a sig- 
nificant tendency for subjects experiencing 
success to attribute this success to internal 
causes and to unstable causes, while there 
was a significant tendency for the subjects 
experiencing failure to attribute their failure 
to stable causes. A closer inspection of the 
individual factor means illuminates these re- 
sults. While subjects who experienced success 
attributed significantly more of their per- 
formance to internal rather than external 
factors, attribution to effort was almost twice 
as great as attribution to ability. On the other 
hand, for subjects experiencing failure, at- 
tribution to lack of ability was over four 
times as great as attribution to lack of effort. 
This effect, whose presence is implied by the 
patterns of significance of the analysis of vari- 
ance, is portrayed in the left-hand portion of 
Figure 1. The relevance of these results is 
discussed below, but it is clear that subjects 
differentially attributed their performance to 
two types of internal factors as a function 
of experiencing success or failure. 

A look at the right-hand portion of Figure 
1 reveals the same basic pattern for external 
attributions. Within the success condition, 
there is more attribution to unstable (luck) 
factors than to stable (task) factors, while 
the pattern for those who failed is just the 
opposite. The results portrayed in the left- 
and right-hand portions of Figure 1 are cor- 
roborated by the grand mean tests for the 
UNS-STA variable (see Table 2). For both 
success and failure the grand mean is shown 
to be significantly different from zero, but for 
subjects who experienced success the devia- 
tion is in the direction of ascribing causality 
to unstable factors, while those subjects ex- 
periencing failure overwhelmingly attributed 


causality to stable factors. 


s of effort, luck, ability, and task are in terms of percent attribution to that faci 

o , luck, „and a s at factor out of 

T is the sum of the attributions to effort and ability minus the attributions fa 
the attributions to effort and luck minus the attributions to ability and task. 


luck and task; 


The subjects’ perception of their locus of 
control was unrelated to any of the depen- 
dent measures. 


Discussion 


The finding of major interest concerns the 
manner in which subjects allocated their 
attributions to the two internal factors of 
effort and ability. While the attributions of 
subjects who experienced success were more 
internal than were those of subjects who ex- 
perienced failure, the subjects who failed 
still ascribed their failure as much to internal 
as to external causes. There was a marked 
difference, however, in the way success versus 
failure subjects subdivided this internal at- 
tribution. Success subjects saw effort as the 
primary internal factor responsible for their 
performance, while failure subjects saw lack 
of ability as the primary factor. Those who 
succeeded attributed the success to a factor 
over which they had control (effort); those 
who failed attributed their failure to a factor 
over which they had no control (ability). 


TABLE 2 


MULTIVARIATE AND UNIVARIATE ANALYSES OF 
VARIANCE ON ATTRIBUTIONS IN 
EXPERIMENT 1 


<,001 

<,001 

<.001 

<.327 

« <,334 

we <.090 

+43 <.177 

J <.040 

TA interaction k <.246 
ate success grand M 5 <.001 
XT success grand M 27.002 <.001 
A success grand M 23.915 <.001 

ure grand M 41.158 <.001 

XT failure grand M 0.035, <.851 
TA failure grand M 124.782 <.001 


Note. Attribution to INT-EX 
to effort and ability minus 
attributions to U 
effort and Juck mi 


T is the sum of the attributions 
attributions to luck and task; 
$ s the sum of the attributions to 
s the attributions to ability and task. 
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FIGURE 1, Percent attribution to the four attribu- 
tional factors in Experiment 1. 


Some results obtained by Weiner and 
Kukla (1970) complement those just de- 
scribed. “Teachers” were required to reward 
hypothetical “learners” on the basis of their 
performance. The teachers were given infor- 
mation as to the learner’s ability and motiva- 
tion. The results showed that the teachers 
dispensed more reward when success appeared 
to result from the student’s effort rather than 
when it resulted from his ability. Further- 
more, they were less harsh when failure ap- 
peared to result from lack of ability than 
when the perceived cause was an absence of 
motivation. Weiner and Kukla’s subjects re- 
warded others on the basis of a “success due 
to effort-failure due to lack of ability” prin- 
ciple; our subjects apparently “rewarded” 
themselves according to the same principle. 
In contrast to the provocative effects of 
the success—failure experience, the differing 
levels of interference had virtually no effect. 
There was a tendency for success to be in- 
creasingly attributed to luck as the level of 
interference increased, but this tendency was 
significant at only about the .10 level. Since 
it seemed reasonable to expect subjects to 
attribute success more to luck as the task 
became more ambiguous, this result was per- 
plexing. In discussing the experiment with the 
subjects, however, we found that even those 
in the 100% interference condition perceived 
minute differences in the various slides, which 
they inferred were the bases for their success. 
These differences included specks on the 
slides, differences in shading, slight differences 


in overall configuration, etc. In othe rds 
subjects went to great lengths t 
rational reason for their success i 
doing did not have to resort to a luc ed 
interpretation for their success. 

One post hoc analysis supports this inte 
pretation. Subjects were asked whether they 
perceived a pattern in the various stimuli 
and they were divided into those who 
they definitely did, thought they did, or \ 
not sure (or did not) see a pattern. For thes: 
groups, the percentage of their su 
tributed to luck, respectively, was 
18.5%, and 35.0%. Correspondingly, there 
were decreasing attributions to ability and 
effort. Unfortunately, there was no systematic 
relationship within the failure condition be 
tween attribution and perception of a pattern 


il 


aid 


EXPERIMENT 2 


A second experiment was conducted í 
principle reasons. The first was to atte: 
replicate the results concerning the 
attributional patterns of success versu 
subjects. These patterns were sufficien: 
teresting to deserve the additional confirm 
tion of replicability. 

The second reason was to manipulate ex 
perimentally the difficulty of the task (and 
hence the subjects’ expectations regardi 
success or failure). While interference was 
manipulated in the first experiment, it is not 
clear that subjects perceived this as affecting 
the level of task difficulty. We hypothesized 
(a) some relationship between difficulty and 
attribution and (b) an increasing ati i 
of success to good luck as the difficy 
the task increased. 


Method 


The overall procedure was quite similar to that 
employed in the first experiment, The Subjects were 
48 introductory psychology students. The manipula- 
tion of success and failure was accomplished in the 
same manner as in Experiment 1. Three levels of 
difficulty were created by varying the clarity of the 
slides. Subjects in the “easy” condition viewed well- 
focused slides; subjects in the “moderately difficult” 
condition viewed blurry but recognizable slides; and 
subjects in the “difficult” condition viewed slides 
which were so out of focus as to be completely un- 
recognizable. Furthermore, subjects were informed, 
respectively, that 95%, 50%, or 5% of the other: 
who had engaged in this task had met with success, 
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TABLE 3 
MULTIVARIATE AND UNIVARIATE AN 
VARIANCE ON ATTRIBUTION IN 
EXPERIMENT 2 


YSIS OF 


Source of variation t 
Multivariate success-failure <.001 
INT-EXT success-failure <.001 
UNS-STA success-failure <.001 
Multivariate interference <.453 
INT-EXT interference <.764 
UNS-STA interference <.180 
interaction <.257 

T interaction <.874 

i j <.034 

<.001 

<.001 

‘A success grand M <.015 
Multivariate failure grand M <.001 
INT: T failure grand M <.007 
UNS-STA failure grand M <.001 


Nolte. Attribution to INT-EXT is the sum of the attributi 
to effort and ability minus the attributions to luck and tas 
attribution to UNS-STA is the sum of the attributions to 

effort and luck minus the attributions to ability and task. 


Thus, we hoped to establish expectations regarding 
the difficulty of the upcoming task and the expected 
performance of the subjects, This yielded a 2 X3 
design with 8 subjects per cell, 4 males and 4 females. 

The major change in the dependent variables was 
the addition of two questions designed to assess the 
effectiveness of the difficulty manipulation before the 
subjects engaged in the task and a question at the 
end of the experiment regarding how difficult the 
task seemed to them, 


Results 


Using exactly the same procedure described 
in Experiment 1, two derivative variables, 
INT-EXT and UNS-STA were created. 
Analyses of variance (see Table 3) revealed 
that the success-failure manipulation had the 
same effect on these variables as in the pre- 
vious experiment. Subjects who succeeded 
attributed significantly more of their per- 
formance to internal factors and to unstable 
factors than did those who failed. Table 4 
presents the means of proportional causality 
attributed to the four factors of effort, luck, 
ability, and task difficulty, as well as the 
means for the two derivative variables; it can 
be seen that the means in this experiment are 
very similar to those in the first experiment. 


Furthermore, the interactions regarding attri- 
bution by those who succeeded to unstable 
factors and those who failed to stable factors 
were convincingly replicated. 

Turning next to the effects of the difficulty 
manipulation, subjects were asked before be- 
ginning their 30 experimental trials to esti- 
mate their probability of success and how 
difficult they expected the task to be. On both 
of these measures the difficulty manipulation 
had a highly significant effect, p < .001, As 
ambiguity increased, the subjects anticipated 
a decreased probability of success and ex- 
pected the task to be more difficult, Given 
that perceived difficulty appears to have been 
successfully manipulated, it is disconcerting 
to note that the difficulty of the task had no 
discernable effect on subjects’ attributions. 
There was a slight tendency for success to be 
attributed more to effort with increasing task 
difficulty, but within the failure condition the 
level of difficulty had no effect whatever. As 
before, there was no effect for sex of subject, 
and the results presented combine the data 
for males and females. 


Discussion 


As stated earlier, we expected some rela- 
tionship to occur between the increasing dif- 
ficulty of the task and attribution. Due to the 
nature of the difficulty manipulation (making 
the task more obscure), it was our expecta- 
tion that subjects who experienced success at 
the difficult task would attribute their success 
to luck to a greater degree than would those 
who experienced success when the task was 
easy. A second (and opposite) prediction 
could have been that as task difficulty in- 
creased, success would be increasingly at- 
tributed to ability (e.g., Luginbuhl, 1972). 

The failure of the difficulty manipulation 
to result in either of these effects, or any 
other effect, is puzzling and for several rea- 


TABLE 4 


ATTRIBUTION MEANS FOR THE Four FACTORS AND Two DERIVED VARIABLES IN EXPERIMENT 2 


Outcome Effort Luck | Ability Task -EXT | UNS-STA 
41.13 17.33 28.79 12.75 39.833 16.917 
ae 6.67 6.13 34.17 53.04 —18.333 74.417 


Note. Attribution to IN’ 
attribution to UNS-STA is t! 


‘T-EXT is the sum of the attributions to effort an 
he sum of the attributions to effort and luck minus 


d ability minus the attributions to luck and task; 
the attributions to ability and task, 
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sons: (a) Feather and Simon (1971b) found 
a manipulation of expected difficulty to have 
an effect on subjects’ attributions; (b) our 
data indicate that the expectation of an easy 
versus difficult task was successfully induced; 
(c) when subjects were asked after the entire 
experiment about the difficulty of the task, 
they responded in a manner appropriate to 
the intended manipulation; and (d) the ma- 
nipulation of difficulty had considerable face 
validity. 

A careful analysis of the task and of the 
reactions of the subjects may shed some light 
on these results. Performance on the task in 
which the subjects engaged was defined in 
terms of skill. This was the kind of task per- 
formed by radar operators, and no one wishes 
to believe that radar operators are sometimes 
just lucky to get a plane down safely. Thus, 
the skill nature of the task may have been 
such that even when the “patterns” viewed 
by the subjects were utterly meaningless (the 
difficult condition), they felt unable to at- 
tribute success to luck. For the same reason, 
subjects who failed could not attribute their 
failure to bad luck. To attempt to absolve 
oneself of responsibility for failure on a skill- 
based task by attributing the failure to bad 
luck may simply be too crass a maneuver for 
the average person. Thus, the fact that the 
task was presented as requiring skill may 
have prevented subjects from attributing 
either success or failure to luck. 

Since the task was defined as one requiring 
skill, why was there no increase in attributing 
success to ability as the difficulty of the task 
increased? The phenomenology of the success 
experience may provide a partial answer. We 
received the impression, both from observing 
the subjects during the experiment and in 
talking with them afterwards, that many of 
those who experienced success were somewhat 
mystified as to why they had done so well. 
This is understandable since their “success” 
was experimentally manipulated. Most sub- 
jects thought they had deciphered a pattern 
but were not really sure, and this lack of cer- 
tainty became somewhat more pronounced as 
the difficulty of the task increased. As a re- 
sult of this uncertainty, subjects may not 
have felt they could attribute their success 
to ability since they were not sure which of 


their abilities were being called 
is interesting is that they refuse 
their successful performance ! 
Both experiments testify to the 
individuals to attribute a succes 
ance to external causes. 

It is quite possible that anot! pe of 
task and/or another type of diffic 
lation would result in a differen! 
attributions. Feather and Simon 
an anagrams task and found in 
bution of unexpected success to 
failure to bad luck. However, th 
ability attributions decreased w- 
was unexpected. It seems clear th 
of the task may be a critical v« 
termining the causal ascription ù 
and failure. 

Let us now turn to the robust that 
subjects who experienced succes 
their performance to unstable fac 
nificantly greater degree thar 
factors, with the attributions be 
for those subjects who experien: 

In the case of external factors, thos 

who experienced success attributed the ca 
slightly more to luck than to the ease of the 
task, while those who experienced 
ascribed the cause of their failure 
ingly to the difficulty of the tas! 

to bad luck. This latter finding 
prising since all subjects thought the task was 
relatively difficult. 

Considering attribution to internal factors, 
subjects indicated that they viewed sı 
resulting primarily from effort, whil 
was seen to result from lack of al 
findings are consistent with the | 
thet individuals wish to claim cre 
successes and not claim credit for 
failures, a notion that has received some 
support in the present study and which has 
been supported by other research (e.g., Beck- 
man, 1970; Johnson, Feigenbaum, & Weiby, 
1964). Subjects attributed success to a factor 
over which they had control (effort) and 
failure to a factor over which they had no 
control (ability). Ascribing failure to lack of 
ability may be a way of deferring personal 
responsibility since one obviously has little 
control over one’s ability. Thus, even though 
subjects who failed attributed their failure 


failure 
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equally to internal and external factors, the 
bulk of the internal attribution was to a 
cause over which they had no influence. 

Viewing the attributional results within this 
framework is not entirely satisfactory, how- 
ever, because it is not clear that the subject 
gains by attributing failure to lack of ability 
as opposed to lack of effort. Why is it not as 
detrimental to one’s ego to attribute failure 
to lack of ability as to lack of effort? 

We suspect that causal attributions are 
affected by the individual’s attention to a 
mini-max principle, which takes the form of 
a desire to maximize present and future posi- 
tive outcomes and minimize present and fu- 
ture negative outcomes. We further propose 
that the importance to the individual of a 
successful outcome on a given task directly 
affects the manifestation of the mini-max 
orientation, 

If a successful performance on a given task 
is not important to the individual, then it is 
of little benefit to attribute success to one’s 
ability. It may be more rewarding to attribute 
success to effort, since we have all been taught 
that one should try, regardless of the nature 
of the task. Failure at a similar task may most 
easily be attributed to lack of ability. The 
person feels he has tried his best, but he 
simply is not very good at this type of 
activity. 

‘However, if a successful performance on a 
given task is of utmost importance to the 
individual (for example, if performance on a 
particular task were thought to be an ex- 
cellent predictor of future success in graduate 
school), the mini-max principle might lead 
to a different set of causal ascriptions. An 
individual could maximize his potential future 
outcomes by attributing a successful perform- 
ance to ability, implying that he will perform 
just as adequately in the future. An unsuc- 
cessful performance, on the other hand, could 
be attributed to lack of effort, suggesting that 
on this very important task the individual has 
the capacity to improve his performance. 

Subjects in the present two experiments 
saw the tasks as meaningful, and while they 
obviously tried to do well, success or failure 
probably had little overall importance to the 
majority of the subjects. The analysis above 
is consistent with our findings, for internal 


attribution, that subjects attributed success 
primarily to effort and failure almost entirely 
to lack of ability. 
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Functional Measurement Methodology Applied to a Subjective 
Probability Model of Cognitive Functioning 


Robert S. Wyer, Jr, 
University of Illinois at Urbana-Champaign 


Functional measurement procedures were used to test Wyer and Goldberg’s 
subjective probability model of cognitive organization. Subjects estimated the 
likelihood that a hypothetical person had an attribute (Pu) on the basis of 
information that directly affected their beliefs that persons in general have a 
particular gene (Pa) and that persons who do and do not have this gene pos- 
sess the attribute (Pua and Paw). Pa interacted significantly with both 
Puja and Puja; each of these interactions was concentrated in the bilinear 
component, supporting the assumption that each pair of beliefs has multipli- 
cative effects upon Ps. A small but significant interaction of Px, and Puja 
was also detected, contrary to implications of the model. Although the model 
provided a good quantitative description of the relations among the beliefs 
involved without the necessity of introducing ad hoc curve-fitting parameters, 
small but significant discrepancies from prediction were detected. These dis- 
crepancies suggested that Pa and Pas were weighted appropriately, but that 
Prw should receive a weight inversely proportional to its magnitude. In sum, 
the results were interpreted as support for the general formulation proposed. 


« 


The subjective probability model of cogni- 


tive organization and information integra- 1974a). However, adequate statistical tests of ri 
tion, proposed initially by McGuire (1960) the fit of the model have not been made. Nor 
and since refined and extended by Wyer and has it been demonstrated that the various 
Goldberg (1970), has been used to describe beliefs comprising the model combine sub- 
the relations among beliefs about both real jectively in the general manner assumed, 
objects and events (Rosen & Wyer, 1972; The latter question may be investigated 
Wyer, 1972; Wyer & Goldberg, 1970) and through the use of procedures Suggested by 
fictitious objects formed on the basis of in- functional measurement theory (Anderson, N 
formation presented in the experiment (Wyer, 1970, 1971), Specifically, Wyer and Goldberg 
1970, 1973a, 1973b). The effects of change assume that 
in one belief upon other related cognitions as as nae hea 
have also been predicted, Good quantitative Po = PaPoaya + (1 — Pa) Pua, (1) 
descriptions of these relations, and of the where P4 and Py are the subjective proba- 
effect of change in one belief upon other re- bilities (beliefs) that statements A and B are 
lated cognitions, can often be generated by true, respectively, and Py, and Prya are the 
the model without the necessity of introducing beliefs that B is true if A is and is not true. 
ad hoe curve-fitting procedures. If valid, the Suppose P4, Pp,4, and Prya are manipulated 
model has broad and important implications experimentally and that subjects are asked to 
at estimate the likelihood that B is true (Pp) at 
This research was supported by National Science each combination of these variables. If Equa- 
Foundation Grant GS-29241, Appreciation = ex- tion 1 is valid and an analysis of variance is 
nded to Marti sii in collectin 3 : 
tio ihe suo al tet H abe Deomed on Py as a function of Pa Poy mg 
suggestion of the stimulus domain used in preparing B/an the tollowing should occur: 
materials for the experiment, and to Norman H. 1. Px should be a significant Positive func- 
Bee ce ey helpful comments on an tion of both Prya and Payas, but should not 
Requests for reprints should be sent to Robert S, Mecessarily be related to Py. 
Wyer, Jr., Department of Psychology, University of 2. The relation between Py and Pps at 
Illinois, Champaign, Illinois 61820. each level of P4 should differ by a constant 
94 q 


for attitude and belief processes (cf. Wyer, 


SUBJECTIVE PROBABILITY AND COGNITIVE FUNCTIONING 95 


slope (a positive function of P4). Statistically, 
this means that the interaction of P4 and 
Py a should be significant; however, this in- 
teraction should be concentrated in a single 
degree of freedom corresponding to the 
linear X linear (bilinear) component, and the 
residual should be nonsignificant (for an 
elaboration of the basis for this prediction, 
see Anderson, 1970), 

3. The relation between Py and Py). at 
each level of Pa should also differ by a con- 
stant slope (in this case, an inverse function 
of P4). Statistically, the interaction of P4 and 
Ppa should be significant but also should be 
concentrated in the bilinear component with 
the residual nonsignificant. 

4. The interaction of Ppya and Prya, and 
the triple interaction of these conditionals 
and P4, should be nonsignificant. 


The present experiment tested these hy- 
potheses using a procedure in which informa- 
tion bearing upon Pa, Prya, and Pry was 
manipulated independently, and subjects’ esti- 
mates of Pn were analyzed as a function of 
these probabilities. In this regard, previous 
tests of the accuracy of Equation 1 in de- 
scribing the effects of change in Pa upon re- 
lated beliefs (Pn) could often be criticized on 
the grounds that the information presented 
had direct effects upon Pn which were not 
mediated by changes in Pa per se. The pro- 
cedure used in the present study was pot 
sensitive to this criticism. 


METHOD 


Overview 


Subjects estimated the likelihood that a particular 
person possessed a hypothetical attribute (Pn) on the 
basis of information about the likelihood that per- 
sons in general possessed a gene (P4) and the likeli- 
hood that persons who possessed the gene did and 
did not possess the attribute (Prix and Pra, re- 
spectively). Twenty-seven sets of information were 
presented which represented all combinations of three 
levels of Pa (low, moderate, and high), three similar 
levels of Paya and three levels of Paya. The levels of 
these probabilities were conveyed through verbal 
descriptions in a manner to be described. After esti- 
mating Pa, subjects also made numerical estimates 
of Ps, Puja, and Paya. The sets of information were 
presented in 27 different permutations of a given 
random sequence; pooled over orderings, the mean 
serial position of each set was identical. 


Subjects 

Fifty-four introductory psychology students, mostly 
freshmen, participated to fulfill a course require- 
ment. Subjects had typically had no formal training 
in probability or statistics. Two subjects received 
information in each of the 27 sequences described 
above. 


Normative Data 


It seemed inadvisable to ask subjects to estimate 
Pn on the basis of numerical values of Pa, Poy, and 
Pras, since this procedure might predispose them to 
develop ad hoc arithmetic rules for combining these 
quantities that they might not normally use in 
organizing their beliefs, Therefore, these probabili- 
ties were manipulated through verbal labels convey- 
ing different frequencies of occurrence, To select these 
labels, 17 pilot subjects (not used in the main 
experiment) were presented eight items of the form 
“Persons have characteristic X, How likely is 
it that a particular person, Bob, has X?”, where the 
missing word was one of eight adverbs to be de- 
scribed, In each case, subjects reported their esti- 
mates using a number from 0 (not at all likely) to 
10 (extremely likely). These estimates were divided 
by 10 convert them to units of subjective proba- 
bility, The adverbs and the mean probability esti- 
mates based upon them were the following: nearly 
always (81), usually (.81), very often (.81), fre- 
quently (75), often (.65), sometimes (44), seldom 
(21), and rarely (.16), Based upon these data, 
usually, sometimes, and rarely were selected to 
represent high, moderate, and low levels of each com- 
ponent probability to be manipulated: 


Preparation and Presentation of Materials 


Twenty-seven sets of three statements were con. 
structed, each corresponding to a different gene- 
attribute pair and to a different combination of 
Pa, Puja, and Paar. Genes were designated by lower- 
case letters (a through z and aa) and the attributes 
paired with them by the corresponding capitals (A 
through Z and AA). The statements in cach set 
were of the same general form: 


Persons usually (sometimes, rarely) have gene x. 


Persons who have gene x usually (sometimes, 
rarely) have attribute X. 


Persons who do not have x usually (sometimes, 
rarely) have attribute X. 


Each set of statements was typed on a separate 
page of an answer booklet, Subjects were told that 
the purpose of the experiment was to find out how 
persons estimate the likelihood of different events on 
the basis of verbal information about them, They 
were told to read the three statements at the top 
of each page and then to estimate the likelihood that 
a particular person (designated by a different name 
when presenting each set of statements) had the 
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TABLE 1 


MEAN Estimates OF Pa, Pyya, AND P pia AS A 
FUNCTION OF EXPERIMENTAL 
MANIPULATIONS OF 


Estimates of |———— - — = 


| Low | Moderate High 
Pa | 168 Al7 | 761 
Pua 186 477. | 740 
Ppa’ 190 


481 | :733 


attribute described in the set of statements (Pa), 
had the gene mentioned in the set, (Px), had the 
attribute if he had the gene (Ps/s), and had the 
attribute if he did not have the gene (Psa). These 
estimates, recorded along an 11-point scale from 0 
(not at all likely) to 10 (extremely likely), were 
divided by 10 to convert them to units of subjective 
probability. 


RESULTS 


Two sets of analyses were performed. One 
determined the extent to which the various 
beliefs comprising Equation 1 combined in 
the manner implied by this equation using 
functional measurement procedures. The other 
investigated the quantitative accuracy of 
Equation 1 in describing the relations among 
these beliefs. 


Functional Measurement Analyses 


Effects upon component probabilities. Each 
experimental manipulation had the intended 
effect as shown in Table 1. While the manipu- 
lations of conditional probabilities did not 


TABLE 2 


MEAN Pg AS A FUNCTION OF MANIPULATED LEVELS 
oF Pa, Pasa, AND Ppjar 
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significantly affect estimates of Py, F 
the manipulation designed to increase P4 also 


increased both Pra, from .446 to .484, F 


(2,104) = 9.72, p < .01, and Prya’, from .463 
to .480, F (2,104) = 3.24, p < 01. 
latter effects were not considered large enough 
to affect substantially the interpretation of 
analyses pertaining to Pp. 

Analyses of Py. Mean obtained estimates of 
Py as a function of P4, Ppa, and Prya are 
shown in Table 2. Analyses of variance of 
these data yielded the two predicted main 
effects of Prya and Payan p < .001, and the 
two predicted interactions of Py with Prsa, 
F (4,212) = 49.80, p < .001, and Pa with 
Ppa, F (4,212) = 16.65, p < .001. These 
effects are shown graphically in Figure 1. 

Tf P4 and Pz,/4 combine multiplicatively in 
the manner implied by Equation 1, the inter 
action of these variables should be concen 
trated in the bilinear component, and the 
residual interaction should be zero, To test 
this hypothesis, the sum of squares pertain- 
ing to this interaction was partitioned into 
the bilinear component, SS = 5.536, and the 
residual, SS = .0875. The first component was 
significant, F (1,212) = 65.37, p< .001, 
MS, = .0282, but not the second, F (3,212) = 
1.03, p > .25.4 

Similarly, Equation 1 implies that Py and 
Pra should have multiplicative effects, Par- 
titioning of this interaction revealed that the 
bilinear component was significant, SS = 
2.098, F (1,212) = 63.41, p < .001, MS, = 
-03310, but not the residual, SS = 1068, F 
(3,212) = 1.08, p > .25, 

In addition, two unexpected effects—the main 
effect of P4, F(2, 106) = 19.91, p < .01, and 
the interaction of Pp;4 with Ppyar, F(4, 212) 


These 


Low Mo i 
Pa Paja Paa 

High 

High Pga -668 -722 804 

Moderate P g;4 496 -551 -611 

Low Paya 176 328 -387 
Moderate 

High Pas 493 -589 -730 

Moderate Ppa 391 -507 -596 

Low Ppa -198 .409 -491 
Low 

High Ppa 274 472 -676 

Moderate Pp) 4 285 487 659 

Low Pga 214 451 -698 


= 2.82, p< .05—reached significance. The 
effect of P4 is theoretically zero only if the 


1 Strictly speaking, the breakdown of this inter- 
action into its bilinear and residual components 
should be accompanied by a corresponding break- 
down of the error term. Since this must be done on 
a subject-by-subject basis, it was not performed in 
the present study. Althcugh the error term used to 
test the significance of the two components may 
therefore be somewhat biased, the magnitude of this 
bias is probably not sufficiently great to affect the 
conclusions drawn from the analyses (N.H. Anderson, 
personal communication, 1973). 
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a 
Booty Pay INTERACTION 


High Pa 
Moderate Pa 


low Pa 


low Modercte High 


C. PyxPayy INTERACTION 
low Pa 
fay Moderate Pa 


N High Pa 


Low Paa Moderate Pam High Paja 


D Poa”Paw INTERACTION 


High Pam 
Moderate Pam 
Low Pain 


T T T 


Low Paa Moderate Paw: High Paw 


Bia 


EAN T 


Low Paw Moderate Pay High Paya 


Ficure 1. Main effects and interactions pertaining to Ps as a function of 
Pa, Paya, and Paya’. 


range of subjective values assigned to low, 
moderate, and high Py,s are identical in mag- 
nitude to those assigned to corresponding 
levels of Px’. Table 1 indicates that these 
values did in fact differ slightly; whether 
these differences are sufficient to produce the 
small positive effect of Pa (see Figure la) 
is unclear. 

The interaction of the two conditional prob- 
abilities is more obviously inconsistent with 
Equation 1, which implies that these compo- 
nents have completely independent effects. 
However, this interaction, while significant, is 
small. Moreover, as Figure 1d shows, it ap- 
pears attributable primarily to a single devi- 
ant point (when both conditional probabilities 
were low). Possible explanations of this 
interaction are discussed presently. 


Quantitative Accuracy 


Predicted values of Pp at each combination 
of experimental variables were calculated for 
each subject separately by combining his esti- 
mates of Pa, Paya; and Ppa’ in the manner 
described by Equation 1. Obtained estimates 
of Pp, averaged over subjects for each combi- 
nation of these variables, are plotted as a 


function of the mean predicted values in Fig- 
ure 2. The correlation between mean predicted 
and mean obtained values was .972, n = 27, 
p < .001. More important, the standard error 
of the difference between mean predicted and 
mean obtained values * was .0479, or less than 
half a unit along the probability scale used 
to record judgments. 

A statistical indication of the accuracy of 
the model in describing the relation between 
mean predicted and mean obtained estimates 
of Pp can be obtained through the use of the 
“jackknifing” procedure developed by Mostel- 
ler and Tukey (1968, pp. 133-160). If the 
model is accurate, the best fitting regression 
function relating predicted to obtained values 
of Pp should have a slope of 1 and an inter- 
cept of 0. On the basis of jackknifing, neither 
the estimated slope (.890) nor the estimated 


2 This error is estimated from the equation 


ee 
ee ape ae | 


where O; is the mean obtained value of Pa for the 
i™ of n combinations of Pa, Paya, and Prjar, and Py 
is the mean predicted value. 
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Pa Faun +- Pa Pauw’ 


Ficure 2. Mean obtained estimates of Ps as a func- 
tion of predicted values based upon Equation 1. 


intercept (.071) of the best fitting function 
describing the grouped data in Figure 2 dif- 
fered significantly from its theoretical value, 
in each case, £(26) < 1.00, p > .25.° 
Systematic discrepancies between predicted 
and obtained values, and the extent to which 
the accuracy of Equation 1 depends upon the 
value of component probabilities comprising 
the predictor, may be clarified by introducing 
obtained versus predicted as a “dummy” vari- 
able in the analysis of variance of Pp as a 
function of P4, Prya, and Py;a:. If errors in 
predicting Py from Equation 1 are unsystem- 
atic, all main effects and interactions involv- 
ing obtained/predicted should be nonsignifi- 
cant. All reliable effects detected in the 
analysis of obtained values alone again reached 
significance except for the interaction of Pp/4 
and Prya. Predicted values of Pz, M = 411, 


* An alternative procedure would be to perform a 
regression analysis upon data obtained from each 
subject separately and determine the extent to which 
the mean slope and mean intercept parameters devi- 
ated from their theoretical values. However, since 
the data obtained from individual subjects were ex- 
pected to be less reliable than grouped data, an 
analysis based upon grouped data was expected to 
Provide a more sensitive test of the alternative 
hypothesis that the difference between theoretical 
and obtained values was significant. 


slightly but significantly underestimated ob- 
tained values, M = .490, F(1, 53) = 8.01, 
p <.01. However, the nature of this dif- 
ference was contingent upon P4, F(2, 106 = 
11.07, < .01, and upon the combination 
of Py and Prv, F(4, 212) = 7.05, p < .01. 
Data relevant to this latter interaction, de- 
scribed in Table 3, indicate that when Py,/,: 
was low, the model underestimated Pp by 
more as P4 decreased; however, when Pp/4 
was high, the model underestimated P} by 
less (or overestimated it more) as P4 de- 
creased. These discrepancies could be reduced 
if Prya» were assigned a weight that was in- 
versely proportional to its scale value, This 
would increase the contribution of the second 
component of Equation 1, 


(1 a P4) Paya, 


when Px 4 is low, and reduce its contribu- 
tion when Pp,4 is high; moreover, the effect 
of this weighting would be most apparent 
when P4 is low. Note that this differen- 
tial weighting would apply only to Pazar; no 
interactions involving obtained/predicted and 
Ppa were significant, F < 1.78. 


Discussion 


The data reported here are generally en- 
couraging for Wyer and Goldberg’s (1970) 
subjective probability formulation of cogni- 
tive organization. Beliefs in A and in the 
contingency of B’s validity upon A combined 
multiplicatively to affect beliefs in B in the 
manner implied by Equation 1. Moreover, 


TABLE 3 


MEAN OBTAINED AND PREDICTED ESTIMATES OF Ps 
AS A FUNCTION OF P4 AND Ppisa 


Low Mode: Hi 
meee oe 
Low 
Obtained -263 452 047 
Predicted | .220 459 -704 
Difference | .043 —.007 —.054 
Moderate 
Obtained -360 -502 -606 
Predicted 319 476 -610 
Difference -041 026 | —.004 
High 
Obtained 447 534 -601 
Predicted 435 496 +522 
Difference -012 -038 079 
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this equation provided a reasonably good 
quantitative fit of the relations among these 
beliefs. Systematic discrepancies from predic- 
tions based upon Equation 1 were in fact 
detected but were generally small in magni- 
tude. These discrepancies could be due to two 
factors. First, they could indicate that beliefs, 
defined as subjective probabilities, do not 
combine in exactly the same manner as objec- 
tive probabilities and that differential weight- 
ing of Ppys (but not Pp/a) on the basis of 
its scale value is necessary. Second, they could 
indicate that subjects’ ratings of Pp/4 (but 
not Py,4) differ from the operative scale 
values of these quantities (that is, from the 
functional values used subjectively by sub- 
jects in arriving at their estimates of Pz). In 
either event, however, the fit of Equation 1 
is sufficiently good as it stands that the in- 
crease in accuracy attained through revisions 
of either the weight or scale value of P4/r 
may not be great enough to offset the prac- 
tical advantages of generating quantitative 
descriptions of the relations among cognitions 
without recourse to curve-fitting or functional 
measurement procedures. 

The result most embarrassing to the model 
is of course the significant interaction of Pg/4 
and Py,4-. This interaction is clearly contrary 
to the implications of Equation 1. Figure 1d 
suggests that this interaction results from the 
fact that estimates of Pp were lower than ex- 
pected on the basis of additivity when both 
conditional probabilities were low and slightly 
higher than expected when both conditionals 
were high. These deviations, if reliable, 
could be attributable to the fact that 
subjects have greater confidence in their 
judgments, and thus make more extreme 
responses, when B is independent of A 
(Prya = Peya) and thus the information 
about A does not have to be considered. How- 
ever, the statistical significance of the inter- 
action is not high and could simply be due to 
minor nonlinearities in the response scale, 
There was no evidence of this interaction 
when predicted and obtained values were ana- 
lyzed in combination. (If the interaction is 
reliable, it either should apply to both ob- 
tained and predicted values or should be 
significantly contingent upon obtained /pre- 


dicted; however, neither was in fact the case.) 
Thus this interaction should not be regarded 
as a serious disconfirmation of the model 
pending replication. 

Considered in isolation, the analysis of vari- 
ance tests of Equation 1 are not completely 
unequivocal. The interactions obtained are in 
principle as consistent with a weighted aver- 
age model as with a multiplicative one. How- 
ever, an averaging model would require the 
assumption that the weight assigned to P4 
is an inverse function of its scale value when 
combined with Pga but is a direct function 
of its scale value when combined with Paya’. 
The psychological significance of this weight- 
ing is unclear. Moreover, a weighted average 
interpretation would be unable to account for 
the good quantitative fit of the multiplicative 
model. 

Two additional matters should be consid- 
ered in evaluating the implications of the data 
reported here. First, the postulation of Equa- 
tion 1 as a description of cognitive organiza- 
tion and social inference processes is based 
upon the general hypothesis that the relations 
among subjective probabilities conform to the 
laws of mathematical probability theory. It is 
also consistent with the syllogistic model pro- 
posed by McGuire (1960), which assumes 
that subjects apply logic in inferring the va- 
lidity of a conclusion (e.g., B) from the valid- 
ity of premises (“A” and “if A, then Bayh 
However, Equation 1 may alternatively be 
viewed as a weighted average model in which 
Ppa and Prya are two pieces of information 
upon which estimates of Pp are based, and 
P, is an empirical estimate of the relative 
weight attached to Prya. To this extent, the 
results of this study are consistent with 
Anderson’s (1971) theory of information 
integration as well as with the theoretical 
formulations of McGuire and Wyer. 

Second, it should be noted that Equation 1 
is less accurate in describing the relations 
among previously formed cognitions (Wyer, 
1972; Wyer & Goldberg, 1970) than in de- 
scribing the relations among beliefs about 
hypothetical objects and events (see also 
Wyer, 1970, 1973a, 1973b). However, there 
are theoretical reasons to expect that cogni- 
tive inconsistencies exist among previously 
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formed cognitions which have not recently 
been considered in combination (McGuire, 
1960, 1968). The critical question in such 
cases is whether, once beliefs which are theo- 
retically related in the manner implied by 
Equation 1 are made salient to a subject in 
close temporal proximity, they are reorganized 
in a manner that increases their consistency 
as defined by this equation. In fact, there is 
convincing evidence to support this hypothesis 
(Rosen & Wyer, 1972; Wyer, 1974b; Wyer 
& Goldberg, 1970). The continued considera- 
tion of Equation 1 as a model of cognitive 
organization and social inference processes 
(or, at least, a modification of it to account 
for the differential weighting of Prya) there- 
fore seems justified. 
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Determinants of Perceived Choice and the Relationship 
Between Perceived Choice and Expectancy 
About Feelings of Internal Control 


John H. Harvey and Ben Harris 
Vanderbilt University 


An experiment was conducted to test the hypotheses that both perceived choice 
and expectancy about feelings of internal control over own behavior in a 
future situation will be greater (a) when a decision involves positive options 
than when it involves negative options and (b) that perceived choice will be 
greater when there is a small difference in attractiveness of the outcomes of 
the options than when there is a large difference. The procedure involved 
asking male and female college students to make a selection from two types 
of visual stimulation. Subjects were led to believe that they would be exposed 
at a later time in the experiment to the type of stimulation they had selected. 
Information about the types of stimulation was varied in order to manipulate 
valence of the outcomes of the options and difference in attractiveness of 
the outcomes of the options. The results supported the hypothesis that both 
perceived choice and expectancy about feelings of internal control are greater 
when a decision involves positive options than when it involves negative 
options. The results also provided evidence for the predicted effect of difference 


in attractiveness upon perceived choice when the de 


ecision involved positive 


options but not when it involved negative options. 


A relatively new focus for research on 
decision-making processes concerns the condi- 
tions affecting feelings of choice in making a 
decision. Recent research has identifed a 
number of variables which influence per- 
ceived choice, and the present research was 
- designed to explore further the variables in- 
fluencing this perception. In particular, we 
were interested in the question of whether 
decision makers feel more freedom of choice 
when their decisions involve pleasant options 
than when their decisions involve unpleasant 
options. Although common-sense psychology 
would dictate an affirmative answer, there has 
been little research designed to examine sys- 
tematically this or related questions. 

The perception of choice has been consid- 
ered as an important variable in such major 
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in Chicago, May 1973. The authors wish to express 
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ments on an earlier version of this article. 
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social psychological approaches as dissonance 
theory (Brehm & Cohen, 1962; Zimbardo, 
1969), attribution theory (deCharms, 1968; 
Heider, 1958; Jones & Davis, 1965; Kelley, 
1967), and reactance theory (Brehm, 1966). 
Until recently, however, research has focused 
almost exclusively upon perceived choice as 
an independent variable. Recent studies by 
Harvey and Johnston (1973), Jellison and 
Harvey (1973), Harvey, Barnes, Sperry, and 
Harris (1974), and Harvey and Jellison 
(1974) have examined a number of variables 
which influence perceived choice. The princi- 
pal basis for a large part of this research was 
an analysis of perceived choice by Mills (J. 
Mills, personal communication, September 24, 
1970). 

In his analysis of perceived choice, Mills 
suggests that choice in taking an action is 
perceived to the extent that an alternative 
action is perceived to exist. Given one action, 
which is referred to as the preferred action, 
another action is perceived as an alternative 
if it satisfies two conditions. First, the ex- 
pected outcome of the other action must be 
perceived to be different in attractiveness 
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from the expected outcome of the preferred 
action. If it is not possible to differentiate 
between the outcome of the other action and 
the outcome of the preferred action, then the 
other action will not be perceived as an alter- 
native and perceived choice will be minimal. 
Second, there must be some likelihood that 
the other action will lead to a more desirable 
outcome than the preferred action. If there is 
no uncertainty about whether the preferred 
action will lead to the more desirable out- 
come, then the other action will not be per- 
ceived as an alternative and perceived choice 
will be minimal. 

Evidence supporting predictions based on 
Mills’s analysis is provided in studies by 
Harvey and Johnston (1973) and Jellison and 
Harvey (1973), The results of these studies 
show that perceived choice is greater (a) 
when there is a small difference in attractive- 
ness of the outcomes of the possible actions 
than when there is a large difference and (b) 
when a person is uncertain about the attrac- 
tiveness of each outcome than when he is 
certain. Both of these effects were interpreted 
in terms of uncertainty; for the former, it was 
assumed that uncertainty would be greater 
when there was a small difference than when 
there was a large difference. Further, in line 
with the assumption that there must be a dis- 
cernible difference between the options in 
order for choice to be perceived, Harvey and 
Johnston found some evidence that perceived 
choice will be less when there is no difference 
in attractiveness between outcomes of possible 
actions than when there is a small difference. 

In the present research, the investigation 
of determinants of perceived choice was ex- 
tended to include the variable of valence 
(positivity-negativity) of outcomes of possi- 
ble actions. Bramel (1969) hypothesized that 
a sense of subjective freedom is attributed 
more to people seeking positive incentives 
than to those seeking to avoid punishment. 
In this experiment, a slightly modified version 
of Bramel’s hypothesis was tested in an actor 
setting. It was hypothesized that when a 
person is faced with making a decision about 
which of a set of actions he will take, per- 
ceived choice will be greater when the selec- 
tion is to be made from positively valenced 
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actions than when it is to be made from nega- 
tively valenced actions. Based upon previous “® 
work (Harvey & Johnston, 1973; Jellison & 
Harvey, 1973), it was also hypothesized that 
perceived choice will be greater when there 
is a small difference in attractiveness of the 
outcomes of the possible actions than when 
there is a large difference in attractiveness. 

Another purpose of this experiment was to 
examine the relationship between perceived 
choice and expectancy about feelings of inter- 
nal control in a future situation. Perceived 
choice is frequently defined in terms of per- 
ception of control over own behavior (Kelley, 
1967) or the potentiality for such perceived 
control (Ruch & Zimbardo, 1971). There has, 
however, been little if any research aimed at 
examining the relationship between perceived 
choice and feelings of control. When an indi- 
vidual is making a decision about which of a 
set of actions he will take, the degree to 
which he thinks he will feel control over his 
behavior in the future situation (ie., in the ^ 
situation which results from his decision) 
should be directly related to his perception 
of choice in making the decision, In this con- 
nection, it should be noted that in his book 
Beyond Freedom and Dignity (1971), Skinner 
suggests that people often fail to recognize 
the environmental controls over their behav- 
ior when it is not under aversive reinforce- 
ment contingencies (which implies that they ~ 
feel personal control) and, significantly, that 
this lack of recognition is associated with a 
sense of freedom. In the present study, it was 
hypothesized that expectancy about feelings 
of internal control over own behavior in a 
future situation will be greater when the 
selection is to be made from positively val- 
enced actions than when it is to be made from 
negatively valenced actions, 


METHOD 
Overview 


In an experiment purportedly concerned with 
sensory processes, male and female college students 
were asked to select a type of visual stimulation 
to which they supposedly would be exposed at a 
later time in the experiment. Subjects were given 
information about two types of stimulation from 
which they could make a selection. The informa- 
tion was varied in order to manipulate valence of 
outcomes of the possible options (positive outcome 
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or negative outcome) and difference in attractiveness 
of outcomes of the options (small difference or large 
difference). After reading the descriptions of the 
types of stimulation and before making a selection, 
each subject was given a form assessing his feelings 
of choice in contemplating the decision and expect- 
ancy about feelings of internal control over own 
behavior. In all, there were four experimental 
conditions comprising a 2 X 2 design. 


Subjects 


The subjects were 80 male and female college 
students who participated in order to fulfill a re- 
quirement in general psychology. Subjects were run 
individually and were randomly assigned to condi- 
tions. There were 20 subjects in each of the condi- 
tions, and the sex ratio was approximately constant 
across conditions, 


Procedure 


At the beginning of each session, the subject was 
told that he was participating in a study concerned 
with “sensory apperception.” The experimenter indi- 
cated that the research was concerned with how 
people uniquely adapt to various stimuli and that 
visual stimuli were being used in this study. The 
subject was told that the study involved first a 
decision phase and then a testing phase, with the 
testing experience dependent on a decision that the 
subject would make for himself. He was told that 
the decision phase would involve the selection of a 
type of visual stimulation from two alternative 
types. The subject was told that he would be ex- 
posed to the type of stimulation which he selected 
from the two possibilities. After the importance of 
the selection was emphasized, the subject was pre- 
sented with written descriptions of two types of 
stimulation. At this time, he was explicitly asked not 
to make a final decision about which type he wanted 
to be exposed to until it was asked for at a later 
point in the procedure. 

For each type of stimulation, there was a one-half 
page description. The two descriptions were juxta- 
posed on a large page. Each description told about 
the type of stimulation (i.e., its name), the nature 
of the stimulation (i.e the nature of the visual 
stimulus to be used and the equipment which would 
be used to present it), and effects of the stimulation 
(i.e its physiological effects upon the subject). 
Each description contained terminology designed to 
make it exceedingly technical and abstruse, and the 
two descriptions were designed to be equally am- 
biguous.t Subjects in all of the conditions received 

The experimental manipulations were introduced 
via “normative information” which was presented 
on the page beside each description. The subject 
was told that this normative information referred to 
summary information from a pilot study which 
had been conducted at the same university some 
time in the past. This information was portrayed as 
indicating how pleasant a very large number of 
students had found each type of stimulation to be. 
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An average pleasantness rating was given, as well as 
a number supposedly referring to the number of 
students contributing to the rating (the number was 
225 for each rating). Also, the subject was provided 
with an example scale identical to the one used by 
the “pilot subjects.” It was explained that he could 
refer to this scale in order to get an idea about how 
pleasant the other students had found each type 
of stimulation to be. The example scale was com- 
posed of 12 points, with low numbers indicating 
extreme unpleasantness and high numbers indicating 
extreme pleasantness. In designing this study, it was 
expected that subjects would rely almost completely 
upon this normative information in their evaluation 
of the two types of stimulation.* 

In the positive outcome conditions, both types of 
stimulation were given very high average ratings. 
In the negative outcome conditions, both types of 
stimulation were given very low average ratings. In 
the small difference conditions, the average rating of | 
one type of stimulation was .03 higher than that of 
the other. In the large difference conditions, the 
average rating of one type of stimulation was 2.51 
higher than that of the other, The descriptions of the 
types of stimulation and their accompanying ratings 
were counterbalanced so that within each condition 
each rating for that condition appeared with each 
description of a type of stimulation an equal number 
of times. The ratings for the alternatives for the 
various conditions are summarized in Table 1. 

After the subject had been given seven minutes 
to evaluate the types of stimulation, the experi- 
menter interrupted him and said that before he 
made his selection, there were some questions on a 
form which he should answer. The experimenter 
said that his answers to the questions would facili- 
tate the investigator’s interpretation of the data 
collected in the testing phase of the study. The sub- 
ject was then given a form which contained questions 
asking (a) about his feeling of choice in deciding 
upon which type of stimulation to select, (b) his 
expectation about the extent to which he would feel 
that he had internal (personal) control over his 
own behavior in whichever stimulus situation he 
ultimately selected, (c) his expectation concerning 
enjoyment of the stimulation encountered in the 
future situation, and (d) his perception of similarity 
in attractiveness of the types of stimulation under 
consideration, Each of the questions was answered 
on a 21-point scale with high numbers indicating 
large amounts. 

After the subject had completed the form, the 
session was ended; but first he was queried for 
suspicion concerning the alleged testing phase of the 
study and other aspects of the procedure. None of 


1 Pilot data indicated that these descriptions are 
approximately equal in ambiguity. 
the same pair of descriptions. 

2 Information obtained in debriefing indicated that 
all of the subjects had relied almost exclusively 
upon the normative information in their assessment 
of the types of stimulation, 


TABLE 1 


RATINGS OF PLEASANTNESS ACCOMPANYING 
DESCRIPTIONS FOR THE ALTERNATIVE 
TYPES OF STIMULATION FOR THE 
EXPERIMENTAL CONDITIONS 


Difference in attractiveness of outcomes 


Valence of outcomes 


Small Large 
Positive 10.91 vs. 10.88 | 10.91 vs. 8.40 
Negative 2.72 vs. 2.69 2.72 vs. .21 


Note. The higher the rating, the greater indicated pleasantness 
of the stimulus, 


the subjects indicated any doubt that the testing 
phase would occur, nor did any express suspicion 
about other aspects of the study. Before he left, 
the subject was given a thorough debriefing. 


RESULTS 


Effectiveness of the Manipulations 


A check on the manipulation of valence of 
the outcomes of the possible actions is pro- 
vided by subjects’ ratings of expected enjoy- 
ment of the stimulation in the future situa- 
tion, As anticipated, subjects in the positive 
outcome conditions indicated that they 
thought they would feel more enjoyment, ¥ = 
13.8, than did subjects in the negative out- 
come conditions, X = 6.9. An analysis of 
variance for the measure of enjoyment for 
the four conditions showed that the main 
effect of valence was highly significant, F 
(1,76) = 58.8, p < .001. The analysis of vari- 
ance revealed no other significant effects. 
These results provide strong evidence that 
the valence manipulation was effective. 

Evidence about the effectiveness of the 
manipulation of difference in attractiveness of 
the outcomes is provided by the subjects’ 
ratings of the similarity in attractiveness of 
the two types of stimulation under considera- 
tion. As was expected, subjects in the small 
difference conditions perceived greater simi- 
larity, ¥ = 15.7, than did subjects in the large 
difference conditions, X = 12.3. An analysis 
of variance for the measure of similarity for 
the experimental conditions revealed a signifi- 
cant main effect of difference in attractiveness, 
F (1,76) = 14.9, p< .01. There were no 
other significant effects for the measure of 
similarity, These results indicate that the 
difference in attractiveness manipulation was 
effective. 
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Ratings of Perceived Choice 


It was hypothesized that perceived choice 
would be greater when a selection was to be 
made from positively valenced actions than 
when it was to be made from negatively 
valenced actions and under conditions of a 
small difference in attractiveness of the out- 
comes of the actions than under conditions of 
a large difference. The means for the measure 
of perceived choice are presented in Table 2. 
It can be seen from Table 2 that the means 
are higher in the positive outcome conditions 
than in the negative outcome conditions. 
Table 2 also shows that the difference in 
attractiveness variable had a sizable effect in 
the predicted direction in the positive outcome 
conditions but had little effect in the negative 
outcome conditions. An analysis of variance 
for the four conditions showed that the main 
effect of valence was highly significant, F 
(1,76) = 44.1, p < .001. These results pro- 
vide strong evidence that perceived choice 
was greater in the positive outcome conditions 
than in the negative outcome conditions. The 
analysis of variance also revealed that the 
main effect of difference in attractiveness did 
not approach significance. However, this anal- 
ysis of variance did show a significant inter- 
action between valence of outcomes and dif- 
ference in attractiveness of outcomes, F 
(1,76) = 5.8, p < .05. Individual comparisons 
among means relevant to this interaction 
showed that the mean for the positive out- 
come/small difference condition was signifi- 
cantly greater than the mean for the positive 
outcome/large difference condition, p < .05, 
two-tailed, but that the mean for the negative 
outcome/small difference condition was not 
significantly different from the mean for the 
negative outcome /large difference condition, 
These results indicate that the predicted effect 
of difference in attractiveness of outcomes 


TABLE 2 
MEANS FOR THE MEASURE OF PERCEIVED CHOICE FOR 
THE EXPERIMENTAL CONDITIONS 


Difference in attractiveness of outcomes 
Valence of outcomes 


Small Large 
Positive 15.3 
Negative 7.9 | oy 


Note, The high ò 
Hee igher the number, the greater the perceptiôn of 


DETERMINANTS OF 


upon perceived choice occurred in the positive 
outcome conditions but not in the negative 
outcome conditions. 


AA 
Ratings of Expectancy about Feelings of 
Internal Control over Own Behavior 


It was anticipated that subjects in the 
positive outcome conditions would expect to 
feel more internal control over their behavior 
in the future stimulus situation than would 
subjects in the negative outcome conditions. 
The means for the ratings of expectancy 
about feelings of internal control over own 
behavior are presented in Table 3. Inspection 
of Table 3 shows that the means are higher in 
the positive outcome conditions than in the 
negative outcome conditions. An analysis of 
variance for the four conditions showed that 
the main effect of valence was significant, F 
(1,76) = 7.3, p < 01. These results indicate 
that expectancy about feelings of internal 
control was greater in the positive outcome 
conditions than in the negative outcome con- 
ditions. There were no other significant effects 
for the analysis, although the interaction be- 
tween valence and difference in attractiveness 
approached significance, p < .10. It can be 
seen from Table 3 that the difference between 
the means for the small and large difference 
conditions is greater for positive outcomes 
than for negative outcomes. 

A comparison of the means in Table 2 to 
the means in Table 3 reveals that the results 
for the measures of perceived choice and 
expectancy about feelings of internal control 
over own behavior were similarly patterned. 
A correlational analysis showed that there was 
a significant positive correlation between the 
overall ratings of perceived choice and the 
overall ratings of expected feelings of internal 
control, r= 38, p < .01. The magnitude of 
the correlation within each condition was 
consistently similar to the overall correlation. 
These results provide support for the expected 
positive relationship between perceived choice 
and expectancy about feelings of internal 
control over own behavior. 


DiscussiION 


The results of this experiment provided 
good support for the hypothesis that perceived 
choice would be greater when a selection is 
to be made from positively valenced actions 


PERCEIVED CHOICE 


TABLE 3 


MEANS FOR THE MEASURE OF EXPECTANCY ABOUT 
FEELINGS OF INTERNAL CONTROL OVER OWN 
BEHAVIOR FOR THE EXPERIMENTAL CONDITIONS 


] 
Difference in attractiveness of outcomes: 


Valence of outcomes 


Small Large 
Positive 12.6 9.4 
Negative 8.6 8.3 


'he higher the score, the greater the expectancy about 


feelings of internal control, 


than when it is to be made from negatively 
valenced actions. The results also revealed an 
unexpected interaction between valence of 
outcomes of possible actions and difference in 
attractiveness of outcomes. It was found that 
the predicted effect of difference in attractive- 
ness of outcomes occurred in the positive out- 
come conditions but not in the negative out- 
come conditions. In the positive outcome 
conditions, perceived choice was significantly 
greater in the small difference condition than 
in the large difference condition; whereas in 
the negative outcome conditions, there was 
little difference in perceived choice as a func- 
tion of difference in attractiveness. This find- 
ing for the positive outcome conditions is con- 
sistent with evidence in studies by Harvey 
and Johnston (1973) and Jellison and Harvey 
(1973) and is based upon the assumption that 
a small difference in attractiveness will en- 
hance uncertainty about which outcome is 
better. Although valence of outcomes was not 
manipulated in these previous studies, the 
choice options were nevertheless relatively 
positive in valence. 

The failure to find a difference in perceived 
choice for the difference in attractiveness 
variable in the negative outcome conditions 
suggests that this variable may have little 
effect upon perceived choice when a decision 
involves negative options. From a theoretical 
standpoint, this finding is rather puzzling. 
One possible explanation is that relative to a 
situation involving positive options, a situa- 
tion involving negative options might retard 
the perception of differences in attractiveness 
and, further, that this could account for the 
failure to find an effect in the latter situation. 
However, evidence against this possibility is 
provided by the analysis of the results for the 
check on the effectiveness of the difference in 


. i’ = 


_ 


T 
; 


jons M. Havey 


attractiveness warlable (Le, the data for rat- 
ings of the similarity of the options), ‘These 
results revealed only a significant main et 
indicating a successful manipulation of this 
variable 

It was anticipated that perceived choice 
and expectancy about feelings of internal com: 
trol over own behavior would be directly ne 
lated, ‘The data provided considerable support 
for this expectation. The pattern of ratingi 
for perceived choice was very similar to the 
pattern of ratings for expectancy about feel- 
ings of loternal control, and there wae a 
relatively strong positive correlation letwren 
the two sets of ratings. Further, as peedicted, 
the results showed that there was a significant 
tendency for positive outcome subjects to et 
pect to feel more internal control aver their 
behavior in the future situation than did nega- 
tive outcome subjects, Alo, the revalts for 
the ratings of expectancy shout feelings of 
internal control showed a marginal interaction 
between valence and difference in attractive 
news, This marginal interaction bs very similar 
to the interaction between these vatlables for 
the measure of perceived choice 

‘The present revearch has mme relationship 
to Lewin’s (1055) distinctions coocerning de- 
cislonal coniilcts characterised by either ap 
proach.approach or avoldance-avoidance. In 
this study, the situation Involving positive 
options corresponds roughly to the approach 
approach type of malic, ard the stastia 
levolving «negative «options ernati 
roughly to the avoilance avoidance typt oa 
oaii Lesin sapgested that the former 
type of combbiet be mre eunily rewberd in real. 
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grocers o withdrawal (rome the condllet sitas 
Yee The premet resalta sangeet thet prvvebly 
cuntiDeating cach an corer aistia ia 


Masha 
2 The Gedling of è prviithve reletiomnbip be 


gweewived choice sad rvzectamcy abem 
behavior 


i 

5 

E 

i 

z 
R 

ZisF 


H 
; 
i 


i 
ir 
g- 

if 
Hi 


ark 
l 
i 


‘PHI EE! HIH 


a NEE He 
Er itini Halini. 
an eina i A hi 


n li HE Miik aluli nt 
te dunlin p ii nji! F 


ity ree He i i i 
ij HR lii RBH nt “ial ai it 


Smith College 


Caria Hubemtris 


Opposite-Sex Encounter 


Attractiveness, Perceived Attitude Similarity, and 


Physical 
Taterpersonal Attraction in an 


shal eee tages ne 
Robert E. Kiek 
bit 


at He 4a neip 
ae a it Ai Hi 


onan an S 


w 


l 
5 


108 


similar or dissimilar, self-report measures of 
liking are influenced by both sets of cues. 
Attitudinal similarity was manipulated within 
the present study to assess the generality of 
these previous results to situations involving 
actual social interaction, but under condi- 
tions in which the stimulus person’s behavior 
is independent of the particular combination 
of attractiveness and similarity which she 
represents, Although Byrne, Ervin, and Lam- 
berth (1970) have previously addressed a 
similar issue, their study does not permit an 
assessment of the behavioral implications of 
attitudinal similarity or physical attraction 
per se in that the behavior of the stimulus 
person was not under the experimenters’ con- 
trol and may have covaried in important ways 
with these characteristics. 

It was expected, consistent with the large 
body of previous research on similarity 
(Byrne, 1971), that perceived high similarity 
to a stimulus person would result in greater 
self-report and nonverbal indicators of attrac- 
tion to that person than would low perceived 
similarity. While the present design is not, 
strictly speaking, an appropriate test of the 
relative potency of similarity and physical 
attractiveness, it was anticipated that the 
latter variable would have a stronger impact 
on our dependent measures than the former. 
This expectation derived from the point noted 
earlier that physical appearance information 
is immediately available to another in social 
interaction and does not depend upon memory 
for its continued availabili y. 

The dependent measures of central in- 
terest consist of a number of self-report 
indicators of interpersonal attraction, includ- 
ing Byrne’s attraction index and three non- 
verbal measures previously demonstrated to 
covary with attraction, that is, eye contact 
(Exline & Winters, 1965), smiling (Rosen- 
feld, 1966), and interaction distance (Mehra- 
bian, 1969), In addition, person perception 
measures similar to those employed by Dion, 
Berscheid, and Walster (1972) to determine 
the presence of a physical attractiveness 
stereotype were collected. It was expected 
that the systematic differences in impressions 
elicited by these latter investigators on the 
basis of photographs of attractive and un- 
attractive individuals would also be elicited 
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by actual stimulus persons varying on the 


physical attractiveness dimension. 
METHOD 


Overview 


Four experimental conditions were def by the 
manipulation of the physical attracti of a 
confederate and by a parallel manipul f per- 
ceived attitudinal similarity. Subjects p ted in 
only one cell of the 2X2 design ar con- 
federates in all four. The experimental 
divided into two distinct and, from | int of 
view of the subject, unrelated periods. During the 
first part of the study the confederate and subject 
interviewed one another concerning a number 
campus issues. In the second phase they partici 
together in a “short-term memory ta 
pose of which was to provide a less consi: 
text in which social interaction could take 
physical attractiveness of the confederate 
nipulated through the use of makeup and 
similarity by having the confederate and - 
port on their responses to items drawn from ! s 
(1971) attitude scale. 


was 


Subjects 

Individuals were recruited during sprir id sum- 
mer terms at Dartmouth College to serve in a “study 
of interviewlike encounters.” The 48 male subjects 
received either course credit or money for their 
participation. 
Confederates 


Five female age peers of the subjects were re- 
cruited to serve as confederates. The central criterion 
for selection was whether or not the facial attractive- 
ness of the person could successfully be controlled 
through the use of unobtrusive makeup and hair 
styling. Those potential confederates who were re- 
jected were primarily persons whose facial attractive 
ness could not be masked without the use 
able amounts of makeup. The confederate: 
informed regarding the purposes of the stu 
the information that we were interested i: 


plications of varying physical appearance: al 
interaction. They were also not informed ing 
the dependent measures or that they eing 
videotaped. The importance of maintair con- 
stant but natural pattern of behavior acros- bjects 
was emphasized during the training of con- 
federates. 

Procedure 


Prior to any contact with the confederate each 
subject responded to 15 attitudinal items selected 
from Byrne’s (1971) 56-item Survey of Attitudes 
questionnaire. These items require the respondent 
to indicate which of six alternatives most clo: 
corresponds with his or her position on a given issue 
(e.g., premarital sex relations). The subset of items 
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taken from the scale was chosen by the authors on 
the basis of presumed high relevance to undergradu- 
ates, 

The subject arrived at the experimental room first 
and was greeted by the experimenter, who intro- 
duced herself and asked the subject to take one of 
two seats placed facing one another at a distance of 
5.5 feet in a 18X14 foot room. The confederate 
appeared almost immediately, was introduced to the 
subject, and was asked to take the other chair. The 
experimenter identified herself as an undergraduate 
working on a study of interviewlike situations and 
explained that it “facilitates interaction in such 
situations to know something about the other per- 
son.” She then reminded them that they had both 
previously completed an attitude survey and handed 
them the survey form which they had completed. 
She suggested that, before the interview began, they 
could exchange information about themselves by 
reading the attitude items they had endorsed on the 
survey. The confederate started and alternated with 
the subject in reading the responses to the 15 atti- 
tudinal issues. This procedure resulted in the con- 
federate always giving her opinion on a given item 
first. In the high-similarity condition, the confeder- 
ate’s responses corresponded with those of the sub- 
ject on 13 of the 15 items whereas in the low- 
similarity condition correspondence was present for 
only 2 of the 15, At the conclusion of the reading 
of the opinion statements the confederate said, “We 
seem to agree a lot,” (high-similarity condition) or, 
“We seem to disagree a lot” (low-similarity condi- 
tion). The experimenter responded by saying “Yes, 
as matter of fact, you seem to be one of the most 
similar (dissimilar) pairs we've had.” 

The experimenter then reiterated her interest in 
interviewlike encounters and explained that each per- 
son would serve as both interviewer and interviewee. 
Since she wanted the interviews to be as standard as 
possible across pairs of subjects, she had prepared 
a standard set of questions (e.g, “How could the 
social life at the college be improved ?”) which she 
wanted each person to ask. The interviewer's task 
was to pose the question and to record on the inter- 
view form what he or she considered to be the cen- 
tral points made by the interviewee. The confederate 
was always “randomly” chosen to assume the role 
of interviewer first. As soon as the experimenter 
handed the interview forms to the confederate and 
subject, she left the room with the comment that 
she would be in the adjoining room and someone 
should come and get her when they were finished. 

From the adjoining room the experimenter moni- 
tored the video and audio recording equipment. 
Neither the subjects nor the confederates were aware 
they were being recorded via hidden television 
cameras and microphones but were informed of 
this fact at the conclusion of the study. 

At the conclusion of the interview period the sub- 
ject and confederate were placed in separate rooms 
and were asked to answer a number of questions 
concerning the interview situation and the other 
person, When these were finished, the experimenter 


109 


asked the subject to go to the same room in which 
the confederate was filling out the questionnaires. 
When the subject arrived at this room, the con- 
federate, who had placed her questionnaire face 
down in front of her, was seated at the side and 
toward the end of a long table. Six other chairs were 
placed around the table at equal distances, and 
several were along the walls of the large (24 X 36 
foot) room. The experimenter explained that the 
second part of the study was unrelated to the first 
and was a very brief experiment in memory which 
she was running as a favor to a professor in the 
department. A list of 10 words was to be shown to 
the subjects twice, followed by a 10-minute interval, 
The experimenter stressed that during this 10-minute 
period the subjects were free to talk or move about 
the room but that they should not actively rehearse 
the word list. After starting the projector on an 
automatic sequence, she left the room with the ex- 
planation that she would be back in 10 minutes to 
test their recall. From an adjoining room she ob- 
served the interaction between the subject and con- 
federate without their knowledge. At the end of 10 
minutes she returned to the room, asked both per- 
sons to write down as many of the items from the 
word list as they could recall, and explained that she 
would like them to complete one final questionnaire, 
The Interpersonal Judgment Scale (Bryne, 1971) re- 
quired them to rate the other interactant on such 
dimensions as morality, adjustment, sexual attrac- 
tiveness, and intelligence. 

The primary purpose of the seccnd phase of the 
study was to provide the subjects with a context 
in which more spontaneous interaction with the con- 
federate would be possible. The confederate was 
instructed to not initiate interaction but to respond 
naturally to conversation initiated with her. Chair 
placement of the confederate was constant across 
subjects in order that seating distance could be 
assessed as a dependent measure, 

Between two and four weeks after the initial in- 
teraction, the experimenter contacted the subject in 
his dormitory to complete a postexperimental ques- 
tionnaire and to debrief the subject concerning the 
nature of the study. The questionnaire focused on 
the subject’s recall of his partner (the confederate) 
and probed his impressions of the study and any 
suspicions he might have had concerning it. At this 
time it was revealed to him that he had been video- 
taped during the interview, and his permission was 
requested to retain that record for analysis purposes." 


1No subject refused to have his video records 
made available for analysis, and only four expressed 
suspicion regarding the experimental procedures. In 
two of the cases suspicion focused on the similarity 
manipulation and in two on the attractiveness ma- 
nipulation. For the latter, the apparent source of 
suspicion was that subjects who had been paired 
with a confederate in the unattractive condition had 
later seen her on campus in her more typical, and 
hence more attractive, physical state. 
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RESULTS 
Manipulation checks 


The subjects rated the confederates on per- 
ceived attitudinal similarity and physical at- 
tractiveness, both during the experimental 
session and two to four weeks later as part of 
the follow-up interview. These ratings were 
done on 7-point scales, with the similarity 
dimension anchored by very similar and not 
at all similar and the attractiveness dimension 
anchored by extremely attractive physically 
and extremely unattractive physically. Two- 
way analyses of variance of these ratings 
revealed the manipulations to be highly effec- 
tive in determining the perception of simi- 
larity, F (1,40) = 54.4, p< .00001, and 
attractiveness, F (1,40) = 36.2, p < .00001, 
at the time of the interaction and also at the 
time of the follow-up interview (similarity: 
F (1,37) = 42.9, p < .00001; attractiveness: 
F (1,37) = 16.9, p < .0005. In that three 
subjects run during the summer term could 
not be contacted in time for the follow-up 
interviews, the number is reduced to 41 for 
the latter two analyses. There are no signifi- 
cant interactions in these analyses, which 
indicates that the manipulation of the con- 
federate on one dimension did not affect the 
subjects’ perception of her on the other. 

As a further check on the effectiveness of 
the modification of the confederates’ appear- 
ance, standardized color slides were taken of 
the confederates each time they were “made 
up” for the study. Two photographs of each 
confederate (one from the attractive and one 
from the unattractive condition) were ran- 
domly selected for presentation to indepen- 
dent male judges. Each of two groups of 12 
judges saw one photograph of each confeder- 
ate and rated her physical attractiveness on 
a 7-point scale. The confederate photographs 
were embedded in a longer sequence of photo- 
graphs of females of approximately the same 
age. For all confederates the high-attractive 
condition photograph resulted in significantly 
higher ratings of physical attractiveness than 
did the low-attractive condition photograph, 
the ¢ ratios ranging between £ (11) = 8.8, 
p < .00001 and ¢ (11) = 3.3, p < .005. 

An important aspect of the intended ma- 
nipulation is obviously the standardization of 
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the confederates’ behavior across the experi- 
mental conditions. In training the confeder- 
ates the importance of a natural and consis- 
tent behavior pattern had been emphasized, 
and they were kept blind regarding the ex- 
perimental hypotheses as well as the specific 
dependent measures under investigation. 
Without her awareness each confederate was 
videotaped during the conversational en- 
counter, and her gaze and smiling behavior 
were compared across conditions. Because of 
scheduling difficulties, the participation of the 
confederates could not be precisely balanced 
across the design. Four of the confederates 
each appeared at least twice and not more 
than three times in each of the four experi- 
mental conditions. The fifth confederate par- 
ticipated once in three of the conditions and 
twice in the fourth, Since this slightly unequal 
distribution made it impossible to treat con- 
federates separately in the analyses, each 
appearance of a confederate was entered as 
an independent observation, and two-way 
analyses of variance were conducted with the 
data. Gaze and smiling behavior were chosen 
as measures of confederate constancy because 
they have been demonstrated to be important 
nonverbal behaviors in the modulation of 
face-to-face interaction (e.g., Kendon, 1967), 
and they are also the nonverbal measures, of 
primary concern vis-a-vis the subjects. Thus, 
if the confederates were found to differ across 
experimental conditions in regard to the fre- 
quencies of these behaviors, such differences 
could have potential confounding implications 
for any observed differences in the subjects’ 
behavior. 

Scoring of the dependent measures was 
done by one coder for all of the confederate 
videotapes and for a random subset by a 
second coder. Intercoder reliabilities were 
computed for the overlapping sample and ex- 
ceeded .90 for all dependent measures. It 
should be noted, of course, that the coders 
could not remain blind regarding whether a 
confederate was in the attractive or unattrac- 
tive condition, and thus the possibility of 
coder bias cannot be completely excluded. 
Since the interaction consisted of two distinct 
parts during which the confederate was pri- 
marily either a listener-interviewer (Part 1) 
or a speaker (Part 2), the analyses were con- 
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ducted for each part separately as well as for 
the total interaction. No significant main 
effects or interactions were present for per- 
centage of time smiling, frequency of smiling 
per unit of time, percentage of gaze time, or 
for the average duration of gaze. 


Self-report measures 


Subsequent to the interviewlike interaction, 
subjects completed an impression scale made 
up of 10 polar adjectives. These were a subset 
of the items previously employed by Dion et 
al. (1972) to establish the presence of a 

. Physical attractiveness stereotype in young 
adults. Ratings were summed for those five 
trait items which had been identified by the 
previous investigators as traits associated with 
a socially desirable person (social, interesting, 
self-assertive, sincere, and warm). An analysis 
of variance revealed no differences in the rat- 
ings received by attractive and unattractive 
confederates though there was a tendency for 
the attractive confederate to be perceived as 
a more socially desirable person, p < .12. 
Similarity had no effect on the ratings, and 
there was no evidence of an interaction be- 
tween these variables. 

The Byrne (1971) 10-item Interpersonal 
Judgment Scale was completed by all subjects 
at the end of the short-term memory portion 
of the experimental session. The key items are 
ratings of liking and desirability as a work 
partner, which Byrne and Nelson (1965) and 
others have combined into a general attrac- 
tion index. The data are plotted in Figure 1. 
An analysis of variance of the ratings reveals 
a significant effect for physical attractiveness, 
F (1,40) = 8.39, p < .01, but not for per- 
ceived similarity, F = 1.6, or for the interac- 
tion, F = .9, Ratings of liking, not combined 
with ratings on desirability as a work partner, 
strongly discriminate the attractive from the 
unattractive conditions, F (1,40) = 8.9, p< 
01. Three other items on the Interpersonal 
Judgment Scale are face-valid indicators of 
approach responses toward others; dating, 
marriage, and sexual attractiveness. For the 
first two, the physical attractiveness manipu- 
lation again results in more positive ratings, 
dating: F (1,40) = 10.4, p < .005; marriage: 
F (1,40) = 12.7, p < .005, with no effects for 
similarity and no evidence of an interaction 
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ATTITUDINAL SIMILARITY 
Ficure 1. Attraction as a function of perceived 
similarity and physical attractiveness of the con- 
federates. (The data are derived from a summation 
of ratings on liking and desirability as a work part- 
ner, and high numbers correspond to high levels 
of attraction.) 


between attractiveness and similarity, In re- 
gard to ratings on the sexual attractiveness 
dimension, neither experimental manipulation 
has a consistent effect on the subjects’ ratings, 

The final self-report measures of interest 
were collected two to four weeks after the 
experimental session and prior to debriefing 
the subjects regarding the nature of the study. 
In addition to the manipulation checks re- 
ported earlier, subjects were asked (a) to 
indicate whether and how frequently they had 
thought about the person they had interacted 
with, (b) to describe what their partner had 
looked like, (c) to indicate their present de- 
gree of liking for the partner, and (d) to 
indicate how much they had enjoyed par- 
ticipating in the study. Analyses of variance 
of these responses indicate that subjects who 
had interacted with confederates in the physi- 
cally attractive condition reported thinking 
about her more frequently, F (1,37) = 3.93, 
p= .05, showed a tendency to remember 
more aspects of her clothing and appearance, 
F (1,37) = 2.7, p= .10, and continued to 
like her more, F (1,37) = 11.64, p < .005, 
than did subjects who had interacted with the 
unattractive confederates. Consistent with the 
earlier self-report data, perceived confederate 
similarity had no significant impact on these 
measures. 
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Nonverbal measures 


Gaze and smiling behaviors were coded 
from the videotapes for both the talking-in- 
terviewee and the listening-interviewer phases 
of the interaction, The coder was blind re- 
garding the experimental conditions repre- 
sented by any particular subject, and only 
one behavior (smiling or gaze) was scored 
at a time, Since the video cameras were in 
approximately the same position as the ob- 
server in the Exline and Winters (1965) 
study, it was assumed that this procedure 
would index actual looking as reliably as their 
widely used method, A second coder inde- 
pendently scored a randomly selected subset 
of the videotapes and achieved a reliability of 

.93 for smiles and .91 for gaze behavior with 

the first coder, Both smiling scores and gaze 
scores were converted to percentages to cor- 

rect for the different interaction times upon 
which they are based. 

During the period in which the subjects 
were being interviewed, those talking to phys- 
ically attractive confederates spent more time 
looking at her, F (1,36) = 6.73, p < .02, and 
smiling, F (1,36) = 5,94, p < .02, than did 
those talking to less attractive confederates.? 
These differences are not present when the 
subject is in the role of listener-interviewer 
though the direction of the differences is the 
same. The absence of an effect when the sub- 
ject is in the interviewer role can in part be 
accounted for by the fact that subjects took 
notes on what the confederate was saying, 
and thus their gaze was diverted away from 
the other interactant for much of the time. 
Neither of the nonverbal measures are related 
to perceived attitude similarity for either the 
talking or listening phases of the interaction. 

Two final behavioral measures were derived 
from the short-term memory part of the inter- 
action. No significant effects were obtained in 
regard to seating distance primarily because 
78% of the subjects elected to sit one chair 
position removed from the confederate. The 
time required for the subject to initiate verbal 
interaction with the confederate was also re- 


* For these analyses, the total sample is reduced 
to 40 (10 per cell) due to camera and video re- 
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corded, and again no significant differences as 
a function of the experimental manipulations 
were observed. 


Discussion 


In the light of previous research the most 
striking aspect of the findings reported here 
is the total absence of an effect of perceived 
attitudinal similarity on either nonverbal or 
self-report measures of interpersonal attrac- 
tion. As the data for the manipulation checks 
revealed, the absence of such an effect cannot 
be attributed to a faulty or weak induction of 
perceived similarity. The lack of continuity 
between the present results and studies such 
as those of Stroebe et al. (1971) and Byrne 
et al. (1968) may be attributable to the fact 
that, while we used actual persons in face-to- 
face contact, they employed photographs as 
stimuli. Situations involving actual interaction 
may so increase the potency of the physical 
attractiveness cues that the effect observed for 
similarity in noninteraction studies is simply 
overwhelmed, 

The discontinuity between the present find- 
ings and those of Byrne et al. (1970) are not 
so easily dismissed, however, in that a sim- 
ilarity effect was obtained with actual stim- 
ulus persons, varying in physical attractive- 
ness, subsequent to a “coke date” interaction. 
The dissimilarities in the two designs need to 
be taken into account, however, in attempting 
to compare the results. While the present 
study focused on the effects of perceived sim- 
ilarity and manipulated physical attractive- 
ness, that of Byrne et al. involved actual 
similarity (both attitudes and personality) 
and rated physical attractiveness. Thus, while 
much of the effort of the current study was 
directed toward maintaining constancy in the 
behavior of the stimulus persons across ex- 
perimental conditions, the behavior of the 
stimulus persons in Byrne et al, was free to 
covary with these stimulus differences, Sec- 
ond, in that the previous authors did not se- 
lect their subjects on the basis of physical 
attractiveness, it appears likely that the stim- 
ulus persons in the present design represent 


more extreme points on the physical attrac- 
tiveness dimension. 


~ 
x 


E 


a ae 


INTERPERSONAL ATTRACTION IN 


It is important to note that evidence for the 
salience of appearance cues in social interac- 
tion, both in regard to stigmatizing cues such 
as physical deformities and in regard to less 
striking physical variations such as the attrac- 
tiveness manipulation employed here, derives 
from studies involving either little interaction 
(an evening dance, a coke date, a conversa- 
tional encounter) or no interaction at all 
(photographs of “persons you might date”). 
It has been suggested by several investigators 
(e.g, Richardson, 1969) that while appear- 
ance cues may be very important during the 
initial phases of an interaction, they probably 
decrease in importance as interaction time ac- 
cumulates. Evidence counter to this argument 
is provided in a recent study by Kleck, 
Richardson, and Ronald (1974), in which 
children’s friendship choices following two 
weeks of intense social interaction were 
strongly and positively related to the physical 
attractiveness of the stimulus children, Dion 
and Berscheid (Note 1) have obtained similar 
results in a nursery school setting. Such re- 
sults can be accounted for in at least two 
ways. Either attractiveness cues continue to 
have an impact on interpersonal attraction 
responses independent of the particular in- 
terpersonal exchanges which take place, or 
physical attractiveness tends to be highly cor- 
related with behaviors and interaction out- 
comes which others find reinforcing. 

One must be cautious in attempting to gen- 
eralize from laboratory manipulations of the 
sort represented here to the operation of phys- 
ical attractiveness and attitudinal similarity 
cues in naturalistic interactions, The extra- 
polation of laboratory findings is particularly 
problematic in regard to perceived similarity 
for a number of reasons. In the first place, the 
manipulation of this variable (via the presen- 
tation of relatively large amounts of attitu- 
dinal information at the start of an interac- 
tion) is at variance with the gradual exchange 
of such information which takes place in nat- 
ural settings (Altman & Taylor, 1973). Sec- 
ond, in order to reduce suspicion regarding 
the similarity manipulation, a fixed order of 
information exchange was imposed on the in- 
teraction such that the subject always gave 
his opinion on a given issue after he had 
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heard the confederate express hers. A possible 
implication of such a restriction is that the 
subject will be made uncomfortable by being 
placed in a position where the confederate can 
perceive his behavior as an attempt to in- 
gratiate her, particularly if she is physically 
attractive. The implications of such uncom- 
fortableness are not obvious, but this factor 
may help account for the absence of a sim- 
ilarity effect in the present data (see Figure 
1). Finally, in the present design the revela- 
tion of similarity was independent of the 
physical attractiveness of the person to whom 
the similarity was disclosed. There is reason 
to expect (e.g., Berscheid & Walster, 1973) 
that in naturalistic situations the physical ap- 
pearance of an interactant may itself affect 
whether we choose to reveal attitudinally sim- 
ilar or dissimilar information, 

It is important to keep in mind that the 


present results are based on male-female — 


dyads in which only females serve as stimulus 
persons, On the basis of previous research 
(Berscheid & Walster, 1973) one might well 
expect the effects of physical attractiveness to 
be greater under these conditions than would 
be the case in same-sex dyads or when fe 
males are responding to males, Clearly, this 
assumption needs to be investigated in that 
there is some evidence (eg, Byrne et al, 
1970) to the contrary. 


REFERENCE NOTE 


1. Dion, K., & Berscheid, E. Physical attractiveness 
and social perception of peers in preschool chil- 
dren. Unpublished manuscript, 1972, (Available 
from E. Berscheid, Department of Psychology, 
N309 Elliott Hall, University of Minnesota, Min- 
neapolis, Minnesota 55455.) 
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Evaluation of an Interviewer as a Function of Interviewer 
Gaze, Reinforcement of Subject Gaze, 
and Interviewer Attractiveness 
: Chris L. Kleinke 
Wheaton College, Norton, Massachusetts 
Dale E. Berger 
Claremont Graduate School 


Richard A. Staneski 
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Male subjects were interviewed by female interviewers who gazed constantly, 
intermittently, or not at all. Experimental subjects were reinforced with green 
light feedback whenever they gazed at the interviewers and were punished 
with red light feedback when they averted gaze for more than 6 seconds. 
Control subjects received noncontingent green and red light feedback. Although 
gaze of experimental subjects toward the interviewers was increased signifi- 
cantly, their attitudes toward the interviewers remained the same. This was 
probably because the subjects did not discriminate that their gazing behavior 
had changed. Subjects gave the most unfavorable reactions to the nongazing 
interviewers, rating them as least attractive, giving them the shortest answers, 
and sitting farthest from them during the debriefing session. Subjects did not 
discriminate between high and low attractive interviewers, except that the 
latter were rated disproportionately low on attentiveness if they did not gaze. 
Interviewers with high rates of talking were preferred over interviewers with 
low rates of talking, It was concluded that interpersonal attraction is related 
to gaze and physical attractiveness through a number of mediating variables 


which will have to be isolated more specifically in future research, 


Bem’s attitude theory (1965, 1967, 1970, 
1972) proposes that people infer their atti- 
tudes by observing their behaviors and mak- 
ing a discrimination as to whether the behav- 
iors are under their own “internal” control 
(tacted) or under the control of consequences 
in the outside environment (manded). Behay- 
iors which are perceived as manded are at- 
tributed to forces beyond the person’s imme- 
diate control. Behaviors which are perceived 
as tacted are interpreted as reflecting part of 
the person’s belief and attitude system. Stud- 
ies attempting to test Bem’s theory have 
manipulated subjects’ perceived causes of 


This research is based on a doctoral dissertation 
submitted by the first author to Claremont Graduate 
School and was supported by a research grant from 
that institution. The data were presented at the 
meeting of the Western Psychological Association, 
Portland, Oregon, April 1972. The authors are in- 
debted to Frederick B. Meeker, Pam Weaver, Rich- 
ard L. Ying, and Carl La Fong. j 

Requests for reprints should be sent to Chris L. 
Kleinke, Department of Psychology, Wheaton Col- 
lege, Norton, Massachusetts 02766. 


various behaviors as well as the actual be- 
haviors of the subjects themselves. 

Cognitive dissonance studies were reinter- 
preted by Bem as essentially manipulating 
subjects’ interpretations of causes of behaviors 
they were subtly induced to perform (see also 
Kelley, 1967, p. 227). In other studies Bem 
(1966) showed that attitudes of subjects 
could be influenced by varying discriminative 
stimuli which had previously been experimen- 
tally conditioned to lend different interpreta- 
tions to behaviors. A number of experiments 
have been conducted in which subjects’ atti- 
tudes were influenced by giving false feedback 
or misinformation about the causes of behav- 
iors they had emitted (Bem, 1972; Nisbett 
& Valins, 1971). 

Natural changes in behavior are described 
by Bem (1970) as affecting attitudes because 
the original external causes of these changes 
eventually become forgotten. Other research- 
ers have produced behavioral “changes” by 
misinforming subjects about various behaviors 
they had (ostensibly) emitted (Kleinke, 
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Bustos, Meeker, & Staneski, 1973; Kleinke, 
Gitlin, & Segal, 1973; Valins, 1966). Changes 
in subjects’ behaviors have also been pro- 
duced by reinforcement control. Bavelas, 
Hastorf, Gross, and Kite (1965) and Hastorf 
(1968) were able to increase the verbal out- 
put of a target person in a four man discus- 
sion group through the use of reinforcing and 
punishing feedback lights. Behavior change 
produced attitude change to the extent that 
target persons who increased in talking were 
later rated more favorably by themselves and 
other group members on sociometric measures, 

The present experiment was directed to- 
ward a further test of Bem’s attitude theory 
by attempting to increase the gaze of male 
subjects toward female interviewers with op- 
erant control. The incorporation of gaze re- 
search into social psychological theory has 
been discussed by Ellsworth and Ludwig 
(1972). If high levels of gaze are positively 
related to interpersonal attraction (Kleinke, 
1972), it was thought that subjects whose 
gaze toward interviewers was increased would 
be more favorable toward the interviewers 
than subjects whose gaze was not increased. 
Interviewer gaze and interviewer attractive- 
ness were manipulated in addition to rein- 
forcement of subject gaze in order to gain 
more information about how the effects of 
gaze interact with other behaviors and at- 
tributes of the gazing person (Ellsworth & 
Carlsmith, 1968; Kleinke & Pohlen, 1971; 
Scherwitz & Helmreich, 1973; Kleinke, Stan- 
eski, & Pipp, in press). It was predicted that 
interviewers would be favored more if they 
gazed at a high rather than low level. Equi- 
librium theory (Argyle & Dean, 1965; Gold- 
berg, Kiesler, & Collins, 1969) suggests that 
it would be more difficult to increase subjects’ 
gaze toward gazing interviewers rather than 
nongazing interviewers. High-attractive inter- 
viewers were expected to be generally pre- 
ferred over low-attractive interviewers (Ber- 
scheid & Walster, 1973), with amount of 
interviewer gaze making more of a difference 
in ratings of low-attractive interviewers 
(Scherwitz & Helmreich, 1973; Kleinke et al., 
in press). 

Success in changing a person’s behaviors 
and subsequent attitudes through reinforce- 
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ment control depends on the subje: 

tion that his behavior change wa 

ent of external pressures or indu 

was hoped that by concealing thc 
ment contingencies in the present si 
jects’ gaze could be increased in a way 
was perceived by them as tacted or from 
own volition. 


their 


METHOD 


Experimental Design 


Two treatments of reinforcement (cx 
subjects reinforced for gaze, control s 
noncontingent reinforcement), three levels o 
viewer gaze (constant, intermittent, no 
two levels of interviewer attractiveness (high « 
tiveness, low attractiveness) were manipulated in 
2X32 factorial design. Each control subject wa 
randomly matched with an experiment 
given the same interviewer, the same 
gaze treatment, and reinforced at exa 
time intervals in which the experime 
was reinforced. The only deviation 
yoked control procedure was that the 
perimental and control subjects werc 
exactly the same time of day. In the 
cases the experimental and control subjects 
in adjacent sessions, but in some cases th f: 
separated by a day or more. Experimental treat 
ments were carried out in counterbalanced order. 


subject, 
interviewer 


Subjects 


Fifty-four male introductory psycho! 
from California State Polytechnic 
Pomona; Claremont Men’s College; and Cha 
College served as subjects, Subjects ranged in age 
from 18 to 25 years. Assignment of subjects to 
interviewers and treatment conditions was random. 
Subjects were given course credit and paid $1 for 
their participation. All subjects were volunte: 


Interviewers 


Interviewers were nine female students 
Claremont Colleges. Interviewers ranged in 
18 to 21 years. Each interviewer interviewe 
jects, one in each gaze-reinforcement condition. In- 
terviewers were not previously acquainted with the 
subjects, were naive to the hypotheses of the study, 
and were always blind to the subjects’ experimental 
or control status. Some of the interviewers wore 
skirts and nylons, and others dressed quite informally 
with sweat shirts and blue jeans. Interviewers 
dressed in the same manner for all six interviews. 


1 The authors wish to thank Nancy Achorn, 
David, Mary Dicke, Jan Frazier, Valerie Gers 
Mary Jo McCormack, Sandra Pipp, : 
and Pam Weaver for their gracious assistance, 


EVALUATION oF 


Experimental Setting and Apparatus 


The experiment was conducted in a 93-foot square 
experimental room furnished with a curtained one- 
way mirror, carpeting, a 30 X 60 inch table with two 
chairs, a floor lamp, and a small 18 X 27 inch table 
holding a tape recorder and battery-operated pre- 
amplifiers for plethysmographic recording of heart 
rate. The participants sat across the corner of the 
larger table at a distance of approximately 3 feet. 
The curtain covering the one-way mirror was sepa- 
rated about 3 inches. The tape recorder was used to 
present experimental instructions to the participants 
and then, ostensibly, to record the conversation be- 
tween them. In reality, the tape recorder served to 


“record the sequences of green and red lights de- 


livered to the experimental subjects so that they 
could be played back in exact replication to the 
respective matched controls. This was accomplished 


. by feeding a 300-cycle tone directly into the tape 


recorder whenever an observer held down his gaze 
recording button for experimental subjects. The tone 
pulses were then played through a voice-operated 
relay to manipulate the green and red light sequences 
for the control subjects. 


Procedure 


To enhance credibility of the experiment, partic- 
ipants were first taken into the observation room 
and shown the physiological recording apparatus, 
clocks, timers, and an Esterline-Angus recorder. The 
apparatus was placed in a corner of the observation 
room away from the viewing screen and a shade 
was drawn over the screen to conceal the fact that 
the interview would be observed visually. A dem- 


‘onstration was given with an audio galvanic skin 


response device, and the participants were then taken 
into the experimental room and seated, always with 
the subject facing the observation screen. Plethysmo- 
graphs connected to the recording apparatus were 
attached to the subjects and interviewer’s left 
thumbs. Tape-recorded instructions introduced the 
experiment as a study of the interview method and 
its use as a way of getting to know more about 
people. It was explained that female volunteers had 
been designated to be interviewers and male volun- 
teers were serving as interviewees, that the interview 
would last 24 minutes, and that feedback would be 
given in an attempt to enhance or improve the 
process of the interview. Participants were told that 
the female interviewer had been given a list of inter- 
view questions ahead of time so that she would have 
a chance to become familiar with them. The inter- 
view was made up of questions about school matters, 
future goals, hobbies, interests, and opinions on issues 
such as ecology, welfare, and women’s liberation. In- 
timate and embarrassing questions were not included. 
When 24 minutes for the interview had elapsed, the 
experimenter interrupted the participants and led 
them into an unfurnished interview room with indi- 
vidual cubicles for completion of a brief question- 
naire. The subject was given the questionnaire for 
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affective ratings of the interviewer and believed that 
the interviewer, in turn, would be rating him, In 
actuality, the interviewers were never shown a copy 
of the questionnaire until after they had completed 
their total six sessions. It was always arranged that 
the interviewer would “finish” the questionnaire first 
and bring her chair into a designated area of the 
interview room to sit in a standard posture and posi- 
tion. When the subject emerged from his cubicle, the 
experimenter, ostensibly busy in an adjacent room, 
requested him to also bring his chair from his cubicle 
and be seated in the interview room. The experi- 
menter then brought a third chair and joined the 
participants to question them about the experiment 
and debrief them in the manner suggested by Orne 
(1962). The full nature of the experiment was ex- 
plained to the subjects, questions were answered, and 
subjects were paid, thanked for their time, and re- 
quested not to tell other prospective subjects about 
the experiment. When the participants had both left 
the interview room, the experimenter measured the 
distance between the left legs of the subject’s and 
interviewer’s chairs and replaced the chairs in the 
cubicles for the next session. 


Independent Variables 


Reinforcement of subjects’ gaze. Gaze reinforce- 
ment was delivered with feedback lights similar to 
those of Bavelas et al. (1965) and Hastorf (1968). 
Participants were separated by a vertical 7 X 15 inch 
white plywood partition with red and green 74-watt 
110-volt bulbs mounted on each side, Participants 
could only see their own lights, In the experimental 
instructions it was explained that the physiological 
responses of the participants would be monitored by 
two research psychologists who had made extensive 
study of the interview process. The research psychol- 
ogists would ostensibly provide feedback with the 
green light whenever the responses above were de- 
sirable for success in an interview and would signal 
with the red light whenever the responses above were 
undesirable for a successful interview. The absence of 
lights was a neutral sign. Participants were told that 
the determination of exactly which responses were 
desirable and undesirable for success in an interview 
had been well established in previous psychological 
research. The instructions stated that the ability to 
adapt to an interview situation, manifested by an 
increase in green light feedback, was a sign of flexi- 
bility and a positive personality characteristic. It was 
mentioned that the experimenter would score partic- 
ipants on the basis of green light duration minus 
red light duration to see how successful they could 
be and how well they could compare with other 
people taking part in the study. 

In reality, the feedback lights of experimental sub- 
jects were controlled by the observer’s gaze recording 
button, and the feedback lights of control subjects 
were activated by the previously described tape re- 
corder, voice-operated relay apparatus. The inter- 
viewer feedback lights were not used, except to cue 
gaze in the intermittent gaze condition. 
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The first 8 minutes of each interview constituted 
a baseline period, followed by 16 minutes of green 
and red light feedback. In the experimental sessions 
the green light came on immediately when the sub- 
ject gazed at the interviewer, remained on during the 
gaze, and went off at the end of the gaze with a 
14-second delay. The delay interval between end of 
gaze and green light termination was used to conceal 
the contingency between subject gaze and green light 
feedback. The red light was programmed to operate 
whenever the experimental subject averted his gaze 
for more than 6 seconds and to remain on until the 
subject again gazed. It was determined, in accord- 
ance with Gibson and Pick (1963), that the feedback 
lights were well within a subject’s peripheral vision 
no matter where he looked in the interview room. 
Manipulation of interviewer gaze. In the constant 
gaze condition the interviewer gazed at the subject 
continuously, except for a few prompting glances at 
the interview questions. Interviewers had familiarized 
themselves with the questions to enhance this treat- 
ment. Interviewer gaze was defined as direction of 
the interviewer’s head as well as her eyes toward the 
eyes of the subject. In the intermittent gaze condi- 
tion, the interviewer was trained to look at the sub- 
ject whenever her green light was on. The intermit- 
tent gaze treatment was the same for all intermittent 
sessions and involved random 1- to 8-second glances 
(mean glance was 4.8 seconds) which combined to a 
total of 12.2 minutes (51%) of gaze given to each 
subject. The program of glance duration and interval 
between glances, operated by a punched film reader, 
was based on normative data from Argyle and Dean 
(1965) and Kendon (1967). The no-gaze condition 
consisted of the interviewer looking down at the 
questions or off to one side of the subject but never 
directly at the subject. All interviewers were aware 
of the importance of being consistent in the amount 
of smiling, talking, and social approval they gave to 
the six subjects whom they interviewed. 
Attractiveness of interviewers. Interviewers were 
photographed at shoulder level with 35-millimeter 
color transparencies. A 20-second segment of super 
8-millimeter color film was also made of each inter- 
viewer walking to a chair and sitting down. Inter- 
viewers dressed for the photographs as they did for 
the experimental sessions. The stimulus materials 
above were rated by 27 male judges from the same 
population as the original subjects. Judges viewed 
the films first and then were presented with 36 pairs 
of slides representing all possible combinations of 
nine interviewers taken two at a time. The sequence 
of pairs was randomized. After each presentation, 
judges chose which of the two girls was most attrac- 
tive by scoring R for the girl on the right or L for 
the girl on the left. It was possible after the ratings 
to place four of the interviewers into a high-attrac- 
tiveness category (preference X =149) and five of 
the interviewers into a low-attractiveness category 
(preference X =46). The Spearman rank order cor- 
relation between attractiveness ratings made by inde- 
pendent judges and attractiveness ratings given by 
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subjects on the postexperimental questionnaire was 
153 (p < 01). The agreement of independent judges 
and subjects on the division of interviewers into the 
high- and Jow-attractiveness categories was perfect. 


Dependent Variables 


Talking during the interview. The experimenter 
(different from the observer) monitored the inter- 
view with headphones: and pressed one putton when- 
ever the interviewer talked and another button 
whenever the subject talked. These buttons were 
connected to the Esterline-Angus recorder and coun- 
ters to provide data for total talking of interviewers 
and subjects as well as the relationship of subject 
talking, interviewer talking, and subject gaze during 
the interview. 

Subject gaze. An observer behind the one-way 
mirror recorded subject gaze by activating an Ester- 
line-Angus pen and electric clocks whenever the sub- 
ject focused his eyes on the eyes or face of the 
interviewer. Six Claremont and Chaffey College stu- 
dents served as observers. All observers except one 
(who recorded 28 sessions) were blind to the hy- 
potheses of the experiment and to the experimental 
or control status of the subject. Analyses of variance 
at the end of the experiment found no difference 
between the data from subjects of the nonblind ob- 
server and subjects of the blind observers. Observers 
who recorded gaze did not hear the conversation 
between the interviewer and subject. Reliability in 
recording subject gaze was determined during 10 
random sessions after the method of Exline (1963). 
The range of agreement between pairs of observers 
was .71 to .88, with a mean of .81. 

Subjects’ attitudes toward the interviewers. The 
postexperimental questionnaire was made up of eight 
polar adjective rating scales and questions asking the 
subjects how much they liked the interviewer, how 
much they thought the interviewer liked them, and 
how attractive they thought the interviewer was. 
The polar adjectives included in the rating scale 
(friendly-unfriendly, incompetent-competent, warm- 
cold, sincere-insincere, unintelligent-intelligent, re- 
laxed-nervous, was attentive-ignored me, polite- 
impolite) had been previously shown by Lott, Lott, 
Reed, and Crow (1970) to discriminate between dif- 
ferentially liked persons. All questionnaire ratings 
were made on a 12-point scale, A behavioral measure 
of subjects’ attitudes toward the interviewers was the 
distance they left between themselves and the inter- 
viewers during the debriefing session. 


RESULTS 


Three of the 54 subjects noticed that the 
feedback lights were contingent upon their 
gaze toward the interviewer. The data from 
these subjects did not appear to differ signif- 
icantly from the data of other subjects and 
were included in the analysis. Unless other- 
wise specified, data were analyzed in a 2 X 3 
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Experimental Group 


Control Group 


Experimental Period 


Ficure 1. Mean gaze durations of experimental and control subjects over two-minute intervals. 


x 2 design, with reinforcement of subject 
gaze, interviewer gaze, and interviewer attrac- 
tiveness as independent variables. 


Reinforcement of Subjects’ Gaze 


Because people gaze more when listening 
than when talking (Exline, 1963; Exline, 
Gray, & Schuette, 1965; Kendon, 1967), an 
analysis of variance was conducted to show 
that amounts of talking by subjects and inter- 
viewers in this study did not change over time 
in any of the 12 treatment conditions. Any 

. change in subject gaze, then, could not be an 
artifact of changes in subject or interviewer 
talking. 

Gaze durations for experimental and con- 
trol subjects are plotted over 2-minute inter- 
vals in Figure 1. Comparison of experimental 
and control subjects’ gaze duration changes 
between baseline and experimental periods 
(subjects as their own control) demonstrated 
that experimental subjects increased their 
gaze duration per 2-minute interval with a 
mean of 6.9 seconds and control subjects de- 


creased their gaze duration per two-minute 
interval with a mean of 1.5 seconds. This dif- 
ference was highly significant, F (1,42) = 
10.4, p < .002. Gaze durations of experimen- 
tal subjects, per 2-minute interval, were sig- 
nificantly higher during the experimental pe- 
riod than during baseline, Ys = 67.4 versus 
60.5 seconds, £ (26) = 4.71, p < .001. There 
was no significant change in gaze duration be- 
tween baseline and experimental periods for 
control subjects, even when the 8-10 minute 
interval (which is probably an artifact of the 
feedback lights beginning) is excluded. The 
absence of interactions involving interviewer 
gaze and interviewer attractiveness signifies 
that reinforcement of subject gaze in the ex- 
perimental group was independent of these 
factors. 

Although the reinforcement treatment sig- 
nificantly increased experimental subjects’ 
gaze durations, it did not influence their 
verbal responses on the questionnaire. There 
was no correlation between subjects’ gaze fre- 
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TABLE 1 
Mean RATINGS OF INTERVIEWERS ON WAS 


ATTENTIVE—IGNoRED ME DIMENSION 
(was attentive scored 1) 
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rated as next most attentive, Y = 3.45, and 
nongazing interviewers were seen by subjects 
as being least attentive, ¥ = 4.95, F (2,42) 


Attractiveness oe interne No-gaze 
High* 2.88 3.88 3.24 
Low? 2.00 | 3.10 6.30 

er 


quencies or gaze durations and any of the 
measures on the questionnaire. 

An analysis of variance with reinforcement 
of subject gaze and interviewer gaze as inde- 
pendent variables and repeated measures for 
subject gaze/subject talk and subject gaze/ 
interviewer talk showed that subjects gazed 
significantly more when they were listening, 
X = 52.6%, than when they were talking, 
X = 46.8%, F (1,48) = 6.58, p < .05. No in- 
teractions were present in the analysis. 


Interviewer Gaze 


Subjects made significantly briefer state- 
ments in the no-gaze condition, ¥ = 10.7 sec- 
onds, compared with the intermittent, X = 
15.2 seconds, and constant gaze, ¥ = 15.6 
seconds, conditions, F (2,42) = 4.55, p < .05. 
Because of this, overall levels of talking were 
lower for subjects in the no-gaze condition, 
X = 67.2%, than they were for subjects in 
the intermittent, X = 72.5%, and constant 
gaze, X = 74.1%, conditions, F (2,42) = 
4.12, p < .05. Interviewers talked most in the 
no-gaze condition, ¥ = 29.5%, and less in the 
intermittent, XY = 22.4%, and constant gaze, 
X = 21.5%, conditions, F (2,42) = 4.39, p < 
.05. The fact that interviewer utterance 
lengths did not differ between the three gaze 
conditions (overall X = 4.1 seconds) implies 
that differential evaluations of the interview- 
ers in the three gaze conditions would have 
to be a function of interviewer gaze, inde- 
pendent of interviewer talking. Interviewers 
talked more in the no-gaze condition only be- 
cause subjects gave briefer answers and more 
questions had to be asked. 

Interviewers who emitted constant gaze 


were rated as most attentive, X = 2.39, 
—— with intermittent gaze were 


a 


= 4.76, p < .02. Table 1 points out the espe- 
cially low rating on attentiveness made by 
subjects in the no-gaze/low-attractiveness 
group. The Interviewer Gaze X Interviewer 
Attractiveness interaction in Table 1 was sig 
nificant, F (2,42) = 3.57, p< .05. A New 
man-Keuls test showed the nongazing low 
attractive interviewers to differ significantly 
p < .05 from the other five groups. No othe: 
rating form responses were related to tl 

interviewer gaze treatment. 

The distance subjects left between them 
selves and the interviewer during the debrief 
ing session was related with borderline signifi 
cance to the interviewer gaze treatment, F 
(2,42) = 3.01, p < .07. Subjects in the no- 
gaze condition maintained the greatest dis 
tance, X = 72.1 inches, subjects in the co: 
stant gaze condition sat closer, ¥ = 60.3 
inches, and subjects receiving intermitten 
gaze sat closest, X = 58.5 inches. The di 
tance measure was independent of reinforce 
ment and interviewer attractiveness. Subjects’ 
gaze and heart rates were not significantly 
affected by interviewer gaze. 


Interviewer Attractiveness 


Division of interviewers into the two attrac- 
tiveness groups were validated by the signifi- 
cantly higher attractiveness ratings (very at- 
tractive scored 1) given to the high-attractive 
interviewers, X = 2.83, compared with the 
low-attractive interviewers, X = 4.63, F (1.42) 
= 12.8, p< .001. High-attractive and low- 
attractive interviewers were not rated differ- 
ently by subjects on other verbal measures, 
except for the interaction in Table 1, 


Amount of Interviewer Talking 


A post hoc analysis was carried out com- 
paring interviews in which the interviewers 
talked more than 25% of the time, X talking 
= 33.9%, m= 29, and interviews in which 
the interviewers talked less than 25% of the 
time, X talking = 15.7%, n = 25. Chi-square 
analyses indicated that reinforcement, inter- 
viewer gaze, and interviewer attractiveness 
treatments were equally distributed between 
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these two groups. Analyses of variance with 
amount of interviewer talking, interviewer 
gaze, and interviewer attractiveness as inde- 
pendent variables showed that interviewers 
with high levels of talking were rated as 
significantly ‘more warm, polite, and friendly 
than interviewers with low levels of talking. 


Discussion 


, | Gaze duration of subjects toward interview- 

ers was clearly. increased as a result of the 
reinforcement procedure. The fact that in- 
creased gazing behavior of subjects did not 
lead to a measurable change in their attitudes 
is probably due to a tendency for people not 
to make discriminations about the amount of 
gaze they emit in a particular situation. If 
subjects in the present experiment were not 
aware that their gazing behavior had in- 
creased, their attitudes could not be expected 
to change. Ellsworth and Ludwig (1972) 
discuss research showing that subjects do not 
always make accurate discriminations about 
the amount of gaze they have received from 
others. In the present experiment, gaze of sub- 
jects did not correlate with their responses on 
the rating scale. Kleinke, Bustos, Meeker, 
and Staneski (1973) were able to influence 
attitudes in an experiment where the ostensi- 
ble gaze of subjects was made very salient 
with false feedback. 

Subjects gave the most unfavorable reac- 
tions to the nongazing interviewers, rating 
them as least attentive, giving them the 
shortest answers, and sitting farthest from 

_ them during the debriefing session. No direct 
relationship was found between interviewer 
gaze and subjects’ ratings of liking. Inter- 
viewers who gazed constantly were not dif- 
ferentiated from interviewers who gazed inter- 
mittently, except that the former were rated 
as slightly more attentive. 

The finding that subjects gave equal gaze, 
gave shorter answers, and had the farthest 
distance to nongazing interviewers does not 
appear consistent with equilibrium theory 
(Argyle & Dean, 1965; Goldberg et al., 1969). 
This theory would predict a combination of 
more gaze, longer answers, and less distance 
given to a nongazing interviewer. 
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The finding that subjects did not evaluate 
high-attractive interviewers more positively 
than low-attractive interviewers is probably 
best explained by the fact that interviewers in 
the present experiment appeared relatively 
inaccessible for personal interaction or poten- 
tial dating. An attempt was made to place 
interviewers and subjects on an equal basis by 
introducing them both as subjects. Because 
the interviewers were restricted to asking 
questions and the subjects were restricted to 
answering, however, it was not possible for 
the subjects to learn anything about the inter- 
viewers. Berscheid and Walster (1973) dis- 
cuss the notion that the influence of attrac- 
tiveness in choosing a partner of the opposite 
sex is greatest when the probability of suc- 
cess in meeting and being accepted by the 
chosen person is high. It is possible that gaze 
from an opposite-sex person also has more 
influence on interpersonal evaluations under 
conditions of high availability. 

The disproportionately unfavorable rating 
on attentiveness given by subjects to low- 
attractive nongazing interviewers is similar to 
results of studies by Kleinke et al. (in press) 
and Scherwitz and Helmreich (1973). These 
researchers also found that low-attractive fe- 
males suffered the most negative ratings from 
males if they gave low levels of gaze in an 
interaction. 

The general positive attitude shown by sub- 
jects toward interviewers with high rates of 
talking adds to limited data on the relation- 
ship between amount of talking and interper- 
sonal attraction. Stang (1973) recently ma- 
nipulated amount of talking in a parametric 
fashion. 

The present experiment shows that gaze, 
physical attractiveness, and liking are related 
in a more complex way than some previous 
investigators have assumed. Future studies 
should attempt to clarify the interrelationship 
between these variables by separating out 
some of the factors which lead to apparent 
inconsistencies in nonverbal research (Kleinke, 
1972). 
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Reanalysis of Vidmar’s Data on the Effects of Decision 
Alternatives on Verdicts of Simulated Jurors 


Kinley Larntz 
Department of Applied Statistics, University of Minnesota 


A simple probabilistic model is proposed for explaining the effects of limiting 
the number of decision alternatives available to a decision maker. The model 
holds that the possible decisions in the restricted case are distributed in the 
same proportions as in the unrestricted case. Vidmar’s data on juror verdicts 
are reanalyzed using this model, and the data appear to be well fit by the 


model. 


In a recent article, Vidmar (1972) pre- 
sented and analyzed data relating the effects 
of limiting the number of decision alterna- 
tives on the verdicts and social perceptions 
of simulated jurors. In this article, I propose 
a simple model for explaining the part of his 
data relating decision alternatives and ver- 
dicts. My conclusions differ somewhat from 
those of Vidmar in that my model offers a 
statistical rather than a psychological expla- 
nation for the observed data pattern. 


Vidmar’s Data 


Vidmar collected data aimed at discovering 
the possible effects on decision making of 
limiting the number of alternatives available 
to a decision maker. As an example he of- 
fered the trial arising from the Algiers Motel 
incident that occurred during the Detroit riots 
of 1967. The judge in that trial instructed 
the jury to choose between two possible ver- 
dicts: guilty of first-degree murder or not 
guilty; other verdicts were not allowed. Vid- 
mar set up and carried out an experiment to 
determine the possible effects of such a re- 
duction of alternatives. For a broader psycho- 
logical discussion, see Vidmar (1972). 

Vidmar’s experiment consisted of present- 
ing an abridged transcript of a murder trial 
(not the Algiers Motel trial) to students in an 
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introductory social psychology course. For this 
particular trial, four possible verdicts were 
presented: first-degree murder, second-degree 
murder, manslaughter, and not guilty. 
Twenty-four students were instructed to act 
as individual jurors making their decision as 
the judge instructed the jury in the Algiers 
Motel trial, that is, between first-degree mur- 
der and not guilty. Another 24 chose between 
second-degree murder and not guilty, etc. All 
seven combinations of guilty verdicts with not 
guilty were presented to 24 subjects. The 
data collected by Vidmar are presented in 
Table 1. 

Vidmar’s conclusion was “that under con- 
ditions of restricted decision alternatives, the 
more severe the degree of guilt associated 
with the least severe guilt alternative, the 
greater were the chances of obtaining a not 
guilty verdict.” The model I present in the 
next section differs from this in that the 
chances of obtaining a not guilty verdict de- 
pend not on the severity of the guilt alterna- 
tive, but on the likelihood (probability) of 


TABLE 1 


Decision ALTERNATIVES AND FREQUENCY 
or VERDICTS 
(Vidmar, 1972) 


Condition 


Alternative — = 
1 2 3 4 | 5 | 6 |7 
First degree 11 |—]|—] 2 T| = 2 
Second degree | — | 20 | — 2 | — |} 11 | 15 
Manslaughter | — | — | 22 | — | 16 | 13 5 
134A | =o G R fie 2 

| 


Not guilty 


Nole. Blank cells indicate that the decision alternative was 
not allowed for subjects under the condition. 
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TABLE 2 
EXPECTED PROPORTIONS FOR VERDICTS UNDER THE CONDITIONAL PROBABILITY MODEL 
Condition 
Alternative rl hR 
1 2 3 4 5 6 
| 
pr Pr pr 
First degree — pr 
Pr + py Pr +Ps +py | pr +pu + py 
| 
ps Ps | Ps 
Second degree — | - ps 
Ps + py pr + ps + py Ps +pu 4 
Pm Pu | Pu 
Manslaughter = 5 ; 
Pu + py Pr +pu + py | ps + pu 4 
PN PN PN PN PN PN 
Not guilty 
Pr + py | ps + pn | pa + py Pr + ps + pw | pr + pw + py | Pat pa +; 
Total 1.0 1.0 1.0 1.0 Lo | 1.0 o 
Note. be Fpipapues first-degree murder, ps = probability second-degree murder, py = Probability matilaus hter 
ability not guilty. 


the guilt alternative in the unrestricted deci- 
sion alternative case. 


A Conditional Probability Model 


Consider first the unrestricted decision al- 
ternative case. Let Pr, ps, pm, and Py be the 
probabilities of the four possible verdicts in 
the unrestricted case where 


Pr = prob(first-degree verdict) 

Ps = prob(second-degree verdict) 

Pa = prob (manslaughter verdict) 

Px = prob(not guilty verdict), 
with pp + ps + pu + py = 1. If a large num- 
ber of jurors were presented the transcript, we 
would expect the proportion of first-degree 
verdicts to be pp, second-degree yerdicts Ps; 
etc. 


TABLE 3 


EXPECTED CELL FREQUENCIES USING THE 
CONDITIONAL PROBABILITY MODEL 


Condition 
Alternative 


1 PEA E ES 
First degree |140| — | —]| 2.6| 3.6] — | 17 
Second degree | — | 21.9] — |19.5| — | 13.8] 12.8 
Manslaughter | — | — |21.0| — |17.8] 9.0] 83 
Not guilty 10.0; 2.1) 3.0} 1.8] 2.6} 1.3] 1.2 
Total 24 |24 |24 |24 |24 |24 |24 


Nolte. Totals may not add to exactly 24 due to rounding. 


Now for a restricted decision a ative, 
we can hypothesize how jurors mighi act. 
Consider the case of forced choice n 
first-degree murder and not guilty, | € 
unrestricted case, we would expect propi 
tions pp and py of the decisions to be first- 
degree murder and not guilty, respectively. 
However, when we do not allow ot 
dicts, a reasonable hypothesis for the cx 
proportions in the restricted case is that 


th 
Proporitions will be in the ratio Pr 


iPx. Thus 


prob (rst degree 


only first degree and 
not guilty allowed 


pi 
and 


only first degree and 


pip (not guilty not guilty allowed } 


PN 
Pr + px 
r , just the usual condi- 
tional probabilities of F and N given only F 
and N occur. Table 2 gives the expected pro- 
portions for the seven conditions in the Vid- 
mar experiment. These are the expected pro- 
Portions for the case of independence in a 
two-way contingency table given that some 
cells are necessarily zero, 


These are, of course 


REANALYSIS oF Vipmar’s Data 


Estimating the Probabilities 


This conditional probability model is a 
case of the quasi-independence model for an 
incomplete contingency table (see Bishop & 
Fienberg, 1969; Fienberg, 1972; Goodman, 
1968). The maximum likelihood estimates for 
Pr, Ps, Pa, and py can be found using a com- 
puter program for fitting log-linear models in 
contingency tables (see Haberman, 1972). 
Discussion of the fit for the Vidmar data is 
given below. 


Reanalyzing Vidmar’s Data 

The data given in Table 1 were analyzed 
using the conditional probability model. Maxi- 
mum likelihood estimates of the unrestricted 
probabilities are given by 


pr =. 
Ds = 


py =. 


Thus, in the unrestricted case, the best esti- 
mate is that 53.2% of the verdicts would be 
for second-degree murder, 34.7% for man- 
slaughter, 7.1% for first-degree murder, and 
5.0% for not guilty. Applying these proba- 
bilities to the restricted case of first-degree 
murder versus not guilty we find that we 
would expect the proportion of not guilty 
verdicts to be 
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In the case of second-degree murder versus 
not guilty, the expected proportion of not 
guilty verdicts is found to be .086; and for 
manslaughter versus not guilty the corre- 
sponding expected proportion is .126. Thus, 
from a strictly statistical point of view, the 
large proportion of not guilty verdicts in 
Condition 1 can be explained by the fact that 
first-degree murder and not guilty have rela- 
tively close unrestricted probabilities (.071 
versus 050). When not guilty is compared 
with more probable events, its relative fre- 
quency decreases. 

A check on the goodness of fit of the con- 
ditional probability model can be made by 
calculating expected cell frequencies to be 
compared with the actual cell frequencies 
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given in Table 1. The expected cell frequen- 
cies for the conditional probability model are 
given in Table 3. 

To judge the goodness of fit for the model 
the usual chi-square statistic 


es (observed — expected)? 


expected À 


is computed. The degrees of freedom for this 
chi-square equal the usual number for inde- 
pendence (3 X 6= 18) minus one for each 
blank cell (9). For this table the conditional 
probability model fits the data (x? = 16.5, df 
= 9, p> .05). 


CONCLUSION AND DISCUSSION 


A model has been proposed essentially say- 
ing that jurors acting under restricted alterna- 
tives distribute their verdicts in proportion to 
the probabilities of the verdicts in the un- 
restricted case. Thus, it is not surprising that 
a relatively large percentage of not guilty 
verdicts are given when not guilty is com- 
pared with another unlikely event. 

Further studies using other cases may 
show that the conditional probability model 
does not hold in all restricted decision-making 
cases; but it represents a baseline model 
against which any other theory should be 
tested. The essential point of the conditional 
probability model is that the effects seen in 
the data are due necessarily to the constric- 
tion of response available to the decision 
maker and not to any psychological accommo- 
dation to that restriction. 
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On Inferring One’s Attitudes From One’s Behavior: 
Some Delimiting Conditions 


Shelley E. Taylor 
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A study was designed to 
their attitudes directly 


tude, In addition, subjects either expected or did not expect the attitude to 
have important Consequences for their future behavior. 
reassessed, type of feedback was only weakly related to 
basis for one’s attitude. However, it was clear that subjects used 
when they expected no future 
When future consequences were 
subjects engaged in a critical, time consuming reevaluation of their attitudes 
in which feedback played a minimal role, It was concluded that 
outlined by Bem’s self-perception theory is most likely to operate when the 
attitudinal issues involved are relatively unimportant and inconsequential, 


as a basis for their attitude only 
to result from their attitudes, 


Recent research suggests that people often 
form attitudes or reach decisions based on a 
relatively superficial assessment of available 
information. Specifically, Jones and Davis 
(1965) stated that rather than engaging in a 
scientificlike analysis of cause and effect, per- 
ceivers typically look only for sufficient ex- 
planations for behavior. Kanouse (1972) 
added that which sufficient explanation the 
person settles on may be determined largely 
by what information is salient, Bem (1965; 
1972) proposed that if internal cues are weak 
or unclear, a person will infer his attitudes 
partially from salient external cues which 
consist of his own behavior and the context in 
which it occurred. In many situations, then, 
the person or actor is functionally in the same 
position as any other observer of his be- 
havior; both are relying on the behavior and 
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or inconsistent with a prior atti- 


When attitudes were 
use of feedback as a 
the feedback 
consequences 
anticipated, 


the process 


the conditions under which it occurred to infer 
the actor’s attitude. 

Other evidence, however, 
people form attitudes and 
with considerable care. Janis and Mann ! 
(1968) suggested that when people must 
decide between two or more courses of action, 
they engage in an appraisal of alternatives: 
they assess the probable utilitarian gains or 
losses to the self and to significant others and oy 
the probable approval or disapproval from 
self or others associated with each alternative, 
Although Janis and Mann point out that the 
decision-making Process may be marked by s 
distortions and omissions, their account sug- 
gests that people use more sophisticated cog- 
nitive processes than those emphasized by 


Bem (1972), Jones and Davis (1965), or 
Kanouse (1972). 


Suggests that A 
make decisions 


man, 


q e may engage in a more 
thoughtful” information search in 
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decision or attitude has no clear implication 
for one’s future behavior, a person may base 
his attitudes on his observations of his own 
behavior as Bem suggests. But when one must 
act upon one’s attitude as in the case of 
decision making, the person may engage in a 
more time-consuming appraisal of many 
sources of information. 

A second limitation on the conditions when 
attitudes are inferred directly from behavior 
may also be warranted. Although some stud- 
ies support Bem’s theory (Bandler, Madaras, 
& Bem, 1968; Kiesler, Nisbett, & Zanna, 
1969; Valins, 1966), others do not. In the 
studies which support Bem’s theory, the feed- 
back subjects were given about their behavior 
can be viewed as consistent with prior atti- 
tudes the subjects held on the same topic. 
For example, Valins (1966) gave college 
males false heartrate feedback leading them 
to believe they liked particular nude women. 


_ Kiesler, Nisbett, and Zanna (1969) led sub- 


jects who were initially opposed to pollution 
to believe that they were even more opposed 
to pollution. In these studies in which feed- 
back is consistent with initial attitude, sub- 
jects appear to accept their behavior as an 
indication of their attitude. 

The studies which have not supported 
Bem’s theory all provide subjects with feed- 
back which is inconsistent with prior attitudes 
on the same subject. For example, Valins and 
Ray (1967) gave subjects who feared snakes 
feedback intended to convince them they 
were not afraid of snakes. The subjects later 
approached a snake more closely but did not 
rate themselves as less afraid than a group 
which did not receive the false feedback. 
These subjects, then, exhibited behavior 
which implied acceptance of a new attitude, 
that is, approaching the snake, but they 
failed to change their attitude about snakes. 
In several other studies, a similar discrepancy 
between attitudes and behavior was found 
(eg., Davison & Valins, 1969; Nisbett & 
Schachter, 1966; Storms & Nisbett, 1970); 
in all of these studies, feedback provided to 
the subjects was inconsistent with prior atti- 
tude and subjects gave only behavioral evi- 


dence of accepting the attitude. 
The discrepancy between behavioral and 
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attitudinal dependent measures in these self- 
perception studies was noted by Nisbett and 
Valins (1972). They suggested that subjects 
receiving false feedback may engage in hy- 
pothesis testing in which they test out the 
implications of the false feedback to see if a 
change in attitude is warranted. The present 
analysis suggests that hypothesis testing oc- 
curs only when subjects perceive inconsistency 
between initial attitude and feedback and 
not when initial attitude and feedback are 
consistent. 

In the present study, subjects received 
feedback about their behavior which was 
either consistent or inconsistent with initial 
attitude expressed on a topic; a third group 
received no feedback. In addition, half of the 
subjects expected to have to act upon their 
attitude in the future, whereas the other half 
did not. Subjects were then given unlimited 
time to consider and comment upon informa- 
tion relevant to their attitudes, following 
which they expressed their final attitudes. To 
recapitulate, the following predictions were 
made. Inconsistency between feedback and 
initial attitude results in a longer and more 
complete information search, that is, hypothe- 
sis testing, than does consistency between 
feedback and initial attitude. The expecta- 
tion of future consequences of one’s attitude 
results in less reliance on feedback about 
one’s attitudes and prompts a longer and 
more complete information search than does 
no expectation of future consequences. 


METHOD 
Subjects and Materials 


Subjects were 72 undergraduate women from 
Albertus Magnus College. No subjects were dis- 
carded on the basis of suspicion. Materials in the 
study were 20 full-face color portrait photographs of 
clothed male graduate students who were judged as 
covering the range from highly attractive to highly 
unattractive. The 20 pictures were then divided into 
two groups of 10 pictures each, matched for at- 
tractiveness. 


Preexperimental Procedures 


Each subject was approached individually in her 
dormitory room and asked to participate in a study 
on what women find attractive about men. The 
subject rated 10 of the photographs of men on the 
questions “How attractive do you personally fnd 
this man to be?” and “How attractive do you think 
most women would find this man to be?” using a 
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15-point, bipolar continuum with labeled endpoints. 
When the subject had completed the task, she was 
told that her ratings would be averaged with other 
people’s ratings to determine which pictures would 
be used in the main part of the study. She was then 
invited to participate in it, All subjects agreed to 
take part in the later study. Subjects received $1.25. 

From the initial ratings, the experimenter jso- 
lated 1 picture that the subject had rated as high 
in attractiveness, 1 rated as medium in attractive- 
ness, and 1 rated as low in attractiveness. The 
highly preferred picture was always the 1 rated 
highest of the 10 pictures; the average rating across 
subjects was 2.92, SD =1.42. The medium-preferred 
picture was selected on the basis that it fell in the 
upper or “attractive” half of the 15-point scale, that 
it was at least 2 scale points away from the highly 
preferred picture, and that it was ranked no higher 
than third and no lower than fifth in overall at- 
tractiveness; the average rating was 5.60, SD= 
1.48. The low-preferred picture was the 1 ranked 
tenth in attractiveness by the subject, = 13.07, 
SD = 1.34, Subjects were then randomly assigned to 
an experimental condition. 


Procedure 


Each subject participated in the experiment one 
week after completing the initial ratings. The experi- 
ment was conducted in a small room which contained 
comfortable armchairs, a projector, a screen, and a 
table covered with scientific-looking equipment. A 
tape recorder was concealed beneath the table. Prior 
to receiving full instructions, the subject was asked 
if she would mind if some physiological data were 
collected during the experiment. The experimenter 
explained that a colleague of hers was using galvanic 
skin response (GSR) in some of his studies and 
that he had asked if he could collect some data in 
this study to see how GSR varies as a fuaction of 
the task a subject is performing. The e Menter 
made it clear that she would not have access to this 
data, The experimenter then explained that GSR is 
particularly useful in studies of Person perception 
because it is the only physiological measure that 
registers big responses to positive reactions and small 
responses to negative reactions. To make this point 
clear, the subject was shown a sample protocol of a 
girl having reacted strongly to a slide of her boy- 
friend and not at all to several neutral stimuli. An 
electrode was then attached to the subject’s non- 
preferred hand. 

The subject was told that the colleague was col- 
lecting his data in an adjoining room. She was in- 
structed to put on head phones, so that he could 
communicate with her. In fact, the confederate’s 
voice was tape-recorded. The tape-recorded confed- 
erate then ran the subject through some tests, os- 
tensibly to establish GSR baselines. 

When the tests were completed, the subject was 
given instructions for the first task. She was told 
that she would see three slides for 20 seconds each 
and that after seeing each slide she should write a 
brief nonevaluative description of the man pictured 
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in the slide. Each slide was paired with a man’s 
name, “Michael,” “Peter,” or “David.” The subject 
was asked to remember the names of the men in the 
slides for purposes of identification later. The sub- 
ject then saw slide versions of her highly preferred, 
medium-preferred, and least-preferred pictures, Order 
of the three slides was counterbalanced within each 
experimental treatment. The experimenter did not 
know the subject’s picture preferences at this point 
or which name had been paired with which slide. 


Feedback Manipulation 


When the subject had completed describing the 
slides, the experimenter asked her to keep the head- 
phones on until the confederate said. that he had 
finished taking his readings. The experim>nter busied 
herself with Paperwork in the other side of the room. 
The second tape recording then began. The subject 
was told to relax her hand so that her GSR would 
return to baseline. At this point, the subject “over- 
heard” a conversation between the colleague and a 
friend of his. In all conditions, the subject heard 
the confederate describe her GSR data. One-third 
of the subjects heard only that their data had 
Produced clear results (no feedback). Two-thirds of 
the subjects heard the confederate say that they had 
reacted noticeably 
of David than to any other slide; this feedback was 


Shortly thereafter, the confederate told the subject 
that her GSR had returned to baseline. He thanked 
her for helping him with his study, and asked her to 
tell the experimenter that he was leaving. 


Future Consequences Manipulation 


After the headphones and 
removed, the experimenter 
consequences manipulation. § 
sheet of Paper, 
the second task, she wanted 
the details of the study. Hai 
that when part two Was over, 
questionnaire assessing her 
chology experiments (no futu: 


d electrode had been 
Introduced the 


following the second Part, 
Jects in the study would be 
Some of the men i 


blind to this manipulation, 
The subject was then given instr 


second task. She was told. that Uctions for the 


she Would see the 


~ 


r 


same three slides in the same order and that this 
time she would see each slide for as long as she 
liked. This time, she was to evaluate the men in 
the slides, listing wlat she liked or did not like 
about them. She was told that she could attend to 
anything she noticed; she could put down all posi- 
tive remarks, all negative remarks, or a mixture of 
both. Again, the name of each man was indicated 
on the evaluation form. The experimenter then 
showed the subject how to run the projector so 
that she could work at her own pace. During the 
second task, the amount of time the subject spent 
evaluating each slide was unobtrusively measured. 
‘When the evaluations were completed, the subject 
filled out a series of dependent measures and ma- 
nipulation checks. 


Dependent Measures 
_ Subjects answered the questions “How attractive 
do you personally find this man to be?” and “How 
attractive do you think most women would find 
this man to be?” on 15-point bipolar scales with 
labeled endpoints; these were the same two ques- 
tions that had been answered about all 10 men one 
„ week earlier. A second dependent measure was the 
time spent by subjects evaluating each slide. A 
third set of dependent measures was derived from 
Ž the subject’s list of comments on the evaluation 
sheet; each list was scored for total number of 
© comments, number of positive comments, number of 
negative comments, and number of references to 
personality attributes. Subjects were also asked if 
= they felt they had reacted to any particular man 
when GSR was being measured and if so, to which 
man, Finally, they were asked how much they 
thought about meeting the men in the slides. 


RESULTS AND DISCUSSION 


Analyses of the dependent measures in- 
„cluded only subjects’ responses to the highly 
preferred and medium-preferred pictures; 
ratings of the least-preferred picture were 
not included, as these ratings should not 
have been differentially affected by the ex- 
perimental manipulations. In addition, to 
highlight the major contrasts, only the con- 
sistent and inconsistent feedback conditions 
were included in the analysis. Data for all 
experimental conditions are, however, in- 
cluded in the mean tables. All analyses of 
variance used two between-subjects factors 
of feedback and future consequences and one 
within-subjects factor, picture preference. 


Inconsistency between Feedback and Initial 
Attitude and Extent of Information Search 
On the basis of studies which failed to 


support Bem’s theory, it was predicted that 
| inconsistency between feedback and initial 
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TABLE 1 
Mean AMOUNT or TIME SPENT LOOKING 
AT PICTURES 
Consequences 
Feedback 
No future Future 
Consistent 
High 2.14 2.26 
Medium 2.04 2.20 
Low 2.06 2.16 
Inconsistent 
High 2.03 2.23 
Medium 2.03 2.27 
Low 2.03 2.24 
No 
High 2.11 2.24 
Medium 2.07 2.24 
Low 2.05 2.19 
Note. Time data transformed to base logarithm scale. 
Between-subjects error term: df = 44, MS = 042. Within- 
subjects error term: df = 44, MS = .013. 


attitude leads to more hypothesis testing than 
does consistency between feedback and initial 
attitude. The feedback manipulation was 
clearly successful. All the subjects correctly 
reported which picture they had reacted to 
and all subjects reported believing they had 
received accurate feedback. 

The prediction that subjects receiving in- 
consistent feedback would spend more time 
evaluating the pictures was not supported, 
F < 1. Instead, there was a significant inter- 
action between feedback and pictures, F 
(1,44) = 4.60, p< .05; as the means in 
Table 1 indicate, subjects spent more time 
vjewing the picture to which they believed 
they had reacted. Subjects who received in- 
consistent feedback also did not write longer 
evaluations than subjects receiving consistent 
feedback or no feedback (F < 1). The evalu- 
ations did not differ among the groups on one 
dimension: Subjects receiving inconsistent 
feedback were more likely to list personality 
attributes in their evaluations of the men 
than were subjects in the other feedback con- 
ditions, F(1,44) = 4.20, p < .05. 

Given these findings, is there any þasis for 
concluding that inconsistency between initial 
attitude and feedback leads to an information 
search? Certainly the data on personality 
attributes is suggestive. The absence of more 
conclusive evidence may be due to the nature 
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TABLE 2 
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MEAN FINAL RATINGS OF ATTRACTIVENESS 


Feedback 


Consequences 


No future Future 
Consistent 
High 13.42% 12.08" 
Medium 8.2 10.33%" 
Low Sis 4.25 
| 
Inconsistent | 
High 11,429b° 11.508¢ 
Medium | diotena 9. 
Low 4.83 6. 
No | 
High | 12.33 | 
Medium 8.42 
Low | 3.50 


Note. Means not sharing a common superscri 
icantly from each other at the .05 level or e 
Keuls procedures (Winer, 1962). Means without superscripts 
are presented for comparison but were not included in the 
analysis of variance reported in the text. Between-subjects 
error term: df = 44, MS = 5.36. Within-subjects error term: 
df = 44, MS = 8.02. 


of the feedback manipulation. Past studies 
which provide evidence of an information 
search are those in which the degree of in- 
consistency between initial attitude and feed- 
back is great. In the present study, the 
inconsistency was relatively weak; finding 
out that one has reacted to a moderately 
attractive picture instead of a highly attrac- 
tive picture is not as inconsistent as, for ex- 
ample, finding out that one has not reacted 
to snakes when one is terrified of them. To 
make the point that feedback is accepted as 
a basis for inferring one’s attitudes only when 
initial attitude and feedback are consistent, 
feedback which is truly counterattitudinal 
must be contrasted with feedback which is 
consistent with initial attitude. In conclusion, 
then, the hypothesis that feedback inconsist- 
ent with initial attitude leads to hypothesis 
testing received only minimal support. 


Expectation of Future Consequences and 
Determinants of Final Attitude 


When one reasons from Bem’s theory of 
self-perception and Janis and Mann’s model 
of decision making, it is predicted that sub- 
jects use the observation of their own behav- 
ior as a basis for their attitudes only if they 
expect no future consequences to result from 


their attitude. The future consequences ma- 
nipulation was clearly successful; the ques- 
tion, “When you were doing your evaluations, 
did you think about possibly meeting any Of 
the men in the pictures,” discriminated sig- 
nificantly between subjects in the no future 
consequences condition and subjects in the 
future consequences condition, F(1,66) = 
28.59, p < .001. 

Bem’s theory predicts that consistent feed- 
back should maximize the difference in ratings 
between the highly preferred and medium- 
preferred pictures. Inconsistent feedback 
should reduce preference for the highly pre- 
ferred picture and increase preference for the 
medium-preferred picture; thus the ratings € 
the two pictures should become more similar 
if not reverse. As predicted, the interactio 
between pictures and feedback is significant, 
F (1,44) = 4.37, p < 05. 

However, an examination of the means in} 


Table 2 reveals that the predicted pattern of% 
scores occurs only when the subjects expect} 9 
no future consequences to result from their " 


attitudes. In the no future consequences con- 
dition, the difference between the highly pre- 
ferred and medium-preferred pictures is 5.175 
following consistent feedback, and only .2 
following inconsistent feedback. In the future 
consequences condition, however, the differ- 
ences between the two pictures were approxi- 
mately the same (1.76 versus 1.83), regard- 
less of feedback. The three-way interaction 
among feedback, pictures, and future conse- 
quences, F(1,44) = 4.68, p < .05, supports 
this observation. In sum, there is clear evi- 
dence that feedback markedly affected ratings 
of the pictures only if no future interaction 
was anticipated. When subjects expected to 
have to act upon their attitudes, they appar- 
ently used information other than feedback to 
assess their attitudes. 

What was this additional information? It 
was predicted that subjects who expected 
future consequences to result from their atti- 
tudes would engage in a time-consuming and 
more complete information search. As pre- 


resulted in increased time spent looking at 
the pictures, F(1,44) = 18.64, p < .001. How- ~ 
ever, the written evaluations made by sub- 
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jects in the future consequences condition 
were no longer than those made by the sub- 
jects who anticipated no future consequences. 
s Analyses of the content of the evaluations 
revealed two trends. First, subjects who re- 
ceived inconsistent feedback and expected 
future consequences tended to infer more per- 
sonality qualities about the men than did 
subjects in the other experimental groups. 
These qualities included such things as 
“probably is musical,” “looks like a good 
lover,” and “probably has a good sense of 
humor,” suggesting that the subjects were 
beginning to anticipate qualities that would 
bécome important when they interacted with 
he men. Second, the numbers of negative 
comments made about the pictures differed 
ignificantly, depending upon whether or not 
future consequences were expected. When no 
future consequences were expected, subjects 
made few critical comments, F(1,44) = 5.82, 
p< .05. However, the three-way interaction 
among pictures, future consequences, and 
$ feedback, F(1,44) = 5.92, p < .05, revealed 
that when both expectations of future conse- 
quences and inconsistent feedback were pres- 
ent, subjects became highly critical of the 
@medium-preferred picture; these results sug- 
gest that subjects were actively refuting the 
mplications of inconsistent feedback. Taken 
together, these data provide the clearest in- 
formation as to what differentiated future 
' consequences subjects from no future conse- 
quences subjects. Apparently, “upping the 
stakes” by having subjects act upon their 
attitudes resulted in a more critical and 
„broader information search. 


CONCLUSIONS 


To summarize, the results suggest quite 
clearly a major qualification to Bem’s hypoth- 
esis that people infer their attitudes from the 
observation of their behavior. Under condi- 
tions where one is asked one’s attitude about 
an inconsequential issue, one’s behavior, or 
more properly feedback about one’s behavior, 
may quite markedly affect the attitude one 
_ expresses. However, when one’s future behav- 
iot is affected by the expression of an atti- 
tude, that attitude seems to be weighted more 
carefully, and more available information is 
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brought to bear on deciding what one be- 
lieves. These results suggest that a complete 
picture of self-perception must take into 
account the conditions under which a person 
infers and subsequently acts on his attitude. 
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Influence of Locus of Control on Interpersonal Attraction 


and Affective Reactions 


in Situations Involving Reward and Punishment 


David S. Holmes and Thomas H. Jackson 
University of Kansas 


Ninety-six subjects with either high or low I-E Scale scores participated in 
individual discussions with an experimenter who either (a) rewarded and then 
punished, (b) punished and then rewarded, or (c) gave no evaluative feed- 
back after the subject’s responses. Because previous research has suggested 
that internally controlled subjects attend primarily to rewards while externally 
controlled subjects attend primarily to punishments, it was hypothesized (a) 
that the internally controlled subjects, as compared to externally controlled 
subjects, would be more attracted to the experimenter in the conditions in- 
volving rewards and punishments regardless of the sequencing and (b) that 
the reverse would be true in the nonevaluative control condition. The predic- 
tion was consistently supported and subjects’ feelings of anger and anxiety 
during the discussions generally complemented the attention and attraction 
hypotheses. The results indicate that individual differences influence attraction 
in situations involving rewards and punishments and they offer a prescription 
for developing interpersonal attraction and minimizing anger: Offer rewards 
to internally controlled persons and remain neutral with externally controlled 


persons, 


A large amount of research has been con- 
ducted to determine whether the interpersonal 
attraction of Person A to Person B can be 
increased more by having Person B increase 
the amount of reward he gives to Person A 
(i.e., the reinforcement position) or by having 
Person B first punish and then reward 
Person A (i.e., the gain-loss position). With 
regard to the gain-loss position, it has been 
suggested that reward which follows punish- 
ment may be more powerful than consistent 
reward because (a) of the contrast effect, 
(b) the reward serves to reduce the negative 
drive state induced by the preceding punish- 
ment, and (c) the change in the evaluator’s 
responses suggests that his rewarding is not 
just a meaningless stylistic response and 
therefore the evaluator and his evaluations 
increase in credibility. Similar arguments can 
be made suggesting that punishment which 
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follows reward reduces attraction more than 

consistent punishment. Unfortunately, at the 

present time no clear answer is available con-s 
cerning the differential effectiveness of amount » 
of reward or punishment versus the se- 

quencing of reward and punishment on inter- 

personal attraction (see reviews by Aronson, 

1969; Byrne, 1969; Byrne & Griffitt, 1973; 

and recent research by Hewitt, 1972; Mettee, 

1971; Tognoli & Keisner, 1972). Overall, it 

appears that the problem is more complex 

than originally thought. In view of this, it 

seemed necessary to consider other variables 

which may interact with reward and punish- 

ment in determining attraction. Therefore, the 

goal of this investigation was to determine 

whether a personality factor might influence 

the patterning of attraction in the gain-loss 

paradigm. 

With regard to the influence of individual 
differences on attraction in the gain-—loss 
paradigm, it could be suggested that if a per- 
son focused his attention more on rewards 
than on punishments, he would be more at- 
tracted to a person who gave rewards and 
punishments regardless of their sequencing 
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than would a person who received the same 
_ feedback but who focused his attention more 
on punishments than on rewards. Further- 
more, it might be expected that a person who 
focused more attention on rewards than on 
punishments would be more attracted to a 
person who gave rewards and punishments 
regardless of their sequencing than he would 
be attracted to a person who gave no rewards 
or punishments. The reverse would probably 
be true for persons who attended to punish- 
ments more than rewards. In other words, in- 
» dividual differences with regard to persons’ 
” attention to rewards and punishments could 
play a very important role in determining the 
pattern of attraction found in the gain-loss 
paradégm. 

The personality measure employed in this 
investigation was Rotter’s (1966) Internal— 
External Scale (I-E) which measures 
differences in subjects’ perceived locus of con- 
trol (i.e., internal versus external). This mea- 
| sure was used because previous research indi- 
/ cated that internally controlled subjects were 
more likely to attend to successes while exter- 
nally controlled subjects were more likely to 
attend to failures and view them as threats 
(Efran, 1963; Lefcourt, 1967; Phares, 
WRitchie, & Davis, 1968). In reviewing this 
research and elaborating on the theory associ- 
| ated with locus of control, Lefcourt (1972) 
pointed out: 


The internal may dwell less upon his deficits, for- 
getting and perhaps become inattentive to cues 
related to them as he assumes an active stance 
toward his problems. The external, on the other 
hand, who has repeatedly described himself as being 
more anxious, lower in self-esteem, and as being 
higher in neuroticism and maladjustment than inter- 
nals would be more likely to be ruminative about 
his failures which, in turn, helps him to maintain 
his own perception as an inactive pawn of fate. 
(p. 22) 


In view of the different ways in which 
internally and externally controlled subjects 
view and deal with successes (rewards) and 
failures (punishments), it was predicted that 
in conditions involving rewards and punish- 
ments, internally controlled subjects would be 
more attracted to another person who served 
as a rewarding and punishing evaluator than 
would externally oriented subjects and that 
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this effect would be independent of the se- 
quencing of the rewards and punishments. 
This prediction was based on the assumption 
that internally controlled subjects would gen- 
erally respond positively to the rewards and 
ignore the punishments while the externally 
controlled subjects would generally respond 
negatively to the punishments and ignore the 
rewards. It was also predicted that in a 
condition in which no rewards or punishments 
were given, externally controlled subjects 
would be more attracted to the nonevaluating 
individual than would internally controlled 
subjects. This latter pattern of attraction 
was predicted because it was expected that 
the externally controlled subjects would feel 
safe with the nonevaluating individual while 
the internally controlled subjects would be 
disappointed in not receiving any rewards. 

In this investigation, subjects participated 
in individual discussions with an experi- 
menter. During the discussion the experi- 
menter followed one of three feedback sched- 
ules in responding to the comments of a 
subject: (a) rewarded and then punished the 
subject, (b) punished and then rewarded the 
subject, or (c) gave no rewards or punish- 
ments. After the discussion, the subject 
worked for approximately five minutes on a 
neutral filler task and then was asked to 
complete a questionnaire supposedly provided 
by the “Research Committee” to determine 
subjects’ responses to and evaluations of ex- 
periments. In fact, responses on this question- 
naire were used to measure subjects’ inter- 
personal attraction to the experimenter and 
the subjects’ feelings of anger and anxiety 
during the experiment. As indicated above, 
it was predicted that subjects’ orientations 
with regard to locus of control would interact 
with the nature of the feedback they received 
to determine the attraction they felt for the 
experimenter. It was also expected that locus 
of control would interact with nature of feed- 
back to determine subjects’ feelings of anger 
and anxiety during the experimental period, 
(Although sequencing of reward and punish- 
ment was not of direct interest in this investi- 
gation, it was manipulated because in the past 
there was a controversy over whether one 
sequence was more effective or had different 


134 


effects than the other; see the first para- 
graph of this article and the previously cited 
reviews. ) 


METHOD 
Subjects 


Forty-eight male and 48 female introductory 
psychology students served as subjects in this experi- 
ment. Since I-E Scale scorers were available for all 
subjects, the subjects were initially divided into 
groups with high and low scores and then subjects 
from each of these groups were randomly assigned 
to one of three treatment control conditions 
(reward-punishment, punishment-reward, control), 


Experimental Manipulation 


Upon arriving for his or her appointment, the 
subject was seated at a desk across from a male 
experimenter who explained that the experiment was 
being conducted to determine why people hold cer- 
tain opinions regarding important social issues and 


topics. In the punishment-reward and reward-pun- 
ishment conditions the experimenter went on to 
point out: 


poor answer and plus five represents an excellent 
answer, The department requires me to tell you 
how I have rated each of your answers. | Me 
Each of my ratings will be based on my own 
subjective opinion of your answers. . . . Each of 
my ratings will be final and will not be subject 
to argument or change, 


The experimenter then asked the subject to indi- 
cate his opinion and reason for holding that opinion 
on each of the following five topics: (a) what 
should the U.S. do about the Vietnam situation; 
(b) should everything be racially integrated; (c) 
should the college student be eligible for the draft; 
(d) is peaceful Coexistence between East and West 


Stating the topic, the experimenter merely listened 
to the subject's response. 


Punishment—reward condition. In this condition the 
feedback was initially negative but became proges- 
sively positive. The Sequence of feedback scores and 
accompanying comments which subjects received 
after their five responses were: (a) —5, Not very 
well thought out; (b) —3, Not too clear; (c) 1, 
Adequate. Better than the first two; (d) 4, Very 
good. Clear; and (e) 5, Much better. Well 
thought out, 
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Reward—punishment condition. The procedure 
this condition was identical to that of the puni 
ment-reward condition except for the fact that 
feedback was initially positive but became progr 
sively negative. The sequence of feedback scores 
accompanying comments to subjects in this condit 
after their five Tesponses were: (a) 5, Very ck 
Well thought out; (b) 3, Good. Clear and to t 
point; (c) —1, Adequate, but much worse than the 
first two; (d) —4, Poor. Missed the point; and 
—S, Very poor answer, Illogical reasoning. 

Control condition. The procedure in this conditi 
differed from those of the other conditions only 
that in the initial instructions the subjects were nc 
told that their responses would be evaluated and 
evaluation scores or comments were given after t! 
subjects’ responses. 

After a brief filler task was completed, all subje 
were asked to fill out a questionnaire supposed] 
provided by the Research Committee to determi: 
subjects’ responses to and evaluations of experiment 
This questionnaire contained 10-point rating scale 
on which the subjects rated the degree to which they 
liked the experimenter “as a person,” the friendline 
of the experimenter, and the degree to which th 
subjects felt angry and anxious during the experi 
ment. (In addition to these experiment-relevan 
items, a number of other filler items were include 
to give the questionnaire face validity for the sub 
ject; specifically, the subject was asked to indicat: 
whether (a) the experimenter was on time, (b) thi 
experimenter explained the purpose of the experi 
ment, (c) the experiment increased interest in psy- 
chology, (d) ‘the subject had heard anything about 
the experiment before, and (e) what the subject 
thought the purpose of the experiment was.) To en 
courage honest replies on this questionnaire, the sub 
ject was instructed to place the anonymous question- 
naire in a box ostensibly provided by the Research 
Committee. When this task was completed, the sub- 
ject was completely debriefed concerning the experi- 
mental manipulation and the purpose of the experi- 
ment, 


RESULTS AND Discussion 


Subjects’ scores on each of the dependent 
variables were analyzed through the use 
of separate 3 (Conditions) x 2 (Personality 
Types) x 2 (Sex) analyses of variance, j 


Attraction to and Perceived Friendliness 
of the Experimenter 


cance, F(2,84) = 2.74, p< 10. 
of the means indicated that, as Predicted, in 
the Punishment—reward and rewa 
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ment conditions the internal subjects felt 
more attracted to the experimenter than did 
the external subjects, while in the control 
condition the external subjects felt more at- 
tracted to the experimenter than did the inter- 
nal subjects. These findings offer support for 
the predicted influence of locus of control in 
determining the effect of reward and punish- 
‘ment on interpersonal attraction. Although it 
is of no particular theoretical interest in this 
investigation, it should be mentioned that the 
analysis of variance indicated that, in general, 
. female subjects indicated more attraction to 
the male experimenter than did male subjects, 
F (1,84) = 4.94, p < 05. 

Parallel to the finding with the attraction 
data, the analysis of the friendliness data re- 
vealed a significant Conditions x Personality 
Types interaction, F(1,84) = 5.43, $ < 01. 
As predicted, the interaction was due to the 
fact that in the punishment—reward and re- 
ward—punishment conditions the internal sub- 
jects saw the experimenter as more friendly 
than did external subjects while in the con- 
trol condition the external subjects saw the 
experimenter as more friendly than did the in- 
ternal subjects. As was the case with the 
attraction measure, female subjects perceived 
the experimenter as significantly more friendly 
than did male subjects F(1,84) = 5.42, 
p < .025. 


Feelings of Anger and Anxiety During the 
Experiment 


The analysis of variance carried out on the 
anger scores revealed a Personality Types ef- 
fect, F(1,84) = 4.10, $ < .05, reflecting the 
fact that, in general, external subjects were 
more angry than internal subjects. This find- 
ing is consistent with the conclusion drawn 
from other research that external subjects are 
more sensitive to punishment than internal 
subjects. This is important because it is on 
this conclusion that the predictions of the 
present experiment were based. The analysis 
of the anger score also revealed a significant 
conditions effect, F(2,84) = 3.80, p < 05. 
When the Tukey B test was applied to the 
condition means it was found that subjects 
in the reward-punishment condition were sig- 


nificantly, p < -05, more angry than subjects 
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in either the punishment-reward or control 
conditions and that subjects in these latter 
conditions did not differ in their reported 
anger. After initially rewarding the subjects 
and then punishing them, the experimenter 
apparently elicited a considerable amount of 
anger. The greater anger in the reward—pun- 
ishment than the punishment-reward condi- 
tion is consistent with what would be expected 
on the basis of the gain—loss position. More 
importantly, however, the analysis of the 
anger scores also revealed a Conditions X Per- 
sonality Types interaction, F (2,84) = 2.25, 
p< .10, which approached significance. As 
would be expected, this interaction was due 
to the fact that in the punishment—reward 
and reward-punishment conditions the exter- 
nal subjects were more angry than the in- 
ternal subjects while this patterning was re- 
versed in the nonevaluative control condition. 
This finding complements the findings con- 
cerning attraction and perceived friendliness; 
that is, in those cases in which subjects re- 
sponded with most anger they were least 
attracted to the experimenter and perceived 
him as least friendly, and vice versa. 

The analysis of the subjects’ ratings of the 
degree to which they felt anxious during the 
experiment revealed a significant conditions 
effect, F(2,84) =7.17, P< 01. Subsequent 
analyses using the Tukey B test indicated 
that control subjects reported significantly, 
p < .01, less anxiety than subjects in either 
the punishment—reward or reward—punishment 
conditions while subjects in the latter two 
conditions did not differ in reported anxiety. 
This patterning of differences in reported anx- 
iety undoubtedly reflects the fact that sub- 
jects in the punishment-reward and reward- 
punishment conditions were in evaluative situ- 
ations while control subjects were not. The 
analyses did not reveal a significant per- 
sonality types effect, F(1,84) = 47, oF 
a significant Conditions X Personality Type 
interaction, F (2,84) = -02. 


CoNCLUSIONS 


Summarizing the differences between the 
responses of internally and externally con- 
trolled subjects it can be said that when inter- 
acting with a person who gave both rewarding 
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and punishing feedback, internally controlled 
subjects felt less angry, saw the other person 
as more friendly, and were more attracted to 
the other person than were externally con- 
trolled subjects. On the other hand, when 
interacting with an individual who gave no 
evaluative feedback, internally controlled sub- 
jects felt more angry, saw the other person 
as less friendly, and were less attracted to the 
other person than were externally controlled 
persons. This patterning of feelings, percep- 
tions, and attraction is consistent with the 
assumptions concerning the different ways 
internally and externally controlled subjects 
respond to evaluations. 
The findings generated by this investigation 
are of interest and possible importance in a 
number of ways: First and most generally, 
they provide more data concerning the influ- 
ence of persons’ perceived locus of control. 
Second, from these findings it appears that 
contrary to what was concluded earlier 
(Byrne & Griffitt, 1973) differences in per- 
sonality do play a role in the attraction para- 
digm and it may be that attention to the 
variable of locus of control accounts for 
inconsistencies in previous research and 
strengthens the results of future research, 
Last, these results offer the following pre- 
scription for developing relationships in which 
there will be high interpersonal attraction, 
high perceived friendliness, and low anger on 
the part of a target person: Offer rewards to 
internally controlled individuals and remain 
neutral with externally controlled individuals. 


Davip S. Hotmes AnD THomas H. Jackson 
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Conditioned Autonomic Responses as Bidirectional 
y Indicators of Racial Attitude 


Louis N. Tognacci and Stuart W. Cook 
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Present physiological techniques of attitude measurement tap attitude intensity 
but not attitude direction. An alternative physiological technique, based upon 
the phenomenon of semantic generalization of conditioned autonomic responses, 
was tested to determine whether it would provide a bidirectional indicator of 
attitude. Subjects with favorable or unfavorable self-reported attitudes toward 
blacks were conditioned to statements having no racial content which they had 
previously evaluated as “bad.” Subsequently, they were presented with state- 
ments with racial content; the latter were sentences which would be differen- 
tially evaluated as “good” or “bad” by persons with dissimilar racial attitudes. 
It was hypothesized that racial attitude would predict conditioned autonomic 
responses elicited by the generalization statements. It was found that groups 
differing in racial attitude could be distinguished on the basis of their condi- 
tioned responses to the race-related generalization statements. Unresolved 
problems of interpretation are discussed. 


While vigorously disputing the nature of 
attitude, theorists universally accord evalua- 
tion or affect a central role in their definitions 
of the concept. Viewing affect as a primary 
or, as some do, the only dimension of atti- 


“tude naturally suggests the possibility of 


assessing attitude by measuring changes in 
autonomic functioning which accompany af- 
fective arousal. With few exceptions investi- 


gators adopting this approach have employed 


measures of response magnitude which are 
assumed to be indicators of the intensity, but 
not the direction (favorable versus unfavor- 
able), of the attitude. The difficulty with 
such nondirectional attitude measures is evi- 
dent in the findings of several investigators 
(Cooper, 1969; Cooper & Singer, 1956; Katz, 
Cadoret, Hughes, & Abbey, 1965; Vidulich & 
Krevanick, 1966) which reveal that greater 
autonomic responses accompany both positive 
and negative affective states. Obviously, a 
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physiological indicator of attitudinal direc- 
tion as well as intensity would constitute a 
methodological advance. 

Pupillary dilation and constriction showed 
some promise of providing such a directional 
index of attitude (Hess, 1965; Hess & Polt, 
1960). However, subsequent research (e.g. 
Paivio & Simpson, 1966; Peavler & McLaugh- 
lin, 1967; Woodmansee, 1965, 1970) has 
shown this tack to be unfruitful. 

Cook and Selltiz (1964) have suggested 
another approach based on the considerable 
body of work with semantic conditioning and 
generalization of autonomic responses (see, 
e.g., Creelman, 1966; Feather, 1965; Razran, 
1961). For example, a Russian investigator, 
Volkova (1953) reported conditioning the 
salivary responses of an 11-year-old boy to 
the spoken word good. Subsequently, the 
conditioned response generalized to sentences 
whose meaning to the subject was good. In a 
successful replication and extension of Vol- 
kova’s study, Acker and Edwards (1964) 
conditioned a vasomotor response to the word 
good or bad and then obtained appropriate 
generalization to words previously rated on the 
good-bad scale of the semantic differential. 

In both the Volkova and the Acker and 
Edwards studies, it is possible to interpret the 
autonomic responses during generalization as 
reflecting attitude direction as well as inten- 
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sity. Thus, these investigations support the 
Cook and Selltiz suggestion that the phe- 
nomena of semantic generalization of condi- 
tioned autonomic responses may yield a bi- 
directional indicator of attitude. However, 
considerable work along the lines initiated by 
these experimenters is required for an ade- 
quate assessment of the Cook and Selltiz sug- 
gestion. The Volkova report is entirely anec- 
dotal, lacking in relevant information, and 
replete with methodological flaws. The Acker 
and Edwards study, while methodologically 
sound, is limited by the fact that it employed 
single words as stimuli; in contrast, in nearly 
all instances stimuli more complex than single 
words are necessary for an adequate assess- 
ment of attitude strength. 

Unlike either of the preceding studies, the 
present investigation was expressly designed 
to explore the possibility that the semantic 
generalization of conditioned autonomic re- 
sponses might provide a bidirectional indi- 
cator of attitude. Such an indicator should 
discriminate between groups of subjects whose 
attitudes differ in polarity. 


METHOD 


Design and Hypotheses 


A known-groups design was employed, with white 
research participants equally divided between those 
with favorable and those with unfavorable attitudes 
toward blacks as evidenced by scores on a self-report 
measure of racial attitudes. Subjects within each at- 
titudinal group were assigned to experimental and 
control conditions, thus establishing four treatment 
groups: (a) prejudiced-experimental; (b) prejudiced- 
control; (c) equalitarian-experimental ; and (d) 
equalitarian-control. 

Stimuli consisted of visually presented declarative 
sentences, either with or without racial content. The 
sentences with racial content were of two types. One 
type consisted of prointegration statements which 
would be positively evaluated by persons with 
equalitarian attitudes, but negatively evaluated by 
prejudiced individuals. The other type, anti-inte- 
gration statements, was the converse of the first. 
Three sets of trials (pretest, training, and post- 
test) were presented. Differential treatment of the 
experimental and control groups was initiated with 
the training trials. A differential classical conditioning 
paradigm using electric shock as the unconditioned 
stimulus (UCS) was employed. The nonrace-related 
statements previously evaluated by the subject as 
bad served as the conditioned stimulus (CS+) while 
those evaluated as good served as the unreinforced 
stimulus (CS—). Presentations of the pro- and 


anti-integration statements during the posttest con- 
stituted generalization test trials. 

The dependent variable was the galvanic skin 
response (GSR). In addition, respiration, digital 
vasoconstriction, and heart rate were recorded,! 

It was anticipated that the posttest race-related 
stimuli would elicit dissimilar autonomic responses 
from the prejudiced-experimental and equalitarian- 
experimental treatment groups (with no such dif- 
ference being anticipated between the prejudiced and 
equalitarian control groups). Let us consider, for 
example, the case of the individual who holds ex- 
tremely favorable attitudes toward blacks. Such an 
individual would uniformly classify all anti-integra- 
tion sentences as bad and all prointegration sen- 
tences as good. Therefore, assuming generalization to 
be mediated by the evaluative (good-bad) meaning 
of the race-related stimuli, this person would be 
expected to exhibit maximal generalization to anti- 
integration statements and minimal generalization 
to prointegration statements. By the same token, 
the prejudiced individual who uniformly classifies 
all prointegration stimuli as bad and all anti-inte- 
gration stimuli as good would be expected to ex- 
hibit maximal generalization to prointegration state- 
ments and minimal generalization to anti-integration 
statements. 

It is, of course, uncertain how nearly the groups 
of subjects in the present investigation approximate 
these ideal cases in terms of attitude strength, To 
the extent they fall short, the posttest behavior of 
the prejudiced-experimental and the equalitarian- 
experimental groups should be less different than 
might be expected from strongly prejudiced and 
strongly equalitarian groups. Nevertheless, it was 
anticipated that it should be possible to distinguish 
between the two groups even if this were the case. 


Subjects 


Selected as research participants were 24 white 
University of Colorado undergraduates evenly di- 
vided between those who had scored more than one 
standard deviation above the mean (in the equali- 
tarian direction) and one standard deviation below 
the mean (in the prejudiced direction) on the 
Multifactor Racial Attitude Inventory (Woodmansee 
& Cook, 1967). This scale was one component of a 
battery of tests administered in university dormi- 
tories 1 to 6 months prior to the present experiment. 
Random assignment of equalitarian and prejudiced 
subjects to the experimental and control groups 
yielded four treatment groups, each composed of 3 
males and 3 females. 


Apparatus 


Located in a monitoring room adjoining the 
subject’s room were the polygraph, the stimulator 


1 We are indebted to Dr. W. W. Hahn and Dr, 
W. F. Hodges for consultation and assistance with 
apparatus, scoring, and procedure and to Dr. Mary 
Schlesinger who conducted a pilot study. 
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project the visual stimuli through a small sound- 
deadéned window onto a screen in the subject’s 
»toom. The window enabled the experimenter to keep 
the subject under constant surveillance, as did an 
intercom system set for continual one-way trans- 
mission from the subject’s room to the monitoring 
room. ‘The hardware and techniques employed to 
record the GSR, digital vasoconstriction, respiration, 
and heart rate were those in common use. A de- 
tailed description is available elsewhere.? 


Visual Stimuli 


Serving as visual stimuli were a pool of race- 
related and nonrace-related declarative sentences 
which had been pretested for lack of ambiguity. 
The content of the race-related sentences included 
such topics as interracial marriage and dating, busing 
of school children to achieve desegregation, and 
valuing private rights over civil rights. These sen- 
tences were equally divided between those which 
were prointegration and those which were anti-inte- 
gration. An example of a prointegration sentence 
was, “The Negro family moved into a house in a 
fine all-white neighborhood.” Evidence is available 
from other studies (e.g., Woodmansee & Cook, 
1967) that the content of such a sentence is viewed 
favorably by equalitarian individuals and unfavor- 
ably by prejudiced individuals. Thus the former 
would categorize the statement as good and the 
latter as bad. An example of an anti-integration 
sentence was, “The real estate agent showed the 
Negro buyer houses only in Negro neighborhoods.” 
Such statements would be evaluated as good by 
‘prejudiced individuals and bad by equalitarians. 

The sentences lacking racial content consisted of 
two subgroups: those readily classifiable as good or 
bad and those which were controver: One of the 
bad sentences was, “The old Irishman poisoned the 
little boy’s dog.” An example of a good sentence 
was, “The young Japanese boy helped the old woman 
up when she fell on the ice.” An example of a 
controversial sentence was, “The political candidate 
urged the use of small nuclear weapons to end the 
war.” 


Procedure 


After the subject was informed that the purpose 
of the investigation was to study physiological re- 
sponses to a variety of “neutral and emotion- 
arousing stimuli,” the physiological monitoring de- 
vices were attached. The subject was handed a 
listing of the complete pool of sentences employed in 
the study and instructed to indicate next to each 
sentence whether it described, in his opinion, some- 
thing good or something bad. If he was uncertain 


2 Space limitations require an abbreviated presenta- 
tion of many technical details and additional sys- 
tems. A copy of a more detailed report can be 
obtained by writing the second author. 
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he could so indicate. There were a number of rea- 
sons for making this request. First, by becoming 
familiar with the content of each sentence, the 
probability that later presentations of these sen- 
tences would elicit autonomic responses contaminated 
by such extraneous influences as item novelty, the 
subject’s misunderstanding of stimulus content, etc., 
would be appreciably reduced. Second, obtaining 
the subject’s evaluations permitted an individualized 
check of the assumption that the nonrace-related 
sentences intended by the experimenter to evoke 
reactions of good or bad were indeed viewed as good 
or bad by the particular research subject. In the 
event the subject did not rate one of the good 
Sentences as good or one of the bad sentences as 
bad, an alternate of the same type that he had rated 
appropriately was substituted. 

Within each of the subsequent phases (pretest, 
training, and posttest), the stimuli were presented 
in a fixed random order with no more than two 
good or two bad sentences or one race-related sen- 
tence presented in succession. No sentence appeared 
twice within any one set of the three sets of trials. 

Each trial consisted of the presentation of a stimu- 
lus slide for 5 seconds, followed immediately by a 
blank slide during the remainder of the intertrial 
interval. On non-UCS trials, the intertrial interval 
varied from 20 to 30 seconds. The administration of 
the UCS was followed by a delay of approximately 
30 seconds prior to the initiation of the succeeding 
trial. Shock intensity was set at the point the 
subject defined as “as high as I can take.” 


Pretest Trials 


All treatment groups were administered 30 pre- 
test trials consisting of 5 prointegration and 5 anti- 
integration sentences intermixed with 20 nonrace- 
related sentences, half of which the subject had 
classified as good and the remainder, as bad. No 
conditioning trials were administered at this time, 
and all four subject groups were treated in identical 
fashion, 


Training Trials 

Only nonrace-related sentences, equally divided 
between those the subject had classified as bad 
(CS+) and as good (CS—), were presented during 
training. None of these had been employed in the 
pretest trials. For the equalitarian-experimental and 
the prejudiced-experimental groups, the administra- 
tion of UCS immediately followed the termination 
of CS+. Each of the first 5 CS+ trials during con- 
ditioning were reinforced; thereafter partial rein- 
forcement was employed, with shock following 60% 
of the CS+ trials. The criterion adopted for condi- 
tioning was a sequence of 5 nonreinforced trials in 
which the presentation of CS+ elicited a conditioned 
response (CR) on each of 3 trials while the presen- 
tation of CS— on 2 trials was not accompanied by 
a CR (for the definition of CR see Response Mea- 
sures below). The equalitarian-control and the 
prejudiced-control groups received no differential 
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TABLE 1 


MEAN GALVANIC SKIN RESPONSE FREQUENCY 
DIFFERENCE SCORES 


Equalitarian Prejudiced 
Test Control Experi- Control 
(not mental (not 
(condi- condi- (condi- condi- 
tioned) tioned) tioned) tioned) 
Pre” —.039 2025") oa -019 
Post -783 097 | 50a —.056 


Note. Mean responses per trial to prointegration sentences 
subtracted from mean responses per trial to anti-integration 
sentences range from —1 to 1. 

a The correlation between pretest and posttest for all sub- 
jects was —.03. For the subjects in the two experimental groups 
it was —.25. For those in the two control groups it was .28. 


reinforcement of good and bad sentences. Half of the 
shocks followed CS+ trials and half CS— trials. 
Subjects in the two experimental groups achieved the 
conditioning criterion within a minimum of 22 and a 
maximum of 56 trials. The mean numbers of train- 
ing trials for the two groups were almost identical, 
that is, 27.0 for the equalitarian-experimental group 
and 27.5 for the prejudiced-experimental group. Sub- 
jects in the control groups received all 56 training 
trials. 


Posttest Trials 


Without interruption, the posttest followed the 
last training trial. At this time the 30 stimuli pre- 
sented during the pretest were reintroduced in a 
different random order. Strength of the CR was 
maintained in the two experimental groups by rein- 
forcing 6 of the 10 trials on which nonrace-related 
bad sentences were presented; the reinforced trials 
were dispersed throughout the posttest period. Con- 
trol subjects were administered the UCS following 3 
bad and 3 good nonrace-related sentences. None of 
the 10 stimuli with racial content were ever rein- 
forced. 


Response Measures 


The GSR was analyzed in terms of response fre- 
quency. A response (including a conditioned re- 
sponse) was defined as a 500 ohm or greater decrease 
in skin resistance which had its onset between 4 and 
10 seconds following initiation of the stimulus. This 
interval was selected to avoid measuring the orient- 
ing reflex associated with stimulus onset and to 
allow sufficient time to comprehend the sentences. 


RESULTS 


Analysis of the GSR data involves the 
hazard that the results may be biased by the 
operation of the law of initial values (Lacey, 
1956; Wilder, 1962). According to this law, 
the magnitude of a physiological response to 
a stimulus is related to the level of autonomic 
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functioning immediately prior to stimulus 
presentation. Should the prestimulus level of 
autonomic functioning differ between groups 
or individuals there then exists a problem in 
comparing the responses of the groups or in- 
dividuals. Reassurance is needed either that 
there are no prestimulus differences or that 
such prestimulus differences that do exist are 
not correlated with the magnitude of post- 
stimulus changes. A series of independent 
checks revealed that this problem was negli- 
gible within the data of this study (see Foot- 
note 2). 


Analysis of Frequency of GSRs 


A single posttest difference score was cal- 
culated for each subject by subtracting the 
mean number of responses of 500 ohms or 
greater to the prointegration sentences from 
the mean number of such responses to the 
anti-integration sentences. Mean response per 
trial rather than total responses over all trials 
was employed because occasionally a trial 


could not be scored due to artifacts on the i 


polygraph records. The resulting difference 
scores could range from —1, if the subject 
responded to each of the five prointegration 
stimuli and none of the five anti-integration 
stimuli, to 1, if the subject responded to each 
of the anti-integration stimuli and none of 
the prointegration stimuli. Thus the strongly 
prejudiced individual would be expected to 
obtain a posttest score of —1, while a post- 
test score of 1 would be anticipated from the 
strongly equalitarian individual. In the same 
manner a single difference score was also de- 
rived from each subject’s pretest record. Pre- 
test and posttest mean difference scores for 
the four treatment groups are presented in 
Table 1. 

Inspection of the pretest means reveals that 
all cluster around zero, indicating little differ- 
ential responding within each group to the 
prointegration and anti-integration stimuli. 
Inspection of the posttest means shows that 
the equalitarian-experimental group ap- 
proached a mean difference score of 1, as had 
been expected, and that the two control 
groups continued, as expected, to show differ- 
ence scores near zero. However, the preju- 
diced-experimental group which had been ex- 


Autonomic RESPONSES AS INDICATORS OF RACIAL ATTITUDE 


pected to approach a difference score of —1, 
approximates zero instead. 

The difference scores were entered into a 
three-way analysis of variance with attitude 
(equalitarian or prejudiced) and condition 
(experimental or control) as between-sub- 
jects variables and time of testing (pretest 
or posttest) as a within-subjects variable. A 
significant main effect was obtained for each 
of the three variables, F (1,20) = 7.71, p< 
025; F (1,20) = 5.72, p < .05; and F (1,20) 
= 9.71, p < .01, respectively. The Attitude 
x Time of Testing and the Condition X 
Time of Testing interactions were significant, 
F (1,20) = 13.75, p < .005, and F (1,20) = 
9.83, p < 01, respectively, as was the Atti- 
tude X Condition x Time of Testing interac- 
tion, F (1,20) = 6.69, p < .025. Comparisons 
of pairs of treatment means confirm what was 
evident by inspection of Table 1, namely that 
only those comparisons involving the large 
posttest mean difference score for the equali- 
tarian-experimental group are significant. 

A within-group comparison of the equali- 
tarian-experimental pretest and posttest means 
supports the hypothesis that the conditioning 
procedure would result in an increase in the 
difference scores for this group, F (1,20) = 
39.35, p < .001. Contrary to expectations, 
however, no such increase was observed in the 
prejudiced-experimental group. As expected, 
neither of the control group posttest mean 
difference scores differed significantly from the 
corresponding pretest means. 

The comparison of the equalitarian-experi- 
mental and prejudiced-experimental posttest 
difference scores provides the test of the hy- 
pothesis that groups differing in racial atti- 
tudes would be separated on the basis of their 
autonomic responses to the posttest race- 
related stimuli. As anticipated, the equali- 
tarian-experimental mean was reliably differ- 
ent from the prejudiced-experimental mean, 
F (1,20) = 28.34, p < .001. Additional com- 
parisons revealed a significant difference be- 
tween posttest means for the equalitarian- 
experimental and control groups, F (1,20) = 
23.55, p < .001, but none between the preju- 
diced-experimental and control groups.” An 
analysis paralleling that above was carried out 
on the GSR magnitude data. The results 


141 


duplicated those obtained for the GSR fre- 
quency scores (see Footnote 2). 


Discussion 


Analyses of both frequency and magnitude 
of GSR scores support the expectation that 
equalitarian-experimental subjects, after ac- 
quiring a conditioned response to the concept 
bad, would generalize the response to anti- 
integration sentences. However, the parallel 
expectation that prejudiced-experimental sub- 
jects would generalize to prointegration sen- 
tences (to a greater extent than to anti-inte- 
gration sentences) was not supported, De- 
spite this, equalitarian and prejudiced sub- 
jects differed significantly on a posttest score 
based on the difference between responses to 
anti-integration and prointegration sentences. 
The two groups had not differed on this score 
during the pretest. 

The major question about the results is 
whether they reflect the racial attitudes of 
the two groups of subjects. The answer is 
complicated by the failure to find the expected 
differential stimulus generalization among the 
prejudiced subjects. One plausible explanation 
for this involves calling into question the as- 
sumed level of prejudice in the prejudiced- 
experimental group. The generalization from 
the conditioned stimulus, bad, to prointegra- 
tion sentences in this group was predicated 
upon the assumption that the level of preju- 
dice in the group was high. It is quite possible 
that this assumption was incorrect since these 
subjects were selected from a college popula- 
tion in which extreme prejudice is relatively 
uncommon. If their racial attitudes were in- 
termediate rather than extreme, they may 
have reacted with indifference to race-related 


- sentences, classifying neither anti-integration 


nor prointegration sentences as bad. Another 
possibility is that they may have classified 
some of the race-related sentences as would 
strongly prejudiced individuals and others as 


3 Because of the possibility that the posttest re- 
sponses of the control subjects reflected a greater 
degree of response habituation (due to receiving 
more training trials), the foregoing analysis was 
supplemented by a two-way analysis of variance in- 
volving only the two experimental groups. This 
analysis confirmed the findings above. 
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would strongly equalitarian persons. Provided 
their classifications were approximately 
equally divided between these two orienta- 
tions, the observed zero posttest difference 
scores would result. 

We would be led by this possibility to the 
interpretation that the responses of the 
prejudiced-experimental group, as well as 
those of the equalitarian-experimental group, 
did reflect their racial attitudes. As a corol- 
lary to this position we would expect that 
greater generalization to prointegration sen- 
tences would be obtained only from persons 
who were more prejudiced than any group in 
this study. 

There are, however, other interpretations of 
the results that would place the emphasis on 
factors other than racial attitude as determi- 
nants of the subjects’ responses. One such 
interpretation is that the subjects’ responses 
reflected their assumptions regarding the ex- 
perimenter’s classifications of good and bad 
sentences. According to this view the stimulus 
generalization that occurred in the posttest 
was to those race-related sentences which the 
subjects believed the experimenter would 
classify as bad. We hypothesize that this may 
have come about as follows: A partial rein- 
forcement schedule was used throughout the 
later training trials. It is possible these sub- 
jects interpreted the partial reinforcement as 
evidence that reinforcement was independent 
of their sentence classifications. They may 
have hypothesized that the experimenter was 
administering reinforcement following the 
presentation of those sentences the experi- 
menter viewed as bad, but not following 
presentations of sentences the experimenter 
viewed as good. Consequently, they may have 
anticipated reinforcement to follow the post- 
test presentations of only those race-related 
sentences they assumed the experimenter 
would classify as bad. If we assume, as is 
likely, that subjects would believe university 
scientists to be prointegration, it follows that 
they should have anticipated shock after anti- 
integration statements. This could account for 
the behavior of the equalitarian-experimental 
subjects who responded almost entirely to 
anti-integration sentences. It could not, how- 
ever, account for the behavior of the preju- 
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diced-experimental subjects who, as indicated 
by their zero difference score, gave responses 
with equal frequency to anti- and prointegra- 
tion statements. 

There remains the possibility that the 
prejudiced-experimental group was made up of 
two subgroups, one of whom responded to the 
race-related sentences in terms of anticipated 
experimenter classification (obtaining differ- 
ence scores with a plus sign) and: the other 
in terms of their own racial attitudes (ob- 
taining differences scores with a minus sign). 
The scores of two such subgroups might very’ 
well balance out to the zero score obtained. 

An examination of the individual posttest 
difference scores of the six prejudiced-experi- 
mental subjects provides little support for 
this suggestion. In this group there were four 
subjects with zero or near zero GSR frequency 
difference scores and three subjects with zero 
or near zero GSR magnitude difference scores. 
Thus at least half of these subjects exhibited 
behavior which was unlike either the behavior 
expected of a person oriented to the classifica- 
tion of sentences of a prointegration experi- 
menter or the behavior expected from a 
prejudiced person responding in terms of his 
own racial attitude. 

A second nonattitudinal interpretation of 
the results is that subjects responded in 
terms of the good-bad classifications they 
assigned to sentences at the beginning of the 
experiment. Following a number of condition- 
ing trials the experimental subjects may have 
become aware that shock was contingent upon 
presentations of sentences they had earlier 
classified as bad. If they surmised that their 
classifications were used by the experimenter 
in determining when shock would be admin- 
istered, they may have anticipated reinforce- 
ment following presentation of all sentences, 
both race-related and nonrace-related, that 
they had publicly classified as bad. Their 
posttest responses would then reflect their 
public classifications of race-related sentences, 
regardless of the resemblance between their 
public and private (attitudinal) classifica- 
tions.* 


*The expression “private classifications” 
intended to suggest either that the subject is 
or unaware of such classifications. 


is not 
aware 
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An examination of the public classifications 
of the 10 race-related sentences by the equali- 
tarian-experimental subjects revealed that, 
with rare exceptions, they were identical 
with classifications to be expected from a 
strongly equalitarian individual. If scores of 1 
are assigned to equalitarian-indicating classi- 
fications and —1 to prejudice-indicating clas- 
sifications, the scores of these subjects range 
from 6 to 10, with a group mean of 9. These 
subjects presumably saw the university-based 
experimenter as equalitarian and hence expe- 
rienced no social pressure to modify the pub- 
lic expression of their attitudinally-determined 
reactions. Hence, the pattern of their posttest 
responses to race-related sentences is con- 
sistent either with their own racial attitudes 
or with their public classifications of these 
sentences. Consequently, their responses are of 
no value in distinguishing between the two 
alternatives. 

The situation is different, however, for the 
public classification of sentences by the preju- 
diced-experimental subjects. One subject, as 
expected, classified all sentences in the man- 
ner anticipated from a prejudiced person, 
while at the other extreme, another subject 
classified 9 of the 10 sentences as one would 
expect from an equalitarian individual; the 
remaining subjects varied between these two 
extremes. Scoring the sentence classifications 
as described in the paragraph above, the 
scores of these subjects range from —10 to 8 
with a group mean of .50. It should be possi- 
ble to examine the responses of those sub- 
jects whose public classifications varied 
widely from their assumed private classifica- 
tions and discover whether public or private 
classifications were more influential determi- 
nants of posttest response patterns. To do this 
we computed within-subject point-biserial 
correlations between public classifications of 
the race-related sentences (where bad equals 1 
and good equals zero) and GSR magnitude 
during the posttest presentations of those sen- 
tences. To illustrate with the prejudiced sub- 
ject who gave 9 out of 10 equalitarian public 
classifications, it follows that, contrary to ex- 
pectations, he classified at least 4 of the 5 
anti-integration sentences as bad and at least 
4 of the 5 prointegration sentences as good. If 
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he gave responses of higher magnitude to the 
sentences he publicly classified as bad and 
lower ones to those he publicly classified as 
good, a high positive correlation should re- 
sult. This would indicate that he had gen- 
eralized the CS+ in a direction consistent 
with his public classifications and opposite to 
his racial attitude. 

We found little support for this possibility. 
For the two subjects with the highest number 
of attitudinally inappropriate public classifica- 
tions we obtained the following point-biserial 
correlations: —.05 (ms) and .15 (ns). None 
of the remaining subjects show a high enough 
proportion of unexpected to expected public 
classifications to make the obtained correla- 
tions relevant. 

A third interpretation of the basis for the 
subjects’ posttest responses to the race-related 
sentences is, in part, attitudinal and, in part, 
nonattitudinal. As indicated in the preceding 
sections of the discussion, the assumed atti- 
tudinal classifications of the race-related sen- 
tences by the equalitarian subjects were very 
similar to their own public classifications and 
to the experimenter’s classifications. For 
prejudiced-experimental subjects, however, the 
three-way congruence of own-attitudinal, pub- 
lic and experimenter classification of sentences 
does not hold. We can only speculate on the 
effect of this when the prejudiced subject 
viewed a race-related sentence in the post- 
test. One possibility is that the lack of con- 
gruence among the three bases of sentence 
classification may have aroused the subject 
and led to an autonomic response for this 
reason alone, The contrasting possibility is 
that it presented the subject with a situation 
which he interpreted as threatening (e.g., the 
subject may have feared experimenter disap- 
proval of the expression of undesirable atti- 
tudes). As a consequence the strength of his 
autonomic responses may have been attenu- 
ated (see, e.g., Shapiro & Crider, 1969, pp- 
35-37). To minimize this problem by con- 
vincing the subject that his own classifications 
constitute the basis for occurrence of the un- 
conditioned stimulus, would seem to be the 
primary challenge facing future research on 


this question. 


Sipo 
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SUMMARY 


Equalitarian subjects who had been condi- 
tioned to respond to nonrace-related sen- 
tences they had classified as bad—and to in- 
hibit responding to sentences they had classi- 
fied as good—later generalized the response, 
as expected, to anti-integration sentences. 
Prejudiced subjects who had experienced the 
same conditioning procedure failed to show 
the expected generalization to prointegration 
sentences. Equalitarian and prejudiced con- 
trol subjects, neither of whom experienced 
differential conditioning, responded equally to 
anti-integration and prointegration sentences 
in both pretest and posttest sessions. 

We considered four interpretations of these 
results, The one which seems most acceptable 
is that the difference in stimuli to which the 
subjects generalized was determined by their 
racial attitude. This interpretation involved a 
redesignation of our prejudiced subjects as 
intermediate in attitude. The basis for this is 
that, although these subjects were at least one 
standard deviation below the mean of stu- 
dents at the University of Colorado, they 
were not sufficiently prejudiced to view many 
of our prointegration sentences as bad and, 
hence, failed to show the expected generali- 
zation to these sentences. We assume that 
more extreme subjects would show such gen- 
eralization. 

A much sought after physiological indi- 
cator that differentiates attitude direction as 
well as intensity seems clearly to be within 
reach. Because such an indicator would be 
immune to direct conscious distortion by the 
respondent, it would be an invaluable asset 
to behavioral scientists now dependent upon 
easily distortable self-description in their 
attitude research. 
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Effects of Stress and Commonality of Fate 
on Helping Behavior 


John F. Dovidio and William N. Morris 
Dartmouth College 


An experiment was performed in order to examine the joint effects of stress 
and commonality of fate on helping behavior, In a 2X2 factorial design, 

‘ subjects were randomly assigned to one of four conditions in which (a) the 
subject and a confederate are both awaiting an experiment involving electric 
shock (high stress-common fate); (b) the subject is awaiting an experiment 
involving ¢lectric shock while the confederate is awaiting an innocuous word 
association experiment (high stress~dissimilar fate); (c) the subject is awaiting 
the word association experiment while the confederate awaits shock (low 
stress-dissimilar fate); and (d) the subject and the confederate are both 
awaiting an innocuous word association experiment (low stress-common fate). 
As predicted, a significant interaction, p< .01, was obtained, indicating that 
high stress, as compared to low stress, facilitates helping if the potential 
recipient is in the same stressful situation but inhibits helping if the potential 
recipient is in a dissimilar and less stressful situation. 


Several investigators studying collective 
ior have found significant increases in 
ve, group-oriented behaviors during 
oi disaster and mutual crisis. These 
were observed by Prince (1920) in 
y of the 1917 munitions ship explo- 
Halifax, by Kutak (1938) in his 
nudy of the 1937 Louisville flood, and in 
Stouffer’s (1949) study of the American sol- 
i l: ring the crisis of battle. Similar obser- 
have been made in historical and 
j ic accounts of large-scale disasters 
the Chicago fire and the San Fran- 
; irthquake, 
| uccases in cooperative and helping be- 
i 


Y times 


havior occurring among individuals interde- 
pendent in coping with a common threat 
have also been demonstrated experimentally. 
Sherif (Sherif, Harvey, White, Hood, & 
| Sherif, 1961), in his classic study of this 
: phenomenon, showed that hostility and ob- 
structionist behavior induced by competitive 
contact between two camp groups were re- 
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placed with liking and cooperation when the 
two groups were confronted with a common 
problem or “superordinate goal.” Increased 
cohesiveness among individuals experiencing 
a common stress has also been demonstrated 
by Lanzetta (1955). Lanzetta, using four- 
person groups, found that stress, induced by 
a time limit and the experimenter’s interfer- 
ence with task completion, resulted in more 
positive, group-oriented behaviors, Similar re- 
sults have been obtained by Mulder and 
Stemerding (1963). 

It appears, however, that high external 
threat does not always increase an individu- 
al’s group-oriented behaviors but may, in fact, 
make them less likely, Janis (1958), in a — 
study of natural disasters, found two pre- 
dominant incompatible manifestations of indi- 
vidual reaction to stress. The individual may 
react: (a) with an increase in positive, group- 
oriented behaviors such as cooperation, at- 
traction, or helping or (b) with “vigilance,” 
whereby the individual becomes preoccupied 
with his own problems and increases atten- 
tion toward threat-relevant cues to the ex- 
clusion of concern for others’ plights. 

Barton (1969) has conceptualized a model 
of behavior under stress that offers insight 
into those conditions that induce either vigi- 
lance or cohesiveness. Barton argues that to 
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the extent that “sympathetic identification” 
exists among stressed individuals, helping is 
more likely. Campbell and Levine (1972) and 
Collins (1970) have made the reciprocal ob- 
servation, namely, that stress, at least in the 
form of external threat, increases awareness 
of ingroup identity. The foregoing implies 
that stress acts to magnify positive behaviors 
which normally follow from identification 
with a group by making that group identifica- 
tion more salient. On the other hand, to the 
extent that stress or threat is not shared with- 
in a group, feelings of differentiation and re- 
ductions of group identification should occur, 
leading to the state of “vigilance” and low- 
ered helping. 

The present experiment tests these notions 
in a 2 X 2 factorial design where subjects 
are awaiting either: (a) electric shock or (b) 
an innocuous word association experiment 
along with another subject (actually a con- 
federate) who is awaiting either (a) electric 
shock or (b) the word association experiment. 
The resulting conditions are (a) high stress— 
common fate, (b) high stress—dissimilar fate, 
(c) low stress-dissimilar fate, and (d) low 
stress-common fate. It is predicted that stress 
and commonality of fate interact such that 
high stress-common fate subjects are more 
helpful to the confederate than low stress— 
common fate controls, whereas high stress- 
dissimilar fate subjects are less helpful than 
low stress—dissimilar fate controls. 


METHOD 
Subjects 


The subject population for this experiment con- 
sisted of 63 undergraduate students enrolled in an 
introductory psychology course. All participants were 
males between the ages of 17 and 21. Subjects volun- 
teered to participate in an experiment entitled 
Psychophysical Cognitions, and all received extra 
credit toward their grade. 


Design 


In a 2X2 factorial design, subjects were ran- 
domly assigned to one of four conditions: (a) high 
stress-common fate, (b) high stress—dissimilar fate, 
(c) low stress-common fate, and (d) low stress- 
dissimilar fate. For the present experiment helping 
behavior was operationally defined as helping a con- 
federate by picking up pencils which the confederate 
“accidentally” knocked to the floor. 
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Procedure 


Upon arrival the subject was instructed to sit at 
a table which was littered with materials irrelevant 
to the experiment except for a plastic container 
holding 100 pencils. Behind the table on the door to 
an adjacent room hung a sign, Experimental Room, 
and the subject was led to believe that the experi- 
ment in which he was to participate would be con 
ducted in that room. As the subject was being 
seated, the confederate knocked on the door, and 
subsequently he was introduced either as another 
subject who was (a) in the same experiment as th 
subject and would be run at a different time, bu 
since “the experiment was running a little behind 
in order to save time, it would be introduced t 
both at the same time” (common fate) or (b) in 
different experiment, but since “the other experi 
menter (Experimenter 2) was running a little be- 
hind,” Experimenter 1 would administer the intro 
duction (dissimilar fate). When the subject and 
confederate were in “different experiments,” the 
confederate first received a brief verbal description 
of his experiment and also a written form, then t 
subject similarly received a description of his « 
periment. When in the same experiment, both 
ject and confederate were given the same verbal a 
written instructions. 

Stress was manipulated by virtue of redunde 
information contained in both the verbal and wr 
descriptions of the experiment. High stress was in 
duced by informing the subject that the experiment 
was one which involved an assessment of reactions 
to electric shock under various levels of uncertaint 
He was advised that there would be 11 trials, 
which 8 were “shock” trials, and he was to rec 
no information as to which trials they were. 
shocks were described as varying from “mildly to 
intermediately painful.” Low-stress conditions were 
induced by informing the subject that he was to be 
in a fairly brief and simple study in which his 
associations to various stimulus words were to be 
elicited. 

After administering the introductions, the experi- 
menter left the room. During this time the confed. 
erate avoided any interaction, including eye con 
tact, with the subject. When the subject had com 
pleted reading the written description of the experi- 
ment and was signing his name on the extra-credit 
receipt on the second page, the confederate wa 
instructed to reach for a pencil while “accidental: 
knocking the container to the floor. He proceeded, 
maintaining no eye contact, to walk around the table 
to the scattered pencils and to pick them up at a 
constant rate which he had been pretrained to 
achieve. 

One trained confederate was used for all subjects. 
The confederate did not find out what condition 
was in progress until he was seated beside the sub- 
ject, where eye contact and interpersonal interaction 
could be easily avoided. Throughout the experiment, 
the confederate responded to any attempts by the 
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subject to initiate interaction with noncommittal 
“mm-hmm’s” and shoulder shrugs. 


Postexperimental Questionnaire and 
Debriefing 

Three minutes after the pencils were knocked to 
the floor, the experimenter returned and asked both 
subject and confederate to complete “a standard 
preexperimental survey.” Embedded among several 
irrelevant items were two questions directed at 
making sure the subject fully understood the instruc- 
tions up to that point. Another question measured 
the subject’s anxiety level by self-report. Subjects 
were then asked to report their reactions to the 
experiment so far (a check for suspicion), and then 
the true purpose of the experiment was explained. 
They were thanked, sworn to secrecy, and given 
credit for their participation. 


RESULTS 


The results of three subjects who had 
previously participated in a similar experi- 
ment and expressed suspicion concerning the 
confederate during the debriefing were ex- 
cluded from the analysis of the data. 

All subjects, according to their responses 
on the postexperimental questionnaire, cor- 
rectly interpreted the instructions. In addi- 
tion, the manipulation of stress as reflected in 
fear ratings was highly successful. There was 
no difference in rated fear as a function of 
the commonality of fate manipulation, nor 
did that manipulation interact with stress, 
and so the data were collapsed across the 
commonality of fate dimension. High-stress 
subjects rated themselves as significantly more 
fearful than low-stress subjects, £= 5.003, 
p < .001. 

The amount of helping behavior by sub- 
jects in each condition was measured by the 
number of pencils picked up and by the num- 
ber of subjects helping in each condition. 
These data are shown in Table 1. The dis- 
tribution of pencils picked up was markedly 
nonnormal due to the presence of zeroes in 
the data. In order to permit analysis, the 
data were transformed by a square-root trans- 
formation. All parametric analyses reported 
below were performed on the data as trans- 
formed. à 

As hypothesized, the analysis of variance 
on the transformed data yielded a highly 
significant Stress X Commonality of Fate in- 
teraction, F (1,56) = 8.80, p < .01. In addi- 
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TABLE 1 


PROPORTION OF SUBJECTS HELPING AND MEAN 
NuMBER OF PexciLs PICKED UP AS A 
FUNCTION OF CONDITION 


Condition aa M 
High stress-common fate 13:15 | 34.13 
Low stress-common fate 9:15 | 16.40 
High stress—dissimilar fate 4:15 | 10.73 
Low stress—dissimilar fate 9:15 | 22.86 


tion, the analysis of variance revealed a main 
effect for commonality of fate, F (1,56) = 
4.39, p < .05, but not for stress, F = .07, ns. 
Planned comparisons showed that high-stress— 
common-fate subjects did indeed assist the 
confederate more than low-stress-common- 
fate subjects, p< .02, one tailed, whereas 
high-stress-dissimilar-fate subjects were less 
helpful to the confederate than subjects in 
the low-stress-dissimilar-fate condition, p < 
.05, one-tailed. Similar significance levels were 
achieved when the proportion of subjects 
helping in each condition was used as a 
dependent measure, both ps < .05, one-tailed, 

Before turning to a discussion of the re- 
sults, there is a source of unease about the 
reliability and interpretation of the data 
which should be addressed. While care was 
taken to avoid the possibility of confederate 
bias, it could be argued that subtle cues such 
as variation in the latency with which the 
confederate started to pick up the pencils 
might have contributed to the pattern of re- 
sults. This concern, while real, is somewhat 
lessened by virtue of the fact that this experi- 
ment was, in fact, a replication with minor 
modifications of an earlier pilot study by 
Morris and Dovidio (Note 1). That study 
used a team of confederates and produced 
virtually identical results in terms of the sig- 
nificance levels of the theoretically relevant 
contrasts; for example, the proportions of 
subjects helping were 8:9, 5:9, 1:9, and 6:9, 
respectively (the order corresponding to that 
in Table 1). Fisher Exact tests yielded one- 
tailed ps less than .07 and .01 for the high- 
stress-common-fate versus low-stress—-common- 
fate and high-stress-dissimilar fate versus 
low-stress—dissimilar fate comparisons, respec- 
tively. Thus, it may at least be said that these 
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results hold up under variation in confeder- 
ates (the confederate here was a paid research 
assistant; those in the pilot study were mem- 
bers of an undergraduate research team). 


Discussion 


The results support the hypothesis and are, 
therefore, supportive of the notion that an 
individual’s group-oriented responses to stress 
are mediated by the extent to which that 
stress is shared with other group members. 
The main effect of commonality of fate indi- 
cates that within-group helping behavior is a 
function of the similarity of circumstance of 
the group members. The significant inter- 
action with stress indicates that the effects of 
similarity of circumstance are heightened by 
stress; facing stress together increases the 
likelihood of positive behaviors such as help- 
ing, presumably, by making the ingroup—out- 
group distinction more salient and increasing 
“we feelings.” Contrariwise, facing stress alone 
makes an ingroup cleavage more likely with a 
resultant decrease in we feelings, which leads 
to a heightened concern for self as compared 
to others. By extrapolation, these results 
would suggest that an individual’s response 
to disasters such as tornadoes, floods, and 
earthquakes is very much a function of the 
similarity of one’s losses to those of others in 
the community. It would follow that cohesive- 
ness would be greatly increased among those 
people who suffered losses in property and/or 
family that were mutually perceived as 
roughly equivalent. Those who suffered rela- 
tively unique losses may be more likely to 
become preoccupied with their own misfor- 
tune to the exclusion of others seen as more 
fortunate. It should be noted, however, that 
such reasoning generalizes from a situation in 
which a misfortune has not yet occurred to 
one in which it has. Research on this point is 
clearly needed. 

While the results support hypotheses based 
upon conceptions of changes in group cohe- 
siveness, there is an alternative interpretation 
which can be developed from social compari- 
son theory (Festinger, 1954) as it has 
been applied to fear-inducing situations by 
Schachter (1959). 3 

According to Schachter (1959), fear in- 
duces a desire to affiliate, since affiliation 
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allows for the possibility of defining the social 
situation and an appropriate emotional re- 
sponse to it. It stands to reason from this con- 
ception that fearful individuals would seek 
out those in similar circumstances, since only 
they can provide a meaningful social com- 
parison. Schachter’s data as well as that 
of other investigators, for example, Rabbi 
(1964) and Darley and Aronson (1966), bea 
out these predictions. Using a paradigm i 
which groups of from five to eight females a: 
told that they are shortly to experience pai 
ful electric shocks, Schachter (1959) found 
two successive experiments that 63% of su 
jects in one experiment and 60% of subject 
in the other chose to wait with others awaii 
ing the same experience as opposed to waiting 
alone. In contrast, in a condition where the 
option was between waiting with a group of 
girls “waiting to see their professors” 
waiting alone, mone of the subjects chose 
wait with others, This prompted Schachtei 
well-known observation that “misery does: 
love just any kind of company, it loves o! 
miserable company.” Perhaps, but the da 
also can be taken to suggest that mis 
actively avoids nonmiserable company in view 
of the fact that further data showed ti 
33% of low-fear subjects, expecting very mil 
shocks that would only “tickle” or “tingle 
chose to wait together with others in the 
some circumstance as opposed to waiting 
alone. While the above data come from two 
separate experiments, which makes compari- 
sons hazardous and, in addition, there was no 
condition in which low-fear subjects were 
given the option of waiting with others in a 
dissimilar circumstance, it is nonetheless true 
that Schachter’s data bear a striking resem- 
blance to those found in the current study 
insofar as roughly 60% of subjects chose to 
affiliate in Schachter’s analogue of our high- 
stress-common-fate condition, 0% in his ana- 
logue of our high-stress—dissimilar-fate con- 
dition, and 33% in his analogue of our 
low-stress-common-fate condition. 

Is it appropriate, though, to use a helping 
behavior as an index of affiliation? Normally 
this would be highly suspect, but certain 
aspects of the present experiment contrive to 
make this a reasonable supposition. A subject 
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did not discover the nature of the experiment, 
that is, shock or word association, until he 
and the confederate were seated at either end 
of a long table, both facing a wall. This as- 
pect of the situation, together with the fact 
that the confederate has been instructed to 
avoid interaction with the subject (avoidance 
of interaction was also implicit in the experi- 
menter’s instructions upon leaving the room), 
posed a problem to a subject wishing to affili- 
ate in the service of social comparison. When 
the pencils were knocked “accidentally” from 
the table by the confederate, a perfect, low- 
cost opportunity to initiate interaction was 
provided. 

Further research is needed to disentangle 
these two interpretations, cohesiveness and 
social comparison, and show which is correct. 
All that seems necessary is a situation in 
which helping behavior is possible but where 
affiliation is precluded. 


REFERENCE NOTE 
1, Morris, William N., & Dovidio, John F. Unpub- 
lished data, 1971. 
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An experiment was carried out to determine whether the relationship between 
frequency of stimulus exposure and affective ratings depends on the uncertainty 
of the stimuli involved. On the basis of previous research, it was assumed 


that subjects prefer stimuli of an ini 


termediate uncertainty (or complexity) 


level. It was further assumed that increasing the frequency: of exposure of a 
stimulus reduces the uncertainty elicited by the stimulus. These assumptions 


led to the prediction of an interaction 


between exposure frequency and stimu- 


lus uncertainty, and the interaction was supported, p < .01. For low-uncer- 
tainty stimuli, liking decreased as frequency increased, For high-uncertainty 


stimuli, liking increased as frequency 


increased. And for stimuli of an inter- 


mediate uncertainty level, liking first increased and then decreased as a 
function of exposure. The effects of stimulus uncertainty were interpreted jn 


information-theoretic terms. 


In his recent monograph Zajonc (1968) 
examined the hypothesis that familiarity 
leads to liking. According to Zajonc, “Mere 
repeated exposure of an individual to a 
stimulus is a sufficient condition for the en- 
hancement of his attitude toward it.” (p. 1) 
By “mere exposure” Zajonc has referred to 
conditions which make a given stimulus ac- 
cessible to a person’s perception. The mono- 
graph has served to revitalize research inter- 
est into the nature of the exposure-liking 
relationship. The bulk of the research has 
tended to support Zajonc (Harrison, 1968; 
Matlin, 1970; Zajonc, 1968), although there 
has been a large amount of research which 
has shown that the exposure-liking rela- 
tionship is not as simple as the monograph 
would imply. 

While Zajonc (1968) consistently found 
that increasing the number of exposures pro- 
duces increased liking for the exposed stimu- 
lus, Cantor (1968) and Cantor and Kubose 
(1969) found opposite results in studies with 
children. Collecting data in procedures simi- 
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lar to those of Zajonc, their data are strikingly 
different from those of Zajonc. Like the pre- 
vious research, these studies are aimed at 
documenting the existence of the exposure- 
attitude relationship, with little attention de- 
voted to an account of the psychological 
mechanisms underlying the effect. 

Studies suported by Cantor and Cantor, 
(1964a, 1964b, 1966) had revealed that 
kindergarten-aged children given the opportu- 
nity to project familiarized and nonfamiliar- 
ized visual stimuli projected the nonfamil- 
iarized material for longer periods of time. 
The findings obtained in these experiments 
led to the conclusion that longer projection 
times of the nonfamiliarized stimuli reflected 
a preference for these stimuli. Because some 
investigators (Berlyne, 1960; Harrison, 1968) 
contend that nonfamiliarized stimuli might 
be projected for longer periods of time to 
reduce the conflict aroused by novelty, Cantor 
(1968) ran an experiment in which the pro- 
cedure closely approximated that of the 
Zajonc experiments in order to assess liking 
as a function of exposure, 

Using as stimuli 20 black-and-white figures 
taken from the Welsh Figure Preference Test 
(Welsh, 1959), Cantor (1968) established a 
familiarization set of 10 stimuli and a non- 
familiarization set of 10 stimuli. The stimuli 
from the familiarization set were each pre- 

150 


à 
} 
} 
$ 


STIMULUS UNCERTAINTY AND FREQUENCY oF EXPOSURE 


sented seven times in random order. Then the 
fifth- and sixth-grade subjects were shown 
each of the familiarized and nonfamiliarized 
stimuli once and asked to indicate how much 
they liked the stimulus on a 12.8 cm scale. 
The results obtained in the study indicated 
that nonfamiliarized stimuli were liked sig- 
nificantly more than familiarized stimuli. 

The contradiction between the findings of 
Zajonc (1968) and Cantor (1968) provides 
evidence that neither investigator’s finding is 
universally true. One of the first attempts to 
reconcile the conflicting results of the above 
investigators was made by Berlyne (1970). 
Berlyne contended that Zajonc and Cantor 
had obtained conflicting results because of the 
use of stimuli of different complexity values. 
Berlyne argued that Zajonc had found a posi- 
tive relationship between exposure frequency 
and liking using complex stimuli, whereas 
Cantor and his associates had found a nega- 
tive relationship using simple stimuli. Using 
wo different sets of stimuli, Berlyne found 
a significant interaction between complexity 
level and tests, which indicated that liking for 
complex figures tended to rise with increasing 
exposure, while liking for simple figures 
ended to decline. One set of stimuli was a set 
of abstract figures that had been previously 
scaled for complexity by Berlyne, Ogilvie, and 
Parham (1968), and the second was a set of 
lack-and-white reproductions of representa- 
ional paintings intuitively chosen for differ- 
ences in complexity. 

Berlyne (1970) interpreted his results by 
assuming that organisms prefer stimuli of 
an intermediate level of arousal potential. 
Arousal potential is the property of stimuli 
that entails a rise in arousal for the viewing 
organism. Changes in arousal potential are 
presumed to result from repeated exposure. 
Changes in hedonic value also result from 
repeated exposure, with the changes said to 
depend on the interaction of two antagonistic 
factors. One factor is a “positive-habituation” 
factor which is said to preponderate when a 
complex stimulus is encountered repeatedly. 
The second factor is a “tedium factor” which 
is presumed to preponderate during repetition 
of a simple stimulus. Positive habituation 
occurs with complex stimuli because these 
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stimuli initially have high arousal potential. 
Repeated presentation reduces their arousal 
potential so that it is nearer the optimally 
preferred level. Tedium occurs when stimuli 
that have too little arousal potential are pre- 
sented repeatedly. Repeated presentation of 
simple stimuli reduces their arousal potential 
so that it is even further from the point of 
maximum liking. 

Other investigators have also presented evi- 
dence which implies that exposure produces 
liking only for complex stimuli. Saegert and 
Jellison (1970) and Munsinger and Kessen 
(1964) reported results which indicate that 
liking for simple stimuli reaches asymptote 
after a small number of exposures. The simple 
stimuli used by Saegert and Jellison were 
composed by reducing Chinese characters to 
one to three of their constituent brush 
strokes. Liking for these simple stimuli first 
increased then declined as a function of the 
number of exposures. For complex stimuli 
(complete Chinese characters), liking in- 
creased monotonically as a function of the 
number of exposures. Munsinger and Kessen 
(1964) varied stimulus complexity by ma- 
nipulating the number of independent turns 
contained in stimuli. Their results were simi- 
lar to those of Saegert and Jellison. A major 
problem with the manipulation of complexity 
is that the term is not unambiguous, and the 
definition of complexity varies with the inves- 
tigator who uses it. Munsinger and Kessen 
have suggested a potential solution to this 
problem by proposing that concepts like com- 
plexity (which they call variability) be de- 
fined in terms of a neutral metric derived 
from information theory. They manipulated 
variability by varying the number of turns or 
contour sides of a polygon. The use of infor- 
mation theory to derive a metric requires 
specification of the grain of the stimulus pat- 
tern. In fact, a stimulus pattern may not be 
said to contain any determinate number of 
bits of information until its grain is specified 
(Attneave, 1954, p. 83). Unfortunately, ma- 
nipulation of the number of turns holds grain 
constant, and hence is not a procedure for 
obtaining an information metric in the strict 
sense. Dorfman (1965), and Dorfman and 
McKenna (1966) have manipulated stimulus 
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preference, which Suggests that it may aid in 
the elucidation of the relation between com- 
plexity and frequency of exposure. 

The major goal of this study is to investi- 
gate the relationship between frequency of 
exposure and complexity with respect to 
stimulus likability, where complexity is de- 
fined in the information-theoretic sense as 
uncertainty (see Attneave, 1959; Garner, 
1962). 


THEORETICAL Expectations 


Dorfman and McKenna (1966) found an 
inverse-U relation between stimulus prefer- 
ence and stimulus uncertainty. If exposing a 
stimulus reduces its uncertainty with respect 
to the observer, then there should be a trade- 
off relation between stimulus uncertainty and 
frequency of exposure. In particular, let us 
assume the following: (a) Axiom 1, L= 
G[H], where L is likability, G is some 
inverse-U function, and H is stimulus un- 
certainty and (b) Axiom 2, ish (sheep 
O'< @ <1, where Hn is the posterior uncer. 
tainty of the stimulus, H,, is the prior 
uncertainty of the stimulus, » is the number 
of exposures, and 1 — @ js the proportion of 
uncertainty reduced by a stimulus exposure 
(see Hays, 1963, pp. 610-612 for a clear 
discussion of prior and Posterior uncertainty). 

The above difference equation has the 
solution 


H, = 6"H,, [1] 


where H, is the uncertainty on the nth trial, 
and H, is the initial uncertainty (see Gold- 
berg, 1958, pp. 63-64, for the solution of this 
simple difference equation). Clearly Equa- 
tion 1 implies that frequency of exposure has 
the effect of reducing stimulus uncertainty, 
or complexity. It follows from this result that 
if a stimulus has an initial uncertainty H, 
to the left of the uncertainty value H associ- 
ated with the maximum of the inverse-U 
function, likability decreases with frequency 
of exposure, in contradition to Zajonc’s expo- 
sure hypothesis. If a stimulus is moderately 
to the right of the uncertainty value of maxi- 
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mum likability on initial uncertainty, then an 
inverse-U relation should be expected between 
likability and frequency of exposure after a 
sufficient number of exposures. Finally, if a 
stimulus has a level of H, far to the right of. 
the point of maximum likability, then the 
Zajonc’s exposure effect would be predicted 
for at least some number of exposures. Hence, 
Equation 1 predicts a family of functions 
relating likability to frequency of exposure, 
with the parameter of the family being initial 
uncertainty H,. The following experiment 
evaluates this prediction under bree levels of 
initial uncertainty. It should be ‘noted that 
Zajonc, Swap, Harrison, and Roberts (1971) 
have proposed an analogous family of likabil- 
ity-frequency functions with discriminability 
as the parameter, 


METHOD 


Experimental Design and Subjects 


The experiment consisted 


dents at two midwestern universities. One hundred 
and forty-four subjects were recruited at the Univer- 
sity of Iowa during 
were recruited by telephone from names taken from 
a direc 


were recruited from the i 
course at Western Illinois University. These 
received course credit for their Participation, 


Stimuli 


The stimuli were selected from the stimuli that 


had been used in the Previous study þ Dorf: 
and McKenna (1966), hese iat 


pattern. Each cell was either green or white, 
probability of a green or white cel] set at .5 


‘ seemed more 
pleasing and appeared to be more readily evaluated. 


Three classes of Nx V matrix grains were used, 
with N equal to 2, 4, and 6. These values were 
chosen on the basis of some pilot research by the 
senior investigator. At each of the three Matrix graj 
levels, four patterns were used, resulting in 12 ea 
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terns for the study. Adopting the Shannon-Wiener 
measure of information, this results in three classes 
of patterns, each class having an uncertainty of 4, 
16, and 36 bits of information, respectively. The 
pattern space of each matrix grain consisted of the 
set of all possible patterns derivable from the matrix, 
with all patterns being equiprobable. The stimuli 
generated by Dorfman and McKenna and used in 
this experiment are shown in Figure 1. Note that 
uncertainty is a class or set property, whereas sur- 
brisal value would be the term applied to a particu- 
lar pattern (Attneave, 1959, pp. 6-7). 


Procedure 


The stimuli were projected onto a beaded screen 
by a Kodak Carousel projector. Each subject was 
presented with stimuli at one level of uncertainty. 
One hundred subjects were randomly assigned to 
each of the three stimulus uncertainty levels accord- 
ing to a prearranged schedule. Forty-eight subjects 
in each condition were from the University of Iowa 
sample, while the remaining 52 subjects in each 
condition were from the Western Illinois University 
sample. The stimuli were projected in two phases: 
(a) the frequency manipulation phase and (b) the 
testing phase. Each subject was run individually. 

During the frequency manipulation phase, subjects 
were told that the experimenter would project a 
series of slides and that each one should be examined 
carefully. The slides were then projected, with each 
stimulus presentation consisting of a 3-second expo- 
sure. The intertrial interval was approximately 2 
seconds. Each subject was presented only the four 
stimuli from the assigned uncertainty level. One of 
the four stimuli was projected 1 time, one was pro- 
jected 5 times, another 10 times, and another 20 
times. 

Since there were four stimuli and four exposure 
frequencies, there are 4!=24 distinct assignments 
of the stimuli to exposure frequencies. Twenty of 
the assignments were employed on four subjects, and 
the remaining 4 assignments were employed on five 
subjects. 

There were 36 trials in the stimulus presentation 
phase. The positions of the stimuli in the trial se- 
quence were determined at random. Each subject 
received a different random order of presentation, 
with the random sequences being generated by a 
computer program. 

Following the frequency manipulation phase, the 
experimenter read the instructions for the testing 
phase. Each of the four stimuli from the assigned 
uncertainty level was presented one more time, with 
subjects being instructed to indicate on 7-point like- 
dislike scales whether they liked each pattern and to 
what extent. The subject was given a single response 
sheet which contained four like-dislike scales, one 
for each stimulus. On one half of the sheets of 
scales, the like extreme was on the left; on the 
other half the like extreme was on the right. The 
order of presentation of the stimuli in the test phase 
was randomly varied over subjects. 


Ficure 1. The patterns used in this study. 


RESULTS 


Performance in the various conditions was 
compared on the dependent measure of mean 
judged likability. An analysis of variance was 
done on the rating data supplied by the 
7-point dislikelike semantic differential scale. 
The ratings were scored on a O to 6 scale in 
which 0 represented the extreme dislike posi- 
tion. The analysis was done with level of com- 
plexity of the stimuli as a between-subjects 
dimension and number of exposures as a 
within-subjects dimension. There were no sig- 
nificant interactions with university, and this 
factor was collapsed in all analyses. 

The analysis revealed the following: There 
were no significant main effects. Neither 
stimulus uncertainty nor number of exposures 
was a significant source of variance. There 
was, however, a significant Stimulus Uncer- 
tainty X Number of Exposures interaction, 
F (6,891) = 3.03, p< .01. The means in- 
volved in this interaction are shown in 
Figure 2. 

Orthogonal polynomial analysis of the trend 
components of the interaction revealed that 
the curves shown in Figure 2 differed in 
their linear trends, F (2,297) = 3.79, p < .05, 
differed in their quadratic trends, F (2,297) 
= 4.43, p < .05, and did not differ in their 
cubic trends, F < 1. Separate trend analyses 
of the curves shown in Figure 2 were con- 
ducted. For the 2 X 2 stimuli there was a 
decreasing linear trend over exposure fre- 
quency, F (1,297) = 3.63, p < .07. For the 
2x2 stimuli there was also a quadratic 
trend, F (1,297) = 4.42, p < .05. The linear 
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Ficure 2. Mean likability ratings at each level of 
uncertainty as a function of number of prior expo- 
sures of the stimulus. 


trend accounted for 43% of the variance due 
to exposure frequency, while the quadratic 
trend accounted for 52% of the variance. The 
cubic trend was not significant. For the 4 X 4 
stimuli there was a quadratic trend, F (1,297) 
= 3.71, p<.07. The quadratic trend ac- 
counted for 96% of the variance due to expo- 
sure frequency. Neither the linear nor cubic 
trend was significant. For the 6 X 6 stimuli 
there was an increasing linear trend, F (1,297) 
= 5.77, p < .05. The linear trend accounted 
for 93% of the variance due to number of 
exposures, Neither the quadratic nor the cubic 
trend was significant. 


Discussion 


The most important result was the Number 
of Exposures X Stimulus Uncertainty inter- 
action. This result agrees with the finding of 
Saegert and Jellison (1970) with complex 
versus simple Chinese ideographs, although 
their interaction was not statistically reliable, 
p< 11. The results also agree with the find- 
ings of Berlyne (1970), where the stimuli 
were psychologically scaled for complexity. 
Berlyne and Harrison and Crandall (1972) 
have also found that homogeneity versus 
heterogeneity of exposure sequence is an im- 
portant manipulation. In particular, hetero- 
geneous sequences are more likely to yield 
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the Zajonc exposure effect, whereas homo- 
geneous sequences are more likely to yield the 
inverse-U relation. This result is quite con- 
sistent with the present information-theoretic 
viewpoint. Heterogeneous sequences have 
more prior uncertainty than homogeneous se- 
quences and, hence, should require more 
exposures to produce the inverse-U relation. 
Some unpublished data on liking for Turkish 
words provide support for this view (Cottrell, 
Fryrear, & Dorfman, Note 1): 

The model presented above relating likabil- 
ity to stimulus uncertainty is at a quite de- 
scriptive level and is not incompatible with 
more fundamental theories concerned with 
process such as the response-competition 
theory of Harrison (1968) or the physiological 
arousal theory of Berlyne (1967). It has an 
advantage, however, over these more funda- 
mental theories in that it can generate useful 
predictions without recourse to process assump- 
tions for which empirical support is not de- 
finitive. In addition, the information-theoretic 
concept of uncertainty is relatively un- 
ambiguous, whereas such is not the case for 
complexity. 
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Moral Judgment of Theft, Charity, and Third-Party Transfers 
That Increase or Decrease Equality 


Philip Brickman and James H. Bryan 
Northwestern University 


Theft and charity may be thought of as special cases of transfers within a 
distribution of goods in which the agent of transfer is directly affected by 
the transfer, either gaining (in theft) or losing (in charity). In a third-party 
transfer the person effecting the change is one whose own resources are neither 
increased or decreased. While charity may be approved and theft disapproved 
regardless of their distributional context, it was hypothesized that the evalua- 
tion of third-party transfers would depend heavily upon their impact on 
equality. Sixty fifth graders rated a model who surreptitiously modified the 
distribution of rewards in a four-person group such that the model's outcome 
was either augmented (by theft), reduced (by charity), or unaffected, and 
the entire distribution was thereby either made more equal or less equal. 
Third-party transfers were viewed more favorably if they increased equality 
or represented a “Robin Hood” transfer. Charitable transfers were also viewed 
more favorably if they increased equality, while thefts were not. The study 
of transfers within distributions is suggested as a new paradigm for research 


on moral judgment. 


In previous studies both theft and charity 
have been established as simple dyadic rela- 
tionships between a thief and a victim or a 
donor and a beneficiary. On occasion the 
victim or the beneficiary may be a collectiv- 
ity (the March of Dimes) rather than a 
single person, but the essential character of a 
two-party relationship is retained in that 
there are visible and distinct resources for 
only two parties, one of whom initiates the 
transaction and either gains or loses thereby. 
From this past research we know that chari- 
table models are generally judged favorably 
while models who steal are generally judged 
unfavorably (Bryan, 1972). 

The introduction of third parties into our 
analysis makes it clear that theft and charity 
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may be thought of as special cases of the 
more general act of transfer of resources in 
a distribution. The transfer may also be ac- 
complished by a party whose own resources 
are not directly involved, as in the case of a 
parent who transfers resources from a more 
fortunate child to a less fortunate one. Such 
transfers will be called third-party transfers. 
It would seem important to understand how 
people react to such transfers and to illumi- 
nate the conditions under which they are 
judged favorably (assimilated to charity) and 
the conditions under which they are judged 
unfavorably (assimilated to theft). 

In addition to judging transfer agents by 
how their acts affect their own outcomes, 
people can also judge them by how their acts 
affect the general distribution of outcomes 
(Brickman & Campbell, 1971). If the initial 
distribution of resources is perceived to be 
unfair and the transfer produces a distribu- 
tion that is perceived to be fair, we can expect 
the transfer agent to be judged more favor- 
ably than if he or she transforms an initially 
fair distribution of resources into an unfair 
one. From this point of view a theft by a 
person unjustly deprived from a person re- 
ceiving extra reward would be less severely 
judged than one that gave the thief an extra 
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THEFT, CHARITY, AND THIRD-PARTY TRANSFERS 


reward and subjected someone else to arbi- 
trary deprivation. A donation by a person 
arbitrarily receiving an excess of reward to a 
person arbitrarily receiving too little reward 
should be more favorably judged than a dona- 
tion that disrupts an initial distribution that 
is equal and fair. Finally, a third-party trans- 
fer that reduces arbitrary disparities in 
wealth by taking from the rich and giving to 
the poor should be more favorably viewed 
than a transfer that produces arbitrary dis- 
parities in wealth. 

Although we might expect transfers that 
increase equity in a distribution to be judged 
more favorably than transfers that decrease 
equity regardless of whether the transfers are 
direct (theft or charity) or third-party, it 
seems reasonable to expect equity to be most 
crucial for third-party transfers. Theft and 
charity, taking and giving, are acts with a 
fairly unambiguous moral quality which we 
might expect to elicit disapproval and ap- 
proval, respectively, regardless of their effects 
on the distribution. Third-party transfers, 
however, have a more ambiguous moral tone 
which might depend upon whether they 
are perceived as creating or correcting an 
injustice. 

The present study assumes that in a situa- 
tion where no obvious differences in merit 
exist people consider an equal distribution of 
rewards more fair than an unequal one, either 
because they believe rewards should be as- 
signed in direct proportion to merits (Adams, 
1965) or because they have a general prefer- 
ence for equality per se (Morgan & Sawyer, 
1967; Rawls, 1971). Subjects observed an 
actor who surreptitiously modified the dis- 
tribution of resources in a four-person group 
such that the actors own outcomes were 
either increased (by theft), decreased (by 
charity), or unchanged (by a third-party 
transfer), with the resultant distribution 
being either more equal and more equitable 
or less equal and less equitable. Thus the 
design of the study was a 2 x 3 factorial 
involving two kinds of initial and final distri- 
butions and three kinds of acts by the model. 

The design and the manipulations of the 
study are depicted graphically in Figure 1. 


EFFECT OF TRANSFER 
ON RESOURCES OF 
TRANSFER AGENT 


EFFECT OF TRANSFER ON EQUALITY OF 
DISTRIBUTION 


INCREASES EQUALITY DECREASES EQUALITY 


E 5 y 
n 


trnerT 
v4 


Decreases Resources fi 


NO CHANGE IN 
resources 
(THIRD PARTY 

TRANSFER) 


EZZ = acent oF tHaNsren O tance oF reansyee 

Ficure 1. Transfers that increase or decrease the 
equality of a distribution while increasing, decreasing, 
or leaving unchanged the resources of the transfer 
agent. 


The initial distribution of resources in each 
of the six conditions is shown to the left of 
the arrow in that condition. The final distri- 
bution, after the transfer, is shown to the 
right of the arrow. The resources of the model 
or the transfer agent (in this study a girl 
named Mary) are represented by the striped 
bar at the left of each distribution. The re- 
sources of people affected by Mary’s acts are 
represented by black bars. Thus theft is 
illustrated in row one as the striped bars in- 
crease and the black bars decrease. Charity 
is illustrated in row two as the striped bars 
decrease and the black bars increase. Third- 
party transfers are illustrated in row three as 
the striped bars remain the same while one 
black bar increases and one black bar de- 
creases. White bars represent the resources of 
individuals who are not affected by the trans- 
fer and hence always remain the same. 
Equality-producing transfers are illustrated in 
the first column where bars begin at unequal 
heights and finish at equal height, while in- 
equality-producing transfers are illustrated in 
the second column where bars begin equal in 
height and finish unequal in height. 


METHOD 
Subjects 


Subjects were 60 female fifth graders. Ten sub- 
jects were randomly assigned to each of the six 
experimental conditions. 


Procedure 


Each subject was taken individually to a trailer 
parked on the school parking lot. The experimenter 
(a 22-year-old female) said: 


We want to show you a movie about a girl named 
Mary, and then ask you what you think about 
Mary and several other girls mentioned in the 
movie, Alice, Jane, and Susan, In the movie, Mary 
is just finishing taking part in an experiment. The 
cameras that took this movie were hidden behind 
pictures in the walls of the experimental room, 
so that Mary, the girl in the movie, did not know 
that she was being observed. 


The experimenter monitored the subject’s viewing 
through a one-way mirror and returned when the 
film was done with a questionnaire which she asked 
the subject to complete. Finally, the experimenter 
asked the subject not to talk about the experiment 
to any of her friends. 


Videotape Script 


The tape began with the experimenter (a female 
college student) thanking the model, Mary (a sey- 
enth grader), for having participated in her experi- 
ment. The experimenter then said that she liked to 
reward the children who helped her in her experi- 
ments. Explaining that people have more fun when 
they can choose their own prize, the experimenter had 
Mary select a grab bag tied shut with a ribbon 
from a set of a half-dozen such bags on a table. 
Mary was told that each bag contains tokens worth 
10¢ apiece, but that the number of tokens varies 
from bag to bag. The purpose of this selection pro- 
cedure was to establish that differences in reward 
were clearly a matter of chance and unrelated to 
previous work. 

After Mary chose her bag, the experimenter had 
her carry it, unopened, into an adjacent “Prize 
Room” and stack up her tokens next to a tag bear- 
ing her name. Alongside Mary’s pile were stacks 
representing prizes earned by three previous sub- 
jects, Alice, Jane, and Susan. When Mary returned, 
the experimenter noted that Mary was the last per- 
son in the experiment today and asked her to help 
in recording the children’s prizes. The experimenter 
indicated that she had not yet been in the Prize 
Room and hence did not know if the piles were neat 
or how many chips anyone had. This statement and 
the earlier mention that the cameras were hidden 
from view were intended to establish a plausible 
belief by Mary that her transfer of the chips would 
go unnoticed by the experimenter so that her be- 
havior would not appear foolish. 


PHILIP BRICKMAN AND JAMES H. BRYAN 


While Mary was alone in the Prize Room, she 
counted the chips in each pile and then counted out 
the transfer appropriate to the experimental condi- 
tion. After she announced she was ready, the ex- 
perimenter entered and recorded the totals Mary 
reported for each child. Finally, Mary and the 
experimenter were shown leaving together on 
friendly terms. 


Manipulation of Independent Variables 


The initial and final distributions were presented 
graphically to subjects by a display of four stacks of 
poker chips that looked very much like the displays 
shown in Figure 1. In the equal conditions each girl 
had 24 chips. In the unequal conditions the advan- 
taged girl had 36 chips and the disadvantaged girl 
had 12 chips, while the other two girls each had `24 
chips. All transfers thus involved moving 12 chips 
from one girl’s pile to another girl’s pile. 


Measurement of Dependent Variables 


The first four questions on the questionnaire were 
designed to assess the subject's evaluation of Mary. 
They were: Do you approve of what Mary did? Do 
you think Mary is a good person? Do you like 
Mary? Do you think what Mary did was fair? On 
these and the other opinion questions the subject 
was asked to indicate her feeling by checking a » 
blank next to one of the five alternatives, not at all, 
a little, some, pretty much, and very much. The 
scales were scored from 1 for not at all to 5 for 
very much, Two other questions were designed to 
assess the subject’s evaluation of the final distribu- 
tion produced by Mary’s act. They were: Do you 
think the amounts of chips people had after Mary 
moved the chips were fair? Do you think the 
amounts of chips people had after Mary moved the 
chips would make people happy? Additional ques- 
tions asked subjects to remember how many chips 
each person had both after and before Mary moved 
the chips using a scale that was divided into units 
of 12 from 0 to 48. 


RESULTS 


The manipulation check items indicated 
that the allocations of chips were correctly 
perceived by subjects. Appropriate differences 
in recall of the number of chips held by Mary, 
Alice, Jane, and Susan were all statistically 
significant at beyond the .001 level. 

An index of subjects’ attitudes toward 
Mary was composed by summing their rat- 
ings on the first four questions. Since the 
patterns of means for all of these items were 
virtually identical, only the results for the 
index are presented. The mean favorability 
of subjects’ judgments of Mary and her action 
are shown in Table 1. 


THEFT, CHARITY, AND THIRD-PARTY TRANSFERS 


An analysis of variance of these judgments 
using effect of transfer on Mary’s outcome 
and effect of transfer on distributional equal- 
ity as independent variables indicated that 
both main effects and the interaction were 
significant. As can be seen from Table 1, 
transfers in which Mary decreased her out- 
come were judged more favorably than trans- 
fers in which Mary shifted other people’s out- 
comes without touching her own or transfers 
in which Mary increased her outcome, F 
(2,54) = 40.57, p < .001. Transfers that in- 
creased equality were judged more favorably 
than transfers that decreased equality, F 
(1,54) = 9.63, p < .01. This preference for 
equality-increasing transfers, however, only 
held for charitable and third-party transfers 
and not for thefts, a difference indicated by 
the significant interaction, F (2,54) = 4.10, 
p < .05. Third-party transfers that increased 
equality were judged more favorably than 
thefts that increased equality, whereas third- 
party transfers that decreased equality were 

. actually judged somewhat less favorably than 
thefts that decreased equality. 

Analyses of variance on ratings of fairness 
and happiness in the final distribution both 
indicated two significant main effects and no 
interaction. Final distributions produced by 
an equality-increasing transfer were rated as 
more fair, F (1, 54) = 50.29, p < .001, and 
more likely to make people happy, F (1, 54) 
= 7.55, p < .01. Distributions produced by a 
charitable transfer were rated as most fair, 
F (2, 54) = 4.08, p < .05, and most likely 
to make people happy, F (2, 54) = 13.71, p 
< .001. 


Discussion 


The most novel contribution of this re- 
search was the discovery of a Robin Hood 
effect, whereby a child who stole from the 
rich to give to the poor was more approved 
of by female fifth graders than a model who 
stole from peers for any reason or agents who 
stole from the rich to give to themselves. Pre- 
vious research could not have found such an 
effect because previous research considered 
theft and charity as two-party relationships 
differing only in whether the transfer was 
unilaterally enacted by the gainer (theft) or 
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TABLE 1 
ATTITUDE TOWARD TRANSFER AGENT AS A 
Function OF EFFECT OF TRANSFER ON 
Resources OF AGENT AND ON 
EQUALITY or DISTRIBUTION 


Effect of transfer on equality 


Effect of transfer on Sh aiseribution 
resources of transfer agent 


pind 
Increases resources (theft) 5.60 6.30 5.95 
No change in resources 
(third-party transfer) 9,20 5.60 7.40 
Decreases resources 
(charity) 15.80 11.70 | 13.75 
M 10.20 7.87 9.03 


_ Scale range is 4-20, high scores indicating positive 
evaluation. Error mean square = 8.485, 


by the loser (charity). In Robin Hood trans- 
fers, the transaction is initiated neither by 
the gainer nor the loser, but rather by a third 
party who is thus simultaneously engaged in 
theft and charity. These third-party transfers 
are evaluated more favorably, or more like 
charity, when they work to increase equity 
and equality in a distribution and more un- 
favorably, or more like theft, when they work 
to decrease equity and equality. Theft on be- 
half of another to redress injustice is evalu- 
ated more favorably than theft on one’s own 
behalf. There is no doubt an important les- 
son here for would be change agents. 

In the present study, the money taken from 
a single rich person was given to a single 
poor person. The Robin Hood of popular fic- 
tion presumably divided up the money taken 
from his rich victims in order to pass on 
smaller amounts to larger numbers of the 
poor. Thus a natural extrapolation of the 
present Robin Hood effect would suggest that 
an agent who donated or transferred smaller 
amounts to a larger number of beneficiaries 
would be judged more favorably than an 
agent who transferred an equivalent total in 
larger amounts to fewer beneficiaries. (On the 
other hand, the present results give no indi- 
cation that a thief who stole much from a 
single person would be judged any differently 
than a thief who stole little from many peo- 
ple.) We might also ask about the Robin 
Hood who takes a good percentage off the top 
for his trouble, as most real-life Robin Hoods 
do (Hobsbawm, 1959). Here we begin re- 
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search on the less romantic phenomenon em- 
bodied by various criminal or political ma- 
chines. 

We have no immediate reasons to believe 
that older subjects or males would respond 
differently than our sample of fifth-grade 
females, although there is evidence that fe- 
males may have a greater preference for equal- 
ity than males when equality and equity con- 
flict (Leventhal, Popp, & Sawyer, 1973). The 
boundary conditions for our present effects, 
however, can of course only be established by 
further research. 


Equality, Equity, and Wealth 


The acts in the present study which in- 
crease equality may also be seen as increasing 
equity among group members, since rich and 
poor persons gained their status essentially by 
lot. The preference for equality-increasing 
transfers might not apply to a situation in 
which the rich were seen as having earned 
their extra wealth by their own efforts. Al- 
ternatively, it is possible that third-party 
transfers may be enhanced only by their con- 
tribution to equity and not by their effect on 
equality, while charity may be enhanced by 
its contribution to equality regardless of 
equity. These issues can only be tested in a 
study that includes cases in which transfers 
that decrease equality also increase equity 
and cases in which transfers that increase 
equality also decrease equity. 

It might also be noted that in establishing 
conditions in which agents either increased or 
decreased their own resources and simultane- 
ously either increased or decreased the equal- 
ity of a distribution, we necessarily estab- 
lished these agents as relatively wealthier to 
begin with in some conditions than in others. 
Tt is possible that individuals with greater 
initial wealth are judged more favorably than 
individuals with less initial wealth, regardless 
of the chance origin of these disparities (cf. 
Lerner & Simmons, 1966). These differences 
in the initial wealth of the agent do corre- 
spond to differences in how favorably the 
agent was judged in the equality-increasing 
conditions, but they fail to predict the differ- 
ences which occur in the equality-decreasing 
conditions or in the third-party transfer con- 
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ditions. Future research is required to estab- 
lish what may be the effects of the initial 
wealth of the agents and the amounts trans- 
ferred by them on the evaluation of different 
kinds of transfers. 


New Directions for Research on Moral 
Judgment 


It seems to us that the present study offers 
a new paradigm for the study of moral judg- 
ment which supplies a valuable corrective to 
the overly “psychological” analyses of Piaget 
(1932) and Kohlberg (1964). The funda- 
mental question of nature, and of evolution, is 
how the resources of the environment are dis- 
tributed across species and among members of 
a given species. Correspondingly, in our view, 
the fundamental question of ethics, and of 
moral judgment, is how people should inter- 
vene to affect this distribution of resources 
across species and among members of the 
human race (Rawls, 1971). Thus there is no 
more important task for the psychology of 
moral judgment than the study of how people 
judge various distributions of valued goods 
(Brickman & Campbell, 1971). 

The elements of moral judgment and the 
developmental stages discovered by Piaget 
and Kohlberg may be expected to hold also 
for judgments of distributions. If we per- 
formed an experiment in which the distribu- 
tion the actor intends to create was dis- 
tinguished from the distribution the actor 
actually created, for example, we might find 
that older judges weighted more heavily the 
intentions of the transfer agent while younger 
judges weighted more heavily the conse- 
quences. On the other hand, it seems possible 
to us that work in the Piaget-Kohlberg tradi- 
tion has overemphasized the importance of 
intentions relative to the importance of the 
objective consequences of behavior as a de- 
terminant of mature moral judgment. When 
distributions of real resources are concerned, 
we suspect that good intentions would not, 
and should not, make up for unsatisfactory 
outcomes. 

Our use of the word transfer draws upon 
the special meaning attached to the idea of 
transfers in economics and suggests a number 
of connections between the present research 
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and problems in welfare economics. “Trans- 
fer payments” in economics refers to pay- 
ments made by one party to another solely 
for the purpose of redistributing the available 
income (e.g., gifts or welfare), as opposed to 
payments for goods or services. The idea of 
transfers is generally linked to the notion of 
creating an optimal distribution, one which 
maximizes the general welfare (Mishan, 
1960). It is only a short step from the present 
research to quantifying the extent to which 
various distributions are subjectively optimal 
for the people involved (Brickman, in press) 
and to studying how subjects distribute or 
redistribute resources in conflict situations 
(Brickman, 1974). 

Future research might also look at the 
effects of identical transfers when they are 
carried out by a legitimate transfer agent, an 
official elected or chosen for this purpose, as 
opposed to being carried out surreptitiously 
by a self-nominated individual as in the pres- 
ent study. We might then compare the evalu- 
ation of redistributions identical to the ones 
“of this study when they are labeled as pay- 
ments for goods and services, or earnings, 
rather than transfers. Finally, future work 
might study whether the evaluation of trans- 
fers depends upon the specific dimensions or 
goods involved—for example, whether egali- 
tarian transfers of health insurance are more 
strongly supported than egalitarian transfers 
of business profits. More generally, we sus- 
pect that when the future consequences of 
resources for the welfare of their owners are 
salient, egalitarian transfers will be more 
strongly approved than when such future 
consequences are not salient or when the re- 
sources are seen simply as rewards for past 
achievements. The present research offers a 
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framework and a simple, graphic method for 
pursuing these inquiries. 
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Escalation of Aggression: Experimental Studies 


Jeffrey H. Goldstein, Roger W. Davis, and Dennis Herman 
Temple University 


A finding commonly obtained in research using the Buss “aggression machine” 

is a main effect for trial blocks, indicating an escalation in shock intensity 

over trials. Theoretical explanations for this effect were tested ina modified 

verbal operant-conditioning situation. In Experiment 1, subjects could admin- EA 
ister any of 10 levels of positive reinforcement to a learner” for correct 

verbal responses or any of 10 levels of negative reinforcement to a learner 

for incorrect responses, Half of the subjects were required to begin with weak, 

and half with strong, reinforcements. Results indicated that, regardless of r 
condition, subjects gave more intense reinforcements as the learning trials 

progressed. Those who administered negative reinforcements devalued the 

learner relative to those who administered positive reinforcements, In Experi- 

ment 2, a role-playing procedure was used in which subjects administered 

either positive or negative reinforcements to a learner whose performance 

either did or did not improve over trials. Again, in all experimental groups, 

cts administered increasingly intense reinforcements over trials. The results 


subje l 
u and prosocial 


are interpreted as supporting a disinhibition theory of anti- 


behavior, 


the subject was to shock him for each error, 
The subject could vary the intensity, number, y 
or duration of shocks, which then served as 
the measure of his aggressiveness, 

One finding commonly obtained with the 
aggression machine “has been a main effect 
for trial blocks over which the mean intensity 
of shock was found to increase. Even for sub- 
jects in control groups, a trials effect has 
often been obtained in which subjects gave 
increasing numbers (intensity or duration) of 
shocks to the learner (Berkowitz, Lepinsky, & ^ 
Angulo, 1969; Brock & Buss, 1962, 1964; 
Buss, 1963, 1966a, 1966b; Geen & Stonner, 
1973; Zillmann, 1971), For example, a recent 
study by Buss, Booker, and Buss (1972) 
found that control subjects, who simply began 
the experiment as “teachers” on the aggres- 
sion machine, gave increasingly intense shocks 
to the learner, despite the fact that they were 
not previously frustrated, angered, or sub- 
jected to any other aggression- or emotion- 
inducing treatment. In Experiment 2 of the J 
Buss, Booker, and Buss series, the authors 
noted that “The trials effect was highly sig- 
nificant; the upward drift of shock intensity 


Much current experimental work on ag- 
gression employs what has been referred to 
as the “aggression machine” (Buss, 1961) as 
a means of generating the dependent measure 
(e.g., Bennett, Buss, & Carpenter, 1969; 
Berkowitz, 1965; Berkowitz & Geen, 1966, 
1967; Buss, 1963, 1966a, 1966b; Geen & 
Pigg, 1970; Geen & Stonner, 1973; Hart- 
mann, 1969; Lefcourt, Barnes, Parke, & 
Schwartz, 1966; Loew, 1967; Walters & 
Llewellyn-Thomas, 1963; Wolfe & Baron, 
1971). In most such studies the subject was 
informed that his role was to teach some task 
to a learner, actually a confederate of the 
experimenter. The learner, through a prear- 
ranged schedule, made a number of errors and 

i iment 1 constituted the senior 
Rani cea author. We are indebted 
to the Biokinetics Research Laboratory, Depart- 
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the design and conduct of this research, and Joseph 
Kremer assisted with data collection. David Kipnis 
kindly commented on an earlier version of the manu- 
script. D. H. is now at the Thomas Jefferson Col- 


lege of Medicine. 
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Jeffrey H. Goldstein, Department of PON 
519 Weiss Hall, Temple University, Philadelphia, 
Pennsylvania 19122. 
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has occurred in virtually every study with the 
aggression machine. Thus, it appears to be a 
constant of research with this instrument, and 


tu 


w 


‘ 


x» it need not be discussed further [p. 298].” 


In explaining the significant trials effect, 
Buss (1966b) has noted: 


The explanation of this upward drift . . . is spec- 
ulative but straightforward. There are 10 shock but- 
tons; most subjects start near the bottom end and 
remain in the 1-5 range. As the learning series 
progresses, subjects tend to try out different shock 
levels. Since the subjects are initially at the low 
end, increased variability must yield an upward 
drift in mean shock levels. . . . (p. 160) 


There are theoretical reasons for suspecting 
that the trials effect is not a simple instru- 
mentation artifact, however. There are at least 
three explanations for a reliable increase in 
aggression (shock) over trials, frustration- 
aggression theory (Dollard, Doob, Miller, 
Mowrer, & Sears, 1939), the theory of cogni- 
tive dissonance (Festinger, 1957), and the no- 
tion of disinhibition of aggression (Wheeler, 
1966; Wheeler & Caggiula, 1966; Zimbardo, 
1969). 


Frustration-A ggression 


Buss (1961, p. 50) has argued that mild 
shocks are instrumental to the teacher’s task 
while more intense shocks are likely to be 
aggressive in character. But as the trials ef- 
fect demonstrates, subjects tend to give more 
intense shock as the learning trials progress. 
This could reflect the frustration subjects feel 
when mild shocks fail to prevent the learner 
from giving incorrect answers. If giving a 


+, shock fails to bring about the teacher’s de- 


sired aim then he is likely to feel some frus- 
tration and shock at a higher level on the next 
trial. If this also fails to eliminate learner 
errors, the subject will feel frustrated because 
of his failure to teach the learner and because 
his previous shock failed to result in a change 
in behavior. Thus, with each shock that fails 
to eliminate errors, the teacher becomes more 
frustrated and to reduce the frustration con- 
tinually raises the intensity of administered 


shock. 


Disinhibition of Aggression 
One interpretation of the trials effect is that 


subjects gradually “warm up” to punishing | 


the learner; their aggression becomes increas- 
ingly disinhibited over trials.* Even in studies 
which do not preexpose subjects to aggressive 


ESCALATION or AGGRESSION: EXPERIMENTAL STUDIES 


163 


stimuli, administering shock without aversive 
consequences to the teacher may gradually 
disinhibit his aggressive “impulses” and result 
in escalation. 


Cognitive Dissonance 


Aronson (1972) offers yet another possible 
explanation for the phenomenon in terms of 
devaluation of the “victim” (learner). Accord- 
ing to dissonance theory (Festinger, 1957), 
the teacher who harms a learner for insuffi- 
ciently motivating reasons might tend to de- 
value him. This devaluation would justify 
further increases in shock intensity, which 
would lead to further devaluation, and so on, 
resulting in escalation. Support for the notion 
of victim devaluation has been obtained by 
Brock and Buss (1962, 1964) and Lerner 
and Simmons (1966). 

The present studies were designed to de- 
termine whether the trials effect so often 
reported in the literature is a reliable phe- 
nomenon, rather than an instrumentation 
artifact, and, if so, which of the several theo- 
retical explanations proposed above offers the 
best fit in explaining the data. In order to 
determine whether the escalation of aggression 
obtained by others was a function of the ag- 
gression machine, an experimental situation 
was needed which contained the essential 
properties of the aggression machine, but 
which did not rely upon the machine itself. 
In addition, in order to determine which theo- 


1 This explanation is particularly appropriate to 
studies which first expose subjects to aggressive cues, 
such as aggressive films or models (e.g., Bandura, 
Ross, & Ross, 1963; Goldstein & Arms, 1971; Gold- 
stein, Rosnow, Raday, Silverman, & Gaskell, in press; 
Liebert & Baron, 1972; Zillmann, 1971). Such studies 
typically report that viewing violence leads to an 
increase in aggression. Two mechanisms by which 
this increase can come about are through a change 
in the observer’s level of aggressive drive or arousal, 
or through a reduction in restraints against aggres- 
sing. If the observation-aggression dynamic is an in- 
crease in the observer’s aggressive drive, then we 
might expect drive to be highest immediately after 
viewing violence, and this should lead to greater 
shock intensities on initial, rather than later, trials. 
On the other hand, if viewing violence lowers ob- 
servers’ inhibitions against aggressing, then as they 
shock the learner they ought to become increasingly 
disinhibited, the result being an escalation of ag- 
gression over trials. 
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TABLE 1 


POSITIVE AND NEGATIVE VERBAL 
REINFORCING STATEMENTS 


Scale | Inten- 
Statement valie ani 
Rewarding 
Okay. 2.1 1 
That’s good. 4.3 2 
T like that. 4.6 3 
‘That was a wise thing to say. 48 4 
Yes. 5.2 5 
Very good. 5.3 6 
Wonderful! 5.8 7 
Great! 6.5 8 
Excellent! 8.3 9 
That’s fantastic! 8.4 10 
Punishing 
I don’t like that. 4.1 1 
That’s no good. 4.6 2 
That’s stupid. 5.3 3 
Never! 6.5 4 
You're a jerk. 7.4 me 
That’s a dumb thing to say. 7.6 6 
How idiotic can you be? 8.4 7 
Idiot! 8.5 8 
I never met anyone as dumb as you. | 8.6 9 
Stupid son of a bitch. 9.8 10 


retical explanation affords the best fit with the 
trials effect, a situation was needed which 
could reliably distinguish between these sev- 
eral alternatives. The situation chosen was a 
modified version of the verbal operant-condi- 
tioning paradigm (Taffel, 1955), in which the 
subject (teacher) reinforced verbal responses 
made by another (learner) in order to influ- 
ence the frequency of usage of a class of re- 
sponses (first-person pronouns). Teachers 
could administer verbal reinforcements which 
varied in intensity from weak to strong. 

To determine whether escalation is due to 
trying out different available levels of shock, 
subjects in Experiment 1 were required to 
begin at either the lowest or the highest avail- 
able level of reinforcement intensity. If Buss 
is correct, those who begin at the lowest level 
ought to gradually try higher levels, resulting 
in escalation; however, those who begin at a 
high level ought to try lower levels, resulting 
in a gradual deescalation. 

In order to test the frustration-aggression 
explanation, some subjects were to reward 
correct responses of the learner rather than 
punish incorrect responses. If escalation of 
punishment is due to frustration, the effect 
should be obtained only for those who punish 


&Z 
the learner. That is, for subjects who reward %4 >» ‘ 
correct responses there should be no escalation F 
of reward intensity over trials; if anything, j 


failure to produce the desired response class f 
should result in a decrease in reward intensity 
over trials. 

According to the dissonance interpretation, 
subjects who punish the learner should de- 
value him, and this devaluation should lead to 
further punishment. In order to test this hy- 
pothesis, subjects were asked to evaluate the 
learner on dimensions of likability, coopera- 
tiveness, and intelligence after rewarding or 
punishing him. In addition, since dissonance 
might be reduced by devaluing oneself, sub- 
jects were also asked to rate their own effec- 
tiveness as teachers. Dissonance theory pre- 
dicts that those who punish the learner for 
errors should devalue him relative to those 
who reward correct responses. k 

The disinhibition of aggression interpreta- ~ 
tion predicts an escalation of punishment over 
trials. For subjects who reward correct re- 
sponses there might also be an escalation ofy 
reward if we assume that some prosocial mo- 
tivation also becomes gradually disinhibited. 
This latter prediction assumes that not only 
are there inhibitions about aggressing against 
others but that there are also inhibitions 
about expressing rewarding behavior toward 
others. 


EXPERIMENT 1 


Method 
Experimental Task 


The standard verbal operant-conditioning para- 
digm was modified so that subjects would teach a 
learner (confederate) to use first-person pronouns 
in a sentence-construction task. There were 40 trials, 
and on 20 of them the learner made up sentences 
with I or WE, while on the remaining 20 he used 
second- and third-person pronouns, The 20 correct 
and 20 incorrect sentences were in random order, 
with the exception that the first sentence was cor- 
rect and the second incorrect. Subjects could de- 
liver any of 10 verbal reinforcements which varied in ¥ 
intensity (see Table 1). These statements were ob- 
tained from 30 male student judges by the method 
of equal-appearing intervals. Half of the subjects in 
each condition were told to give a Level 1 rein- 
forcement on the first trial to be reinforced, and 
half were told to begin with a Level 10 reinforce- 
ment. After this initial positive or negative rein- 
forcement was administered, subjects were free to 
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use any level they thought appropriate for condi- 
tioning I or WE responses. 


Subjects and Procedure 


The subjects in the experiment were 48 male 
undergraduate students at Temple University who 
volunteered to participate in an experiment on “ver- 
bal behavior.” Data from 4 subjects were dis- 
carded, 2 for incomplete data sheets, and 2 because 
of suspiciousness. Subjects were run individually. 
When the subject arrived at the laboratory, he was 
met by a male experimenter and briefed on the 
basic verbal-conditioning procedures. In many cases 
subjects had already been exposed to this material 
in their coursework. Each subject was told that he 
would be the teacher and another subject the 
learner, and that the teacher’s task was to condition 
I or WE responses in the learner. He was to hand 
the learner a 3 inch X 5 inch index card on which 
was printed a verb and the pronouns I, HE, YOU, 
WE, SHE, THEY in random order. The learner’s 
task was to make up a sentence with the verb and 
one of the six pronouns. The teacher’s task was to 
get the learner to use I and WE as often as possible. 
At this point subjects were randomly assigned to 
one of four experimental conditions in a 2 X 2 fac- 
torial design. The two independent variables were 
whether the subject would reward correct responses 


„ Or punish incorrect responses and whether the initial 


reinforcement would be at a low or high level. Sub- 
jects were given a list of the appropriate verbal re- 
inforcements in each condition, along with their 
rank order intensity, as in Table 1. 


Dependent Measures 


For each trial, the subject recorded the learner’s 
response as well as the level of reinforcement he 
administered. The learner, a Temple undergraduate 
who was an experimental accomplice, “made up” 20 
I or WE sentences and 20 non-I or non-WE sen- 
tences on the 40 trials. The learner’s sentences were 
actually prepared in advance and he gave the same 
sentences in the same order for all teachers. 

A wooden partition was placed between the teacher 
and learner, purportedly to “eliminate nonverbal 
cues and gestures” from influencing the Jearner’s 
responses. The partition had a slot cut through the 
bottom so that the index cards could be passed back 
and forth. Following the 40 learning trials, teachers 
completed a questionnaire which contained items for 
evaluation of the learner and the teacher's self- 
rating of his performance. These items were 7-point 
rating scales of the learner’s likability, cooperative- 
ness, and intelligence, and an item on the subject’s 


“ evaluation of his own effectiveness as a teacher. Fol- 


the experiment, subjects were interviewed 


orug n the study and were 


about their performance i 
thoroughly debriefed. 


Results 


The 40 learning trials contained 20 correct 
and 20 incorrect sentences SO each subject 
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administered 20 reinforcements. Since the in- 
tensity of the initial reinforcement was pro- 
grammed by the experimenter, there were 
actually 19 freely given reinforcements for 
each subject. The first of these 19 reinforce- 
ments was counted twice in order to yield 
equal numbers of trials per block and the 20 
levels of reinforcement were divided into four 
blocks of five trials each. A 2 X 2X 4 re- 
peated measures analysis of variance was 
computed on intensity of reinforcement. The 
factors in the analysis were reinforcement 
(positive or negative), initial level (1 or 10), 
and trial blocks. Cell means are presented in 
Table 2. It can be seen that an escalation 
effect was present in all four experimental 
groups, the main effect for trial blocks being 
significant, F (3, 120) = 5.83, p < .01. There 
were no interaction effects involving trial 
blocks which approached statistical signifi- 
cance. Thus, escalation, as indicated by the 
significant trial blocks effect occurred uni- 
formly across the four experimental condi- 
tions. Trend analysis using orthogonal poly- 
nomials was computed for trial blocks effects 
and the linear main effect for trials was 
highly significant, Æ (1, 40) = 12.86, p< 
.001. None of the interaction trends was sta- 
tistically significant. 

The analysis of variance also yielded a 
significant main effect for reinforcement, F 
(1, 40) = 17.83, p < 001, a main effect for 
initial level, F (1,40) = 7.96, p < .01, and a 
significant interaction effect, F (1,40) = 8.24, 
p<.01. The main effect for reinforcement 
indicated that those in the positive reinforce- 
ment conditions gave more intense reinforce- 
ments than those in the negative-reinforce- 
ment conditions. Those who began at Initial 
Level 10 gave more intense reinforcements 
throughout the trials than those who began at 
Level 1. The interaction indicated that those 
in the Negative Reinforcement-Level 1 condi- 
tion gave lower intensity reinforcements than 
subjects in any other group. If the scale val- 
ues of the reinforcements are analyzed rather 
than their ordinal values, the same statistical 
effects are obtained. 

A 2 X 2 analysis of variance was computed 
on the subjects’ evaluations of the learner and 
on self-ratings of competence as a teacher. 
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TABLE 2 
MEAN INTENSITY OF REINFORCEMENT OVER Four BLOCKS OF TRIALS 
Reinforcement 
Negative Positive 
Experiment 
Trial block Trial block 
1 2 3 4 1 2 3 4 
1 
Initial intensity 
1 12.55 17.27 17.00 19.18 27.18 29,36 29.36 32.10 
10 25.36 25.56 25.45 31.36 28.64 28.00 29.91 31.10 
2 Ż 
Series 
Random 12.88 20.13 17.88 23.50 27.78 34.44 38.67 38.89 
Improvement 9.29 14.57 17.29 20.57 27.00 34.33 33.67 37.11 


Note. Each entry represents th 
Experiment 1 and from 7 to 9 in 
ment 2, 


The three scales designed to measure evalua- 
tion of the learner were combined to yield a 
global evaluation score. The means for other- 
and self-evaluation are presented in Table 3. 
Analysis of subjects’ evaluations of the learner 
yielded only one statistically significant effect, 
the main effect for reinforcement, F (1, 40) 
= 4.64, p < .05. Subjects who administered 
negative reinforcements devalued the victim 
relative to those who gave positive reinforce- 
ments. There were no significant main effects 
between groups on self-evaluations, ps > .10, 
although the’ interaction effect approached 
Statistical significance, F (1, 40) = 2.86, p < 
10, 


Discussion 


The results of Experiment 1 indicated that 
the main effect for trial blocks obtained in 
previous research using the aggression ma- 
chine is a reliable phenomenon and not a sim- 
ple function of the instrument itself. Further- 
more, the escalation of intensity of reinforce- 
ment over trials occurred with positive as 
well as with negative reinforcement. Thus, it 
does not appear to be an aggression-specific 
phenomenon. 

On the basis of these findings we can rule 
out several theoretical explanations for the 
trials effect. Buss’s notion that escalation oc- 
curs because subjects want to try out differ- 
ent available responses is rejected because 


sum of five trials, The higher the score, the more intense the reinforcement. Cell ns are 11 in 
xperiment 2. Within-subjects mean square error is 32.87 in Experiment 1 and 31.45 in Experi- 


when subjects began at an initially high level 
of reinforcement, they tended to remain at a 
high level, rather than gradually trying the 
lower intensity reinforcements. The finding 
that there was also a trials effect for those 
who administered positive reinforcements 
casts doubt on the frustration-aggression ex- 
planation, since frustration due to the learn- 
er’s failure to perform his task correctly 
should not encourage more intense positive 
reinforcements but less. There was some sup- 
port for the cognitive dissonance explanation 
in our finding that those in the negative-rein- 
forcement conditions devalued the victim 


r 


compared to those in the positive reinforce- *~ 


ment conditions, and there was also support 
for the disinhibition explanation. Actually, 
given the present data we do not know 
whether learners were devalued in the punish- 
ment conditions or overvalued in the reward 
conditions, and Experiment 2 was designed, 
in part, to answer this question. Subjects 
tended to give more intense punishment over 
trials in accordance with the assumption of a 
disinhibition of an aggressive drive and more 
intense rewards over trials in accordance with y 
the assumption of a disinhibition of a “pro- 
social drive.” 

There is also a reasonable alternative ex- 
planation for our findings, which Experiment 
2 was also designed to explore. It is possible 
that teachers simply felt that low levels of 
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t TABLE 3 
EVALUATION OF LEARNER AND SELF 
Reinforcement ie. 
Experiment Negative | Positive Control 
Other Self Other Self Other Self 
1 (Initial intensity) 
‘te ieee 3.91 16.82 3.55 
- 4 A 3.00 15.18 A 
2 (Series) x ae 
Random 11.75 3.75 10.78 3.78 14.17 4.33 
Improvement 13.29 4.43 13.67 4,33 14.00 4.80 


Note. Cell ns = 11 in Experiment 1 and from 6 to 9 in Experi E an square = 4.79 in Experi 
e D>, 2xperiment 2, Error mean square = 4.79 in Experiment 1 a 1 
Experiment 2, for other and error mean square = 1.53 in Experiment 1 and 1,34 in Experiment 2 for self. periment 1 and 1.42 Ñ 


reward or punishment were not effective in 
shaping the behavior of the learner, and so 
they continued to raise the intensity of rein- 
forcement. Subjects, if this was the case, 
would perceive the reinforcement they admin- 
istered as noninstrumental, since the learner 
continued to make errors over the 40 trials. 


» Perhaps if the learner’s performance im- 


proved, subjects would be content to stick 
with weak levels of reward and punishment, 
so long as they were instrumental in changing 
the learner’s behavior. In Experiment 2, sub- 
jects administered either positive or negative 
reinforcement to the learner, and for half of 
the subjects, the learner’s performance stead- 
ily improved over trials, while for the remain- 
ing half his performance was random across 
40 trials, as in Experiment 1. 


EXPERIMENT 2 


The second experiment was designed to 
determine whether the performance of the 
learner influenced the intensity of reinforce- 
ment over trials delivered by the teacher. 
Specifically, the study sought to test the hy- 
pothesis that increasing reinforcement inten- 
sity is due to the teacher’s inability to alter 
the learner’s behavior. In addition, the study 
was designed to determine whether those who 
punished the Jearner’s errors devalued the 
learner or whether those who rewarded cor- 
rect responses enhanced evaluation of the 


learner. } f 
Given the results of Experiment 1, which 


Wfhnd support to the disinhibition hypothesis 


and the cognitive dissonance hypothesis, it is 
interesting to speculate what effect learner 
performance might have vis-à-vis these two 
theories. According to dissonance theory, sub- 
jects should devalue the learner when they 
administer punishments to him, regardless of 
the learner’s performance. If we can assume 
that improving performance (i.e., increased 
use of I and WE over trials) provides less 
justification for teachers to punish, then there 
should be more devaluation in the improve- 
ment than in the random-performance condi- 
tion, since these subjects should experience, 
the most dissonance. The disinhibition hy- 
pothesis predicts that, once punishment is 
administered, it becomes increasingly easy to 
administer more intense punishment, regard- 
less of the behavior of the learner. That is, 
disinhibition is an autonomous process which 
should operate independently of feedback 
from the victim and depends only upon pro- 
prioceptive feedback (cf., Goldstein, in press; 
Zimbardo, 1969). Thus, regardless of whether 
the learner’s performance improves over trials, 
an escalation effect should occur. 

In Experiment 2 subjects rewarded correct 
learner responses, or punished incorrect re- 
sponses, or neither. For half of the subjects 
in each reinforcement condition, the learner’s 
performance improved continually over trials, 
while for the remaining subjects it was ran- 
dom, as in the first experiment. The same 
postexperimental questionnaire used in Ex- 
periment 1 was administered to all subjects 


following the 40 learning trials. 


168 


Method 


The procedure was modified from Experiment 1 
to make data collection more efficient. Rather than 
running subjects individually, the learner’s sentence 
construction responses were tape recorded. Subjects 
heard a tape of the learner’s 40 sentences, with 7 
seconds between successive sentences. After each 
sentence, subjects were to indicate what reinforce- 
ment intensity they would administer in order to 
influence the learner’s performance. The study was 
presented to subjects as a role-playing experiment in 
which they were to imagine themselves as “behavior 
modifiers” whose task it was to encourage I and WE 
sentences. 

The subjects were 47 students at Temple Uni- 
versity enrolled in two undergraduate courses. Three 
subjects were dropped from the analysis due to in- 
complete response sheets. After being briefed on the 
nature of verbal operant conditioning and behavior 
modification techniques, subjects were given one of 
three booklets. One-third of the subjects were told 
to indicate which of 10 positive reinforcements they 
would administer after each I or WE sentence; 
those assigned to the negative-reinforcement condi- 
tion were told to indicate which of 10 punishments 
they would administer after each non-I or non-WE 
sentence; and the remaining third were assigned to 
the no-reinforcement condition and were told simply 
to listen to the tape of the learner’s performance. 
Following the tape, all subjects rated the learner’s 
performance, intelligence, and likability. The study 
was conducted in two classes, one class hearing a 
tape of the learner’s sentences with I and WE used 
20 times randomly throughout the 40 sentences, and 
one class hearing I and WE used increasingly as 
the 40 trials progressed. In this latter condition, I 
or WE sentences were used two times in the first 
10 sentences, four times in the second 10, six times 
in the third 10, and eight times in the last 10. 


RESULTS 


Evidence that the role-playing technique 
used in Experiment 2 was successful can be 
seen in the “Random” row of Table 2. The 
mean reinforcement intensity for the four 
blocks of trials in the negative and positive 
conditions replicates the findings of Experi- 
ment 1. 

A 2X2 4 repeated measures analysis of 
variance was computed on reinforcement in- 
tensity, the factors being learner’s perform- 
ance (random or improving), reinforcement 
(positive or negative), and trial block. 

Only two effects reach statistical signifi- 
cance, the trial blocks main effect, F (3, 87) 
= 21.57, p< .001, and the reinforcement 
main effect, F (1,29) = 29.14, p< .001. 
The trials effect, as in Experiment 1, indi- 
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cated a general escalation of intensity over 
trials, regardless of whether the learner’s per- 
formance improved or whether subjects ad- 
ministered positive or negative reinforcement. 
The reinforcement main effect, as in the 
first experiment, indicated that those in the 
positive-reinforcement conditions administered 
more intense reinforcements than those in the 
negative-reinforcement conditions. 

Three 7-point scales were included in the 
postexperimental questionnaire designed to 
assess evaluation of the learner. These three 
items were summed and analyzed in a 2 X 3 
analysis of variance. Unlike the analysis of 
these same items in Experiment 1, there were 
no statistically significant differences between 
groups in evaluation of the learner, nor were 
there any differences between groups in self- 
evaluations (see Table 3). 


GENERAL Discussion 


Experiment 2 replicated the escalation re- 
sults of the first study and also found an 
escalation of both punishment and reward” 
even when the learner’s performance steadily 
improved over trials. This last finding sug- 
gests that the escalation is not dependent 
upon feedback from the learner. Failure to 
find devaluation of the learner indicates that 
cognitive dissonance does not always play a 
major role in the escalation process. It is pos- 
sible that devaluation of the victim occurs in 
face-to-face situations (Experiment 1) but 
not in less personal settings (Experiment 2). 

Most consistent with the results of both 
experiments is the disinhibition hypothesis. 
Once subjects engage in a particular type of 
response (either rewarding or punishing) by 
administering a reinforcement, they tend to 
become more extreme in their behavior over 
time. 

The results of the two experiments reported 
here indicate that the escalation effect ob- 
tained in laboratory research on aggression is 
a reliable phenomenon and is not restricted to 
situations in which the aggression machine is 
used. Rather, the effect seems to be a general 
one occurring for rewarding as well as for 
punitive responses, regardless of the learner’s 
performance. The theoretical explanation 
which seems best able to account for this 
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phenomenon is the notion of a general dis- 
inhibition of anti- and prosocial behaviors. 

The concept of disinhibition is theoretically 
and conceptually vague, but we can offer some 
speculations about its operation. In most 
complex societies individuals learn norms ap- 
propriate to both the display as well as the 
inhibition of aggressive behaviors, and quite 
often these norms are ill-defined or contra- 
dictory. In a potential aggression situation, 
where there are cues both to express and to 
inhibit aggressive behavior, the individual ex- 
periences some conflict. That is, situations, 
by their complexity or ambiguity may call 
into play both aggressive and antiaggressive 
norms. This conflict must be cognitively re- 
solved before the individual acts, and the par- 
ticular resolution depends on the relative 
strength and number of competing norms 
aroused. Two processes are likely to occur 
once the individual decides to act aggres- 
sively; first, there is a spreading apart of the 
two competing norm systems such that pro- 
aggressive norms become psychologically more 
potent while antiaggressive norms become less 
important; and second, in face-to-face situa- 
tions, additional cognitions are apt to be 
invoked by the aggressor to justify his actions. 
Among these latter cognitions are additional 
reasons for his behavior, such as devaluation 
of the victim as found in Experiment 1. With 
regard to escalation in the reward conditions, 
a similar interpretation can be proposed. In 
Western society we learn norms for the ex- 
pression of prosocial, rewarding behaviors, 
some of which are contradictory or ambigu- 
ous. Although space precludes an extensive 
discussion of the development of inhibitory 
systems, Pavlov (1927, p. 109) has discussed 
the association of stimuli with the failure to 
act as a means of developing what Scott 
(1958, p. 11) has called “passive inhibition.” 

The present studies have fairly broad im- 
plications for the experimental study of ag- 
gression. First, since aggression tends to in- 
crease over trials among control subjects, 
even in the absence of exposure to aggression- 
inducing stimuli, increases in aggression 
among experimental groups must be inter- 
preted with caution. On a broader level, the 
general procedure of the present studies, par- 
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ticularly Experiment 1, may provide an al- 
ternative method of measuring punitiveness 
to the aggression machine. In view of the 
ethical concerns which are now becoming pre- 
eminent in experimental research on human 
subjects, alternatives to the use of sham 
shock procedures will eventually have to be 
developed (cf. Goldstein, 1974; Holper, Gold- 
stein, & Snyderman, 1973). The verbal-condi- 
tioning paradigm of Experiment 1, although 
still containing issues of ethical concern (pri- 
marily a deceptive experimental guise), may 
be a more satisfactory procedure than the 
aggression machine. 

One additional theoretical system which 
may be applicable to the present studies 
should be mentioned, Helson’s (1964) adap- 
tation level theory. According to adaptation 
level theory, a given level of reinforcement 
may, after repeated use, become psychologi- 
cally less potent, that is, the subject may 
adapt to a particular reinforcement, thus 
requiring a more intense level on subsequent 
trials to maintain the same subjective inten- 
sity. Adaptation level theory predicts the same 
results as the disinhibition hypothesis, but 
for dynamically different reasons. 

It is interesting to note that studies of 
animal aggression also report what we might 
call an escalation of aggression, either in the 
form of increased intensity of attack over time 
or of decreased latencies of attack over suc- 
cessive aggressive encounters (e.g., Frederic- 
son, Story, Gurney, & Butterworth, 1955). 
The escalation effects found in the present 
studies may be similar in nature to those 
found in several extralaboratory situations, 
such as in the U.S. aerial bombardment of 
North Vietnam during the 1966-67, 1971, 
and 1972 periods and in what has come to be 
called the battered child syndrome, in which 
simple physical punishment escalates to a 
level of brutalization of a child. 
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Childbirth as Crisis: A Test of Janis’s Theory of 
Communication and Stress Resolution 


Judith M. Levy and Richard K. McGee 
University of Florida 


The relationship between expectation and subjective outcome in childbirth 
was investigated. Sixty primigravidae responded to a questionnaire before and 
after labor and delivery. The evaluation of childbirth as favorable or un- 
favorable was related to whether the experience was rated better or worse 
than expectation. Anticipation of labor and delivery was positively related to a 
woman’s perception of her mother’s experience in childbirth, but was not 
related to the woman’s evaluation of her actual delivery experience. Women 
receiving extreme communications or no information from their mother reported 
poorer outcomes than women who received moderate communications. Anxiety, 
as measured by the Taylor Manifest Anxiety Scale, and denial, as measured 
by the Self-Criticism Scale of the Tennessee Self-Concept Scale, were not 
related to anticipation or subjective outcome measures. Results were interpreted 


if 


fi 


within Janis’s theory of psychological stre: 


Childbearing, as a significant maturational 
period in a woman’s life, provides a natural 
experiment in crisis resolution. Janis’s theory 
of psychological stress is particularly relevant 
to the childbirth situation since it specifies 
the kinds of prestress-impact attitudes associ- 
ated with resolution of the experience. 

Janis (1958, 1965) defines psychological 
stress as the reaction to a physically danger- 
ous event in which pain, bodily injury, or 
death is anticipated. The way an individual 
psychologically handles the impending crisis 
during the preimpact period is an important 
determinant for the outcome. Janis postulates 
that a “work of worry” is essential for suc- 
cessful outcome. This anticipatory rehearsal 
or imaginative construction of future events 
serves to bind anxiety and later functions to 
reduce uncertainty at crisis impact. In his 
studies of surgical patients, Janis found a 
relationship between magnitude of preopera- 
tive anxiety, the way in which the patient 
handled his anxiety, and the postoperative 
course of recovery. A curvilinear relation- 
ship was postulated between level of anticipa- 
tory fear and crisis outcome, with both ex- 
treme worry and denial of danger resulting in 
poor outcomes. Those persons indicating no 
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fear of the impending surgery, who con- 
fronted the event by joking or asserting its 
triviality, as well as those highly anxious pa- 
tients who coped with distracting games and 
fantasies, had a higher probability of a longer 
postoperative course of recovery than did in- 
dividuals with moderate amounts of anxiety. 
Fear arousal then appears to have “psycho- 
logical inoculation” effects such that the in- 
dividual is better able to understand and 
control the adversities of the stress episode by 
his or her rehearsal of events. The person who 
remains unconcerned about impending danger 
fails to prepare for it by worrying. At impact, 
efforts at intellectual denial are no longer 
successful and the stress is augmented since 
more suffering is experienced than had been 
expected. 

It is generally agreed that the attitudes a 
woman holds toward childbearing are those 
which have been communicated directly or in- 
directly by her mother. Klein, Potter, and 
Dyk (1950) found that family experiences 
rather than current knowledge influenced a 
woman’s expectations of childbirth as favor- 
able or unfavorable. Ritson (1966) tested the 
hypothesis that a woman in labor identifies 
with her mother’s experience of pregnancy 
and childbirth. In a retrospective design, 
women having had normal and prolonged 
labors were asked about their mother’s experi- 
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ence. In the normal group 70% indicated 
heir mothers had had difficult labors; in the 
prolonged group only half recollected mother’s 
difficulty. Ritson concludes from this data 
hat the mother’s childbearing experience is 
not significant. in predicting the daughter's 
performance in labor. Ritson was surprised, 
however, by the finding that women who had 
experienced long, arduous labors “did not in 
‘act find it prolonged beyond their expecta- 
ion.” Watson (1959) similarly in his inter- 
views with women whose labors were at least 
23 hours found that most indicated the ex- 
perience was not as bad as they thought it 
would be. 

The relationship between expectancy, 
identification with mother’s experience, and 
evaluation of one’s own labor and delivery is 
indicated by a comparison of childbirth ex- 
pectations of Gravid I and IT women (Levy, 
1966). It was found that for primigravidae 
(first pregnancies), expectations of difficulty 
in delivery were similar to their perceptions 
of their mother’s evaluation of childbirth. 
For women anticipating their second de- 
liveries, expectations were in the reverse di- 
rection from the way they perceived their 
mother’s evaluation. Thus, if a primigravida 
recalled her mother’s experience as easy, she 
was likely to imagine her anticipated delivery 
would be easy. On the other hand, if a multi- 
gravida recalled her mother experiencing easy 
deliveries, she imagined her anticipated sec- 
ond delivery would be a difficult one. Since 
the major difference between the groups was 
the experience of childbirth, the difference in 
anticipatory attitudes was likely to derive 
from the presence or absence of that experi- 
ence. This was confirmed in the significant 
negative relationship between the multi- 
gravida’s evaluation of her first delivery with 
her mother’s evaluation. 

The following sequence of events is sug- 
gested. A mother’s evaluation of delivery is 
the most influential information a woman has 
about. childbirth. In preparation for her first 
delivery, a woman imagines that her experi- 
ence recapitulates her mother’s experience. 
Her own experience then becomes the most 
salient source of knowledge about childbear- 
ing. After delivery, the woman no longer 
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organizes her ideas of childbirth around her 
mother’s experiences. The aspects of indepen- 
dence and growth are reflected in this process. 

If a mothers communication concerning 
labor and delivery is the prime source of in- 
formation for the primigravida, then the kind 
of information commmunicated is related to 
the level of anticipatory fear. It was found 
(Levy, 1966) that when a woman reported 
that her mother had communicated an un- 
favorable impression of childbirth, that the 
woman’s own experience in delivery tended to 
be better than she expected. Moreover, she 
anticipated her second delivery would be 
easier in comparison with the first. Con- 
versely, when the mother communicated fa- 
vorably about labor and delivery, the subject 
tended to view her own experience as worse 
than she anticipated and further to imagine 
that her second delivery would be harder than 
her first. It was also found that extremes in 
either direction, that is, recalling mother’s 
experience as extremely positive or extremely 
negative resulted in an unfavorable childbirth 
experience for the woman. While sample size 
precluded analysis, these preliminary results 
appear consistent with Janis’s theory that 
extreme attitudes reflecting denial and high 
anxiety impede necessary anticipatory work. 
Empirical studies have shown that both anx- 
iety and denial have been found to be associ- 
ated with complications of labor and delivery 
(Cramond, 1954; Kapp, 1963; McDonald & 
Christakos, 1963). 

The purpose of this investigation is to test 
hypotheses derived from Janis’s theory in a 
prospective design employing the childbearing 
experience as the situation of stress impact. 
Specifically, the study focuses upon expecta- 
tion of the stressful situation, identification of 
the parturient with her mother and levels of 
anxiety and denial with relation to crisis 
resolution. 


Hypothesis 1 


A woman’s evaluation of childbirth as fa- 
vorable or unfavorable is related to the per- 
ceived amount of stress which she has experi- 
enced relative to her expectation of stress. 
According to Janis, at the end of the stress 
episode the affective status of the person is 
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dependent upon whether the amount of per- 
ceived suffering is more or less than was con- 
sciously expected before stress impact. If the 
amount of victimization is more than ex- 
pected, the mood will be dysphoric (reactive 
depression) ; if less than expected, the mood 
will tend to be euphoric (reactive elation). 
The hypothesis above assumes that emotional 
reactions will be reflected in the woman’s 
evaluation of her experience. 


Hypothesis 2 


Anticipation of the delivery experience as 
measured during pregnancy is positively re- 
lated to the woman’s perception of her 


mother’s experience in childbearing. This hy-- 


pothesis is based on earlier findings concern- 
ing identification patterns of women antici- 
pating labor and delivery (Levy, 1966). For 
primigravidae, a relationship was found be- 
tween the anticipated labor and delivery and 
the woman’s perception of her mother’s evalu- 
ation of childbirth, Thus, if a woman per- 
ceives her mother as having had a positive 
“experience, she will tend to anticipate a posi- 
tive experience; conversely, if the mother has 
conveyed an unfavorable impression of her 
labor and delivery, anticipation will be un- 
favorable. 
Hypothesis 3 
Women who receive information from their 
mothers concerning childbirth tend to evalu- 
ate their own deliveries more favorably than 
women who receive no information from their 
mothers. The assumption here is that a 
woman’s mother is the most important infor- 
mation source regarding the childbirth experi- 
ence. If there is a failure to convey informa- 
tion concerning stress impact, the amount of 
victimization is likely to be greater than ex- 
pected and hence childbirth is likely to be 
negatively evaluated. 


Hypothesis 4 

Women who receive extreme evaluations 
from their mothers, either extremely nega- 
tive or positive, perceive their own experience 
more negatively than women _who receive 
moderate evaluations, either positive or nega- 
tive, from their mothers. This hypothesis de- 
rives from Janis’s theory that extreme com- 
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munications arouse either anxiety or denial 
mechanisms preventing adequate anticipatory 
work and hence stress impact is perceived as 
unfavorable. 


Hypothesis 5 


Women with high anxiety tend to perceive 
their childbearing experience more negatively 
than women with low anxiety. This hypothe- 
sis is based on the premise that highly anx- 
ious persons are unable to cope adequately 
during the prestress-impact period and are 
more likely to encounter greater difficulty at 
stress impact. 


Hypothesis 6 


Women demonstrating high defensiveness 
tend to perceive their childbearing experience 
more negatively than women demonstrating 
low defensiveness. As in Hypothesis 5, it is 
proposed that poststress-impact difficulty 
occurs when denial mechanisms prevent 
anticipatory work. 


METHOD 
Subjects 


Seventy women were selected with the following 
specifications to control for variations associated with 
race, marital status, and parity: All subjects were 
white, married, primigravidae with a negative his- 
tory of reproductive casualty, that is, none of the 
women had ever experienced abortion, miscarriage, 
or stillbirth. In addition, all the subjects were sus- 
taining normal pregnancies as defined by their phy- 
sicians. The women ranged in age from 17 to 34 
years old with a median age of 23. Years of school- 
ing ranged from 10 to 18 years with 90% of the 
sample having at least a high school education and 
15% having completed college. 

To control for variations in measurement found 
to be associated with “time of pregnancy,” all sub- 
jects received a Pre-Delivery Schedule in what they 
and their physicians considered to be the ninth 
month of pregnancy. Hence, all subjects completed 
this schedule, at the most, four weeks prior to their 
calculated due date. Post-Delivery Schedules were 
completed between 24 and 72 hours following 
delivery. 

The women were private patients of six physicians 
specializing in obstetrics and gynecology who were 
affiliated with the same hospital, so that hospital 
experience was kept relatively constant. While physi- 
cian differences doubtlessly existed, the obstetrical 
approach of these doctors was quite similar. None of 
the physicians was a natural childbirth enthusiast 
and patients were not instructed in any prenatal 
psychophysical preparations. The approach of these 
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physicians was traditional prenatal care of their pa- tion, “How did your mother describe childbirth?” g if 
tients. The kind of anesthesia and analgesia em- Prenatal Anxiety (PNA): Anxiety was measured 
ployed in labor and delivery was decided by the by the Taylor Manifest Anxiety Scale (Taylor, 1953) 
physician unless the patient specified a preference. which has been found useful in assessing psycho- 
The 70 women who filled the specifications for the logical tension in a pregnant population (Davids, 
study were drawn consecutively from the physicians’ DeVault, & Talmadge, 1961). 
rolls. Only one woman refused to partake in the Prenatal Denial (PND): Denial was measured by 
study and one refused to complete the Post-Delivery the 10-item Self-Criticism Scale of the Tennessee 
Schedule. Seven patients delivered their babies prior Self-Concept Scale (Fitts, Note 1). This subscale 
to completion of Pre-Delivery Schedules and one derived from the Lie Scale of the Minnesota Multi- 
patient failed to receive the postdelivery question- phasic Personality Inventory provides a measure of 
naire in the hospital. ‘Therefore, predelivery data are defensiveness. 
available for 62 subjects; postdelivery data for 67 Anticipated Hours (AH): AH is the number of 
subjects and complete data for 60 subjects. Loss of hours indicated by the subjects in response to the 
data occurred because of the time limitations jim- question, “How many hours do you think you'll 
posed by the nature of the investigation. The actual be in labor and delivery?” 
refusal rate of 2%, extremely low in field investiga- 
tions, was not felt to place limiting conditions on Postpartum Measures 


the findings. 
Evaluation of Labor and Delivery (ELD): The 


ELD is the average scale score across the five adjec- 
Predelivery questionnaires were distributed in the aa se ees Eee Cae tee con 
obstetric setting by the physician or his nurse. The EE) AN iae Wsi Sei 
subjects were asked if they wished to participate Experience of Deer Reratinelto Expected Stress 
voluntarily in a study of women who were expecting i X a Ta i H a a E 
first babies. The patients were instructed to com- Aspect a E: a ak tae act! cae ee 
plete the material at home and to return the ques- Si al ne ee th ee coe id ss 
tionnaires on their next prenatal visit in a sealed Er ae a ‘hoi sadel s p 4 buaayould: yor 
envelope. The subjects were assured of confidential- Tate; yOu Danos 5 DEL EEY i y 
ity in the ‘written instructions, that is, only the Evaluation of Delivery Relative to Other Women 
investigator would have access to the information. CO ee Me TPA on a qestep sa 
At the end of the questionnaire the subjects were ron ee fe ee an ore Vormen ? mys 
asked if they would be willing to complete & similar woe Chk ronen in response to the item, 
type of schedule following delivery, but it was not eg a to what you heey a ae onen 
specified that a Post-Delivery Schedule would be ow would you rate Joura toan delivery ? 
distributed. All women indicated favorably. Postpartum Anxiety (PPA): PPA is the score on 
Postdelivery questionnaires were distributed in the the RS Manifest Armisty Scale following 
hospital one to three days following delivery by the F i3 g 
nursing staff or the ward receptionist. Patients com- Postpartum Denial (PPD): PPD is the score of 
self-criticism derived from the Tennessee Self- 


Procedure 


leted th i ires in the hospital. 

de ie duesh nna CO Concept Scale administered after delivery. 

Prenatal Measures Hours in Labor and Delivery (H): This is the 
response to the question, “How long was your 


Anticipated Evaluation of Labor and Delivery labor and delivery ?” 
(ALD): The semantic differential technique was 


employed to determine the expectation of labor and RESULTS 
delivery as a positive or negative experience. Five 
adjective pairs were selected which have moderate Pearson product moment correlations were 


to high loadings on the evaluative dimension €x- com uted for the 12 variables un ee 
tracted by Osgood, Suci, and Tannenbaum (1957). ae In Table 1, the © 1 der ane 
A 6-step scale with descriptive qualifiers at each gins , the corre ation etween 

Evaluation of Labor and Delivery (ELD) 


scale point was employed to measure both direction j 2 
and intensity of the response. The five attributes and Experience of Delivery Relative to Ex- 


selected were: good-bad; pleasant-painful; happy- pected Stress (EXS) is .78 which is signifi- 
unhappy ; comfortable-uncomfortable ; and healthy- cant beyond the .001 level. Thus, the predic- 


sick. The subjects completed these items for the x : 2 % 
concept “What my labor and delivery will be Tike” on of Hypothesis 1 is confirmed: Women 


Mother as Information Source (MIS): The MIS who rate their deliveries as favorable also 
was a dichotomous choice score indicating whether tend to indicate that their experience was 
the woman received or did not receive information etter than they expected, while women 
concerning childbirth from her mother who indicate that childbirth was wore than 


Mother's Evaluation of Delivery (MED): MED 5 5 EA TA 
was the score on a 6-step scale item ranging from expected, tend to rate their experience un- 
very easy to very difficult in response to the ques- favorably. a 
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TABLE 1 
INTERCORRELATIONS AMONG MAJOR VARIABLES 

Predelivery measures | Postdelivery measures 5 
Mea- 
sire | ALD| Mis | MED | PNA Px | AH ELD | EXS | EO | PPA | PPD | H 
ALD |) — 01 ATH Oden 07 25% -21 -02 
MIS E Sones Sens See On 
MED eee) ES meer ane fi —110 
PNA — 09 —.12 -06 AS —.08 
PND — 20 -08 04 08 
AH — —.16 16 16 
ELD == 784" Siu 
EXS se ag 
EO 50s 
PPA —15 
PPD — =01 
H > 


Note. ALD = Anticipated Evaluation of Labor and Deliver, MIS = Mother as Information Source, ME) 
VD = Prenatal Denial, AH 


tion of Delivery, PNA = Prenatal Anxiet 
and Delivery, EXS = Experience of Deliver 


*b <05. 
mo <01 


wet p < (001. 


e y Relative to Expected Str = 
Women, PPA = Postpartum Anxiety, PPD = Posteunan, Denial, H = Hours in Labor and Delivery, 


Mother's Evalua 
valuation of Labor 
y Relative to Other 


Anticipated Hours, ELD 


aluation of Deli 


« The correlation between MIS and MED is unity since the subjects indicating ‘'no information from mother” were excluded 


thereby from the MED calculations, 


It was predicted in Hypothesis 2 that 
anticipation of childbirth would be positively 
related to the woman’s perception of her 
mother’s experience in childbearing. The cor- 
relation between Anticipation of Labor and 
Delivery (ALD) and Mother’s Evaluation of 
Labor and Delivery (MED) is .47 which is 
significant beyond the .001 level. 

A woman’s evaluation of childbirth as posi- 
tive or negative was found, however, to be 
unrelated to her perception of her mother’s 
experience. The correlation obtained between 
ELD and MED is .15 which is not signifi- 
cant. Moreover, the relationship between 
Anticipation of Labor and Delivery (ALD) 
and the actual experience (ELD) is not sig- 
nificant. The lack of these relationships indi- 
cate that outcome is not determined simply 
by favorable or unfavorable anticipation or 
by an isomorphic identification with mother’s 
experience. 

The complex interrelationships between 
evaluation of childbearing, the anticipation of 
the experience and the mother’s description of 
the event are posited in Hypotheses 3 and 4. 
It was postulated that women receiving in- 
formation from their mothers would tend to 
evaluate their own deliveries more positively 
than women who indicated that their mother 
did not communicate regarding childbirth. 


The point biserial correlation between Mother 
as Information Source (MIS) and Evaluation 
of Labor and Delivery (ELD) was found to 
be positive and significant, r = 52, < 001. 
A positive and significant relationship was 
also found between MIS and the woman’s 
evaluation of her labor and delivery relative 
to stress expectancy, r = .34, p < .01, Thus, 
women who receive information from their 
mothers tend to evaluate their experience 
more positively than women who have not 
been informed about childbirth by their 
mothers. 

It was predicted further than an extreme 
evaluation from a mother, either negative or 
positive, would effect a more negative out- 
come than would a moderate evaluation, 
Three groups were differentiated for analysis: 
(a) those reporting their mother to have had 
a moderately easy or moderately difficult ex- 
perience in childbirth (moderate communica- 
tion group); (b) those reporting their mother 
to have had an extremely easy or extremely 
difficult experience (extreme communication 
group); and (c) those indicating that their 
mother had never discussed labor and delivery 
(no communication group). Table 2 presents 
the normative data for the three groups for 
two-outcome measures, Evaluation (ELD) 
and Experience Relative to Stress Expectancy 
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TABLE 2 
MEANS AND STANDARD DEVIATIONS FOR 
MODERATE-, EXTREME- AND NO- 
COMMUNICATION GROUPS ON 
EvaLuation (ELD) AND 
EXPECTATION (EXS) MEASURES 
ELD EXS 
Group n ee 
M SD M SD 
Moderate 
communication 19 | 2.42 84 | 2.95 | 1.31 
Extreme 
communication |25 | 4.00 | 1.50 | 5.52 | 1.58 
No communication | 16 | 5.31 | 1.54 | 5.94 .94 


(EXS). Two one-way analyses of variance 
were performed to determine the relationship 
between type of communication and outcome. 
Using ELD as the dependent variable, a sig- 
nificant difference was found between the 
three communication groups, F (2,57) = 20.5, 
p < .001. Newman-Keuls post hoc analyses 
(Winer, 1962) showed significant differences 
between each paired comparison beyond the 
01 level. Significant between group differ- 
ences were also obtained for the EXS mea- 
sure, F (2,57) = 19.8, p< .001. Multiple 
comparisons between the groups showed sig- 
nificant differences beyond the .01 level for all 
comparisons except for that between the 
extreme-communication and no-communication 
groups. 

The relationship between outcome and type 
of communication is illustrated in Figure 1. 
Positive and negative outcome groups were 
established on the basis of response on the 
ELD measure. Women who evaluated labor 
and delivery with degrees of favorableness 
were assigned to the positive outcome group 
(n = 27); while women who evaluated child- 
birth with degrees of unfavorableness were 
assigned to the negative outcome group 
(n = 33). It may be seen that there is a pre- 
ponderance of extreme and no communica- 
tions for the negative evaluation group, while 
women who rated their deliveries positively 
tended to have received moderate communica- 
tions from their mothers. 

Figure 2 depicts the relationship between 
type of communication and Evaluation of 
Labor and Delivery Relative to Stress Ex- 
pectancy (EXS). It can be noted that in the 
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moderate group, 74% of the subjects indi- 
cated an experience that was better than they 
expected; 21% indicated an experience that 
was exactly as expected and 5% responded 
that childbirth was worse than expectation. 
For the extreme group, however, 70% felt 
that labor and delivery was worse than they 
had expected; while 8% indicated an ex- 
perience exactly as expected and 12% of the 
group reported that labor and delivery was 
better than expectation. For women who had 
received no information from their mothers, 
88% reported a worse experience than they 
had expected. 

The relationship between the direction and 
intensity of the mother’s communication and 
the anticipation and outcome of labor and 
delivery was further demonstrated when sub- 
jects were assigned to one of four groups re- 
flecting the possible combinations of anticipa- 
tion and outcome (favorable or unfavorable 
anticipation/favorable or unfavorable out- 
come). For the favorable anticipation/favor- 
able outcome group, 54% indicated that they 
had received a moderately negative communi- 
cation from their mothers; 18% reported a 
moderate positive communication; and the 
remaining 9% indicated an extreme posi- 
tive communication. For the group antici- 
pating an unfavorable experience but effecting 


—— POSITIVE EVALUATION (N=27) 
== NEGATIVE EVALUATION (N=33) 


PROPORTION OF Ss 


T T 
EXTREME 
COMMUNICATION 


T 
MODERATE 


NO 
COMMUNICATION COMMUNICATION 


TYPE OF COMMUNICATION 


Ficure 1. Relationship between positive and nega- 
tive outcome as a function of mother’s communica- 
tion. 
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a favorable outcome, 80% reported that they 
had received a moderately negative communi- 
cation from their mothers while 20% indi- 
cated no information. 

For the group in which the prophecy of 
negative outcome was fulfilled (unfavorable 
anticipation/unfavorable outcome), 64% re- 
orted an extreme negative communication 
from their mothers; 27% reported no com- 
munication and 9% indicated an extreme 
ositive communication. For the subjects an- 
ticipating a favorable outcome but experi- 
encing an unfavorable one (favorable antici- 
pation/unfavorable outcome), 50% indicated 
that they had received no communication 
rom their mothers; 23% reported an extreme 
negative communication; another 23% re- 
ported an extreme positive communication; 
while the remaining 4% indicated a moderate 
ositive communication. Thus, women who 
favorably evaluate outcome in childbirth ap- 
pear to have received a moderately negative 
communication from their mothers. For 
women evaluating their outcome unfavorably, 
the predominant types of communication 
from mother are either extremely positive, 
extremely negative, or none at all. Modal 
communications for the groups are summarized 
in Table 3. 


*=+= NO COMMUNICATION (N=16) 
= EXTREME COMMUNICATION (N= 25) 
1004 === MODERATE COMMUNICATION (N=19) 
80 
| 
‘897 
i 
a d 
‘ 
© 40 
z 
= 4 
Bees 
5 
.00: es 
‘HAN EXACTLY AS WORSE THAN 
eaten EXPECTED EXPECTED 


EXPECTED 


EVALUATION OF LABOR AND DELIVERY RELATIVE TO 
EXPECTATION 


Ficure 2. Relationship between type of communi- 
cation and Evaluation of Labor and Delivery Rela- 


tive to Stress Expectancy (EXS). 
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TABLE 3 


SUMMARY OF ÅNTICIPATION-OUTCOME Groups 
WITH MODAL Communication FROM Moruer 


Modal communication 


Anticipa- 
tion Outcome 

Intensity 
Favorable | Favorable Mode: 
J rate 
Favorable | Unfavorable | a. Extreme 

b. 

Unfavorable | Unfavorable | Negat Extreme 
Unfavorable | Favorable Negati Moderate 


Hypotheses 5 and 6 predicted that anxiety 
and denial would be related to a woman’s 
evaluation of childbirth. There were no sig- 
nificant correlations between these variables 
(see Table 1). A significant decrease in anx- 
iety following delivery was found, ¢ = 3.41, 
Pp < .01. While there was a slight decrease in 
the measure of defensiveness following child- 
birth, this decrease was not found to be 
significant, £ = 1.10, p > .05. 

The correlation of women’s anticipation of 
the duration of labor (AH) with actual labor 
time (H) was found to be positive but not 
significant (see Table 1). A modest positive 
correlation between anticipated time (AH) 
and anticipated evaluation (ALD) was found, 
r = .25, p < .05. Thus, women who expect a 
favorable childbirth tend to estimate a shorter 
labor than women anticipating an unfavorable 
labor and delivery. However, the correlation 
between Mother as Information Source 
(MIS) and Estimated Duration of Labor was 
found to be —.25, p < .05, indicating a tend- 
ency of women who received no information 
from their mothers to anticipate a shorter 
labor than women who received information. 
In order to analyze the relationships between 
length of labor and anticipation of length of 
labor with the type of childbirth communica- 
tion received, a one-way analysis of variance 
of each of the first two measures was per- 
formed with quality of communication as the 
incept variable. While there was a tendency 
for the moderate communication group to 
overestimate its time and for the no communi- 
cation to underestimate in anticipating length 
of labor, the F ratios obtained were not 
significant. 


Discussion 


Conclusions can be drawn from the data 
obtained in this study in three areas: (a) the 
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importance of expectation in crisis resolution; 
(b) the relationship between prestress com- 
munication and crisis outcome; and (c) the 
role of identification with one’s mother in 
childbearing. 

When women are asked to evaluate the fa- 
vorableness of their labor and delivery on a 
5-attribute semantic differential scale and also 
to indicate whether the experience was better 
or worse than expected, there is a high prob- 
ability that the two descriptions—evaluation 
and evaluation relative to expectation—will 
be in the same direction. The prediction that 
evaluation of childbearing is related to expec- 
tations concerning the childbearing experience 
was confirmed: Women indicating that child- 
birth was favorable also indicate that child- 
birth was better than expectation; conversely, 
women evaluating labor and delivery as un- 
favorable tend to indicate that the experience 
was worse than they had anticipated. It 
should be noted that confirmation of this hy- 
pothesis is based on correlations obtained 
during the same session and may reflect 
merely a global attitude or mood at the time 
the questionnaire was completed. Moreover, 
if expectation is a salient factor in a woman’s 
evaluation of her experience, it becomes nec- 
essary for predictive purposes to obtain a 
measure of that expectancy prior to stress 
impact. In this study, primigravidae were 
asked to rate their impending labor and 
delivery, and the majority predicted a favor- 
able outcome. Thus, 73% of the women an- 
ticipated a favorable experience, but after 
childbearing less than one-half of the sample 
rated their actual experience positively. There 
was no relationship, then, between ratings of 
anticipation and actual delivery when mea- 
sured on the same semantic differential scales 
administered pre- and postnatally. 

The role of expectation is clarified when 
other relationships are considered. Hypothe- 
sis 2 predicted that anticipation of childbear- 
ing is positively related to the woman’s 
perception of her mother’s experience in 
childbearing; this was confirmed. Women who 
recall their mother as describing childbearing 
as favorable tend to anticipate a favorable ex- 
perience; conversely, if their mother’s experi- 
ence is recalled as unfavorable, the anticipa- 
tion of childbirth tends to be unfavorable. 

Identification with mother’s experience is 
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an important aspect in a primigravida’s an- 
ticipation of labor and delivery. But identifi- 
cation with mother’s experience does not de- 
termine the woman’s evaluation of her own 
childbirth experience. Her mother’s evaluation 
as positive or negative is predictive of the 
woman’s anticipation of her experience but 
not of her perception of the actual labor and 
delivery. 

Janis postulates in his theory of stress that 
inability to realistically anticipate the dangers 
of impending stress is a pathogenic condition 
which leads to negative outcome following 
stress impact. It was hypothesized, then, that 
women who received no information from 
their mothers concerning mother’s experience 
in childbearing would be unprepared and 
effect a poorer outcome in labor and delivery 
than women receiving information (Hypothe- 
sis 3); and, further, that women who received 
extreme evaluations from their mothers, 
whether extremely negative or extremely posi- 
tive, would effect a poorer outcome than 
women who received moderately negative or 
positive evaluations from their mothers (Hy- 
pothesis 4). Both hypotheses were confirmed. 

The intensity of the communication as well 
as the direction helps to clarify Ritson’s con- 
clusion that the mother’s childbearing experi- 
ence was not predictive of the daughter’s 
experience. While the clinical criterion (dura- 
tion of labor) in Ritson’s study is not compa- 
rable to the subjective criteria employed in 
the present investigation, it is interesting to 
note that over two-thirds of Ritson’s normal 
group received negative communications from 
their mothers, while over half the prolonged 
group received positive communications. These 
results are suggestive of those obtained in this 
study. What Ritson ignores in his analysis is 
the intensity of the communication. 

Anxiety as measured by the Taylor Mani- 
fest Anxiety Scale and denial as measured by 
the Self-Criticism Scale were not found to be 
related to subjective outcome. As Davids et 
al. (1961) found, anxiety decreased signifi- 
cantly following delivery. It may be more 
appropriate, for future research, in assessing 
anxiety and denial mechanisms to construct 
instruments specific to childbirth rather than 
employing more generalized _ trait-oriented 
instruments. 

Duration of labor was included because of 
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its wide usage in obstetrical research as a 
clinical evaluation measure. As employed in 
the present investigation, length of labor was 
a subjective response. Only a modest positive 
correlation was found between duration of 
labor and the woman’s evaluation of child- 
birth indicating that psychological differences 
in time estimation, rather than absolute dif- 
ferences, determine for the woman whether 
her labors are long or short. Here, too, the 
factor of expectation may well play a role in 
psychological time and subjective outcome. 
That over half the respondents found their 
experience unfavorable indicates the great 
need for prenatal counseling and preparation 
for childbirth. The findings in this investiga- 
tion indicate that preparatory communica- 
tions be tailored to fit the individualized 
needs of the subject. Many prenatal programs 
operate on the assumption that reassurance 
and alleviation of anxiety result in favorable 
childbearing experiences. While this type of 
communication may be successful with highly 


`. anxious women, it would defeat its purpose 


by strengthening the defenses of those who 
are unable to confront and anticipate the 
more unpleasant aspects of childbirth. There 
is some indication that women who avail 
themselves of existing voluntary prenatal 
programs are those with adequate motivation 
to worry—in Janis’s terminology the “infor- 
mation seekers” who tend to effect successful 
crisis resolution. The efficacy of these pro- 
grams must be evaluated with regard to this 
“volunteer” factor. 

The results of this study lend further sup- 
port for Janis’s theory of expectation, pre- 
paratory communications, and stress tolerance 
in crisis resolution. The communication of 
realistic information concerning childbirth or 
other anticipated stress situations permits re- 
hearsal of the negative aspects of the expected 
experience. This allows the individual to in- 
dulge in “the work of worry.” Where there is 
no information or extremely positive informa- 
tion, the negative aspects of the impending 
experience remain unspoken. It is these indi- 
viduals who anticipate a favorable experience 
and feel victimized at stress impact. These 
are the “deniers.” In cases where extremely 
negative attitudes are communicated, the 
person is again psychologically unprepared. In 
this case, the contemplation of the stress is so 
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shattering that realistic rehearsal is impeded. 
These are the highly anxious people. There- 
fore, merely asking a woman to rate her an- 
ticipated labor and delivery is not adequate 
for eliciting her real expectation, since the 
response may instead reflect avoidant or de- 
nial defenses. The perception of her mother’s 
childbearing experience, however, was found 
to be a significant indicator for assessing a 
woman’s feelings toward her own childbirth, 
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Effects of Viewed Violence and Aggression: 
Stimulation and Catharsis 


Sidney A. Manning and Dalmas A. Taylor 
University of Maryland 


Tests of the catharsis hypothesis involving viewed violence have led to con- 
flicting results. It was suggested that the inconsistencies were due in part to 
the use of two different types of response measures, aggressive responses and 
hostile responses. This study involved a direct comparison of the two types of 
responses under conditions of instigation versus no instigation and aggressive 
versus neutral film. It was hypothesized that viewing an aggressive film would 
lead to a decrease in hostile responses (catharsis) and an increase in aggressive 
responses (stimulation) when compared with a neutral film condition. It was 
also hypothesized that both sitmulation and catharsis effects would be enhanced 
under instigation conditions. A main effect of instigation was found with 
subjects in the instigation condition having higher scores than those in the 
noninstigated condition. A main effect of film was found with subjects having 
lower scores in the aggressive film condition than in the neutral film condition. 
A significant Film X Response Outlet interaction was found which offered 
support for the predicted catharsis effect on the hostility measure. There was 
no support for the predicted stimulation effect on the aggression measure. 
The results of the study offer support for the main thesis that in examining 
the effects of viewed violence, separation of hostile responses and aggressive 
responses is necessary for full understanding. 7 


lying emotion is crucial in the attempt to 
understand the effects of viewed violence. Pre- 
dictions from the catharsis hypothesis and 
from various learning theories have both been 
tested by measuring aggressive behavior, on 
the implicit assumption that there is a one-to- 
one relationship between hostile emotion and 
aggressive behavior. It is argued here that 
such an assumption is invalid and that differ- 
ent response measures must be used to test 
the different predictions. Aggression and hos- 


Current interest in causes of the increase 
in crime has led to a focus on the causes of 
violent behavior in general. One of the most 
frequently cited causal factors is television 
violence. In terms of predicting the effects of 
viewing violence, there are two theoretical 
approaches that are relevant. It can be ar- 
gued from a learning theory approach that 
television violence should increase violent 
behavior. Derivations from the catharsis hy- 
pothesis would predict that television would 


we 


reduce aggression by providing cathartic ex- 
pression in vicarious form, 

Early formulations of the catharsis hy- 
pothesis (e.g., Dollard, Doob, Miller, Mowrer, 
& Sears, 1939) stated that catharsis refers to 
a reduction in motivation, not behavior. This 
distinction between behavior and the under- 
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tility are different responses and therefore 
may be controlled by different eliciting and 
reinforcing stimuli. 

Aggression has been defined in two ways 
in the literature. The definition used by Dol- 
lard et al. (1939) involved intentional in- 
fliction of harm on some target. This defini- 
tion, requiring intention, has been widely used 


by others working in the area. However, Buss )- 


(1961) excluded intention from his definition 
of aggression by suggesting that the term 
aggression be used to refer to any response 
delivering noxious stimulation. 

The definition of aggression as inflicting 
harm without intention was adopted for this 
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study. It was felt that this definition is more 
straightforward and emphasizes the distinc- 
tion between aggression and hostility more 
clearly. 

Hostility, on a conceptual level, refers to 
the emotional response resulting from a situ- 
ation perceived by the individual as anger 
inducing and is most directly measured as a 
physiological response. In examining the lit- 
erature, few studies have measured physio- 
logical responses. Most researchers have used 
indirect measures of hostility, including a 
wide variety of responses to anger-inducing 
situations which are not aggressive in nature 
(i.e., responses that do not deliver noxious 
stimuli). If hostility is an emotion, it may be 
reflected in indirect ways; that is, ways that 
can be differentiated from the responses of 
nonhostile subjects and yet are not aggressive 
in nature. 

In attempting to demonstrate catharsis, 
many investigators have examined the effects 
of real aggressive responses by subjects on 
subsequent aggressive behavior. The results of 
such studies are conflicting. Feshbach (1956) 
found increased aggression as a result of prior 
aggression. Doob (1970) found decreased 
aggressive responses as a result of prior ag- 
gression. 

The results of studies examining the effect 
of real aggression on measures more closely 
related to hostility give a more consistent 
picture of catharsis effects. Several studies 
demonstrated that aggression groups showed 
less hostility than controls not allowed prior 
aggressive responses (Berkowitz, Green, & 
Macaulay, 1962; Thibaut & Coules, 1952). 

In an interesting set of studies, Hokanson 
(Hokanson, 1961; Hokanson & Burgess, 
1962) has examined physiological correlates 
of hostility. These studies demonstrate a 
reduction in the physiological correlates where 
prior aggressive behavior is permitted. In 
summary, prior aggression produces catharsis 
effects more consistently with subsequent hos- 
tility measures than with aggression measures. 

Of interest to the current study are experi- 
ments examining the effects of vicarious ag- 
gression on subsequent aggression and hostil- 
ity. A large number of studies have demon- 


strated that vicarious aggression can increase 
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aggressive behavior (Bandura, 1965; Berko- 
witz, 1965; Berkowitz & Geen, 1966; Geen 
& Berkowitz, 1967). The Feshbach and 
Singer (Note 1) findings are the only ones 
showing a decrease in aggressive behavior as 
a result of viewed aggression, Their work 
represents the most thorough field study to 
date. Residents in boys’ homes or private 
schools were exposed to either an aggressive 
or nonaggressive commercial television diet 
for a minimum of 6 hours per week for 12 
weeks. The results indicated that the aggres- 
sive television diet had no effect on private 
school boys but led to a reduction in aggres- 
sive behavior for residents in the boys’ home. 

Liebert, Sobol, and Davidson (1972) criti- 
cized these data for lack of reliability and 
possible bias in the dependent measure, pos- 
sibility of subject preference for aggressive 
over nonaggressive television programs, and 
lack of identical treatment of the experimental 
and control groups on all factors but the inde- 
pendent variable. Feshbach and Singer (Note 
1) have replied to these criticisms citing addi- 
tional data. Despite the provocative results 
of Feshbach and Singer, the weight of evi- 
dence strongly indicates that exposure to vi- 
carious aggression does not reduce aggressive 
behavior but rather increases it. 

Feshbach (1955; 1961) demonstrated ca- 
tharsis of hostility as a result of prior vicari- 
ous aggression. However, Hornberger (Note 
2), in an attempted replication of the Fesh- 
bach (1955) study, failed to find reduced 
hostility as a function of prior vicarious ag- 
gression. Buss (1961) has suggested that the 
failure of Hornberger to replicate the Fesh- 
bach (1955) study may have been due to a 
lack of sufficient instigation to aggression. The 
importance of instigation or angering of the 
subject is discussed more fully later. In a 
comparison of the effects of real and vicarious 
aggression (physical and verbal fantasy) on 
physiological measures, Hokanson and Burgess 
(1962) found that the direct aggression re- 
duced the blood pressure and heart rate of 
angered subjects but that fantasy aggression 
had no effect. 

Although the results of the studies using 
hostility measures are less consistent than 
studies using aggressive responses, they tend 
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Ficure 1. Sample stimuli and composite of violent 
and nonviolent object. 


to indicate that vicarious aggression can 
reduce hostility far more frequently than it 
reduces aggressive behavior. This is in line 
with the results obtained from studies on real 
aggression, in which catharsis was more con- 
sistently found when hostility measures were 
used than when aggression measures were 
used. In summary, then, it appears that the 
real or vicarious expression of aggression is 
more likely to lead to a decrease in hostility 
behavior than in aggressive behavior. In the 
case of vicarious aggression, increased aggres- 
sion typically occurs, although hostility typi- 
cally decreases. 


Past studies have emphasized common 
eliciting stimulus conditions for hostile and 
aggressive responses. That is, both responses 
are elicited by a stimulus situation described 
as anger producing by the subject. But there 
are noncommon controllers of the responses as 
well. Justification of the viewed violence, for 
example, deals with the aggressive response 
and its “rightness” (implying rewards). It 
therefore should have a powerful effect on 
aggressive responses (Meyer, 1972). It does 
not deal with the rightness of the individual’s 
feelings of hostility and therefore should have 
little, if any, effect on the hostility responses. 

This experiment was designed as a first 
step in obtaining support for this conceptual- 
ization. It involved a direct comparison be- 
tween hostile responses and aggressive re- 
sponses to determine if they operate differently 
under identical conditions. The aggressive re- 
sponse measure used was physical aggression 
against a peer. The “aggression machine” de- 
scribed by Buss (1961) was used in the con- 
text of what was described to the subject as 
a learning experiment in which the subject 
was the teacher and the confederate was the 
learner. The subjects were to administer shock 
to the confederate for each error the confed- 
erate made on the learning task. The subject 
could administer any one of 10 different shock 
levels for each error made, 

The hostility measure was a response given 
to the tachistoscopic presentation of ambigu- 
ous stimuli. Composite stimuli depicting vio- 
lent and nonviolent cues were presented to 
the subject (see Figure 1). The subject’s re- 
sponses required a perceptual resolution of 
the composite and were used as measures of 
hostility. This measure was based on the 
assumption and experimental evidence indi- 
cating that underlying motivation can affect 
perception (e.g., McClelland & Atkinson, 
1948). This particular technique was used by 
Taylor and Oberlander (1969) and was dem- 
onstrated to be sensitive to motivational dif- 
ferences among subjects. 

The experimental design was such that each 
subject was tested on only one dependent 
variable (aggression or hostility). Conversion 
of raw data to T scores permitted a single 
anaylsis of variance to be performed in which 
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each measure was included as a part of a 
single factor (Response Outlet). The combi- 
nation of the two response measures into a 
single analysis was done in order to permit 
more direct statements on the significance of 
differences between patterns of results for 
the two measures. Most previous studies 
have used only one of the response measures, 
which makes statements about the differential 
effect of any factor on the two measures 
highly inferential. Combination of the two 
responses in the same analysis permitted di- 
rect testing of the significance of such differ- 
ential effects. This strategy is of central im- 
portance in examining the thesis presented 
here. 

Tnstigation has been cited as an important 
variable in the study of aggression and hos- 
tility. Buss (1961) has argued that no cathar- 
sis is likely without prior instigation. Instiga- 
tion is probably not necessary in the sense 
that it can be assumed that hostility is always 
minimally present in subjects, and therefore 
some catharsis can occur. However, instigation 
was manipulated in this study in order to 
maximize the probability of obtaining a ca- 
tharsis effect by raising the level of hostility. 

For purposes of this article, catharsis effects 
refer to any significant reduction in either ag- 
gression or hostility. Stimulation effects refer 
to any significant increase in either aggres- 
sion or hostility. Subjects who view an ag- 
gressive film will exhibit a stimulation effect 
on aggressive responses as compared with sub- 
jects viewing a neutral film. Contrariwise, 
subjects who view an aggressive film will ex- 
hibit a catharsis effect on hostility responses 
as compared with subjects viewing a neutral 
film. Stimulation and catharsis effects will be 
enhanced under conditions of instigation. 


METHOD 


The subjects were 80 undergraduate male students 
enrolled in introductory psychology classes. Male 
subjects were used because much of the past re- 
search has used males, and it has been demonstrated 
that there are sex differences in aggressive respon- 
siveness (Feshbach, 1970). All subjects were volun- 
teers and received bonus points toward their final 
grades. Ten subjects were eliminated due to suspi- 
ciousness, and 1 was eliminated for failing to under- 
stand the instructions. ; 

Subjects were brought to an experimental room 
where they were paired with a confederate who 
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was introduced to another subject. Two subjects 
were scheduled simultaneously so that two subject- 
confederate pairs were tested at the same time, Sub- 
jects were randomly assigned to experimental condi- 
tions. However, in each experimental run, one sub- 
ject was always assigned to the Response Outlet 
condition permitting aggression, while the other was 
assigned to the condition which allowed the hostil- 
ity response. The initial instructions were as follows: 


You are here to take part in an experiment on 
what is known as response-patterning, It js 
hypothesized that each individual has a unique 
style of responding and that this style influences 
the way in which he will behave in various situ- 
ations. That is not to say that each individual 
behaves the same way in all situations, but that 
there are underlying similarities in his responses in 
diverse situations. 

Therefore, in this experiment you will take part 
in three different kinds of tasks. We want you to 
respond to them as naturally as possible so that 
when we analyze the data from the three separate 
tasks, we can discover if the response-patterning 
hypothesis is correct. 

The results of your performance will be kept 
strictly confidential. No one will have access to 
information on the performance of any specific 
individual taking part in this study. 

During this experiment you will work in pairs 
(indicating the members of each pair). The first 
task you will take is a cognitive skills task. Its 
purpose is to determine your response style on a 
task which requires a primarily cognitive type of 
response. In this task, one member of each pair 
will be designated as task recorder and one as task 
responder by means of a drawing we will hold 
now. 


Each member of each pair drew one of two cards, 
one pair at a time. Both cards said Man A. The 
confederate was instructed to say that his card indi- 
cated he was Man B, 

An anagram task was then given to the real sub- 
jects and the recording sheets to the confederates 
with the following instructions: 


As Man A, you [two] will be task responders. 
Here is a copy of the task. Please read the instruc- 
tions and complete the sample items. Do not turn 
ahead to the task. As Man B, your job is to 
record the answers of Man A and the time it takes 
him for each. Write his answers here (indicating) 
and the time here. Use these stopwatches in keep- 
ing the time. Please look over the recording sheet 
and try the stopwatch to be sure that you know 
what you are to do. (Time allowed for subjects 
to complete reading instructions). The time for 
this task is three minutes. You should be able to 
complete most or all of the items in that time. 
All of the items are words. As you complete each 
item, read it aloud. Man B, you should record 
his answer when he tells it to you, along with 
the time on your stopwatch. When the three min- 
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TABLE 1 
Srimurtt Usep to Ostarn Hostimity MEASURE 


Type 
of Violent Nonviolent 
stimulus 

Object | TNT (14) Apple (2) 
Knife (8) Flute (4) 
Atomic blast cloud (7) | Mushroom (16) 
Missile (10) Pen (5) 

Person | Marching soldiers (1) | Marching band (13) 


Riot scene (15) 
Gun fighter (12) 
Rock thrower (3) 


Beer party (9) 
Man giving diploma (6) 
Ice skater (11) 


Note, Numbers in parentheses refer to the order in which the 
stimuli were presented to the subjects. 


utes are up, Man B, you should say stop. Then 
Man A should stop work at once and close the 
booklet. I will come and pick up the tests. 


Each subject-confederate pair was then taken to 
a separate room and reminded that each item should 
be read aloud when completed and that the time 
for the task was 3 minutes. 

Instigation condition. Subjects in this condition 
were required to solve 20 high-difficulty anagrams. 
During the 3-minute test interval, the confederate 
made the following three remarks, each of which 
was made at approximately 1-minute intervals: 


[1] You're not doing too well. 

[2] You have only __ [number] done and 
time’s almost gone. She said most people did 
them all, You sure won’t. 

[3] You're terrible at this thing—only have 
— [number] finished. ’'d have done them all by 
now. 


No-instigation condition. Subjects in this condition 
were given 15 low-difficulty anagrams. The confed- 
erate recorded their responses and times with no 
comment. 

At the end of the time period, the experimenter 
went into each testing room and collected the forms 
without comment. At this point, both subject-con- 
federate pairs were taken back to the first room and 
were seated. They were then given the following 
instructions. 


The second task you will take is a visual mem- 
ory skill task. You will be shown a brief film at 
this time. Later in the experiment you will be 
given a memory test on the content of the film. 
This will test your response style in a task in 
which memory of visually presented stimuli is im- 
portant. 


The subjects were then shown one of two films. 
In the aggressive-film condition, subjects saw an 
83-sec film from the movie “The Chase” depicting a 
violent fist fight scene. In the neutral-film condi- 
tion, subjects saw a 77-sec soundless film depicting 
four track races. After viewing the film, the subjects 
were taken to different rooms and given instructions 
according to the Response Outlet (hostility or ag- 
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gression) condition to which they had been assigned. 
Hostility-Response-Outlet condition. The following 
instructions were first given to the subject: 


You will now take a task which evaluates 
perceptual response accuracy. In order to set the 
apparatus properly, I will show you a picture. It 
will be presented very briefly. Each time I will 
present it a little longer. When you recognize it, 
tell me, If you think you know what it is, say so. 


Recognition threshold was then tested by the ex- 
perimenter using a picture of the face of a young 
male. A Scientific Prototype two-channel tachisto- 
scope was used. The blank field was constantly 
illuminated in order to minimize change in retinal 
illumination level and the effects of rod-cone breaks 
during intertrial intervals. The first presentation of 
the stimulus was at a duration of 6 msec and was in- 
creased in 2-msec increments until the subject could 
recognize the face and give the sex and approximate 
age (young) of the individual pictured. 

The following instructions were then given: 


During this task Man B will present a series of 
slides to you in this apparatus. Before we actually 
start, I will show you each picture one at a time 
and tell you what the name is for each. This will 
enable you to know what pictures will be shown 
and will help you identify them when we ac- 
tually start. 


The 16 stimuli which were shown to the subjects 
are listed in Table 1. The cards were shown only 
long enough for their name to be given. The stimuli 
actually used in the tachistoscope during testing were 
representations of each violent and nonviolent ob- 
ject pair superimposed, so that there were in fact 
only eight different slides shown during testing. The 
order of the eight slides was randomly varied for 
each subject. During each testing session, the stimu- 
lus cards were all shown two times for a total of 16 
trials. After viewing the 16 nonsuperimposed cards 
and being given their names, the confederate was 
brought into the room, and the following instruc- 
tions were given: 


Your task as Man B is to present these slides 
one at a time in this apparatus. You simply slide 
them into the frame like this [indicating]. Then 
say “Ready” and press this red start button. Do 
not touch any other dials or buttons on this ap- 
paratus—only the red start button. After Man A 
has told you what he saw, write it here [indicat- 
ing data sheet]. Then take out the card and 
replace it with the next one. Be sure to say 
“Ready” before pressing the button so that Man 
A will be watching. Go through all these cards in 
the order they are here. When all the cards have 
been shown, tell Man A you are finished. Each 
of you should then complete these questionnaires 
[one placed beside the subject and one beside the 
confederate]. 
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The task was then started, and the experimenter 
left the room. The score used was the number of 
hostile or violent responses given. 

A geressive-Response-Outlet condition. Subjects in 
this condition initially were left alone for a few 
minutes to read written instructions pertaining to the 
Buss-Durke apparatus. After the instructions were 
read, the experimenter returned and answered any 
questions. The subject was allowed to experience two 
of the low-shock levels so that he would have an 
idea of how the shock felt. A finger electrode was 
attached to the subject’s left index finger, and the 
experimenter administered Shock Levels 1 and 3, 
The experimenter then noted that Level 3 was a 
stronger shock than Level 1 and that all the buttons 
represented increasing shock at about that rate up to 
10. The confederate was then brought into the room 
and given the following instructions: 


Your job is to solve this concept formation 
task, a task designed to see how quickly you can 
form complex concepts. Each time one of these 
numbered lights appears, press one of these two 
response switches, marked A or B. If your choice 
is correct, this green light labelled “Correct” will 
go on. If you are wrong, you will be given a 
shock through these electrodes [attach]. Your job 
is to figure out which response is correct for each 
light. [To subject] When you have given all of 
the stimuli on both pages, tell Man B you are 
finished. You should then each fill out these 
questionnaires. 


The confederate gave a preprogrammed series of 
responses that were arranged so that he made 16 
errors. He recorded the shock levels administered by 
the subject on a data-recording sheet. The dependent 
measure was the sum of shock levels administered. 

The postexperimental questionnaires contained 
questions about the movie content in order to main- 
tain the cover story of that portion of the experi- 
ment as a memory test. In addition, the subjects 
rated both the anagram task and the Response Out- 
let tasks on three scales (dull-interesting, difficult- 
easy, frustrating-satisfying). They also rated the 
confederate during each of these tasks on three 
scales (friendly—hostile, warm-cold, helpful-inter- 
fering). 

After the questionnaires were completed, the sub- 
jects were brought together for debriefing, at which 
time they were told the true purpose of the experi- 
ment. 


RESULTS 


The ratings of the first task (anagrams) 
and of the confederate during the first task 
were analyzed in six separate analyses of 
variance in order to provide a manipulation 
check on the anger manipulation. The six 
analyses of variance corresponded to the six 
rating scales used for evaluation (three for 
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TABLE 2 


ANALYSIS OF VARIANCE ON T SCORE CONVERSION 
FOR HOSTILITY AND AGGRESSION Scores 


Source df MS r 
Instigation (A) 1 624.56 | 7,234 
Film (B) 1 444.49 | 5.154% 
Response outlet (C) 1 .00 -00 
AXB 1 272.37 | 3.15* 

1 61.52 
1 362.85 | 4.20** 
1 18.46 

12 86.33 


ponse Outlet measures (aggression and 
€ y ed to T sı which resulted in the two 
ng identical and thus prevented any main effect for 


** p < 105. 
wD < OL, 


the task and three for the confederate). An- 
gered subjects, as compared to nonangered 
subjects, rated the anagram task as signifi- 
cantly harder, 6.30 versus 2.80; F = 153.12, 
p < .01, and more frustrating, 5.95 versus 
3.08; F = 108.76, p < .01. On the dull-in- 
teresting scale, there was a significant Film X 
Response Outlet interaction. However a Dun- 
can’s multiple-range test indicated no pairwise 
significance, 

The ratings of the confederate for the first 
task indicated that the angered subjects, as 
compared to the nonangered subjects, found 
him significantly colder, 4.25 versus 3.5250 
< .05, and more interfering, 4.05 versus 3.42; 
p < .05. No difference in rated friendliness of 
the confederate was found. These findings 
indicate that the anger manipulation was ef- 
fective and in the expected direction. 

In order to make direct comparisons be- 
tween the aggression and hostility scores, all 
scores were converted to T scores. A 2 X 2 X 
2 analysis of variance was conducted. The 
summary analysis of variance (see Table 2) 
indicated significant main effects for the in- 
stigation, F = 7.23, p < .01, and the film, F 
= 5.15, p < .05, conditions. (All cell means 
are shown in Table 3). The means associated 
with the instigation main effect indicated 
that instigated subjects scored higher than 
noninstigated subjects, 52.79 versus 47.21. 
Thus when subjects were angered, they 
showed a higher level of aggression and hos- 
tility expression than when they were not 
angered. This finding confirmed the effective- 
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TABLE 3 


MEANS FOR AGGRESSION AND HOSTILITY 
RESPONSE OUTLET 


Condition Aggressive film Neutral film 


Tnstigation 


Aggression 50.32 53.51 

Hostility 46.86 60.48 
No instigation 

Aggression 49.22 46.95 

Hostility 44.17 48.49 


ness of the instigation manipulation. The 
means associated with the film main effect 
indicated that the aggressive film condition 
subjects scored significantly lower than did 
subjects in the neutral film condition, 47.64 
versus 52.36. The means associated with the 
significant Film X Response Outlet interac- 
tion, F = 4.20, p < .05, indicated that the 
film main effect was due primarily to differ- 
ences on the hostility measure. This interac- 
tion reflected most directly on the first two 
hypotheses. The only pairwise significant dif- 
ference occurred between the aggressive film- 
hostility condition and the neutral film—hostil- 
ity condition (see Table 4). This mean dif- 
ference confirmed the second hypothesis; that 
is, that hostility scores would be significantly 
lower in the aggressive film condition than in 
the neutral film condition. The other two pre- 
dictions were not confirmed. No stimulation 
effect was found on aggressive responses, and 
the instigation condition did not enhance the 
stimulation or catharsis effects. 

The Instigation X Film interaction reached 
a marginal level of significance, F = 3.15, p 
<.10, Pairwise comparisons of the means 
indicated that the instigation-neutral film 
mean was significantly higher than the other 
three means, none of which differed from 
each other. 

A secondary analysis of the data also was 
conducted in which the hostility data were 
analyzed in order to determine if the person 
stimuli showed different response patterns 
than did object stimuli. Scores were calcu- 
lated for each subject to determine the per- 
centage of object responses which were hostile 
(as opposed to nonhostile) and the percentage 
of person responses which were hostile. 

As can be seen in Table 5, a significant 
main effect for instigation, F = 8.86, p < .01, 
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was obtained. Means associated with the in- 
stigation main effect indicated that instigated 
subjects scored higher on the hostility mea- 
sure than did noninstigated subjects, 54.76 
versus 43.48. Means associated with the film 
main effect demonstrated that subjects view- 
ing the aggressive film expressed less hostility 
than did subjects viewing the neutral film, 
42.76 versus 55.47. The means associated 
with both of these main effects are in the same 
direction as the film and instigation effects in 
the analysis of variance on the total Response 
Outlet measure. Means associated with a 
main effect of stimulus type (object versus 
person) indicated that more hostile responses 
were given to person stimuli than to object 
stimuli, 59.92 versus 38.32. 


Discussion 


This study found direct confirmation for 
the prediction that a catharsis effect would be 
found for a hostility response measure. Sup- 
port was not found for the predicted stimula- 
tion effect on the aggressive response measure 
nor for the predicted enhancement of both 
catharsis and stimulation effects by instiga- 
tion. In examining the possible reasons for 
failure to find a stimulation effect, it should 
be kept in mind that the basic conceptualiza- 
tion which led to the predictions received 
support from the data. This conceptualization 
was that aggression and hostility are different 
responses and therefore respond differently to 
identical situations. The means associated with 
the Film x Response Outlet interaction (see 
Table 4) clearly indicate significant differ- 
ences in the pattern of effect of film on the 
aggression measure and on the hostility mea- 
sure. Film differences produced catharsis on 
the hostility measure, yet had no effect on 
aggression. 


TABLE 4 


MEANS FoR AGGRESSION AND Hostitity 
SIGNIFICANT FiLM X RESPONSE OUTLET 


Condition Aggressive film Neutral film 
Aggression 49.77 50.23 
Hostility 45.52 54.48, 


Note. Means having subscripts i are signifies 
diferent ans having subscripts in common are significantly 
Cell ns = 20, 


the most 
o find a 
that the 
t contain 
to maxi- 
A number 


Sponses and increases the probability of ag- 
an aggression-eliciting 
cue (i.e., the instigator) is Present. However, 
since stimulation effects have been found 
without high cue similarity (Geen & Berko- 
witz, 1967), it would appear that this factor 
alone cannot account for the lack of stimu- 
‘tion effects in the Present study, although it 
be a contributing factor, 
A second possible reason for the failure to 
increased aggressive responses may have 
been the lack of justification for film violence. 
Meyer's (1972) work indicated that justifica- 
tion of film violence is a crucial variable in 
producing increased aggression. Justified ag- 
gression is rewarded by society and thus, 
according to Meyer, justification serves to 
increase the likelihood of aggressive responses 
occurring. Had this variable been included in 
the current experiment, the discovery of a 
stimulation effect would have been more 
likely. 

A final condition which may have been 
present in the current study operating to 
reduce the aggressive response level is guilt. 
Berkowitz has argued that when catharsislike 
phenomena occur, they may represent a re- 


á TABLE 5 


` ANALYSIS OF VARIANCE ON STIMULUS TYPE FOR 
tae HOSTILITY-RESPONSE-OUTLET MEASURE 


Gm df MS F 

igati A 1 2,543.64 8.86* 

eo S 1 3,229.61 | 11.25* 
AXB 1 979.30 

Error 36 287.05 ‘ 

Stimulus type (C) 1 9,337.68 | 21.80 
AXC 1 -60 
BXC 1 1,408.68 
AXBXC 1 22 
rhe 36 428.28 


*p <.01. 
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pression of the instigated responses due to 
guilt. The same argument could be applied to 
a failure to find a stimulation effect; that is, 
guilt held back the subjects’ tendencies to 
express more aggression. Mean comparisons 
associated with the Film X Response Outlet 
interaction, however, cast serious doubt on the 
guilt explanation in the present study. It is 
difficult to understand why guilt would be 
more effective in repressing the hostility mea- 
sure, which involved a nondirect anger re- 
sponse, than in the aggression measure, which 
involved a direct expression of aggression to 
another individual, 

The secondary analysis of the data in which 
the person and object stimuli were examined 
throws further light on the likelihood of the 
guilt explanation. A main effect of stimulus 
type .was found with more hostile responses 
being given to person stimuli than to object 
stimuli, Although this offers confirmation of 
Berkowitz’s (1962) cue similarity hypothe- 
sis, it does not Support a guilt interpretation. 
Tf guilt were operative, one would expect it to 
suppress hostile responses in terms of other 
individuals to a greater extent than hostile 
responses to objects. 

The results of this study demonstrated that 
aggression and hostility are different responses 
that may behave differently to the same elic- 
iting and reinforcing stimuli, while responding 
similarly to other stimuli. They both in- 
creased as a result of instigation. At the same 
time they reacted differently to film, with 
type of film having a differential effect on 
hostility but not on aggression. 

This study Suggests that viewed violence 
alone may have positive effects in terms of 
reducing hostile responses and has no effect on 
aggressive responses, In dealing on a practical 
level with how to Prevent television from 
increasing aggressive behavior, as the litera- 
ture and anecdotal evidence indicate can 
happen, one must deal with discovering and 
altering elements within the violent content 
which serve as eliciting or reinforcing stimuli 
for the aggressive responses. Removing all 
violent shows from television may be “throw- 
ing the baby out with the bath” by removing 
a valuable outlet for hostility in order to 
prevent any increase in aggressive behavior. 


188 


REFERENCE NOTES 


1. Feshbach, S., & Singer, R. D. Television and ag- 
gression: A reply to Liebert, Sobol and Davidson. 
Report submitted for publication to the National 
Institute of Mental Health, 1971. 

2. Hornberger, P, H. The differential reduction of 
aggressive responses as a function of interpolated 
activities. Paper presented at the meeting of the 
American Psychological Association, Cincinnati, 
Ohio, September 1959. 


REFERENCES 


Bandura, A. Influence of model’s reinforcement con- 
tingencies on the acquisition of imitative re- 
sponses. Journal of Personality and Social Psy- 
chology, 1965, 1, 589-595. 

Berkowitz, L. Aggression: A social psychological 
analysis. New York: McGraw-Hill, 1962. 

Berkowitz, L. Some aspects of observed aggression. 
Journal of Personality and Social Psychology, 1965, 
2, 359-369. 

Berkowitz, L., & Geen, R. G. Film violence and the 
cue properties of available targets. A itai of 
Personality and Social Psychology, 1966, 3, 
530. 

Berkowitz, L., Green, J. A., & Macaulay, J. Hostil- 
ity catharsis as the reduction of emotional tension. 
Psychiatry, 1962, 25, 23-31. 


525- 


Buss, A. H. The psychology of aggression. New 
York: Wiley, 1961. 
Dollard, J., Doob, L. Wt, Miller, N. E., Mowrer, 


O. H., & Sears, R. R. Frustration. and aggression. 
New Haven, Conn.: Yale University Press, 1939. 

Doob, A. N. Catharsis and aggression. The effect 
of hurting one’s enemy. Journal of Experimental 
Research in Personality, 1970, 4, 291-296. 

Feshbach, S. The drive-reducing function of fantasy 
behavior. Journal of Abnormal and Social Psy- 
chology, 1955, 50, 3-11. 

Feshbach, S. The catharsis hypothesis and some con- 
sequences of interaction with aggressive and neu- 


Smwney A. Manninc AnD Datmas A. TAYLOR 


tral play objects. Journal of Personality, 1956, 


24, 449-462. 


Feshbach, S. The stimulating versus cathartic effects 
of a vicarious aggressive activity. Journal of Ab- 
normal and Social Psychology, 1961, 63, 381-385. 


Feshbach, S. Aggression. In H. Mussen (Ed.), 
Charmichael’s manual of child psychology. Vol. 2. 
(3rd ed.) New York; Wiley, 1970. 

Feshbach, S., & Singer, R. D. Television and ag- 
gression. San Francisco: Jossey-Bass, 1971. 

Geen, R. G., & Berkowitz, L. Some conditions fa- 


w 
j 
$ 


fi 
I 


É 


cilitating the occurrence of aggression after the // 


observation of violence. 
1967, 35, 666-676. 
Hokanson, J. E. The effects of frustration and 
anxiety on overt aggression. Journal of Abnormal 
and Social Psychology, 1961, 62, 346-351. 
Hokanson, J. E., & Burgess, M. The effects of three 


Journal of Personality, 


types of aggression on vascular processes. Journal , 


of Abnormal and Social Psychology, 1962, 

~ 446-449, 

Liebert, R. M., Sobol, M. D., & Davidson, E. S. 
Catharsis of aggression among institutionalized 
boys: Fact or artifact? In G. A. Comstock, E. A. 
Rubenstein, & J. P. Murray (Eds.), Television 
and social behavior. Vol. 5. Washington, D.C.: 
U.S. Government Printing Office, 1972. 

McClelland, D. C., & Atkinson, J. W. The projec- 
tive expression of needs. I: The effect of different 


64, 


ie | 


intensities of hunger drive on perception. Journal; 7 


of Psychology, 1948, 25, 205-222. 

Meyer, T. P. Effect of justified and unjustified real 
film violence on aggressive behavior. Journal of 
Personality and Social Psychology, 1972, 23, 21-29. 

Taylor, D. A., & Oberlander, L. Person-perception 
and self disclosure: Motivational mechanisms in 
interpersonal processes. Journal of PPI TS 
Research in Personality, 1969, 4, 14-28. 

Thibaut, J. W., & Coules, J. The role of communi- 
cation in the reduction of interpersonal hostility. 
Journal of Abnormal and Social Psychology, 1952, 
47, 710-777. 


(Received June 25, 1973) 


“AT LAST... A FIRST! 


t 


Leonard Berkowitz combines theory and topical coverage in 


_A SURVEY OF SOCIAL PSYCHOLOGY 


An eminent researcher presents a comprehensive overview of his field 


Pest l found the treatment and use of such examples as Synanon extremely well done and very well 
` integrated into the discussion. This is true of other examples as well. , . . It is written with the ease 
and elegance of someone who is a master of his trade... .” 


Fred E. Fiedler 
University of Washington 


“Overal l, | judge it to be an excellent book! | think its strength is that itis much more close to the core 
of experimental psychology and psychology in general, than are its competitors.” 


Harry Triandis 
University of Illinois” 


—The authotapagsents a broad survey of social psychology, including both theory and topical 
coverage. y 

A —This’comprehensive coverage includes relatively new topics such as sex differences and social 
behavior, 

—Real world social implications of both theory and research are stressed. 

—Written in an informal style, the text is two-color, with abundant use of photos, cartoons and 
line-drawings. 


Coming: 3-15-75 586 pages Instructor's Manual 


° 
' An accompanying reader 


READINGS IN SOCIAL PSYCHOLOGY 


David Aderman and Leonard Berkowitz 
Of Related Interest: 
INFLUENCES ON HUMAN DEVELOPMENT, Second Edition 


Urie Bronfenbrenner and Maureen Mahoney 
1-10-75 576 pages (paper) 


DEVELOPMENTAL PSYCHOLOGY AND EARLY EDUCATION 


- Lawrence Kohlberg and Rochelle Mayer 


{0415-75 320 pages 


For further information please contact: 


The Dryden Press 
A division of Holt, Rinehart and Winston, Inc. 


GO 901 North Elm Street, Hinsdale, Illinois 60521 


Social 
Structure and 
Social 
Personality 


Jerold Starr 

University of Pennsylvania 

A collection of thirty-one contemporary 
readings, this book examines the impact 
of social structure on social personality. 

It covers all major socializing agents, 
including heredity, family, church, school, 
peers, and media, plus controversial 
areas such as the effects of the economic 
system on personality. While biological 
and cultural facts are given some atten- 
tion, the primary focus is on those basic 
aspects of personality which are most 
directly influenced by membership in 
major social aggregates, By way of intro- 
duction, Jerold Starr has provided a 
systemic diagram which outlines the 
structure and process of socialization and 
serves as a rationale for the organization 
of the book. The readings that follow are 
divided into thirteen sections, each of 
which is preceded by substantial editorial 
material that summarizes and integrates 
principal research in the field and 
discusses the social implications of 

the issues. 

Cloth 461 pages 1974 $12.95 


College Division 


Little 


(I 
Brown 
and Company 
34 Beacon St. 
Boston, Mass. 
02106 


Conceptions 
of Social Life 


A Text-Reader for Social 
Psychology 

William Gamson 

Andre Modigliani 

Both at the University of Michigan 
Dealing with twelve key issues, ranging 
broadly from the psychological (e.g., 
social perception and moral development) 
to the sociological (e.g., deviance and 
rebellion), this new text-reader deliber- 
ately juxtaposes conflicting explanations 
of social behavior, clearly exposing the 
controversies that characterize the field. 
Each problem area is introduced by a 
lengthy essay that outlines the assump- 
tions underlying each framework and 
attempts to place the diverse formulations 
within a broader perspective, This creates 
a meaningful forum in which viewpoints 
that are competitive with each other can 
be discussed simultaneously. The inter- 
disciplinary readings include excerpts 
from the work of distinguished thinkers 
and researchers, including Peter Blau, 
Kenneth Keniston, Stanley Milgram, 
Lionel Tiger, Philip Converse, and 
Charles Tilly, among many others. These 
readings are not intended to provide a 
history of the field or a survey of its 
methodologies, but to stimulate students 
to think more deeply about their own 
social experiences through a discussion 
of issues that are central to organized 
social life. 

Paper 592 pages 1974 $7.95 


(Contents continued) 


Influence of Locus of Control on Interpersonal Attraction and Affective Reactions in 
Situations Involving Reward and Punishment/David S. Holmes and Thomas 
ETRE RSOT tee iio eee ae A are otra ee N E S ponte nice TES 
Conditioned Autonomic Responses as Bidirectional Indicators of Racial Attitude/ 
Louis Ñ. Tognacci and Stuart W. Cook .....+++++++-+00+ 
Effects of Stress and Commonality of Fate on Helping Behavior/John F. Dovidio 
ond Wallhamn Mo MORES. aoaea cis tek vivid reels aeaa tae areas ee te 


The Effect of Stimulus Uncertainty on the Relationship Between Frequency of Ex- 
posure and Liking/Gene F. Smith and Donald D. Dorfman ...-+-++++++++0000+ 
Moral Judgment of Theft, Charity, and Third-Party Transfers That Increase or De- 
crease Equality/Philip Brickman and James H. Bryan .....c.seee sere ence ees 
Escalation of Aggression: Experimental Studies/Jefirey H. Goldstein, Roger W. Davis, 
PADENA EL EET tenes a EE aioe iene a anne test 


Childbirth as Crisis: A Test of Janis’s Theory of Communication an 
tion/Judith M. Levy and Richard K. McGee 

Effects of Viewed Violence and Aggression: Stimulation and Catha 
Manning and Dalmas A. Taylor 


“List of Manuscripts Accepted ..-...- 


132 


Readings - . . 


Psychology Is Social 

Readings and Conversations in Social Psychology 
Editor: Edward Krupat, Boston College 

A fresh approach to readings for social psychology courses, distinguished by a wide range 
of materials and original interviews with leading social psychologists. 


A Sampling... 
A conversation with Alvin Pouissant on black identity 


Matina S. Horner, Femininity and Successful Achievement: A Basic Inconsistency 
Philip Meyer, If Hitler Asked You to Electrocute a Stranger, Would You? 

George Orwell, Shooting an Elephant 

A conversation with Philip Zimbardo on attitudes and attitude change 

Leonard Berkowitz and Anthony LePage, Weapons As Aggression-Eliciting Stimuli 


Bibb Latané and John M. Darley, Social Determinants: of Bystander 
Intervention in Emergencies 


Stanley Milgram, The Experience of Living in Cities 
Available Now: 1975, 352 pages, Illustrated, paper, approx. $6.95 


Supplementary Text... 


Personality 


Julian B. Rotter / Dorothy J. Hochreich 3 

The University of Connecticut H 

Covers all aspects of personality study—theories, measurement problems, and empirical, 
data—for a complete overview of the field for introductory students. Available Now: 
1975, 160 pages, illustrated, paper, approx. $4.95 N 


Basic Text... 


Personality Development 


The Chronology of Experience 

Leon Rappoport, Kansas State University 

Subjective data, case material, research findings, and literary insights are woven EAT 
for an unusual approach to the study of personality. 1972, 448 pages, cloth $11.50 


For further information write to 

Jack Bevington, Advertising Department 

Scott, Foresman College Division 

1900 East Lake Avenue. Glenview, Illinois 60025 


d Xx 


oS Socal Psychology 


iunt 31, NUMBER 2 FEBRUARY 1975 


ates «and Albert Pepitone 
“Moral Judgment in Sociopathic and Normal Children/Anthony Frank Campagna and 
EAn arteri lnh eei o eStats, SAAR Ms ise A EON ENA 199 

Reciprocal Concessions Procedure for Inducing Compliance: The Door-in-the-Face 
Technique/Robert B. Cialdini, Joyce E. Vincent, Stephen K. Lewis, José Catalan, 


Diane Wheeler, and Betty Lee BDI ter PoE A A a 206 
Endorsement of Formal Leaders: An Integrative Model/H. Andrew Michener and 

dwari T AEON enna ile (teat ne Me eee a Ac ace 216 
Self-Discrimination, Responses in Black School Children/Andrew Sappington and 

eR Dope GHAEGP OM, A 5 ¢s8is Saeco CARTER cate whats ws 224 


, Performance and Satisfaction as Affected by Personal Control Over Salient Decisions/ 


= G. Ramsay Liem , 232 
. Cooperativeness and'Social Perspective Taking/David W. Johnson 241 
Physical Attractiveness, Social Relations, and Personality Style/Dennis Krebs and 
Allen “A, Adin RER as PS. a cc aaa claws tas. aa ats oem Diarsia | RAS 
Cognitive Appraisals and Transformations in Delay Behavior/Walter Mischel and 
Maney Baker oi ise ise ces estan’ See AI A Eh A N Rot) elie 3.9. ONT 254 
Some Effects of Time and Thought on Attitude Polarization/Abraham Tesser and 
262 


, Mary Charles Conlee 
Comparison of Social Judgments of Creativity 
LOREM Maas er ETU AET E EA A E E a: CAA oman hers ee. 271 


Some Determinants of Perceived Control/Camille B. Wortman . 282 
Do People Have Inflated Views of Their Own Ability?/Judith Weiner Rilan, Holly 
Gosselink, Janet Hubsch, and Elizabeth Ulsh ....0...cccccceeeereeseneneen 295 
effects of Intergroup Orientation on Group Attitudes and Proxemic Behavior/Allen 
PUT EMR ERLGROMAPMONE AGN: n M ice Rak, ele cielo e mercertays aie sR oD alee braigin Sinica nie 302 
‘Generality of Learned Helplessness in Man/Donald S. Hiroto and Martin E. P. 
OTE ARG A eid oc SB che) SRE Oe a rian bec e huts ceed auicied ken Mina ia. Ca 311 
Effects of Altered Responsibility, Cognitive Set, and Modeling on Physical Aggression 
. and Deindividuation/Edward Diener, John Dineen, Karen Endresen, Arthur L. 
PS BLY NAMNONUIIO SCOP DE! Ca! E. I 2, «5.28 E rales ere ah ase gedsrajs aie o Mattos Ow 328 
‘Reactions to Severity and Nature of Threat Among Children of Dissimilar Socio- * ; 
338 


economic g ea M. Dembroski and James W. Pennebaker 

' Leader Effectiveness and Leadership Conferral as Determinants of Helping in a 
Medical Emergency//ra J. Firestone, Cary M. Lichtman, and John V. Célamosca 343 
_Multimethod Analysis of Social Reinforcement Exchange between Maritally Dis- 
© - tressed and Nondistressed Spouse and Stranger Dyads/Gary R. Birchler, Robert 


7 A nA JON E. VIERE o a Is SSA EAE 0. amaia 9 Viti aa Aa Sl 349 
Name-Calling and Compliance/Claude M. Steele .. 361 
Crime, Commitment, and. the Responsive Bystander: Two Field Experiments/Thomas 

Moriarty oo 370 
270 


“List of Manuscripts Accepted 


PUBLISHED MONTHLY BY THE AMERICAN PSYCHOLOGICAL ASSOCIATION 


Life As Theater: 


A Dramaturgical Sourcebook 


edited by Dennis Brissett 
and Charles Edgley 


It is only in recent years that the 
vivid metaphor of life as theater—in 
which “all the world’s a stage and 
all the men and women merely play- 
ers’’—has been explored system- 
atically by social psychologists and 
used to develop a consistent and 
cumulative body of theory and re- 
search. This volume provides a gen- 
eral introduction to social psychol- 
ogy from the dramaturgical point of 
view. It is the first book to define 
and illustrate this perspective and 
its usefulness in exploring the rich- 
ness and variety of human expe- 
rience. The book is organized 
around seven substantive issues 
central to contemporary social psy- 
chology: the nature of social life, the 
nature of individuality, the nature of 
social roles, the nature of motiva- 
tion, the nature of instituted rela- 
tionships, the nature of deviance 
and mental disorders, and the na- 
ture of physical disorder and death. 


392 pp. Jan. 1975 
$15.00 cloth $4.95 paper* 


Spring Texts 
from Aldine 


Women, Wives, Mothers: 
Values and Options 
Jessie Bernard 


One of the most important series of events in 
modern times—the restructuring of sex roles 
to adapt them to modern life—is unfolding be- 
fore our eyes. In this lively, lucid book the au- 
thor examines the dramatic changes in values 
being experienced by women in all classes of 
society, and how these changes are affecting 
the options available to women today. 

300 pp. Feb.1975 $12.50cloth $3.95 paper 


Tearoom Trade: 
Impersonal Sex in Public Places 
Enlarged Edition, Including A 
Retrospect on Ethical Issues 

Laud Humphreys 


NOW FOR THE FIRST TIME IN PAPERBACK! 


In the new enlarged edition of this con- 
troversial study of impersonal sex in public 
restrooms the author adds a selection of ma- 
jor essays dealing with the difficult ethical is- 
sues involved in this research, plus his own 
retrospective commentary. 

256 pp. Jan. 1975 $12.50cloth $3.95 paper 


Psychopathology: 

The Science of Understanding 
Deviance 

SECOND EDITION 

James D. Page 


This is the best written, most up-to-date psy- 
chopathology textbook ever. It reflects the 
very latest thought on the full range of psy- 
chotherapies, sexual behavior, the nature and 
Causes of schzophrenia, suicidal behavior, 
the shift from institutional treatment of mental 
disorders to treatment within the community, 
drug and alcohol problems and many other 
topics. 

528 pp. Feb.1975 $14.75 cloth 


“text edition for classroom use only 


ALDINE PUBLISHING COMPANY 
529 South Wabash Avenue ° Chicago, Illinois 60605 


4) 


Volume 31 


February 1975 


mei re 


and 


EDITOR: 
Associate EDITORS: 


IRWIN ALTMAN: 


Wig University of Utah 
A Norman H. ANDERSON 
: University of California, San Diego 
Pas ALBERT BANDURA 


Stdnjord’ University 


Daxyt J., BEM 

y Stanford University 
Seymour M. BERGER 
„ University vf Massachusetts 


Jonn C, BrIGHAM 
Florida State University 


Uni BRONFENBRENNER 
Cornell University 


J. Merrmn CARLSMITH 
Stanford University 


} Istpor CHEIN 
New York University 


Ropert: B. CIALDINI 
Arizona State University 


| Tuomas Cook 


Northwestern University 


Water H. CROCKETT 
Univesity of Kansas 


` James’ E, DITTES 
rie Yale University 
`a Norman T, FEATHER 
a Flinders University of South 
iz ‘Australia, Bedford Park, Australia 


KENNETH GERGEN 
Swarthmore College 


Aran E. Gross 
University of Missouri 


ROBERT ABELSON 
Sh Yale University 

Natan Brony 

New School for Social Research 

i Ropert CAIRNS 
p > Indiana University 

DESMOND CARTWRIGHT 

University of Colorado 


Doucras P. CROWNE 


University of Waterloo, Waterloo, . 
Canada 
eh Haroww B. GERARD. 
A University of California, Los Angeles 
Jacon Gerwitz 


‘National Institute of Mental Health, 
Rockville, Maryland 


THE AMERICA 


Copyright © 1975 by the American Psy: 


Journal of Personality ; 
Social Psychology 


27° F° 78> 
pie 


Joun T. Lanzerta, Dartmouth College 
Auice H. Eacty, University of Massachusetts— 


Amherst 


ANTHONY G. GREENWALD, Ohio State University 
ALBERT Prprtone, University of Pennsylvania 
Wut1am Scorr, James Cook University, Townsville, 


Australia 


CONSULTING EDITORS 


Wittiam HAYTHORN 
Florida State University 
Crype HENDRICK 
Kent State University 
Davin S. Ho 
University of 
Lovise Kipper 
Temple University 
Rosert E. Kieck 
Dartmouth College 
Mitton E. Lirerz 
University of Colorado 
Ciirrorp E. LUNNEBORG 
University of Washington 
R. Crank McCaurey 
Bryn Mawr College 
ALBERT MEHRABIAN 
University of California, Berkeley 
Geratp A. MENDELSOHN 
University of California, Berkeley 
Davi Messick 
University of California, 
Santa Barbara 


Jupson Mirs 
University of Maryland 

Hervert L. MIRELS 
Ohio State University 

Wikram N., Morris 
Dartmouth College 


RıcHarp NISBETT 
University oj Michigan, Ann Arbor 


S 
Kansas 


JANE A. PILIAVIN à 
University of Wisconsin—Madison 


ISSUE CONSULTANTS 


Davm Grass 
University oj Texas at Austin 


Lors HOFFMAN 
University of Michigan 


Epwin P. HOLLANDER 
State University oj New 
Bufalo 


York at 


Doveras Jackson 4 
University of Western Ontario, 
London, Canada 


Wiseet McKeacute 
University of J 


Bernice NEUGARTE? 
University of Chicago 


PUBLISHED BY 


DEAN Pruitt 
State University of New York 
at Buffalo 


Muton E. ROSENBAUM 
University of Towa 
Davy J. SCHNEIDER 
Brandeis University 
Marvin SHAW 
University of Florida 
LAWRENCE STRICKER 
Educational Testing Service, 
Princeton, New Jersey 
Lrovp H. STRICKLAND - 
Carleton University, Ottawa, Ontario 
Frep L. StroptnecKk 
University of Chicago 
PETER SUEDFELD ; 
University of British Columbia, 
Vancouver, Canada 
James T., TEDESCHI 5 
‘State University of New York, 
‘Albany 
Harry Upsnaw 9 % 
University of IMlinois at Chicago 
Circle 
Evainr, WALSTER 4 
University of Wisconsin—Madison 
Lapp WHEELER 
University of Rochester 
Ronert $, Wyer, Je. 
University of Ilinois at 
Urbana-Champaign 
ROBERT ZILLER > 
University of Florida 
Puriip ZIMBARDO | 
Stanjord University 


Demetrios PAPAGEO 
University of 
Vancouver, Canada 
Jupitn RODIN | 
Yale University 
Purp SHAVER, 
Columbia University 
Cuartes P. Sut oo 
City College of the City University 
oj New Vork 
Ross STAGNER eet 
Wayne State University 
HENRI TAJFEL 4 
University of Bristol, England 
MıcHarL WERTHEIMER 
University of Colorado 


Columbia, 


N PSYCHOLOGICAL ASSOCIATION, Inc. 


chological Association, Inc. 


(Or. 


The Journal of Personality and Social Psychology publishes original research Tenor i ne areas 
of social psychology and personality dynamics. The Journal includes articles on the fo owing onia 
social motivation; attitudes and attitude change; social interaction; verbal and 2o a srami : 
cation processes; group behavior; person perception ; conformity and deviation ; and personal iy y structure 
and dynamics. Methodological articles, studies primarily concerned with the ropna E a 
instruments, replications, and reports of negative results or failures to replicate publist Z | wor! = 
acceptable if they are judged to make a substantial contribution to knowledge. Low priority is given o 
papers concerned with personality assessment or the development and validation of assessment instru- 
ments. Manuscripts in these latter categories should be brief, not more than nine typewritten pages. 
The Journal does not include studies of abnormal processes or populations. 


Manuscripts: Submit manuscripts in duplicate to John T. Lanzetta, Editor, Journal of Personality and Social Psy- 
chology, Department of Psychology, Dartmouth College, Hanover, New Hampshire 03755, according to instructions 
provided below. A . 

Manuscripts must conform in style to that described in the Publication Manual of the American Psychological 
Association (2nd ed.). (a) An abstract of 175 words or less, typed on a separate page, must accompany each manu- 
script. Instructions for preparation of the abstract are in the Publication Manual (p. 15). (b) The title should be 75 words 
or tess, If this length is exceeded, the Editors will use their judgment in shortening the titles of accepted reports, (e) 
Duplication of information in text, tables, and figures should be avoided. (d) One set of figures should be glossy prints 
and must not exceed 8} X 11 inches, Figures will generally be reduced to single-column width ymbols nd lette: 
should be large enough to be readable after reduction. Metric equivalents must be given for all nonmetric measurements, 
Authors should keep a copy of the manuscript to guard against loss. Accepted papers are normally published in order 
of receipt. The length of manuscripts should normally not exceed 25 typewritten pages. Longer papers will be considered 
for pub.ication if the length is judged necessary for effective communication. 2 

Because the reviewers have agreed to participate in a blind reviewing system, authors submitting manuscripts are 
requested to include with each copy of the manuscript a cover sheet stating the title of the manuscript, the name of 
the author or authors, the author's institutional affiliation, and the date the manuscript is submitted. Any footnotes 
identifying the author or his affiliation should also appear on the cover sheet, The first page of the manuscript should 
omit the author's name and affiliation but should include the title and the date submitted, Every effort should be 
made by the author to see that the manuscript itself contains no clues as to his identity. 


Copyright and Permissions: All rights reserved. Written permission must be obtained from the American Psycho- 
logical Association for copying or reprinting text of more than 500 words, tables, or figures. Per s are normally 
granted contingent upon like permission of the author, inclusion of the APA copyright notice on the first page of repro- 
duced material, and payment of a fee of $10 per page of APA material, Permission fees are waived for authors who wish 
to reproduce their own articles. Address requests for permission to the Permissions Office, American Psychological 
Association, 1200 Seventeenth Street, N.W., Washington, D.C. 20036. 


Subscriptions : Subscriptions are available on a calenda' r basis only (January through December). Nonmember 
rates for 1975; $48 domestic, $49 foreign, $5 single issue. APA member rate: $20. Write to Subscription Section, APA, 


Reprints: Authors may order reprints of their articles from the printer when they recelve galley proofs. 


pack, Lastteg and Back Volumes: For information regarding back issues or back volumes write to Order Depart- 
ment, APA, 


Microform E ditions : For information Tegarding microform editions write to any of the following: Johnson Associates, 


Inc., P.O. Box 1017, Greenwich, Connecticut 06830; University Microfilms, Ann Arbor, Michigan 48106; or Princeton 
Microfilms, Princeton, New Jersey 08540. 


Change of Address: Send change of address notice and a recent mailing label to the attention of the Subscription 
Section, APA, 30 days prior to the actual change of address. Undelivered copies resulting from address changes will not 


be replaced; second-class mail is not forwarded unless subscribers notify the local post office in writing that they will 
guarantee second-class forwarding postage. 


Published monthly in two volumes per year, six issues per volume, by the American Psychological Association, Inc,, 
1200 Seventeenth Street, N.W., Washington, D.C. 20036. Printed in the U.S.A. Second-class postage paid at Washing- 
ton, D.C., and additional mailing offices, 


Journal Staff 
Harotp P. Van Cort, Managing Editor ÅRTHUR GORDON AND RITA Jaskowirz, 
Anita DEVivo, Executive Editor and Technical Editors 
General Manager Lynn D. Sprer, Manager, Publications 
DEBORAH S. FELDER, Supervising Business Services 


Technical Editor ANNE REDMAN, Subscription Manager 


Journal of Personality and Social Psychol 
1975, Vol. 31, No. 2, 189-198 Sheer 


Helping to Reward Another Person: A Cross-Cultural Analysis 


Kathleen L’Armand and Albert Pepitone 
University of Pennsylvania 


A series of experimental studies was designed to confirm several motivational 
tendencies theoretically expected to determine how much an individual maxi- 
mizes reward for another peer (other-rewarding as distinguished from other- 
relieving “altruism”), for himself, and for himself or another. Since these 
hypothesized motives, for example, need for achievement, the norm of social 
responsibility, and the value of equity, are likely to be affected by the larger 
cultural value system, we compared samples of individuals from widely dif- 
ferent cultures—India and the United States—with respect to their disburse- 
ment of rewards to another and to self. The major results indicate that Amer- 
ican subjects reward another peer more than they reward themselves, while 
Indian subjects reward themselves more than they reward another peer. Such 
marked helping on the part of American subjects is limited to the case where 
nothing can be lost in doing so. When individuals can lose in proportion to 
how much the other gains, the altruistic tendency vanishes, presumably because 
of the competitiveness engendered in such a zero-sum reward structure. The 
low level of helping by Indian subjects suggested a competitive tendency 
(“quasi competitiveness, since objectively nothing could be lost in rewarding 
another) engendered by a world view of limited resources. Questionnaire data 
confirmed this conjecture by showing that the lower the income of the sub- 
jects’ families, the lower the level of maximizing reward for another. Finally, 
non-Brahmin subjects rewarded a Brahmin peer less than did Brahmin sub- 
jects, which supports the view that quasi competitiveness is attenuated when 
the other is an ingroup member and/or the norm of social responsibility 
(operating at a comparatively low level in Indian subjects) is restricted to 
ingroup members. 


the striking absence of general theories of 
helping behavior. It would also suggest that 
investigators begin to taxonomize their de- 
pendent variables so as to facilitate general- 
izing from one study to another. As a rough 


A review of the relatively recent but al- 
ready voluminous literature in the area of 
helping behavior or “altruism” indicates that 
the particular acts of helping observed in 
field and laboratory studies are highly varied. 


This fact would be sufficient to account for 
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first step in this direction, it may be proposed 
that a major class of human helping behav- 
iors deals with other protecting or other 
relieving. Thus, a victim is relieved of pain 
or another form of suffering or is protected 
from some psychological or physical danger. 
Familiar examples in the literature of social 
psychology include donating to the sick or 
needy through charity (Macaulay, 1970), 
coming to the aid of an accident victim or one 
suddenly struck by illness (Darley & Latané, 
1970), donating a kidney (Fellner & Mar- 
shall, 1970), and so on. There is another 
class of helping that is less a subject of re- 
search but fairly ubiquitous in everyday life. 
In this category, the beneficiary is not in 
real need or pain nor in any kind of danger. 
The helping consists of his or her being gra- 
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tultously rewarded in some fashion: spon- 
taneous acts of kindness and generosity, the 
giving of unsolicited gifts without the expecta- 
Won of reciprocity, and so on. In the litera- 
ture, the helping that resembles this type in- 
cludes doing work for another person (Berko- 
witz & Daniels, 1963), distributing rewards 
to others (Schopler & Bateson, 1965), and the 
“kindness” in the cross-cultural study of Bald- 
win, Baldwin, Castillo-Valles, and Seegmiller 
(1971), As might be expected given the 
paucity of research, there is even less theo- 


concerning several motiva- 
tonal determinants of such helping behavior 
in two widely different cultures, Before form- 
ulating the hypotheses, it is necessary to de- 
scribe the experimental task and the opera- 
tional measure of rewarding behaviors em- 


In the standard procedure, adapted irom 
Goldman (Note 1), the subject is provided 
with money with which to wager on a series 
of events each having an explicit objective 
Probability of successful outcome, The amount 
pald off on each successful event is equal to 
the amount wagered on that event, an “even 
money" wager, The sequential wagering is 
done under conditions of uncertainty: The 
subject knows neither the sequence of future 


and, at 
the same time, to allow for the intrusion of 
motivational dynamics into wagering deci- 
sions, Preliminary studies showed that despite 
the uncertainty, subjects did wager in a “ra- 
tional” way: The amount of money placed on 
each of a sequence of bets was roughly pro- 
portional to the probability of success on each 
bet. In other words, individuals wagered so 
as to maximize the expected values of these 
bets. However, although the pattern of wagers 
uniformly showed a linear relationship be- 
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tween the amount wagered and events ar- 
ranged in ascending order of probability of 
success, there were differences in the slope of 
this wagering function, apparently produced 
by our experimental manipulations, In gen- 
eral, the degree of maximizing appeared to be 
mainly a function of how much the wagerer 
wanted to win for another person or for him- 
self. The stronger such “altruistic” or “sel- 
fish" motivation, the more careful the indi- 
vidual becomes in the allocation of wagers, 
the more attentive he is to the probabilities 
of success, and so on. Of course, factors other 
than motivation are no doubt involved in 
determining the maximizing slope, for exam- 
ple, past experience with probability concepts, 
attitudes toward risk, strategies about optimal 
resource allocation, etc.; but since we have no 
reason to suppose that these interact with 
experimental conditions, they can be con- 
trolled like other individual differences by 
random assignment of subjects to conditions. 

The experimental design feature in the iso- 
lation of motives * underlying self- and other- 
rewarding behavior consisted in varying the 
beneficiary of the subject’s wagers, In one ex- 
perimental condition, the wagerer himself re- 
ceives the proceeds of his successful wagers 
(self-reward) ; in another, a peer is the bene- 
ficiary of these winnings (other reward); in 
a third basic condition, subjects are in com- 
petition for the same rewards (self- or other 
reward), 

‘The research literature of social psychology 
as well as our own studies Suggest several 
motivational factors that could determine the 
degree of maximizing for self or other in these 
experimental conditions. 

In the self-reward condition, the most ob- 
vious motive to maximize one’s own reward is 
simple acquisitiveness. The unadorned “sel- 
fish” tendency to maximize Profit is obvi- 
ously a major of noncooperative 
choices in non zero-sum games (Messick & 
McClintock, 1968). It is also likely that a 
need for achievement (McClelland, 1961) 
would be implicated in wagering decisions, 
since the amount of money won can often be 
interpreted as a sign of achievement. 

* By motives we simply mean 
needs. Cognitions of purposes, cube or eee 
also included in this loose working definition, 
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As to the other-reward condition, the moti- 
vo ional analysis can be guided by two prop- 
‘s therein, In the first place, the wagering 
ect cannot gain or lose anything himself, 
tendency to reward another person is thus 
ctively independent of any considerations 
cding his own benefit. Second, the person 
m he is given the opportunity to reward 
stranger. This fact rules out any kind of 
uism that is based on an affective bond, 
nder such circumstances there are several 
ivational factors that could lead to reward 
imizing. For instance, the work of Berko- 
and Daniels (1963) would suggest that 
internalized norm of social responsibility 
ates in situations where the subject can 
clit a dependent other, The sense of re- 
vnsibility may take on a special coloration 
where subjects are given money with which 
to wager for anothers benefit, The subject 
think of himself as a banker or invest- 
cat broker who has the responsibility to 
maximize the return for his client. 
losely related to if not part of the social 
nsibility dynamism is the need to enhance 
self-concept. If individuals picture them- 
s as kind, good, generous, etc., and can 
idate such a self-image at no cost, they 
will do so. Thus, “ethical” self-concept con- 
‘emation could account for other-reward 
imizing. 
iven that subjects are paid for their job 
wagering, the concept of justice can 
allect wagering for another. When committed 
to a norm of equity, the wagerer may be 
~nected to distribute the available funds so 
as to achieve a “just” distribution of rewards, 
Thus, he would maximize less for the other 
when the reward he receives is lower than the 
other person's total gain, that is, he is 
nid a small fee while being In a position to 
win a larger amount for the other person, 


distribution. 
Finally, it is obvious that an individual may 
as as relieve—be- 
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other observer such as the experimenter, In 
the present studies we have attempted in 
various ways to reduce if not to eliminate this 
instrumental basis of helping. 

As mentioned above, subjects lose nothing 
by benefiting another person in the basic 
other-reward condition, A third condition, 
sel{- or other-reward, was set up to observe 
the tendency to reward another person when 
doing so does result in loss to the self, In this 
condition, motives to reward the self and to 
reward the other are directly opposed, This 
introduces the possibility that a competitive 
motive—a desire to get more than the other 
person—influences wagering, Competitiveness 
—the tendency to maximize relative gain—ts 
clearly one of the major motives in decision- 
making games (Messick & McClintock, 1968), 

It may be said without equivocation that 
all of the foregoing motive tendencles—ae- 
quisitiveness, achievement, responsibility, 
ethical self-concept, equity, and competitive- 
ness—originate in and are controlled by the 
individual’s social environment, particularly 
the value system in the culture of which he 
is a part, Tt follows that to understand fully 
characteristics, and 


reviewing 
pological literature, he proposed t 
of the “Image of the limited good” 
the world view of the individual 
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societies and, to a lesser extent, of the indi- 
vidual in developing nations. To the holder 
of the image of the limited good, all material 
and nonmaterial rewards are regarded as fixed 
and severely limited, and any gain to another 
person necessarily implies a loss to oneself. 
Banfield (1958) has identified a similar orien- 
tation among the Neapolitan villagers in his 
field study. Hirschman (1958) and McClel- 
land and Winter (1969) have described the 
same assumptions about reward distribution 
among businessmen in South America and 
India, respectively. 

One supposes that altruistic motives of the 
sort we have specified would be weakened in 
proportion to the strength with which indi- 
viduals held to this world view. Clearly, if a 
person chronically believes that he is in a 
zero-sum situation, he is not often aroused to 
give generously to others. Indeed, regardless 
of the objective rules of reward distribution 
in a particular situation, he may be motivated 
to limit reward to others because of his com- 
petitive orientation. Thus, to the extent that 
the image of the limited good describes the 
reward-distribution assumptions of the urban 
university students included in our Indian 
sample, we would predict that they would 
show a tendency to limit delivery of rewards 
to others more than would the subjects in our 
sample of American students. Since the 
counteraltruistic motive generated by a lim- 
ited-good world view may operate in a situ- 
ation where there is no actual competition 
for the same rewards, it may be termed quasi 
competitive. 

Another cross-cultural proposition about 
self- and other-rewarding behavior derives 
from the empirical work of area specialists 
who attempt to specify the psychological 
prerequisites or concomitants of economic 
growth (see Smith & Inkeles, 1966, for a list 
of such variables), Probably the most exten- 
sive empirical investigations along these lines 

are those of McClelland and his associates 
(McClelland, 1961, 1966; McClelland & 
Winter, 1969). In these studies, McClelland 
(1961) has shown that need for achievement 
is high in both the United States and India. 
Tt has also been found on the basis of a 
content analysis of popular literature that 
“interest in the welfare of the generalized 
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other fellow” (McClelland, 1966, p. 36) 
coupled with the need for achievement, pre- 
dicts economic growth better than the achieve- 
ment motive alone. Such coded altruistic 
themes are relatively low in India and high in 
the United States. On the basis of these find- 
ings we would predict self-reward maximizing 
to be the same in the two countries but other- 
reward maximizing to be higher for American 
subjects. 

To summarize the issues and hypotheses 
under study, in the self-reward condition we 
would expect acquisitiveness and achievement 
drive to motivate reward maximizing and to 
do so in the same degree for American and 
Indians. In the other-reward conditions, 
available evidence leads us to expect internal- 
ized social responsibility pressure or a related 
need for an ethical self-concept to motivate 
American subjects to maximize rewards for 
the other, There is no basis for predicting 
which will be greater, self- or other maximiz- 
ing. Although there is no hard information on 
the extent to which responsibility norms or 
ethical self-concept needs would bring Indian 
subjects to maximize rewards for others, there 
is a plausible theoretical basis for expecting 
that a limited-good world view, prevalent in 
economically depressed cultures, will generate 
a quasi-competitive motive in the other- 
reward condition which will reduce the level 
of maximizing. Thus, it is predicted that 
Indian subjects will maximize to a greater 
extent in self-reward than in other-reward 
conditions. The degree of reward maximizing 
for another by American and Indian subjects 
can be lowered by a need for equity in the 
distribution of rewards. This is an empirical 
question which we shall investigate. In the 
self- or other-reward condition, American sub- 
jects can be expected to maximize their own 
rewards at the expense of the other. That is, 
in such a zero-sum situation, a competitive 
motive should lead them to more self- than 
other maximizing, Tt should be mentioned 
here that due to a lack of resources and time, 
we were unable to run a self- or other-reward 
condition with Indian subjects. However, to 
anticipate our findings, we had already veri- 
fied our prediction that a quasi-competitive 
motive in the other-reward condition severely 
suppressed the other-reward maximizing of 
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our Indian subjects. It would not have added 
much to show that competitiveness would be 
even greater in the zero-sum situation. 


METHOD 


Subjects were paid volunteers, male university 
students in Philadelphia, Pennsylvania, (n = 80) 
and in Madras, Tamil Nadu, India (n= 92). The 
two student samples were selected for comparability 
in age and level of education; for both, the mean 
age was around 20 and the mean level of education 
15 years. Instructions and questionnaires were given 
in English in both Philadelphia and Madras, with 
Tamil translations available in Madras. Experi- 
menters in both countries were male graduate stu- 
dents, Americans in the United States and Indians in 
India. 


Procedure 


Subjects were introduced to the experiment by 
being told that its ultimate purpose was to build a 
decision model, to find out how people make money 
decisions that affect themselves and others. 

Subjects were then arbitrarily assigned to roles 
by the experimenter. All subjects were told that a 
fixed amount of money ($5 in the United States, Rs 
5/- in India)? was to be provided and would be 
available for use by the subject in wagering; any 
money not wagered would go back to the experi- 
menter, Subjects were encouraged to use all of the 
money at their disposal. 

Each subject was then asked some practice ques- 
tions. These involved stating the chances of picking 
a blue card on a blind draw from decks with varying 
ratios of blue to red cards. If the subject could not 
answer these questions correctly, he was coached 
until he was able to do so. This procedure was 
designed to insure that all subjects were familiar 
with simple probability concepts. 

The wagering procedure was then explained. The 
experimenter produced a demonstration jar, a clear 
glass jar containing 10 marbles, 5 blue and 5 yellow. 
The experimenter pointed out a large label affixed to 
the jar giving the number of blue marbles over the 
total number of marbles, and asked each subject to 
count to verify the information on the label. The 
experimenter then pointed out a hole cut in the lid 
of the jar which was just large enough to allow one 
marble to fall out when the jar was turned over. 

The experimenter then took out the money avail- 
able for wagering, one hundred coins. Each subject 
was told that nine jars would be brought out, one 
at a time, from a box not visible to the subject. All 
of the jars would have 10 marbles, but the propor- 
tions of blue marbles out of the total of 10 would 


ple, BE 

2The official rate of exchange at the time was 
about Rs 7/50 to $1 U.S. However, the 1:1 con- 
version we adopted was more accurate in terms of 
purchasing power for common items of student 
expenditure. The coins used were 100 US. nickels 
and 100 coins of 5 naya paisa denomination in India. 


an i. re ee 
193 


vary, and a label giving this proportion would be 
on each jar. The jars would be produced in random 
order. Each time a jar was brought out, the subject 
was instructed, he could decide how many coins he 
wanted to wager on that jar, that is, on a marble of 
a given color falling out of the hole in the lid when 
the jar was overturned, The jars would be over- 
turned only after the subject had wagered on all 
nine jars. The only rule, the subject was told, was 
that at least one coin must be wagered on each jar. 

The payoff procedures, varying according to condi- 
tion, were then explained to the subject. After all 
nine jars had been brought out and all wagers 
placed, the experimenter would shake each jar and 
turn it over. 

Subjects in both self-reward and other-reward 
conditions were told that when a blue marble fell 
out of the jar, it was a successful or winning wager. 
When a yellow marble fell out, the money wagered 
on that jar was returned to the experimenter. Each 
subject in the self-reward condition was told that 
all the money wagered on “winning” (blue-marble) 
jars could be kept by him. Each subject in other- 
reward condition was told that all of the money 
won as a result of successful wagers would be given 
to another person. This other person was described 
to the subject as another student who had volun- 
teered to take part in the experiment and who had 
been assigned to the role of beneficiary of the sub- 
ject’s wagers, with no other part in the experiment 
and no other fee for participation, Subjects in the 
other-reward condition were also told that they 
would be paid a flat fee for participation in the 
experiment. The amount of the fee was either large 
(equal or nearly equal to the amount that could be 
won for the other) or small (one quarter of the 
amount that could be won for the other). In the 
other-reward condition it was arranged that the two 
“subjects” would see one another in the waiting 
room, so that the subject would have no doubts as 
to whether another person were really involved, but 
they were given no opportunity for conversation. 
Sometimes the “other subject” was a confederate of 
the experimenter; sometimes he was another naive 
subject, both subjects then being told that the other 
was the passive beneficiary. 

Subjects in the competitive self- or other-reward 
condition were given a description of a zero-sum 
game in which money wagered on jars yielding a 
blue marble would be paid to them, while money 
wagered on jars yielding a yellow marble would be 
paid to the other subject. 

After the instructions had been completed and 
questions had been answered, the wagering procedure 
was carried out as described. The nine jars in the 
series were presented, one at a time, with the proba- 
bility of a favorable outcome (blue marbles in the 
jar out of a total of 10 marbles in the jar) in this 
order: 5, 6, 4, 8, 5, 3, 7, 2, 5. 

After the subject had made all of his wagers, the 
experimenter turned the jars over and either gave 
the money won to the subject or carried it out of 
the room, ostensibly to be given to the other person. 
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TABLE 1 


MEAN PERCENTAGE OF Corns WAGERED BY AMERICAN SUBJECTS ON EVENTS ee PETORO, 
EVEN, AND FAVORABLE PROBABILITY OF Success; MAXIMIZING INDEX; AND PERCENTAGE 
AVAILABLE CoINs WAGERED UNDER SELF-REWARD, OTHER-REWARD, AND 
SELF- OR OTHER-REWARD CONDITIONS 


Maximizing % 
Eondon aE E ssaravorable “Index coinswagered | 
Self-reward 15 34 51 S “3 i 
Other reward-large fee 11 36 53 oa “4 - 
Other reward-small fee 10 34 56 Ay ae 
Self- or other reward 10 32 58 É 


Note. n = 20 for each of the reward conditions. The differences between those means with common letters are tested fi = 


nificance (two-tailed) as follows: 
at = 1.76, p < 07. 
Dt = 2.60, p < 01. 
og = 4,11, p <.01. 


In self- or other-reward condition, of course, both 
the wagerer and the passive beneficiary received some 
winnings. 


Controls 


Several measures were taken to reduce chances of 
social desirability effects in wagering due to perceived 
pressure from the experimenter or from the partner. 
First, the purpose of the experiment was described 
as being model building rather than a study of 
individual differences, Subjects were also told that 
there were no incorrect answers and that there was 
no “puzzle” for them to solve. The large number of 
subjects participating was stressed, and references to 
computers were freely thrown about. Experimenters 
adopted a detached manner and made no comments 
during the actual wagering period. Subjects were 
told that exits were arranged so that they would not 
meet their partners after the experiment. Both 
Universities (Madras and Pennsylvania) were large, 
and there was little chance of a later “chance” 
meeting. Finally, the anonymity of each subject was 
preserved as far as possible by using numbers rather 
than names as identification. 


Dependent Variable Measures 


As indicated earlier, maximizing behavior may be 
described by the slope of a curve representing the 
distribution of coins wagered over events arranged 
in ascending order of probability of successful out- 
come. For convenience in analysis, the nine events 
(jars of marbles) are divided into three groups: 
favorable (F) (p=.6, .7, and .8), even (E) (p= 
-5, .5, and .5) and unfavorable (U) (p=.2, .3, and 
-4). The mean percentage of those coins wagered 
which were placed on favorable, even, and unfavor- 
able events is reported for each condition to be 
discussed. The summary measure which represents 
the slope of the wagering curve, the “Maximizing 
Index” (MI) is as follows: 3 


3 The maximizing index excludes the wagers placed 
on the .5 jars. The location of these jars first and 
last in the series made them especially susceptible to 
haphazard wagering. 


= 
MI = —— 


The mean percentage of available coins act y 
wagered is also reported for each condition a 1 
additional measure of the wagerer’s motivation <o 
reward the beneficiary. Since any coins not wa l 
reverted to the experimenter, limiting the total n 
ber of coins wagered would lessen the benefici 
chances of winning money. 


RESULTS 
American subjects 


Table 1 summarizes patterns of wagering in 
the self-reward, other-reward, and self- or 
other-reward conditions for American sub- 
jects. The reader is reminded that there are 
two versions of the other-reward condition—a 
large-fee ($3.60) and a small-fee ($1.25) 
version—which enable us to check on the 
possible operation of a reward-equity moti- 
vation that could depress maximizing for 
another person. 

It is immediately evident that subjects in 
all conditions show a general “rational” ten- 
dency in wagering in that they use more 
money to wager as the probability of success 
rises. Further, it will be noted that a larger 
proportion of funds has been wagered on 
high-probability bets and a lower proportion 
on low-probability bets by subjects in the 
other-reward condition as compared to those 
in self-reward condition. This difference is 
reflected in the Maximizing Index: In both 
other-reward conditions these American sub- 
jects maximized to a greater degree, with 
Maximizing Indices 65.3 and 69.0, than did 
subjects in the self-reward condition, with a 
Maximizing Index of 54.6. 
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TABLE 2 


MEAN PERCENTAGE OF Corns WAGERED BY INDIAN SUBJECTS ON EVENTS OF UNFAVORABLE, 
EVEN, AND FAVORABLE PROBABILITY OF Success; MAXIMIZING INDEX; AND PERCENTAGE OF 
AVAILABLE COINS WAGERED UNDER SELF-REWARD AND OTHER-REWARD CONDITIONS 


Condition T % % Maximizing A 
unfavorable even favorable Index coins wagered 

Self-reward | 

(n = 37) 17 34 49 49,760 9,gbit 
Other PEE fee e 

n = 20) 24 33 43 2507 65.0" 
Other reward-small fee 2 7 

(n = 25) 23 34 43 29.6° 62.6" 


Table 1 also shows that there was practi- 
cally no difference in the degree of maximiz- 
ing for other between the small-fee and large- 
fee other-reward conditions. Thus, the poten- 
tial greater inequity involved in the condition 
where the wagerer was paid relatively little 
did not result in reduced maximizing for the 
other, This suggests that subjects were not 
responsive to perceived inequity when social 
responsibility and self-concept enhancement 
were involved or that they did not perceive a 
difference in potential reward between them- 
selves and another subject as inequitable. 

Thus far, then, it appears that the tendency 
to reward another person is strong for this 
group of subjects, but it must be remembered 
that they did not lose anything by their 
largesse. When an element of competition for 
the same rewards is introduced, as in the self- 
or other-reward condition, subjects are seen to 
tend very strongly to maximize their own 
rewards at the expense of those of the other 
person. Indeed, it will be noted in the bottom 
row of Table 1 that a competitive structure 
has produced reliably greater maximizing for 
the self than is found in the noncompetitive 
self-rewarding case.* 

To summarize, in a situation where Amer- 
ican subjects did not lose by wagering for the 
benefit of a dependent partner, they maxi- 
ere 

4 The difference between self-reward and self- or 
other-reward conditions has been replicated with a 
second sample of comparable undergraduate sub- 
jects, with 15 in each condition. Maximizing indices 


for the two conditions were, respectively, 58.1 and 
71.8, t=1.81, p< 410, based on a two-tailed test. 


Err differences between those means with common letters are tested for significance (two-tailed) as follows: 


mized probable gains for the other person to 
a greater extent than did subjects who 
wagered for their own benefit. This relatively 
high degree of maximizing for the other was 
not affected when the preexperimental re- 
wards promised to the wagerer created a 
potential inequity strongly favoring the pas- 
sive beneficiary. When, however, subjects 
were put into a situation of competition for 
the same rewards, they chose to benefit them- 
selves at the expense of the passive partner. 
In motivational terms, a motive for reward 
equity does not override motives for social 
responsibility or ethical self-image validation. 
On the other hand, a competitive structure 
does serve as a limiting condition for such 
altruistic motivations. 


Indian Subjects 


Tn contrast to the American subjects, it can 
be seen in Table 2 that Indian subjects 
showed a higher degree of maximizing when 
the wagerer himself was to benefit from his 
wagers than when the other person was to be 
the beneficiary. Reflecting this effect quanti- 
tatively, the Maximizing Index for the self- 
reward condition (49.7) is reliably higher 
than that for both large-fee (Rs 5/-) and 
small-fee (Rs 1/25) other-reward conditions, 
which showed maximizing indices of 25.5 and 
29.6, respectively. It will also be noted that 
the number of available coins actually 
wagered is considerably higher for self-reward 
than for other-reward conditions. Evidently 
the other person is getting short-changed by 
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FIGURE 1. Comparison of maximizing for self and 
maximizing for other by American and Indian sub- 
jects, 


reluctance to wager at all as well as by the 
indifferent maximizing. 

Again, as in the United States experiments, 
considerations of equity did not seem to af- 
fect other-rewarding behavior. It will be re- 
membered that this matter can be checked 
by comparing wagering for two other-reward 
conditions involving large fees paid to the 
wagerer (no inequity) and small fees paid to 
the wagerer (inequity favoring beneficiary), 
Tt can be seen that the Maximizing Indices 
for these two conditions are highly similar: 
therefore reduced maximizing for the other 
person cannot be explained as the result of 
reward-equity motivation, 

There is another form of equity, however, 
that could Possibly account for comparatively 
depressed maximizing for the other person. 
The wagering subject, it will be remembered, 
is acting for a completely passive beneficiary, 
unfair role 
. In order to eliminate such “role 
inequity,” we added a condition where the 
subject wagered for the benefit of a partner 
who would then, in his turn, wager for the 
benefit of the subject, In other words, the 
roles would alternate and be justly distrib- 
uted. As it happened, removing the element 
of task inequity in this way did not result 
in an appreciably higher degree of maxi- 


mizing for the other. The Maximizing Index 
A 

5 The independence of these two modes of limiting 
reward delivery within conditions is demonstrated by 
near zero, r= .020, obtained be- 
tween Maximizing Indices and coins Wagered for 
Subjects in the other-reward conditions, 
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To check on the existence 
petitiveness, we looked into 
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background of our Indian sample. Q 
experimental questionnaire we asked 
to estimate monthly family income. 
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HELPING TO REWARD ANOTHER PERSON 


As indicated earlier, since Indian society is 
more pluralized than American society, we 
would expect that the range of social respon- 
sibility would be smaller in India. To test the 
proposition that Indians would reward others 
selectively, the subjects in all three other- 
rewarding conditions® were divided along 
caste lines into Brahmins and non-Brahmins 
and compared as to maximizing for a 
Brahmin beneficiary (the confederate). 

As can be seen in Table 3, non-Brahmin 
subjects maximized rewards for a Brahmin 
peer to a reliably lesser degree than did 
Brahmin subjects. Also note that the maxi- 
mizing tendencies for Brahmin and non- 
Brahmin subjects in the self-reward condition 
did not differ. Taken together, these results 
suggest that a norm of social responsibility to 
help is not only relatively weak in these sub- 
jects in this wagering situation but tends to 
be restricted in application to the ingroup. It 
could also be supposed that the quasi com- 
petitiveness of Indian subjects is attenuated 
when a peer is a member of the ingroup.’ 

In conclusion, when American subjects have 
nothing to lose by being generous to peers, 
they do so, maximizing more for the other 
than for self. Such a finding is consistent with 
the assumption that this undergraduate sam- 
ple is motivated by the internal pressure of 
social responsibility or to maintain an ethical 
self-image. Their altruism, however, appears 
to be strictly limited. When placed in a com- 
petitive reward structure, American subjects 
maximize reward for themselves for more than 
they do for others. Indeed, competitiveness 
adds to the level of maximizing of self-reward 
that is motivated by acquisitiveness and the 
need for achievement. 

Our Indian subjects maximize reward for 
themselves to a much higher degree than they 
maximize reward for others. The evidence 
suggests that a strong competitive tendency 
~The three conditions involving wagering for a 
passive beneficiary (other reward-large fee, other 
reward-small fee, and other reward-reciprocal an- 


ticipated) were combined to compensate for small 
numbers of subjects of each caste group in indi- 
vidual conditions. 

7Mean income for Brahmin and non-Brahmin 
groups did not differ, indicating that family income 
and caste affiliation are independent of one another 
in producing effects on maximizing behavior. 
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TABLE 3 


Comparison OF MAXIMIZING ror A BRAHMIN 
OTHER BY BRAHMIN AND Non-BRAHMIN 
SUBJECTS UNDER SELF- AND THREE 
OTHER-REWARDING CONDITIONS 


Brahmins Non-Brahmins 

Condition 
| SEs | ean 
Other reward>© | 22 ee 35 23.6" 
Self-reward* 9 47.3 22 51.4 


at = 1,90, p < .05, two-tailed. 

b The three conditions were other reward-large fee, other 
reward-small and other reward-reciprocal anticipated. 

© Eight subjects did not report caste. 

4 Six subjects did not report caste. 


is responsible for this; competitiveness en- 
gendered by a view of the world as holding 
limited resources. 

If the economic status of the family and 
the outgroup status of the peer tend to reduce 
the level and range of helping on the part of 
Indian subjects, these variables should have 
the same effect on American subjects. Al- 
though this prediction is an empirical ques- 
tion for future study, it may be argued in 
advance that relatively poor American fam- 
ilies do not exist in the same societal context 
of scarcity as do Indian families. As for the 
outgroup implication, it is probably correct to 
assume that American minority group mem- 
bers are not as separate culturally from the 
majority as Indian castes are from each other. 


REFERENCE NOTE 


1. Goldman, A. Vested interests and. their influence 
upon risk-taking under conditions of social re- 
sponsibility. Paper presented at the meeting of the 
Eastern Psychological Association, Boston, April 
1967. 
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Moral Judgment in Sociopathic and Normal Children 


Anthony Frank Camp 
Yale U 


Mental age- and 1Q-matched normal 
tered Kohlberg’s moral development 


agna and Susan Harter 
niversity 


and sociopathic children were adminis- 
interview and the Wechsler Intelligence 


Scale for Children. The results revealed that level of moral reasoning was 
higher for normal than for sociopathic children at both mental age levels, 


Within each group, high-mental-age 


children tended to have higher moral 


judgment scores than low-mental-age children, suggesting the presence of a 


general cı 
of the sociopathic children wa: 


ognitive factor underlying moral development. The poorer performance 
s interpreted as supporting the formulation 


that sociopathy is related to an arrest in moral development. Discussion 


focused on the relative lack of opportuni 


ties for role-taking and identification 


in the families of sociopathic children. 


Research in sociopathy has largely focused 
on the role of hereditary and neurological 
determinants, as well as environmental or 
“ecological,” for example, social and cultural, 
forces (McCord & McCord, 1964). However, 
the cognitive aspects of the syndrome have 
been virtually ignored. Kohlberg’s (1964, 
1969) formulation of the development of 
moral reasoning, derived from Piagetian 
theory, would appear to be particularly ap- 
propriate to the analysis of cognitive-devel- 
opmental factors in sociopathy. The present 
study applied this approach in examining the 
level of moral judgment among sociopathic 
and nonsociopathic children. 

Central to Kohlberg’s cognitive-develop- 
mental analysis is the notion that moral de- 
velopment proceeds through an invariant 
sequence of six stages, each of which is quali- 
tatively different from the former stages and 
represents more highly differentiated cognitive 
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structures of moral reasoning. The “precon- 
ventional” levels focus on avoidance of nega- 
tive physical consequences (Stage 1) and on a 
system of exchange and need satisfaction 
(Stage 2). The “conventional” levels of moral 
reasoning emphasize conformity to stereo- 
types of “nice” behavior meeting with social 
approval (Stage 3) and the adherence to law 
and maintenance of social order (Stage 4). 
At the “post-conventional” levels, Stage 5 
focuses on changes in law to recognize both 
individual rights and social utility; in Stage 
6, individual decisions of conscience in accord 
with self-chosen ethical principles predomi- 
nate. (See Kohlberg, 1969, for a complete 
description of the stage characteristics.) 

Kohlberg has empirically demonstrated the 
existence of this six-stage sequence of moral 
reasoning across a variety of moral issues, for 
example, interpersonal obligation, value of 
human life, law and social order, punitive jus- 
tice, etc., and has validated this develop- 
mental progression cross-culturally. While 
cultural and environmental inputs may accel- 
erate, impede, or even arrest the rate of moral 
development (Blatt & Kohlberg, 1971; 
Kramer, 1969; Turiel, 1966), evidence indi- 
cates that they do not alter the sequence of 
stages or their basic form (Kohlberg & 
Kramer, 1969). 

Critical to the present study is Kohlberg’s 
(1958) finding that arrest at preconventional 
levels of moral development is related to social 
adjustment. He found that delinquent boys, 
all of whom had histories of antisocial be- 
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havior, consistently used preconventional 
moral reasoning, in contrast to the more ma- 
ture judgments of control subjects. Kohlberg 
attributed the delinquents’ developmental ar- 
rest to experiential factors, citing as evidence 
such factors as the parents’ tendency to be 
arbitrary, inconsistent, and frustrating in 
their child-rearing practices. Clinical evidence 
also indirectly supports the hypothesis that 
preconventional moral development is associ- 
ated with overt antisocial behavior, Socio- 
paths or psychopaths (the terms are used 
interchangeably here) have often been char- 
acterized as hedonistic, with strong self-in- 
dulgent drives (McCord & McCord, 1964), as 
lacking an internalized sense of guilt and re- 
sponsiveness to social criticism or evaluation, 
and as modifying their behavior only in re- 
sponse to brute force or credible threats of 
violence (Cleckley, 1959). Such hedonism 
and intractability are consistent with precon- 
ventional moral reasoning, 

Furthermore, the onset of sociopathy usu- 
ally occurs between the ages of 10 and 13 
(Cleckley, 1959). Kohlberg (1958) has 
shown this same period to be the time of 
transition from preconventional (Stages 1 
and 2) to conventional (Stages 3 and 4) 
levels of moral judgment. Thus, arrest at the 
hreconventional stages may be related to the 
emergence of sociopathic symptoms. While 
their normal peers develop a conventional 
“good-boy” morality, oriented toward con- 
forming to and maintaining the expectations 
of others, the arrested children remain at a 
preconventional “might makes right” level, 
Their consequent behavior becomes progres- 
sively more age inappropriate, thereby in- 
creasing the likelihood that society will label 
them as deviant, 

While there has been some evidence (Kohl- 
berg, 1958) demonstrating a relationship be- 
tween sociopathy and preconventional moral 
reasoning, general level of cognitive devel- 
opment has not been controlled in studies 

comparing the moral reasoning of sociopathic 
and nonsociopathic subjects. Since certain 
cognitive structures appear necessary, al- 
though presumably not sufficient, for the 
emergence of higher levels of moral judgment, 
one possible explanation for differences be- 
tween sociopathic and nonsociopathic sub- 
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jects is that the former are simply function 
ing at lower levels of general cognitive dev 

opment. Thus, the present study was directe: 
toward examining possible differences in mor: 
reasoning of sociopathic and nonsociopathi 
boys, matched on cognitive-development 

level. Mental age (MA), rather than calend: 
age (CA), was chosen as the measure of cos 
nitive-developmental level since MA repre 
sents a more precise index of those cognitis 
processes and reasoning abilities involve: 
across a variety of problem-solving situation 
The finding of group differences in mora 
judgments would suggest that certain life ex 
periences, over and above those required fo 
normal cognitive development, may be re 
sponsible for the attenuated levels of mora 
reasoning observed in sociopathic subjects, 

To determine whether the typical develop. 
mental changes in moral reasoning would be 
found among a normal, nonsociopathic sample 
as well as among sociopathic children, each 
diagnostic group included two MA levels, 10 
and 13. A behavioral definition of sociopathy 
was used, given the difficulties inherent in 
formulations which rely on intrapsychic proc- 
esses or inferred dynamics (see Cleckley, 
1955, 1959, for an excellent statement of this 
problem). The major dependent measure was 
level of moral judgment derived from Kohl- 


berg’s interview technique and method of 
analysis. 


METHOD 
Subjects 


Subjects were 44 boys, 21 were sociopathic and 23 
were normal. The sociopathic boys were residents at 
High Meadows, a state-operated treatment center for 
children manifesting different types of psychopa- 
thology. Their average length of participation in the 
High Meadows program was 18 months, The normal 
children were drawn from one elementary and one 
junior high school in the greater New Haven area, 
All subjects were from working-class families, ex- 
cept for two sociopaths who were from professional 
families. The normal and sociopathic subjects were 
matched on the Wechsler Intelligence Scale for Chil- 
dren Full-Scale IQ and MA, and each diagnostic 
group was then divided into high- and low-MA 
levels. The means and standard deviations for MA, 
IQ, and CA for all subgroups are given in Table E 


Operational Definition of Sociopathy 


Sociopathy was defined in terms of overt behav- 
iors and objective demographic characteristics, the 
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definition relying heavily on Robins’s (1966) exten- 
sive study of sociopathic children. Robins identified 26 
common antisocial behaviors and eight key demo- 
graphic characteristics or “predictors” which dis- 
criminated between the childhood behaviors of un- 
remitted sociopathic adults and both remitted socio- 
pathic and other diagnostic groups. Antisocial be- 
haviors included markedly impulsive behavior, van- 
dalism, and pathological lying; the predictors in- 
cluded truancy and poor school performance, time 
spent in a correctional institution, and destructive 
acts against businesses. Robins found that maximum 
discrimination resulted from labeling as presocio~ 
pathic all those children whose history showed (a) 
six or more antisocial behaviors and (b) seven or 
more demographic predictors. In the present study 
the antisocial behaviors and demographic predictors 
comprised the sociopathy checklist, and those chil- 
dren whose records revealed either of these two pat- 
terns comprised the sociopathic group. This pro- 
cedure had the general advantage of a precise op- 
erational definition and made possible highly reliable 
judgments about the presence of relevant behaviors 
in the case history material. 


Sociopathic Group 


Only boys were included, since very few girls 
resided at High Meadows (reflecting the relatively 
low incidence of sociopathy among females in the 
general population). Trained scorers reviewed the 
case history of each boy for evidence of behaviors 
on the sociopathy checklist, Twenty-one met Cri- 
terion A and were designated as the sociopathic 
group. The Wechsler Intelligence Scale for Children 
was administered to these subjects in order to obtain 
a current IQ and MA. 


Normal Group 


Preliminary selection of normal fifth- and seventh- 
grade boys was made on the basis of IQ data ob- 
tained from the school records (although Wechsler 
Intelligence Scales for Children were subsequently 
administered for the purpose of IQ and MA match- 
ing of subjects to the sociopathic group). To ensure 
that there was no hidden sociopathy among these 
boys, behavior reports were gathered from school 
records, psychological ‘examiner's files, school princi- 
pals, teachers, and guidance counselors. Children 
who had at least two antisocial behaviors and two 
predictors on the sociopathy checklist were elimi- 
nated from the pool. For the remaining children, the 
teachers completed a “classroom behavior checklist” 
devised for this study. The list comprised brief 
descriptions of 17 mildly troublesome behaviors, for 
example, lunchroom misconduct, lying, borrowing 
things without permission, fighting. For each child, 
the teacher rated the occurrence of each behavior on 
a 4-point scale: (1) never, (2) occasionally, but no 
more than average, (3) more so than average, (4) 
so frequently as to characterize much of the child’s 
behavior in school. Any child with four or more 
behaviors occurring more frequently than average 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS ror MA, 1Q, 
anp CA vor Each Grour 


M SD M SD M SD 


| 
han | 13.36 143 9.78) 12.61 | 144 
e 12 | 10,36 | <56 | 95.58 | 8.05 | 10.96 | 1.97 
Sociopathic | | 
High MA | 11 | 13.39) 1.14 | 101.18 | 14.09 | 13.46] 1.75 
Low MA | 10 | 10,00 117 | 93.40 | 14,01 | 10.81 | 1.05 


Nole. MA = mental age; CA ~ calendar age, 


was eliminated. The remaining children had no 
serious and few minor antisocial behaviors, and 
none had ever had any psychiatric treatment or 
counseling. Wechsler Intelligence Scales for Children 
were administered to 30 of these boys from which 
23 were selected on the basis of IQ and MA scores 
comparable to the sociopathic group. 


Procedure 


Kohlberg’s moral development interview has been 
fully described elsewhere (Kohlberg, 1958; Turiel, 
1966). The interview consists of 12 hypothetical 
situations of moral conflict and standard probe ques- 
tions for each story, A subset of four stories was 
selected for use in the present study, The central 
conflict in each story was as follows: (a) a boy 
laboriously earns money to finance a summer camp- 
ing trip; his father demands the money for his own 
use; should the boy refuse? (b) a man’s wife is 
dying of cancer; the only drug that may save her is 
priced beyond his means; should he steal it? (c) a 
woman is dying in pain from an incurable disease} 
she begs her doctor to end her life with a painless 
anesthetic; should the doctor respect her wishes? (d) 
an escaped convict becomes a wealthy industrialist- 
philanthropist and initiates programs of significant 
social benefit; a tailor discovers his true identity; 
should the tailor report him to the police? This 
group of stories has been found to be appropriate 
with children ages 8 to 16 (Blatt & Kohlberg, 1971). 

The children were interviewed by the first author 
who read each story aloud. After each story, he 
asked the probe questions and tape-recorded the 
child’s responses. Tapes were then transcribed, coded 
to preserve anonymity, and scored independently by 
two judges working blindly. 

The global scoring method (Kohlberg, 1958), 
which yields a summary measure of moral develop- 
ment, was employed. Scoring is based on themes 
which represent 25 fundamental moral issues which 
Kohlberg has identified, for example, interpersonal 
obligation, value of human life, law and social order, 
issues of affection, punitive justice. Each response 
was categorized according to the issue it represented 
and then given a stage score, representing the de- 
yelopmental level of the response. Both “pure-” and 
umixed-” stage scores were obtained, Pure stage 
scores indicate moral reasoning at one of the six 
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TABLE 2 
MEANS AND STANDARD DEVIATIONS FOR GRAND 
GLOBAL MORAL MATURITY MEASURE 


Group n M RERS 
Normal 
High MA 11 237.64 24.06 
Low MA 12 221.58 31,42 
Sociopathic 
High MA re 171.36 31.98 
Low MA 10 157.90 14.15 


stages of development and are represented by an 
integer, 1 through 6. Mixed-stage scores indicate a 
transitional level of development and are represented 
by two integers, for example, 1(2), 2(3). The domi- 
nant stage, outside the parentheses, is more heavily 
used than the stage inside the parentheses. 

Scores were then converted to moral maturity 
scores (Blatt & Kohlberg, 1971) as follows: transi- 
tional scores were weighted 2 for the major stage, 
1 for the less dominant Stage, and the sum divided 
by 3, the total number of weights; pure stage scores 
were unweighted. All scores were then multiplied 
by 100 and rounded to convert them to an integer 
scale from 100 to 600.1 Three different global moral 
maturity scores were computed for each subject: 
(a) a global story score by first averaging the moral 
maturity scores across the issues mentioned for each 
story and then averaging these means across stories; 
(b) a global issue score by first averaging the scores 
across stories for each issue and then averaging 
these means across issues; (c) a grand global score 
by simply averaging all scores without respect to 
Issue or story, 


RESULTS 
Moral Maturity Scores 


Preliminary analyses indicated high inter- 
correlations among the three moral develop- 
ment measures: grand global, global story, 
and global issue, all 7s > .95, Thus, for sub- 
sequent analyses the most stable of these 
measures, the grand global index, was em- 
ployed. The interjudge reliability of this 
score, based on rank order correlations, was 
-78 for the normal children and .73 for the 
sociopathic children.? (These values are 

1 Moral maturity score equivalents for Stages 1, 
1(2), 2(1), and 2 are 100, 133, 167, and 200, re- 
spectively; for Stages 2(3), 3(2), and 3, they are 
233, 267, 300, etc. up through Stage 6. 

2 Protocols were scored blind, and the interrater 
reliabilities were based on the judgments of two inde- 
Pendent raters, one of whom was not directly in- 
volved in the study. The mean difference between 
judges in moral maturity ratings across all sub- 
Jects was only 29.6, in which a difference of 100 in 
this scoring system represents one whole stage. 
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slightly higher than that, r = .71, reported by 
Rest, Turiel, & Kohlberg, 1969, for subjects 
in this same age range.) Table 2 presents the 
mean and standard deviation of the grand 
global score for each subgroup. 

A Group X MA analysis of variance of the 
grand global measure yielded a significant 
main effect for group, F (1,40) = 58.79, p < 
-001, and a main effect for MA which ap- 
proached significance, F (1,40) = 3.03, p < 
-10. The interaction was not significant. The 
group effect reflected the fact that the moral 
judgment scores were lower for sociopathic, 
X = 165.0, than for normal children, ¥ = 
229.3. These mean moral maturity scores 
convert into stage scores of 2(1) for the 
sociopathic and 2(3) for the normal children. 
As can be seen in Table 2, diagnostic group 
differences were evident at both MA levels. 
The overall MA difference obtained reflects 
the tendency for high-MA subjects to have 
higher moral maturity scores, Š = 204.5, 
than low-MA subjects, ¥ = 189.7. 


Group Differences on the Wechsler 
Intelligence Scale for Children 


Although the sociopathic and normal chil- 
dren were matched on Wechsler Intelligence 
Scale for Children Full-Scale IQ, separate 
analyses were performed to determine if there 
were differences in verbal IQ, performance 
IQ, and subtest scores, all of which are pre- 
sented in Table 3. The normal children had a 
significantly higher verbal IQ than the socio- 
pathic children, although the tendency for the 
latter to have higher performance IQs than 
the former did not reach significance (see 
Table 3). 

Further comparisons of the subtest scale 
Scores revealed that the normal children scored 
significantly higher than the sociopathic ones 
on all verbal subtests and on two perform- 
ance subtests, coding and mazes (see Table 
3). The tendency for sociopathic children to 
score higher than the normal children on 
three performance subtests, picture comple- 
tion, block design, and object assembly, ap- 
proached significance, 

To examine the possibility that the socio- 
pathic children’s lower moral maturity scores 
were related to their poorer verbal skills, an 
analysis of covariance of the moral maturity 
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scores, with Wechsler Intelligence Scale for 
Children Verbal IQ partialled out, was per- 
formed. Consistent with the original analysis, 
significant effects for group, F (1,39) = 
54.35, p < .001, and for MA, F (1,39) = 
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TABLE 3 


MEAN WECHSLER INTELLIGENCE SCALE FOR CHILDREN 
IQ AnD SCALE SCORES FOR EACH GROUP 


4.30, p < .05, were found with no significant 
interaction. These findings indicated that 
moral judgment scores of normal children 
were higher than sociopathic children even 
when the effects of differential verbal abilities 
were controlled. In addition, the high-MA 
children in both diagnostic groups had higher 
moral maturity scores than the low-MA chil- 
dren when scores were adjusted for verbal IQ. 


DI$CUSSION 


The major finding of the present study was 
that the moral reasoning of sociopathic chil- 
dren was less mature than that of MA- 
matched normal children, at two MA levels. 
While the sociopathic group demonstrated 
solidly preconyentional reasoning at stage 
2(1), the normal group reached a transitional 
stage, 2(3), between preconventional and con- 
ventional levels of moral development. These 
differences in level of moral judgment cannot 
be attributed to differences in general level of 
cognitive development (CA or MA), IQ, or 
social class since these variables were con- 
trolled across diagnostic groups. T hus, while 
moral judgment was found to be moderately 
related to developmental level, the more 
striking finding was that certain factors asso- 
ciated with sociopathy attenuated the per- 
formance of the sociopathic group to below 
that expected on the basis of their cognitive 
level. 

There is some evidence to suggest that 
such group differences may be related to dif- 
ferential opportunities for identification and 
role-taking. Developmentalists have empha- 
sized the general importance of these experi- 
ences to moral development (Kohlberg, 1958, 
1964; Kohlberg & Kramer, 1969; Piaget, 
1932), although their specific impact has not 
been fully elucidated. Kohlberg’s (1958) data 
suggest, however, that familial and environ- 
mental conditions which restrict role-taking 
opportunities and discourage identification are 
associated with lower terminal levels of moral 
development. It has also been found that 
identification and role-taking are less likely to 


Measure Normal | Sociopathic t 
Full scale IQ 100.87 97.48 -86* 
Verbal IQ 102.52 O23" ELTI 
Performance IQ 102.05 105.33 his 
Verbal subtests 
Information 9.52 8.05 | 1.92*** 
Comprehension 9.09 TST «\230"* 
Arithmetic 9.61 8.24 |2.00*** 
Similarities 11.31 9.57 | 2.02*** 
Vocabulary 10.26 8.76 |2.01*** 
Digit span 10.26 8.10 | 3.04" 
Performance subtests 
Picture completion 9.61 10.76 | 1.46** 
Picture arrangement 9.48 10.24 .88* 
Block design 9.74 POSH 1238" 
Object assembly 10.39 11.66) ia EK pha 
Coding 12.26 10.19 V (i yaa 
9.87 1190415 


occur under conditions of a distant, nonre- 
warding model (Bandura & Huston, 1961; 
Bandura & Walters, 1963) and lack of pa- 
rental affection (Sears, 1953). The records of 
the sociopathic children in the present study 
revealed many of these conditions. In dealing 
with their child, parents were frequently de- 
scribed as quite inconsistent, vacillating con- 
tinually between total permissiveness and 
harsh restrictiveness. In many cases, the his- 
tories indicated that the sociopathic child’s 
needs and desires were ignored, actively frus- 
trated, or arbitrarily overindulged. Finally 
there were indications that the sociopathic 
children frequently were the focus of parents’ 
displaced hostility and acting out of their 
own personal difficulties. These observations 
are consistent with those of Robins (1966) 
who found similar parental characteristics 
(discipline which was lax or inconsistent, no 
supervision, or overly restrictive) for children 
who met the criteria for sociopathy used in 
the present study. 

More systematic research is necessary, how- 
ever, to determine which of these possible 
antecedent conditions are related to attenu- 
ated moral development and to what extent 
differential opportunities for role-taking and 
identification are the mediating variables criti- 
cal to such development. Direct comparisons 
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of MA-matched sociopathic and normal chil- 
dren on specific measures of identification and 
role-taking should be performed to determine 
whether sociopathic children are, in fact, less 
able to identify with others or to assume roles 
other than their own. The relationship be- 
tween the hypothesized parental character- 
istics and child-rearing practices to moral de- 
velopment, as well as to identification, should 
also be examined more directly. 

The present study also revealed group dif- 
ferences on the Wechsler Intelligence Scale 
for Children subtests. Sociopathic children 
scored lower than normal children on all six 
verbal subtests, reflecting the former’s gen- 
erally weaker skills in such areas as verbal 
concept formation, social judgment, abstract 
number concepts, and fund of general infor- 
mation. It could be argued that such differ- 
ences accounted for the obtained group differ- 
ences in moral reasoning since the latter were 
based on verbal data. The findings do not 
support this interpretation, however, since 
the normal children demonstrated significantly 
more mature moral judgment than the socio- 
pathic children even when the contribution of 
verbal IQ was controlled, This suggests that 
factors other than differential verbal skills 
were involved, 

On the nonverbal subtests, the poorer per- 
formance of the sociopathic children on cod- 
ing and mazes is consistent with findings in- 
dicating their deficiency on tasks requiring 
impulse control, sustained effort and attention 
(Bender, 1947; Gibson, 1964), the anticipa- 
tion of consequences (Snortum, 1968), and 
the ability to plan accordingly (Wirt & 
Briggs, 1960). 


circumscribed test responses have little or no 
relation to behavior in the real world, While 
validation studies typically seek to predict 
patterns of behavior from test responses, the 
direction of prediction was reversed in the 
present study. Clear-cut, well-defined þe- 
haviors, labeled sociopathic, predicted overall 
level of performance on Kohlberg’s moral 
development interview for children at pre- 
conventional and transitional levels of moral 
reasoning, This finding suggests that person- 


ality constructs can be successfully opera- 
tionalized in terms of precise statements of 
related behaviors, providing additional ya- 
lidity for these constructs. 
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Reciprocal Concessions Procedure for Inducing Compliance: | 
The Door-in-the-Face Technique 
Robert B. Cialdini, Joyce E. Vincent, Stephen K. Lewis, José Catalan, ] 


Diane Wheeler, and Betty Lee Darby 
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for an extreme favor (which was refused to him) and then for a smaller 
favor. In each instance, 
smaller favor than a procedure in which the requester asked solely for the 
smaller favor. Additional i 

hypothesis that the effect i 


The foot-in-the-door technique has been in- 
vestigated by Freedman and Fraser (1966) 
as a procedure for inducing compliance with a 
request for a favor. They demonstrated that 
obtaining a person’s compliance with a small 
request substantially increases the likelihood 
of that person’s compliance with a subse- 
quent, larger request. Freedman and Fraser 
Suggest that the mediator of the foot-in-the- 
door effect is a shift in the self-perception of 
the benefactor, After performing or agreeing 
to perform an initial favor, a person “may 
become, in his own eyes, the kind of person 
who does this sort of thing, who agrees to 
requests made by strangers, who takes action 
on things he believes in, who cooperates with 
good causes. . . . The basic idea is that the 
change in attitude need not be toward any 
particular person or activity, but may be 
toward activity or compliance in general.” 
Thus, one effective way to obtain a favor is to 
begin by making a minimal first request which 
is sure to produce compliance and then to 
advance to a larger favor (the one which was 
desired from the outset). It may well be, how- 
ever, that an equally effective method for 
getting a favor done involves the exact oppo- 
site procedure. What would be the result of 
making an extreme first request which is sure 


Reprint requests should be sent to Robert Cialdini, 
Department of Psychology, Arizona State Univer- 
sity, Tempe, Arizona 85281. 
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to be rejected and then asking for a more 
moderate second favor (the one which was 
desired from the outset)? There are two lines 
of evidence suggesting that such a technique 
would be efficacious in producing compliance 
with the second request. 

The first sort of evidence comes from work 
investigating the concept of reciprocation. 
Gouldner (1960) maintains that a norm of 
reciprocity exists in all societies, Gouldner 
states the norm of reciprocity in its simple 
form as: “You should give benefits to those 
who give you benefits.” (p. 170) There is 
considerable experimental evidence attesting 
to the workings of such a rule in our culture 
(eg., Brehm & Cole, 1966; Goranson & 
Berkowitz, 1966; Pruitt, 1968; Regan, 1971; 
Wilke & Lanzetta, 1970). In each case, re- 
ceipt of a favor has been shown to increase 
the likelihood that the favor will be returned, 
although not necessarily in kind. While Gould- 
ner (1960) speaks of the norm of reciprocity 
almost exclusively in terms of the reciproca- 
tion of benefits and services, it seems likely 
that a norm for reciprocity governs other 
types of social exchange also. Specifically, we 
would like to postulate a reciprocal conces- 
sions corollary to the general norm of reci- 
procity: “You should make concessions to 
those who make concessions to you.” Such a 
rule can be seen as having an important soci- 
etal function. Very often in social interaction 
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participants begin with requirements and de- 
mands which are unacceptable to one an- 
other. In order for the interaction to continue 
and hence for common goals to be achieved, 
compromise must be struck. Mutual conces- 
sion is crucial, If there is no implicit prescrip- 
tion that retreat from an initial position by 
one participant should be reciprocated by the 
other participant, then it is unlikely that 
compromise attempts would be initiated and, 
consequently, that the interaction would con- 
tinue. However, given a principle for recipro- 
cation of concessions, an interaction partici- 
pant could instigate compromise attempts 
with little fear of exploitation by his partner. 

Evidence for the existence of a reciprocal 
concessions relationship in our society can be 
seen in numerous, terms and phrases of the 
language: “give and take,” “meeting the 
other fellow halfway,” etc. Much more com- 
pelling, however, are the data which come 
from a number of studies of negotiation be- 
havior. An experiment by Chertkoff and Con- 
ley (1967) demonstrated that the number of 
concessions a subject makes in a bargaining 
situation is significantly affected by the num- 
ber of his opponent’s concessions; more fre- 
quent concessions by the opponent elicited 
more frequent concessions from the subject. 
In a somewhat similar context, Komorita and 
Brenner (1968) had subjects bargain as buy- 
ers against opponent-sellers. In one condition, 
the opponent initially proposed what was a 
perfectly equitable selling price and refused 
to move from that price throughout the course 
of the negotiations; in other conditions, the 
opponent began with an extreme offer and 
then gradually retreated from that price as 
bargaining progressed. The consistent result 
was that the former condition elicited the 
least amount of yielding on the part of the 
subjects. Komorita and Brenner conclude that, 
“in a bargaining situation, if one party wishes 
to reach an agreement at a ‘fair’ price, clearly 
a strategy of making an initial offer at that 
level and remaining firm thereafter is not an 
effective means of reaching an agreement.” 
(p. 18) Finally, an experiment by Benton, 
Kelley, and Liebling (1972) had subjects 
negotiate the allocation of funds with a pre- 
programmed opponent ina mixed-motive game. 
One condition of the experiment saw sub- 
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jects faced with an opponent who repeatedly 
made an extreme demand during the first two 
minutes of the bargaining session and who 
then reduced this demand during the next two 
minutes. The number of subjects’ own extreme 
demands was drastically reduced by this 
strategy. In contrast, another condition, in 
which the opponent remained intransigently 
extreme, produced almost no reduction in the 
number of extreme subject demands during 
this second two-minute period. In sum, it 
seems that the likelihood of a concession by 
one party is positively related to the occur- 
rence of a concession by another party. 

Let us now return to the original question, 
“How might we enhance the probability 
that another will comply with our request for 
a favor?” The analysis above suggests that if 
we were to begin by asking for an extreme 
favor which was sure to be refused by the 
other, and then we were to move to a smaller 
request, the other would feel a normative 
strain to match our concession with one of 
his own. Since the situation is such that the 
other’s response to our request involves an 
essentially dichotomous choice—yes or no— 
the only available reciprocation route for him 
would be to move from his position of initial 
noncompliance to one of compliance. So, by 
means of an illusory retreat from our initial 
position, we should be able to obtain another’s 
agreement to the request that we desired from 
the outset. 

In line with the formulation we have pro- 
posed, two things are crucial to the success of 
such a procedure. First, our original request 
must be rejected by the target person; once 
this has occurred, the target will have taken 
a position and an apparent concession on our 
part will pressure him to meet us halfway 
and hence to yield to our smaller request. Sec- 
ond, the target must perceive that we have 
conceded in some way. Thus, the size of our 
second favor must be unambiguously smaller 
than that of the first; only then can the action 
of a reciprocal concessions norm come into 
play. 


EXPERIMENT 1 


In order to test the effectiveness of this 
procedure for inducing compliance, an experi- 
ment was conducted. It was expected that a 
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person who followed a refused initial request 
with a smaller request would obtain more 
agreement to the smaller request than a per- 
son who made only the smaller request. Such 
a result could be explained, however, in a way 
quite apart from the theoretical account we 
have proposed. Rather than through the ac- 
tion of a reciprocal concessions mechanism, 
the superiority of the technique we have 
described could be seen as occurring through 
the action of a contrast effect. Exposure to 
an initial, large request could cause subjects 
to perceive a subsequent, smaller request as 
less demanding than would subjects who had 
never been exposed to the large request; con- 
sequently, the former type of subject might be 
expected to comply more with the critical re- 
quest. It was necessary, therefore, to include 
in our experimental design a condition which 
differentiated these two theoretical explana- 
tions, 

One point of departure for the two accounts 
lies in the requirement of the reciprocal con- 
cessions explanation for the target’s refusal 
of and the requester’s moderation of the ini- 
tial, larger favor. The contrast effect expla- 
nation does not demand this sequence of 
refusal and moderation; rather, it requires 
only that the target person be previously 
exposed to the larger request. An experiment 
was performed, then, which included three 
conditions. In one condition, subjects were 
asked to perform a favor. In a second condi- 
tion, subjects were asked to perform the criti- 
cal favor after they had refused to perform a 
larger favor. In a final condition, subjects 
heard the larger favor described to them 
before they were asked to perform the critical 
one. 


Method 
Subjects 


Subjects were 72 people of both sexes who were 
moving along university walkways during daylight 
hours. Only those individuals who were walking 
alone were selected, and no subjects were selected 
during the 10-minute break period between classes. 


Procedure 


A subject meeting the conditions above was 
approached by a student-experimenter 1 who initiated 


*The experimenters were three college age stu- 
dents, one female and two male. Experimenters ap- 
proached only subjects of the same sex as themselves. 
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interaction by introducing him- o being 
with the County Youth Counseling \t this 
point, the experimenter made (for Soun- 
seling Program) either an extreme lowed 
by a smaller request or made maller 
request. 

The extreme request asked subjec ym as 
counselors to juvenile delinquents fc of a 


least two years. Specifically, the c 


We're currently recruiting univers to 
work as voluntary, nonpaid cour he 
County Juvenile Detention Center ion 
could require two hours of your 
for a minimum of two years. \ yC 
working more in the line of 

(Sister) to one of the boys (gir 

tention home. Would you be inter 

considered for one of these positi: 


The smaller request asked subjects 18 
chaperones for a group of juvenile d à 
two-hour trip to the zoo. Specific 
menter said: 


We're recruiting university student 

a group of boys (girls) from the 

nile Detention Center onoa trip t 
would be voluntary, nonpaid, and 

about two hours of one afternoon 
Would you be interested in being conside 
one of these positions? 


Subjects were randomly assigned { three 
conditions. 

Rejection-moderation condition. 
condition heard the experimenter 1 
extreme request. After subjects refu urge 
request, the experimenter said, “Weli, w: have 
another program you might be interested in then.” 
At this point the experimenter made the smaller 
request. 

Smaller request only control. Subjects in this con- 
dition were asked by the experimenter only to per- 
form the smaller request. 

Exposure control. In this condition the experi- 
menter first described the extreme and then the 
smaller favor and requested that the subjects per- 
form either one. Specifically, subjects in the exposure 
only control heard the experimenter give the standard 
introduction and then say: 


We're currently recruiting unive lents 
for two different programs. In t} 
looking for voluntary, nonpaid c 
work at the County Juvenile Dete 
The position would require two h 
time per week for a minimum of tw 
would be working more in the ji 
Brother (Sister) to one of the boy 
the detention center. In the other 
we're looking for university students } 
erone a group of boys (girls) from the dete 
tion center on a trip to the zoo. It 

be voluntary, nonpaid, and would 
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hours of one afternoon or evening. Would you 
be interested in being considered for either of 
these two programs? 


No subject during the course of the experiment 
ever agreed to perform the initial, large favor. How- 
ever, when a subject agreed to the smaller request, 
the experimenter took his or her name and phone 
number. The experimenter promised to call if the 
subject was needed but explained that “there is a 
chance that you won’t be called because of the large 
number of people who have already volunteered to 
help.” At this point, the experimenter thanked the 
subject and moved on. 


Predictions 


Two predictions derived from the reciprocal con- 
cessions model were made. First, it was expected 
that the subjects in the rejection-moderation condi- 
tion would comply with the smaller request more 
than would subjects in the two control conditions. 
Second, it was predicted that the amount of compli- 
ance with the smaller request would not differ be- 
tween the two controls. 


Results 


No subject in the present experiment agreed 
to perform the extreme favor. The percentage 
of subjects who complied with the smaller re- 
quest in each of the treatment conditions can 
be seen in Table 1. 

Planned orthogonal contrasts designed to 
test the two experimental predictions were 
performed on the data. The first contrast, 
comparing the compliance rates of the two 
control groups, found no difference, x? = .50, 
ns. The second contrast tested the combined 
control conditions against the rejection-mod- 
eration condition; this analysis produced a 
highly significant difference, xX = 6.42, p= 
O11. All tests in this and subsequent experi- 
ments are two-tailed. 

Additional analyses investigating the ex- 
tent to which the pattern of results above was 
affected by such factors as the sex of the sub- 
ject and the identity of the experimenter pro- 
vided no statistic which approached conven- 
tional levels of significance; the same pattern 
obtained for all three experimenters and for 
male and female subjects. In all, then, it 
seems that the only factor which enhanced the 
amount of agreement to the smaller request 
was the procedure of moving to the smaller 
request after the larger request had been re- 


fused. 
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TABLE 1 


PERCENTAGE OF SUBJECTS COMPLYING WITH 
THE SMALLER REQUEST 


Treatment % Compliance 
Rejection-moderation condition 50.0 
Exposure control 25.0 
Smaller request only control 16.7 


Noile. The n for each condition = 24, 


Discussion 


It is clear from the findings above that 
making an extreme initial request which is 
sure to be rejected and then moving to a 
smaller request significantly increases the 
probability of a target person’s agreement to 
the second request. Moreover, this phenom- 
enon does not seem mediated by a perceptual 
contrast effect; simply exposing the target to 
the extreme request beforehand does not af- 
fect compliance. 

While the results of this first experiment 
lend some support to the reciprocal conces- 
sions explanation, they do not, of course, 
necessarily confirm the validity of the inter- 
pretation. If we are to gain confidence in such 
a model, additional predictions derivable from 
it must be proposed and demonstrated. To 
this end, it was decided to replicate and ex- 
tend our findings in a second experiment. 


EXPERIMENT 2 


The reciprocal concessions formulation we 
have described suggests that a target person 
feels pressure to change from his initial posi- 
tion of noncompliance after it is seen that the 
requester has changed from his own initial 
position. It is not enough that the target has 
been asked to comply with a large then a 
smaller request, the target must perceive the 
request for the smaller favor as a concession 
by the requester. If this is in fact the case, a 
target person who is asked an extreme favor 
by one individual and a smaller favor by some 
other individual in a second interaction con- 
text should not experience a reciprocation- 
mediated tendency to agree to the smaller 
request. The second requester should not be 
perceived as conceding and thus, according to 
our model, the target should not be spurred 
to reciprocate via compliance. On the other 
hand, if, as in Experiment 1, the requests are 
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made by the same person, compliance with 
the smaller request should be enhanced. 

To test the importance of the perception of 
concession, an experiment was conducted 
which included three conditions. In one con- 
dition, subjects were asked to perform a favor 
by a single requester. In a second condition, 
subjects were asked by a single requester to 
perform the critical favor after they had 
refused to perform a larger favor for that 
requester. In the third condition, subjects 
were asked to perform the critical favor by 
one requester after they had refused to per- 
form a larger favor for a different requester. 
An additional benefit of this third condition 
was that it afforded another test of the per- 
ceptual contrast explanation for the obtained 
effect and thus provided a conceptual replica- 
tion of one aspect of Experiment 1. 


Method 
Subjects 


Subjects were 58 males who were selected for par- 
ticipation in a fashion identical to that of Experiment 
aS 


Procedure 


A subject meeting the conditions above was 
approached by two student-experimenters, one male 
and one female; we call them Experimenters A and 
B, respectively. Experimenter A initiated interaction 
by introducing both himself and Experimenter B to 
the subject. At this point, a second male experimenter 
(Experimenter C) who was apparently an acquain- 
tance of Experimenter B, approached the group and 
engaged Experimenter B in conversation about an 
upcoming exam they both would be taking. This 
procedure uniformly distracted the subject’s atten- 
tion for a second, so Experimenter A waited for the 
subject to turn back to him. Here the three treat- 
ment conditions of the study differed. 

Rejection-moderation condition. Subjects in this 
condition next heard Experimenter A ask for the 
extreme favor. The extreme favor was the same as 
that used in Experiment 1. After the subject had 
refused to comply, Experimenter A made the smaller 
request, which in this experiment asked subjects to 

chaperone a group of “low-income children” to the 
zoo. Specifically, he said: 


Oh. Well. I’m also with the Campus Volunteer 
Service Organization in another program that 
has nothing to do with the Juvenile Detention 
Center. It involves helping to chaperone a group 
of low-income children on a trip to the zoo. We 
can’t give you any money for it, but it would 
only involve about two hours of one afternoon 
or evening. Would you be willing to help us 
with this? 
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Two requester control. The procedures of this con- 
dition were similar to those of the rejection-mod 
eration condition except that, upon refusal of the 
extreme request, Experimenter A thanked the subject 
and walked away from the group with Experimenter 
B; this left Experimenter C alone with the subject. 
At this point, Experimenter C made the smaller 
request. He prefaced the request by saying, 


Excuse me, I couldn’t help overhearing you say 
that you would not be able to be a counselor 
to juvenile delinquents for two years. [If a 
subject had given a reason for refusing the ex- 
treme request, Experimenter C mentioned that 
he had overheard the stated reason as well.?] 
But maybe you can help me. My name is __, 
and I’m with the Campus Volunteer Service 
Organization in a program that has nothing to 
do with the Juvenile Detention Center. [The 
remainder of the request was identical to that 
made in the rejection-moderation condition.] 


Smaller request only control. The procedures of 
this condition were similar to those of the rejection- 
moderation condition except that the extreme re- 
quest was not made. The events in this condition 
were as follow: Experimenters A and B approached 
the subject; Experimenter A introduced himself and 
Experimenter B; Experimenter C joined the group 
and engaged Experimenter B in conversation; Ex- 
perimenter A made the smaller request. It should be 
noted that in this and both other conditions the 
roles of Experimenter A and Experimenter C were 
alternated between the two male experimenters of 
the study. 


Predictions 


The predictions of the present experiment were 
similar to those of Experiment 1. It was expected, 
first, that the two control conditions would not 
differ from one another in amount of compliance 
with the smaller request. Second, it was thought that 
the rejection-moderation condition would produce 
more compliance with the smaller request than 
would the controls. 

The experimenters in this instance were not 
aware of the nature of these predictions; in fact, 
they were led by the principal investigator to expect 
opposite results. As in Experiment 1, the experi- 
menters were undergraduate research assistants, Be- 
cause of evidence indicating that undergraduate ex- 
perimenters have in the past produced results con- 
sistent with prediction via experimenter expectancy 
effects (Rosenthal, 1966) or conscious data fixing 
(Azrin, Holz, Ulrich, & Goldiamond, 1961), a test 


2A replication of Experiment 2 was subsequently 
performed by the authors. The only difference be- 
tween the original and replicated versions was that 
in the replication Experimenter C’s performance in 
the two requester control did not include a claim 
that he had overheard the target’s conversation with 
Experimenter A. The data of the two versions of 
Experiment 2 were virtually identical. 


aa 
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of such explanations for the obtained effect in Ex- 
periment 1 seemed in order. Hence, the experimenters 
of Experiment 2 were told that the principal inves- 
tigator was predicting that the smaller request only 
control would produce the most compliance. This 
would supposedly be so because of an “irritation or 
reactance tendency in people who have been asked 
for favors twice in succession.” If the pattern of re- 
sults nonetheless appeared as predicted by the recip- 
rocal concession formulation, experimenter bias could 
no longer be offered as a possible explanation for 
the superiority of the rejection-moderation condition. 


Results 


Three subjects in Experiment 2 complied 
with the extreme request, two in the rejection- 
moderation condition and the other in the two 
requester control. These subjects were re- 
moved from the analysis and replaced by three 
other subjects. The percentage of subjects 
who complied with the smaller request in each 
of the treatment conditions of Experiment 2 
can be seen in Table 2. 

Again, a priori orthogonal contrasts were 
used to test the experimental predictions. One 
contrast compared the amounts of compliance 
with the smaller request within the two con- 
trol conditions; no conventionally significant 
difference occurred, x? = 2.53, P= 111. The 
other comparison, which tested the rejection- 
moderation condition against the combined 
control conditions, did produce a clearly sig- 
nificant difference at conventional levels, x” 
= 6.85, p = .009. 


Discussion 


It appears from the results of Experiment 
2 that the target’s perception of concession 
by the requester is a crucial factor in produc- 
ing compliance with the smaller request. Only 
when the extreme and the smaller favors were 
asked by the same requester was compliance 
enhanced. This finding provides further evi- 
dence for a reciprocal concessions mediator of 
the rejection-then-moderation effect. It seems 
that our subjects increased the frequency of 
assent to the smaller request only in response 
to what could be interpreted as concession 


3 Tt was necessary to discard the data of the origi- 
nal three subjects because of the likelihood that 
their responses to the second request would be medi- 
ated by a foot-in-the-door effect rather than a recip- 
rocal concessions effect; thus our results would have 
been artificially inflated in the direction of predic- 


tion. 


TABLE 2 
PERCENTAGE OF SUBJECTS COMPLYING WITH THE 
SMALLER REQUEST IN EXPERIMENT 2 


Treatment % Compliance 
Rejection-moderation condition 55.5 
Two requester control 10,5 
Smaller request only control 31.5 


Note. The n for the rejection-moderation condition = 20; the 
n for each of the two control conditions = 19. 


behavior on the part of the requester; such 
assent, then, would seem best viewed as re- 
ciprocal concession behavior. 

It might be noted that compliance in the 
two requester control was inhibited relative to 
that in the small request only control. This 
finding replicates quite closely a result ob- 
tained by Snyder and Cunningham (1975) 
and fits very well with evidence suggesting 
that in most cases, people are quite consistent 
in their responses to requests for favors 
(Freedman & Fraser, 1966; Snyder & Cun- 
ningham, 1975). Unless there was a pres- 
sure to reciprocate a concession, 89.5% of the 
subjects in our experiment who said, “No” to 
an initial request said, “No” to a subsequent 
one. 


EXPERIMENT 3 


While the data of Experiments 1 and 2 are 
wholly consistent with the reciprocal conces- 
sions formulation, an alternative explanation 
for these results is applicable as well. It may 
have been that the heightened compliance in 
our rejection-moderation conditions was due 
to the fact that only in these conditions did 
one requester persist in making a second re- 
quest after his first had been refused. Perhaps 
subjects in these conditions acquiesced to the 
critical, zoo trip request not because of pres- 
sure to reciprocate a concession but because 
they were dunned into accession by a tena- 
cious requester or because they wanted to 
avoid the requester’s perception of them as 
having a generally antisocial or unhelpful 
nature. 

In order to test this type of explanation, a 
third experiment was performed. Included in 
Experiment 3 was a procedure in which sub- 
jects were asked to perform an initial favor 
and then were asked by the same requester to 
perform a second favor (the critical request) 


to 
= 
w 


TABLE 3 


PERCENTAGE OF SUBJECTS COMPLYING WITH THE 
SMALLER REQUEST IN EXPERIMENT 3 


| % Complaince 


Treatment 
Rejection-moderation condition 54.1 
Equivalent request control 33.3 
Smaller request only control 33.3 


Note, The n for each condition = 24, 


of equivalent size. Since the proposal of an 
equivalent second favor does not constitute a 
concession on the part of the requester, the 
reciprocal concessions model would predict no 
increased compliance with the critical re- 
quest from this procedure. However, if the 
persistance of a single requester is the medi- 
ator of enhanced compliance, then such a 
procedure should produce heightened agree- 
ment to perform the critical request. A second 
function of Experiment 3 was to provide a 
conceptual replication of Experiment 2. As in 
Experiment 2, one group of subjects received 
two requests but should not have construed 
the second request as a concession on the 
part of the person who made it. In Experi- 
ment 2, the perception of concession was 
avoided by having a second requester make 
the smaller, critical request; in Experiment 
3, it was done by making the initial request 
equivalent in size to the critical one. For both 
procedures, the results should be similar—no 
enhancement of compliance. 


Method 
Subjects 
Subjects were 72 people of both sexes who were 


selected for participation in a fashion identical to 
that of Experiments 1 and 2. 


Procedure 


A subject meeting the conditions above was ap- 
proached by a student-experimenter in a fashion 
identical to that of Experiment 1.4 Subjects were 
randomly assigned to one of three conditions. 

Rejection-moderation condition, Subjects in this 
condition were treated identically to subjects in the 
comparable condition of Experiment 1; that is, after 
hearing and rejecting an extreme request (to per- 
form as a counselor to a juvenile delinquent for a 
minimum of two years), a subject heard the same 


4%n the present experiment there were four ex- 
perimenters, three female and one male. Experi- 
menters approached only subjects of the same sex as 
themselves. 
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requester make a smaller request (to perform as a 
chaperone for a group of juvenile delinquents on a 
two-hour trip to the zoo). 

Smaller request only control. Subjects in this con- 
dition were treated identically to subjects in the 
comparable condition of Experiment 1; that is, a 
subject heard the requester make only the smaller 
request to chaperone a group of juvenile delinquents 
on a trip to the zoo. 

Equivalent request control. Subjects in this condi- 
tion heard a requester initially request that they 
perform as chaperones for a group of juvenile de- 
linquents on a two-hour trip to the city museum; 
after the subjects responded to this first request, the 
experimenter then requested that they chaperone a 
group of juvenile delinquents on a two-hour trip to 
the zoo. 


Predictions 


As in the previous experiments, it was predicted on 
the basis of the reciprocal concessions model that, 
first, the two control conditions would not differ 
from one another in amount of compliance with the 
critical request (the zoo trip) and, second, that the 
rejection-moderation condition would produce more 
compliance with the critical request than would the 
controls. 


Results 


No subject in Experiment 3 complied with 
the extreme request in the rejection-modera- 
tion condition. However, eight subjects com- 
plied with the initial request in the equivalent 
request control. The percentage of subjects 
who complied with the cirtical request in each 
of the treatment conditions of Experiment 3 
can be seen in Table 3. 

As before, two planned orthogonal com- 
parisons were used to test the predictions. The 
first contrasted the two control conditions; no 
significant difference resulted. x? = 0.0, ns. 
The second tested the rejection-moderation 
condition against the combined controls; a 
marginally significant difference occurred, y? 
= 2.88, p = .091. Two features of the data 
from this experiment argue against the in- 
terpretation that a requester’s persistence in 
making requests accounts for the superiority 
of the rejection-moderation condition. First, 
the equivalent request control, which involved 
successive requests from the same requester, 
produced exactly the same amount of compli- 
ance as the smaller request only control. Sec- « 


- ond, of the eight subjects who agreed to per- 


form the critical request in the equivalent 
request control, only one had refused to per- 
form the similar-sized initial request. Clearly, 
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then, it is not the case that a persistent re- 
quester induces compliance to a second re- 
quest solely through the act of making a 
second request. Indeed, in the equivalent re- 
quest control, subjects were stoutly consistent 
in the nature of their responses to the two 
requests. Twenty-two of the 24 subjects in 
that group responded similarly to both re- 
quests. 


GENERAL DISCUSSION 


Taken together, the findings of Experiments 
1, 2, and 3 seem to support the reciprocal 
concessions model. Each experiment indicated 
that proposing an extreme request which is 
rejected and then moving to a smaller re- 
quest increases compliance with the smaller 
request. The results of Experiment 1 sug- 
gested that the target person’s rejection of the 
initial, extreme request is crucial to the effec- 
tiveness of this technique. Through his re- 
` fusal to perform the. large favor, the target 
puts himself in a position from which virtu- 
ally his only possible retreat is accession to 
the smaller request. Thus when the requester 
moves from his extreme proposal to a smaller 
“one, the target must agree to the second pro- 
. posal in order to relieve any felt pressure for 
$ reciprocation of concessions. As was shown in 
_Experiment 1, if movement to a smaller re- 
: quest occurs without -the target’s initial re- 
_ jection of the extreme request, compliance 
with the smaller request will not be signifi- 
cantly enhanced. Experiment 1 demonstrated 
further, as did Experiment 2, that merely 
exposing a target person to an extreme request 
does not increase the likelihood of his com- 
pliance with a subsequent smaller request; 
such results tend to disconfirm a perceptual 
contrast explanation of the phenomenon. Ex- 
periments 2 and 3 demonstrated the impor- 
tance of COncession. Simply presenting a tar- 
get person with a smaller request after he had 
rejected a larger one or simply presenting a 
target person with.a second request of equiva- 
lent size, does not increase agreement to the 
second request. Onl, when the proposal of 
the second favor can be considered a conces- 
sion on the part of the requester is compliance 


increased. 
í the phenomenon we have 


Several aspects 0 e 
investigated suggest that its use would be 
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highly functional for someone in need of a 
favor. First, it is clear that the effect is quite 
a powerful one for inducing compliance. 
Averaging over all three studies and compar- 
ing against the small request only control 
conditions, we were able to double the likeli- 
hood of compliance through the use of the 
rejection-then-moderation procedure. The 
strength of this procedure is further evidenced 
when it is realized that it is working in a 
direction counter to any tendency for the 
target to be consistent in his responses to re- 
quests for favors. It should be remembered 
that Freedman and Fraser (1966) found such 
a tendency for consistency to be a potent one 
in their foot-in-the-door study, and we found 
a similar tendency in the two requester con- 
trol of Experiment 2 and the equivalent re- 
quest control of Experiment 3. Seemingly, 
then, the size of the effect is such that it over- 
whelmed a strong propensity in our subjects 
for constancy in their reactions to compliance 
requests. 

Second, the technique does not limit a re- 
quester to the receipt of small favors. It is 
only necessary that the critical request be 
smaller than the initial one for a reciprocal 
concessions mechanism to come into play. 
Evidence that a requester can use this tech- 
nique to gain assent to a substantial request 
can be seen in the data of Experiment 1. The 
smaller request in that study might well be 
seen, objectively, as an extreme one in itself; 
it asked subjects to be responsible for an un- 
specified number of juvenile delinquents of 
unspecified age in a public place for a period 
of two hours outdoors in winter.’ Only 16.7% 
of our population was willing to agree to such 
a request when it was the only one made. Yet, 
the proposal of this request after the rejection 
of a still more extreme favor produced 50% 
compliance. 

Another benefit of the rejection-then-mod- 
eration procedure is that its force seems to 
derive from the existence of a social norm. 
Thus, a requester wishing to use the pro- 
cedure need have little reward or coercive 


5 Only Experiment 1 was conducted in the winter 
of the year. Experiments 2 and 3 were conducted in 
the spring or summer’ which, may account for the 
somewhat higher compliance rates in the small re- 
quest only controls of these experiments. 
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power over his target to be effective. Thibaut 
and Kelley (1959) speak of a norm in any 
two-person interaction as a third agent exer- 
cising power over each member but whose 
“influence appeal is to a supraindividual value 
(‘Do it for the group’ or ‘Do it because it’s 
good’) rather than to personal interests. ee 
(p. 129) A recognition of this kind of norma- 
tive influence in concession making may help 
explain some of the bargaining literature on 
the subject, in addition to the data of the 
present study. For instance, Pruitt and Drews 
(1969) report with some surprise their sub- 
jects’ failure to try to maximize their out- 
‘comes when faced with a bargaining opponent 
who made a large, constant concession on each 
game trial. Even though this sort of opponent 
was perceived as significantly weaker and less 
demanding than one who made constant but 
small concessions on each trial, no advantage 
was taken of the vulnerable opponent. Every- 
time an opponent made a standard concession, 
no matter what the size, a subject responded 
with a standard concession of his own. Pruitt 
and Drews admit to being mystified by the 
lack of “rationality” on the part of their sub- 
jects and describe “them as ‘automatons’ tun- 
ing out external stimuli and new ideas, and 
moving mechanically a standard distance from 
_ the position adopted on the first trial.” (p. 57) 
" Perhaps much of the mystery can be elimi- 
nated by assuming that the subjects were 
reacting to the pressures of a norm requiring 
that regular concessions be reciprocated. 

A final advantage of a compliance induc- 
tion procedure which uses concessions in- 
volves the feelings of the target person toward 
the outcome of the interaction. Benton, Kel- 
ley, and Liebling (1972) present evidence 
suggesting that not only will someone who 
applies such a procedure be quite effective in 
obtaining favorable payoffs for himself but 
that the person to whom it is applied will feel 
more responsible for and satisfied with the 
outcome. In an allocation of resources situa- 
tion, subjects faced a bargaining opponent 
who intransigently demanded the maximum 
payoff for himself, intransigently demanded a 
moderately favorably payoff for himself, or 
retreated from the maximum payoff demand 
to the moderate payoff demand. In each con- 
dition, failure to reach an allocation agree- 
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ment resulted in a loss of all money by both 
participants. It was found that the retreat 
strategy produced the highest average earning 
for the opponent. Moreover, not only did sub- 
jects concede the greatest payoffs to an op- 
ponent using this tactic, they felt significantly 
more responsible for and satisfied with the 
outcome than did subjects faced with an in- 
transigent opponent. The results of this study 
when coupled with. those of our experiments 
suggest some intriguing implications. One who 
feels responsible for the terms of an agree- 
ment should be more likely to meet his com- 
mitments concerning that agreement. Thus, 
someone who uses concession to produce com- 
pliance with a request for a favor is likely to 
see the favor actually performed. Second, one 
who feels fairly satisfied with the outcome of 
an interaction with another person should be 
willing to enter into interaction with that 
person again. Thus, the target person of a 
rejection-then-moderation moderation proce- 
dure may well be vulnerable to subsequent 
requests by the same requester. In all, then, 
it appears that the rejection-then-moderation 
procedure can be an extremely valuable tech- 
nique for the elicitation of compliance. 

A note of caution should probably be inter- 
jected at this point lest we make too much 
of the potential implications of the present 
findings. It is the case that the rejection-then- 
moderation procedure has been shown to work 
under a fairly limited set of conditions. The 
extent to which the effect is generalizable to 
other contexts and situations remains to be ` 
seen. For example, we have tested the effec- 
tiveness of the procedure only in situations in 
which the interaction was face-to-face, the 
interactants were of the same sex, and the 
requests were prosocial in nature. Moreover, it 
would be well to remember that, while the 
present research appears to support a recip- 
rocal concessions interpretation of the effect, 
it in no way ultimately confirms that inter- 
pretation. Other explanations may exist which 
account completely for the data of this study; 
and to the extent that they do exist they 
should be tested in subsequent work, A 
; Future research on the reciprocal conces- 
A procedure might also profitably investi- 
i tee a a ee 

cession by a 


requester was operationalized as moderation 
from a large request to a smaller one. In- 
-volved in such moderation, however, are two 
separate components: the target will no doubt 
‘perceive the move from ‘the large to the 
2 smaller request as more desirable ‘for himself 
-but less desirable for the requester and his 
a -ecause. While these two aspects of concession~ 
j D usually occur together, there is no good rea- 
2 \ fos fo assumé that both are necessary for the 


enhancenient of compliance. It may be the pro- 
` posal of a more desirable arrangement for the 
target—rather than the proposal of a less 
desirable arrangement for the requester—that 
is-the crucial, compliance-producing aspect of 
concession; or the opposite may be the case. 
Stated otherwise, a concession involves two 
normally correlated but conceptually separate 
., features: the granting of a more favorable 
.. situation to one’s interaction partner and the 
| surrendering of a more favorable position for 
= oneself. It remains for further investigation to 
determine whether the aspect of concession 
which induces compliance involves the grant- 
ing of something, the surrendering of some- 
thing, or both. 
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Endorsement of Formal Leaders: An Integrative Model 
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This experiment develops an integrative, path-analytic model for the endorse- 
ment accorded formal leaders. The model contains four independent variables 
reflecting aspects of group structure (i.e. group success-failure, the payoff 
distribution, the degree of support by others members for the leader, and the 
vulnerability of the leader). Also included are two intervening variables 
reflecting perceptual processes (attributed competence and attributed fair- 
ness), and one dependent variable (endorsement). The results indicate that 
endorsement is greater when the group’s success is high, when the payoff dis- 
tribution is flat rather than hierarchical, and when the leader is not vulnerable 
to removal from office. Other support had no significant impact on endorse- 
ment. Analyses further demonstate that the effect of success-failure on en- 
dorsement is mediated by attributed competence, while the effect of the 
payoff distributed is mediated by attributed fairness. These results suggest 


that moral and task evaluations are distinct bases of endorsement. 


Treatments of legitimacy (French & Raven, 
1959; Michener & Burt, 1974; Weber, 1947) 
have distinguished between the formal posi- 
tion occupied by a leader and the person 
occupying the leadership position. The atti- 
tudes of group members toward both position 
and person are important in establishing and 
maintaining regularized patterns of influence 
within a group. The present study investigates 
one of these attitudes, the endorsement ac- 
corded a formal leader. By definition, a lower- 
status member is said to endorse a high-status 

leader if he expresses satisfaction with the 
leader’s performance in directing the group, 
supports his use of control prerogatives, and 
wants him to continue in a position of leader- 
ship within the group. The goal of the present 
article is to develop an integrative model of 
factors determining the endorsement accorded 
formal leaders. 
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Factors Affecting Endorsement 


Previous research has identified various 
determinants of endorsement. These fall into 
two broad classes: social structural factors 
(i.e., properties of the group as a unit) and 
perceptual factors (properties of individuals 
in the group). Although prior research has in- 
vestigated both classes, no study has yet inte- 
grated them in a single theoretical framework. 

The present article simultaneously incorpo- 
rates six determinants of endorsement in an 
integrative causal model. Four of these fac- 
tors are social structural (i.e., the degree of 
collective success, the distribution of payoffs 
within the group, the degree of support for 
the leader from other members, and the 
leader’s vulnerability to removal from office), 
while the other two factors are perceptual 
(i.e., competence attributed to the leader and 
fairness attributed to the leader). In the 
present study, the perceptual variables medi- 
ate the impact of the structural variables on 
endorsement. 

Collective success—failure. Research has 
documented the impact of both success—fail- 
ure (a collective property) and attributed 
competence (a perceptual variable) on en- 
dorsement. Leaders receive greater endorse- 
ment when the group achieves its goals 
(Hollander & Julian, 1970; Suchner, 1972) 
and when they are perceived as competent on 
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group-related tasks (Hollander & Julian, 
1970; Julian, Hollander, & Regula, 1969). 

However, the interrelationship between 
these determinants is unclear. One might 
argue that collective success—failure affects 
endorsement merely because it serves as a 
basis for ascribing competence (or incompe- 
tence) to the high-status leader. To investi- 
_ Bate this issue, the present study incorporates 

both variables in a multicausal framework. 
It hypothesizes that the effect of collective 
success—failure on endorsement is mediated by 
attributed competence, such that if compe- 
tence were held statistically constant, success— 
failure would have little or no effect on 
endorsement. 

Payoff distribution. A second structural 
variable that may affect endorsement is the 
distribution of rewards within the group, This 
distribution can range from a skewed (hier- 
archical) pattern giving greater rewards to the 
leader than to the lower-status members, 
down to a “flat” distribution giving the 
members rewards equal to those of the leader. 
This variable should affect endorsement be- 
cause the leader’s reward disbursements pro- 
vide a means whereby members can infer his 
motives and his concern for the welfare of 
the group. 

Research by Hollander and Julian (1970) 
demonstrated that subjects in low-status roles 
perceive a leader as more fair and group ori- 
ented when he allocates a substantial portion 
of the group’s rewards to the members rather 
than to himself. This study also demon- 
strated that leaders interested in group mem- 
bers receive greater endorsement than those 
not interested in the members. However, pre- 
vious research has not determined whether 
the payoff distribution had a direct effect on 
endorsement, nor has it investigated the de- 
gree to which attributed fairness mediates any 
impact of payoff distribution on endorsement. 

The present study extends earlier work by 
investigating these issues. It hypothesizes that 
endorsement is lower under a highly skewed 
payoff distribution favoring the high-status 
person than under a flat distribution, and that 
this éffect (if it appears) is mediated by at- 
tributed fairness, such that payoff distribution 
has little or no effect if attributed fairness is 


held statistically constant. 
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Support for the leader from other members. 
Blau (1964) has suggested that the overall 
degree of collective support for a leader 
within a group may determine the extent to 
which a given low-status member endorses 
him. In other words, if a member believes 
other low-status members endorse the leader, 
he will endorse the leader to a greater extent 
than if he believes they oppose (or refuse to 
support) him. 

Some research substantiates this hypothe- 
sis. Michener and Tausig (1971) reported 
that low-status subjects endorsed a leader to 
a greater extent when they believed that 
another low-status member endorsed him. 
Michener and Lyons (1972) demonstrated 
that a high level of support for a formal 
leader from other members suppresses a sub- 
ject’s inclination to overturn the group’s 
status structure. 

Nevertheless, the processes that mediate the 
impact of other members’ support on endorse- 
ment remain unclear. Some research suggests 
that low levels of other support lead to dis- 
satisfaction with the group’s goal attainment 
(and possibly with the leader’s competence) 
and hence to low endorsement (Michener & 
Tausig, 1971). Other research suggests that 
low support leads to low levels of ascribed 
fairness and, in turn, to low endorsement 
(Michener & Lyons, 1972). Choosing between 
alternative explanations is difficult because 
these studies were run with different fixed 
parameters. 

The present study seeks a clearer picture 
of the effects of other support on endorsement 
by manipulating other support, collective 
success-failure, and payoff distribution in a 
single design. Simultaneous manipulation of 
all three structural variables provides a basis 
for assessing the effects of other support, 
independent of the effects of success—failure 
and payoff distribution. 

Vulnerability. The leader’s vulnerability is 
a fourth structural variable in the present 
design. Vulnerability reflects the extent to 
which a high-status person lacks security in 
office and can be ousted if low-status members 
no longer endorse his incumbency. Prior 
studies have investigated the impact of vul- 
nerability on the leader’s liking for the group 
and concern for his own status (Berkowitz & 
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Macaulay, 1961) and on his use of social in- 
fluence (Michener & Burt, Note 1). In con- 
trast, the present study treats vulnerability 
from the viewpoint of low-status members and 
investigates whether a leader’s vulnerability 
affects the endorsement accorded him by the 
group’s members. 

Although this issue has not been studied 
previously, vulnerability may have an impor- 
tant impact on authority-subordinate rela- 
tions. Members in a group with a vulnerable 
leader may experience a heightened awareness 
of their ability to induce change by deposing 
him. If this causes them to adopt a more 
vigilant and critical stance toward his policies, 
one would predict that members accord less 
endorsement to a leader under high vulner- 
ability than under low. Given the paucity of 
previous research on this issue, the present 
study constitutes an exploratory treatment of 
vulnerability’s impact on endorsement. 

Summary of predictions. Overall, the pres- 
ent study hypothesizes that endorsement is 
higher when collective success is high, when 
the payoff distribution is flat (as opposed to 
hierarchical), when other low-status persons 
support the leader, and when the leader is not 
vulnerable to status loss. Further, attributed 
competence is expected to mediate the effects 
of collective success-failure, while attributed 
fairness is expected to mediate the effects of 
the payoff distribution. To demonstrate medi- 
ation, the present research employs a simul- 
taneous-equation causal model, rather than a 
single-equation model such as those used in 
previous research on endorsement. 


METHOD 
Experimental Procedures 


One hundred forty male undergraduates at the 
Universitly of Wisconsin participated as paid, volun- 
teer subjects. Serving in three-person groups, they 
believed that their rate of pay would depend on the 
triad’s performance. 

Upon arriving, subjects were placed in separate 
rooms and given written instructions that character- 
ized the study as an investigation of problem solving 
under varying conditions of communication. The 
instructions indicated that the experiment was struc- 
tured into two parts during which subjects would 
work on a collective task. In part one, they would 
remain in separate rooms where they could neither 
hear nor see one another; in the subsequent part, 
they would work face-to-face in one room. In actu- 
ality, the experiment terminated after part one. The 


fiction concerning the second part served as a ra- 
tionale for isolating the subjects from one another 
and facilitated the manipulation of the independent 
variables. 

The written instructions also described the group's 
task. These indicated that on each of three trials 
comprising part one, every subject would receiv: 
several cards depicting black and white geometric 
patterns. The task was to judge what proportion of 
each card’s surface was black. These individual judg 
ments were to be combined (on each trial) to yield 
a collective decision by the group. As documented 
in earlier studies (Michener & Lawler, 1971; 
Michener & Lyons, 1972; Michener & Tausig, 1971), 
the complexity of these geometric patterns made it 
difficult for subjects to estimate their own perform 
ance and permitted the experimenter to manipulate 
bogus performance feedback. 

The instructions further explained that the triad 
would be structured into a two-level status hier 
archy, with one high-status (HS) person and tw 
lower-status (LS) persons. The ostensive purpos: 
of this status differentiation was to enhance the 
group’s problem-solving efficiency by placing the 
person with the greatest skill in the leader role. To 
this end, all the subjects took a “judgment test” 
consisting of geometric patterns similar to those the 
would later encounter during the regular judgment 
trials. This test was administered in a central room, 
with subjects sitting face-to-face. Upon completion 
of the test, subjects returned to their separate rooms. 

Shortly thereafter, the experimenter gave subjects 
fictitious feedback regarding the test. Each subject 
learned that his performance placed him significant!) 
below one of the other subjects and slightly ahea 
of the third. The high performer was to be installe 
in the HS role, the subject in one of the low-statu 
roles (LS1), and the third member in the other 
low-status role (LS2). In fact, all subjects were 
placed in identical LS1 roles; no one actually 
occupied the HS role or the LS2 role. 

After installation into role, subjects read another 
set of instructions. These indicated that HS had tw 
major prerogatives. First, he was responsible {or 
combining the subjects’ individual judgments intc 
collective decisions. He could combine these indi 
vidual judgments in any way he saw fit, so th 
judgments by LS1 and LS2 were advisory opinion 
Second, HS distributed the monetary winning 
achieved by the group's performance on the collec 
tive decisions. He had complete discretion in dis 
bursing the rewards among the group’s members. 


Experimental Manipulations 


The independent variables were manipulated during 
the three identical trials that comprised what sub- 
jects believed to be part one of the experiment. 
Subjects were randomly assigned to levels on the 
four manipulated variables (collective success-failure, 
payoff distribution, support from other members for 
HS, and vulnerability of HS). In contrast to most 
experimental research, these variables were conceptu- 
alized as continuous, not as categoric, variables 


Operationally, each independent variable had nine 
possible levels (i.e., 10, 20, 30, 40, 50, 60, 70, 80, 90), 
and a table of random numbers was used to assign 
Subjects to levels on each of the four manipulated 
variables. This procedure means that the independent 
variables are uncorrelated in the (hypothetical) 
infinite population of subjects, although not neces- 
‘sarily in a sample. In virtually all cases, of course, 
the subjects comprising a given triad received dif- 
ferent treatment combinations based on the random 
assignment. 

Precautions were taken to assure that the experi- 
menter remained blind to all experimental manipula- 
tions. Independent variables were manipulated via 
written messages prepared in advance of the experi- 
mental sessions and packaged so the experimenter 
would not see the contents. 

Collective success-failure. At the beginning of each 
of the three trials, subjects made individual judg- 
ments on five cards portraying geometric figures. 
_ The experimenter collected these and ostensibly 

transferred LS1’s and LS2’s judgments to HS, who 
was to make the final group decisions. After waiting 
several minutes, the experimenter distributed bogus 
feedback regarding the group’s performance. Each 
subject received a message (called the performance 
score) which stated the following: “On this trial 
your group won cents. Scoring is ad- 
justed so an average performance wins 50 cents; 
superior performances get higher scores.” Depending 
on the experimental condition, subjects Jearned the 
group won an amount ranging from 10 cents to 90 
cents (in 10-cent increments). Subjects knew that 
a group could win a maximum of $1 on each trial, 
and this information enabled them to gauge their 
group’s success. Collective performance feedback re- 
mained constant over three trials (although slight 
perturbations were introduced to preclude suspicion 
from excessively consistent scores). 

Payoff distribution. After receiving the bogus feed- 
back on collective performance, subjects received a 
“money distribution form” indicating how the group’s 
winnings were to be divided among its members for 
that trial. This written message, ostensibly from 
HS, was transferred to the subjects by the experi- 
menter. The message stated the following: “On this 

ey trial,. our group won cents. I have 
"| decided’ to divide the money as follows: HS gets 
per cent; LS1 gets per 
cent; LS2 gets —__—— per cent.” The percentage 
of the collective winnings that HS distributed to 
LS1 and LS2 constituted the payoff variables. This 
value ranged from 10% to 90% (in 10% incre- 
ments). In all cases, the money not kept by HS 
was divided equally between LSi and LS2. For 
example, if HS gave away 40% (retaining 60% for 
himself), each LS person received 20%. 

Other support for HS. Next, LS1 and LS2 ex- 
changed opinion messages expressing the extent to 
which they supported HS's leadership. The experi- 
menter intercepted the subject’s (LS1’s) messages 
and returned prefabricated ones that manipulated 
LS2’s apparent support for HS. This manipulation 
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consisted of LS2’s answers to the following three 
questions: “To what extent are you satisfied with 
HS’s leadership of the group? To what extent do 
you endorse and support HS as leader? How much 
confidence do you have in HS’s leadership?” LS2 
answered these questions by circling the appropriate 
numbers on a scale ranging from low (1) to high 
(9). LS2’s answers varied depending on the experi- 
mental condition, and this manipulation, like the 
others included in this study, had nine levels. 

Vulnerability of HS. At the end of the second 
trial, subjects completed a brief questionnaire mea- 
suring the intervening variables—that is, HS’s compe- 
tence and fairness. At the end of the third trial, 
subjects completed a longer questionnaire that in- 
cluded manipulation checks and the measure of 
endorsement. Immediately after this longer question- 
naire, each subject completed a written form to indi- 
cate whether he wanted to retain HS in office during 
part two or preferred to remove him from office. 
The prerogative of making this decision rested with 
LS1 rather than LS2, because LS1 (the subject) 
had ostensibly demonstrated greater task-competence 
on the test than LS2. 

A decision to remove HS, however, did not auto- 
matically guarantee the leader’s overthrow. The 
instructions informed subjects that attempts to re- 
move him carried a specific probability of success. 
These probabilities constituted the manipulation of 
vulnerability, and ranged from 10% to 90% (in 
increments of 10%). Higher probabilities of success- 
fully deposing HS designated higher levels of 
vulnerability. 

This probabilistic component simulated natural 
situations, in which attempts to oust authorities 
often fail. Subjects believed that if they opted for 
removal, the experimenter would in effect draw lots 
from a hat to determine whether the ouster was 
successful. In the event of HS’s removal, LS1 and 
LS2 would jointly decide which of them would 
assume the position of leadership during part two. 
Subjects believed that the leader knew he was vul- 
nerable, but that he had not been told the exact 
probability of losing his hegemony. 


Measurement of Intervening and 
Dependent Variables 


The questionnaire completed by subjects after trial 
two contained semantic differential measures of the 
two intervening variables—competence and fairness. 
Indices were constructed by summing the scores on 
the semantic differential items (9-point scales), 
weighting each item equally. Competence was mea- 
sured by the following bipolar items: unskillful- 
skillful, incompetent-competent, bungling-proficient, 
inefficient-efficient, low ability-high ability, inept- 
masterful, inexpert-expert, incapable-capable. Fairness 
was measured by the following items: unfair—fair, in- 
equitable-equitable, selfish-generous, self-interested— 
altruistic, biased-impartial, unjust-just. 

The measure of the dependent variable (endorse- 
ment) was on the second questionnaire, administered 
after trial three. Endorsement was measured by five 
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TABLE 1 
ZERO-ÖRDER CORRELATIONS 


Variables i | 2. l 3 l 4 | ial Sess | 7 
1, Success-failure = 
2. Payoff distribution 040 — 
3, Other support —.023 —.031 — 
4. Vulnerability 004 —.029 —.013 — 
5. Competence .720 076 .067 .036 — 
6. Fairness —.032 -785 —.024 —.018 121 — 
7. Endorsement 360 452 O18 —.137 -590 .530 
M 4.99 4.61 4.90 5.26 41.14 20.63 
ISD 2.60 2.59 2.63 2.64 13.84 10.15 
questions: Payof distribution. To check this manipu- 


Consider the person occupying the position of 
leadership. How legitimate is it for him to occupy 
this position? 

How satisfied are you with the leader’s use of his 
power in arriving at group decisions? 

How willing would you be to have the person who 
served as leader this time head the group again? 
How satisfied are you with the performance of 
the leader in directing the group? 

To what extent do you support or oppose the 
leader, 


Subjects responded on 9-point scales. To construct 
the index of endorsement, responses to the five ques- 
tions were summed, A similar measure has been used 
in other studies of endorsement (Michener & Burt, 
in press; Michener & Tausig, 1971; Michener & 
Burt, Note 1). Upon completion of the second ques- 
tionnaire, subjects were fully debriefed and paid $2 
for participation. 


REsULTS 
Checks on Experimental Manipulations 


Success—failure. An item on the second 
questionnaire checked the success-failure ma- 
nipulation. Subjects were asked the following: 
“Compared with the other groups, how suc- 
cessful or unsuccessful was your group at the 
problem-solving task?” Subjects’ interpreta- 
tion of their own group’s performance cor- 
relate highly with the actual manipulation of 
success—failure, 7 = .927, p < .001. 


SUCCESS- 
FAILURE A) 
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Ficure 1, Basic causal model. 


lation, an item on the second questionnaire 
asked: “In your opinion, how prone was HS 
to distribute the group’s winnings to LS1 and 
LS2?” Responses correlate highly with the 
manipulation, r = .907, p < .001. 

Other support. One item asked: “Overall, 
to what extent does LS2 support and endorse 
HS as leader?” Responses show a strong cor- 
relation with the manipulation, r= 930, 
p < 001. 

Vulnerability. An item on the second ques- 
tionnaire asked: “If LS1 wants to impeach 
HS, what are the chances that HS will actu- 
ally be removed from his position of leader- 
ship?” Subjects’ responses correlate highly 
with the actual manipulation of vulnerability, 
r = .939, p < .001. 

Overall, the checks indicate that all the 
manipulations proved salient to the subjects 
and that appropriate: conditions were estab- 
lished to test the research hypotheses. 

Reliability of measured variables. Coeff- 
cients of reliability based on internal-consist- 
ency criteria (Cronbach’s alpha) reveal high 
levels of reliability for each measured vari- 
able. The reliability of the endorsement mea- 
sure 1s .923, which compares with figures of 
924 and .884 for the same instrument in pre- 
vious research (Michener & Burt, in press; 
Michener & Tausig, 1971). The reliability 
coefficient for the index of competence is .958 
and that for the measure of fairness is .919, 


Basic Causal Model 


The technique of path analysis is used to 
analyze the experimental results for endorse- 
ment. This technique combines an explicit 
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Ficure 2. Expanded causal model with mediating variables. 


causal model with multiple regression estima- 
tion procedures (see discussions by Costner, 
1971; Duncan, 1970; Heise, 1969; Land, 
1969). Table 1 presents the zero-order correla- 
tions among the seven variables in this study, 
and Figure 1 depicts the basic causal model 
for these data. This model postulates that en- 
dorsement is caused by the four manipulated 
variables (with competence and fairness 
omitted, for the time being). The four inde- 
pendent variables are represented as uncor- 
related. Given the procedures for sampling 
experimental treatments, this condition should 
hold true in the underlying population. 

The predictive equation for endorsement, 
with the partial regression coefficients ex- 
pressed in standardized form, is: 


E = 360 S +.425P + .018 0 
— 137 V +.804 e,. (1) 


In this equation, Æ is endorsement, S is 
success-failure, P is payoff distribution, O is 
other support, V is vulnerability, and e, is the 
error term. The coefficients for success—failure 
and payoff distribution are significantly dif- 
ferent from zero, p < .0001, and the coeffi- 
cient for vulnerability achieves marginal sig- 
nificance, p=.07, and the coefficient for 
other support is nonsignificant. The propor- 
tion of variance in endorsement explained by 
this model (R?) is 3530. 

The importance of these results is clear. 
First, they replicate the earlier finding of 
Julian, Hollander, and Regula (1969) that 
low-status members accord more endorsement 


to a leader under collective success than 
under collective failure. Second, they extend 
prior research by demonstrating that endorse- 
ment varies as a function of the payoff dis- 
tribution; endorsement is lower when the 
leader usurps large payoffs for himself than 
when he allocates payoffs more evenly. Third, 
these results show that endorsement is a 
negative function of vulnerability, although 
this effect is small. 

Surprisingly, the data also indicate that 
other support does not affect endorsement. 
This result contrasts with the findings of 
earlier studies (Michener & Lyons, 1972; 
Michener & Tausig, 1971), where other sup- 
port heightened endorsement. Apparently, the 
difference occurs because success—failure and 
payoff distribution were varied over their 
entire range in the present study, while they 
were established as fixed parameters (at 
moderate-to-low levels) in the earlier research. 
Subjects in the present study based their 
judgments of the leader directly on informa- 
tion about success-failure and payoffs, rather 
than on the opinion of the other low-status 
member. In fact, high levels of other support 
did not lead to increased endorsement, but to 
derogation of the other low-status member. 
Semantic differential measures show that sub- 
jects rated the other member as less intel- 
ligent, less perceptive, and less influential 
under high support than under low support 
(p< .05 for each of these items). Evi- 
dently, the presence of another member sup- 
porting the leader is not sufficient to affect 
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endorsement when definitive information on 
performance and rewards is directly available. 


Expanded Causal Model 


Figure 2 portrays an expanded causal 
model incorporating attributed competence 
and fairness, This model treats competence 
and fairness as intervening between endorse- 
ment and the four independent variables. 
This causal sequence is established by the 
experimental design, with the measurements 
of competence and fairness coming after the 
manipulation of the independent variables, 
but before the measurement of endorsement. 
As in the previous model, the independent 
variables are assumed to be uncorrelated in 
the population, A direct causal relation is not 
postulated between competence and fairness, 
so the residual association between these 
variables is represented as a correlation be- 
tween their error terms (the curved arrow in 
Figure 2). 

This expanded causal model involves three 
simultaneous equations, one each for compe- 
tence (C), fairness (F), and endorsement 
(E). Expressed in terms of standardized 
partial regression coefficients, these equations 
are the following: 


C = .720 S + .076 P + .067 O + .036 V 


+ .687 e (2) 
F = — 0325 + .785 P — 0240 — 018 V 
+ .618 e (3) 


E 


ll 


— 058 S + .159 P — .013 O — 158 V 
+ 580 C + 318 F + .713 e. (4) 


Significance tests for Equation 2 indicate that 
only the effect of success—failure on compe- 
tence differs from zero, p < .001; no other 
effects are significant. Tests for Equation 3 
show that payoff distribution is the only sig- 
nificant determinant of fairness, ~ < .0001, 
while tests for Equation 4 indicate that vul- 
nerability, p < .01, competence, p < .0001, 
and fairness, p < .001, are all significant de- 
terminants of endorsement. Despite its ap- 
parent size, the direct effect of payoff dis- 
tribution on endorsement (Equation 4) is not 
significantly different from zero. The results 
indicate correlated error between competence 
and fairness, with rese, = .204. Interpretively, 
this means that some (unknown) variable not 
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explicitly entered in the model is cau 
competence and fairness; this v € 
taken to be uncorrelated with the four ind 
pendent variables. The coefficient of determi- 
nation (R?) for endorsement in ition 4 
is .4911, which means that the in 
this model are explaining abo: 

variance in endorsement. 

These results are important ve 
for they show that competence ne 
mediate the effects of the mani) ed vari 
ables on endorsement.’ As indic he 
pattern of regression coefficients € 
mediates the effect of success-fa 
dorsement (Equations 2 and 4), w 
mediates the impact of payoff dis 
endorsement (Equations 3 and 4). Mi 
mediation is almost complete, because 
direct effects of success—failure and payoff 
distribution on endorsement drop virtually to 
zero when competence and fairness are in- 
cluded in the model (Equation 4 ly the 
small effect of vulnerability on ez 
is not mediated to any appreciable degres by 
competence and fairness. 


Discussion 


The expanded causal model corroborate 
and extends prior research on en ment. 
The finding that competence det: ; en- 
dorsement accords with results re by 
Julian, Hollander, and Regula (196 7 
Hollander and Julian (1970). Ti 


1The causal model in Figure 2 
Tore =O and ress =0, A critic might : 
assumption because subjects were ob 
randomly assigned to “levels” of com 
fairness. If this assumption of uncorrela 
false, the coefficients for the effect of co 
endorsement (.580) and for the effect of r 
endorsement (.318) are biased and inconsistent 
check this Possibility, an alternative model was 
mated in which rej. and Teyeg Were not z 
equal zero; to achieve identification, the d pat 
from other support and Success-failure to endorse 
ment were subject to zero restrictions. Coefficients 
for this alternative model were estimated by means 
of two-stage least Squares (Johnston, 1963; Miller, 
1971). Results show that although some correlated 
error is present (with reco = 558 and Teres = .395), 
competence and fairness are still important mediating 
variables for endorsement, The regression coefficient 
for endorsement on competence is .381, while that 
for endorsement on fairness is .409. Thus, these re- 


sults teinforce the basic conclusion of the model jn 
Figure 2. 
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t that fairness affects endorsement is new. More 
important, this research reveals two distinct 
causal chains producing endorsement. Collec- 
tive. success-failure affected endorsement 
through attributed competence. And, con- 
‘sistent with the theoretical reasoning of 
Hollander and Julian (1970), the payoff dis- 
tribution affected endorsement through attrib- 
uted fairness. These patterns demonstrate that 
low-status subjects assimilated the group’s 


_® structure in terms of the personal attributes 


of the leader. Depending on the group’s struc- 
tural situation, the leader was viewed as com- 
petent (or incompetent) and as fair (unfair) , 
and these perceptions in turn led to differ- 
ences in endorsement. 

Overall, these causal chains suggest that 
group members apply two distinct criteria 
when assessing formal leaders. The task 
criterion concerns a leader’s ability to ad- 
vance the interests of the group and its mem- 
bers. Tke moral criterion concerns a leader’s 
commitment to higher social ideals transcend- 
ing personal interests (Kelley, 1971), and 
reflects his desire to act on behalf of the col- 
lective welfare. In the present study, the mea- 
sures of competence and fairness reflect the 
leader’s standing on the task and moral 
dimensions, respectively. 

One final note. A critic might argue that 
knowledge of endorsement is of little practical 
use, unless endorsement relates to some im- 
portant behavior or outcome. Data in the 
present study, however, show that highly en- 
dorsed leaders were less likely to be removed 
from office than poorly endorsed leaders. The 
zero-order correlation between the leader's 
endorsement and the subjects’ ouster attempts 
is substantial, r = —.742, p < .001. Knowl- 
edge of endorsement would forecast tenure 


in office. 


REFERENCE NOTE 
1. Michener, H. A., & Burt, M. R. The use of social 
influence under varying conditions of legitimacy. 
Unpublished manuscript, University of Wiscon- 
sin—Madison, 1972. 


REFERENCES 
i ffects of 
Berkowitz, L., & Macaulay, J. R. Some e: ts 
heee i status net and status stability. 


Human Relations, 1961, 14, 135-147. ake 
Blau, P. M. Exchange and power in social life. New 


York: Wiley, 1964. 


223 


Costner, H. L. Utilizing causal models to discover 
flaws in experiments. Sociometry, 1971, 34, 398- 
408. 

Duncan, O. D. Partials, partitions, and paths. In 
E. F. Borgatta & G. W. Bohrnstedt (Eds.), Socio- 
logical methodology 1970. San Francisco: Jossey- 
Bass, 1970. 

French, J. R. P., Jr., & Raven, B. R. The bases of 
social power. In D. Cartwright (Ed.), Studies in 
social power. Ann Arbor, Mich.: University of 
Michigan Press, 1959. 

Heise, D. R. Problems in path analysis and causal 
inference, In E. Borgatta (Ed.), Sociological 
methodology 1969. San Francisco; Jossey-Bass, 
1969. 

Hollander, E. P., & Julian, J. W. Studies in leader 
legitimacy, influence, and innovation. In L. Berko- 
witz (Ed.), Advances in experimental social psy- 
chology. Vol. 5. New York: Academic Press, 1970. 

Johnston, J. Econometric methods. New York: 
McGraw-Hill, 1963. 

Julian, J. W., Hollander, E. P., & Regula, C. R. 
Endorsement of the group spokesman as a func- 
tion of his source of authority, competence, and 
success. Journal of Personality and Social Psychol- 
ogy, 1969, 11, 42-49. 

Kelley, H. H. Moral evaluation. American Psycholo- 
gist, 1971, 26, 293-300. 

Land, K. C. Principles of path analysis. In E. 
Borgatta (Ed.), Sociological methodology 1969. 
San Francisco: Jossey-Bass, 1969. 

Michener, H. A., & Burt, M. R. Legitmacy as a base 
of social influence. In J. T. Tedeschi (Ed.), Per- 
spectives on social power. Chicago: Aldine- 
Atherton, 1974. 

Michener, H. A., & Burt, M. R. Components of 
“authority” as determinants of compliance. Journal 
of Personality and Social Psychology, in press. 

Michener, H. A., & Lawler, E. J. Revolutionary 
coalition strength and collective failure as deter- 
minants of status reallocation. Journal of Experi- 
mental Social Psychology, 1971, 7, 448-460. 

Michener, H. A., & Lyons, M. Perceived support 
and upward mobility as determinants of revolu- 
tionary coalitional behavior. Journal of Experi- 
mental Social Psychology, 1972, 8, 180-195. 

Michener, H. A., & Tausig, M. Usurpation and 
perceived support as determinants of the endorse- 
ment accorded formal leaders. Journal of Personal- 
ity and Social Psychology, 1971, 18, 364-372. 

Miller, A. D. Logic of causal analysis: From experi- 
mental to nonexperimental designs. In H. M. 
Blalock (Ed.), Causal models in the social sciences. 
Chicago: Aldine-Atherton, 1971. 

Suchner, R. W. Status endorsement as a function of 
perceived competence and conformity of leaders 
in successful and unsuccessful groups. Unpublished 
doctoral dissertation, University of Wisconsin— 
Madison, 1972. 

Weber, M. The theory of social and economic orga- 
nization. New York: Oxford University Press, 
1947, 


(Received July 2, 1973) 


Journal of Personality and Social Psychology 
1975, Vol. 31, No. 2, 224-231 


Self-Discrimination Responses in Black School Children 
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Previous studies found that the performance of blacks on intellectual tasks 
decreases in the presence of whites. This study attempted to determine if this 
performance decrement still exists by administering three trials of a digit 
symbol task to black junior high school students in the presence of either 
white or Negro counselors. The task was labeled as either intellectual or motor 


skill. In addition, measures of expecta 
siveness, and task satisfaction were ob 


ncy, incentive, anxiety, hostility, defen- 
tained, It was found that the students 


performed better in the presence of whites and that this was a function of 
task complexity and task labeling. Of several theories considered, Spence’s 
analysis of the relationship of drive level to performance can best explain the 


data. No source for the increased drivi 


This article is concerned with responses of 
blacks which tend to fulfill the expectations 
of biased whites. These responses, which are 
pooled under the general rubric of self- 
discrimination responses, may remain a prob- 
lem for society as a whole and especially for 
educators even in a hypothetical absence of 
white discrimination. 

Katz (1964) surveyed much of the litera- 
ture relating to predjudice and its effects upon 
black performance. Examining the studies 
covered in this article, several types of self- 
discrimination are evident: (a) Blacks may 
underestimate their own performance when 
they are being compared with whites. (b) 
Blacks may fail to utilize their full potential 
when working with whites. (c) The actual 
performance level of blacks on difficult tasks 
may decrease when they are told that they 
are to be compared with whites on difficult 
tasks. (d) The actual performance level of 
blacks may decrease when tasks are super- 
vised by whites. The last effect was found to 
be a function of task difficulty and also of 
description of a task as either intellectual or 
nonintellectual. 

The present study investigates this last 
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e in the presence of whites was found. 


type of self-discrimination response. Any of 
the self-discrimination responses might be ex- 
pected to cause difficulties, but this is espe- 
cially true for performance deficits in the 
presence of whites. In particular, these deficits 
might prove a problem in integrated academic 
situations, where tasks are often complex and 
are almost by definition intellectual. 

A variety of possible explanations for the 
obtained performance deficit have been pro- 
posed. Expectancy has been suggested as one 
important variable. Katz, Epps, and Axelson 
(1964) have suggested that performance 
varies directly with the subjects’ expectancy 
of success and have argued that expectation 
is less when black subjects believe they are 
to be compared with whites, They investi- 
gated the performace on a digit symbol task 
of black subjects who were given one of three 
sets of instructions: no test, scholastic apti- 
tude test with black college norms, or scho- 
lastic aptitude test with national (i.e., pre- 
dominantly white) college norms, The highest 
achievement was found in the black-norm 
condition, intermediate performance in the 
white-norm condition, and lowest perform- 
ance in the no-test condition. The differences 
tended to be greater on a hard version of the 
task than on an €asy version. The authors 
have argued that performance was lowest in 
the no-test condition because of low incentive 
and that the difference between the two test 
conditions was a result of a higher expectancy 
of success when the black subjects believed 


they were competing against blacks, No 
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direct measurement of expectancy or incentive 
was made. 

A second group of hypotheses, which might 
be called the intrapsychic interference theo- 
ries, proposes that blacks have internal emo- 
- tional reactions to whites that distract from 
the performance of the assigned task. One 


‘ „possibility is that blacks experience internal 


cues of automatic arousal in the presence of 
whites and attend to these instead of to the 
task. Katz (1964) has suggested that social 
threat, a class of social stimulus events that 
tend to elicit anxious expectations that others 
will inflict harm or pain, may produce such 
arousal cues and that blacks may experi- 
* ence social threat in the presence of whites. 
Arousal cues from failure threat, defined by 
Katz (1964) as a class of stimulus events in 
an achievement situation which elicit anxious 
expectations of harm or pain as a consequence 
of failure, may be the mechanism underlying 
any possible correlation between low expect- 
ancy and self-discrimination if black expecta- 
tions of success are indeed lower in the 
presence of whites. 

Another possibility is that intrapsychic 
interference results from defensive operations 
aimed at reducing unpleasant emotional reac- 
tions. Katz, Robinson, Epps, and Waly 
(1965) have suggested that fear of failure 
produces hostility toward the authority fig- 
ures demanding that the task be done. When 
the authority figures are white, it is argued 
that black subjects suppress the hostility, and 
this repression distracts from the assigned 
task. Katz et al. (1964) gave a test of aggres- 
sion disguised as a concept formation task to 
male students at a black high school and 
junior high school. The following day they 
gave an equivalent test to the same subjects 
but described it as either a research instru- 
ment or an IQ test. Under IQ instructions 
hostility increased if the experimenter was 
black but decreased if the experimenter was 
white. The authors interpreted this as indi- 
cating repression for subjects with white ex- 
perimenters, but no direct measure of repres- 
sion was made. Other defensive operations 
aimed at reducing anxiety are described in 
Katz (1964). 

A final,hypothesis is suggested by the con- 
cept that social threat or failure threat may 
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increase autonomic arousal in the presence 
of whites. If this is the case, then the Spence 
model of the effects of high drive upon per- 
formance may apply (Spence, 1958). The 
effect of drive level is hypothesized to be a 
function of the number of trials and task 
complexity as defined by the number of com- 
peting responses. While the intrapsychic theo- 
ries deal with competing responses in the 
sense of nontask-oriented internal defensive 
operations, the Spence model defines com- 
peting responses as overt operant responses 
which resemble one another and which are sug- 
gested as possible solutions by either the task 
itself or the subjects’ past learning history. If 
drive is high and the task is complex, then per- 
formance initially suffers until competing re- 
sponses are extinguished, but eventually per- 
formance is facilitated relative to low-drive 
conditions. If the task is simple, then per- 
formance in high-drive conditions may be 
facilitated from the very first relative to per- 
formance in low-drive conditions. If the 
Spence model applies, then self-discrimination 
responses may be characteristic of only cer- 
tain situations—early in the learning process 
when complex tasks are being mastered. 
Changes in the relationship between perform- 
ance in the presence of whites and perform- 
ance in the presence of blacks would be 
expected as a function of the number of trials. 

The Katz (1964) discussion of the effects 
of social and failure threat suggests one pos- 
sible reason for believing that blacks may be 
more aroused in the presence of whites than 
in the presence of other blacks, but there are 
other reasons as well. Berkowitz and Knurek 
(1959) found that hostility paired with a pre- 
viously neutral label by means of semantic 
conditioning generalized to the person signi- 
fied by that label. The authors suggest that 
white prejudice towards blacks may result 
from a similar pairing of negative words with 
the label Negro. If this line of reasoning is 
correct, then blacks may be even more likely 
to develop negative emotional responses to 
whites by hearing of negative experiences of 
others with whites. When the probability of 
at least some direct negative experience with 
whites is taken into account, the chances of 
conditioning an emotional arousal response 
seem high. Porier and Lott (1967) found that 
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white subjects had higher galvanic skin re- 
sponses in the presence of blacks than in the 
presence of whites. While direct evidence is 
Jacking, it would be surprising if the reverse 
were not true for black subjects. 

There is already some evidence that the 
Spence model may apply to the self-discrimi- 
nation area. Katz, Roberts, and Robinson 
(1965) found that black subjects performed 
better in a digit symbol task with white ex- 
perimenters under neutral task instructions 
but better with black experimenters when told 
that the task was a measure of IQ. The 
authors reasoned that the presence of whites 
alone increased drive to an optimal level, but 
description of the task as intellectual further 
increased drive to a level that interfered with 
performance. No differences were found on a 
less complex version of the task. The Spence 
model predicts changes in relative perform- 
ance as a function of trials, and it is par- 
ticularly important to look at changes early 
in the learning process. Katz et al. (1965) 
have presented learning curves but only for 
the second of two sessions. 

The present research examines the effects 
of the presence of a white experimenter on 
the performance of blacks and seeks to throw 
light on the usefulness of the various hypothe- 
ses in explaining the data. Toward that goal, 
the study looks at the effects of the presence 
of whites and the labeling of a task as intel- 
lectual upon (a) incentive, (b) expectancy, 
(c) defensiveness, (d) hostility, (e) anxiety, 
and (f) satisfaction during the task. Also, 
possible effects of task labeling and the pres- 
ence of whites upon performance are studied 
as a function of trials so that learning curves 
can be investigated. 

In the existing literature three main inde- 
pendent variables have been manipulated: 
race of the experimenter, task labeling, and 
race of peers. The present study varies the 
first two of these and also adds another 
variable, number of trials. 


METHOD 
Subjects 


Subjects were 40 black male students in the 
seventh and eighth grades at a junior high school 
in Lexington, Kentucky. The student body was over- 
whelmingly black; the faculty was racially mixed. 
Subjects were recruited from a large study hall and 
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included most of the male population 
hall. Most students in the school had one suc 
hall per day, so the subjects chosen were J 
fairly typical of the total male popula 

school. 


Experimenters 


The experimenters were two white and two b 
counselors of approximately the same a 
tional background. On the basis of | 
views with the authors, it was felt that both blach 
and white experimenters were similar terms of 
warmth, intelligence, and verbal ability hool 
administrative staff concurred with t 
students were not consulted. The experiment 
given a brief training session prior to the 
ment, which involved explaining the procedure tha 
would be used, going through the answer booklet 
with them, and answering any qustions aboul wh 
they were to do. The experimenters read all 
tions to the subjects. All experimenters referred 
a written time schedule so that they allowed the 
same amount of time for the various tasks, All 
experimenters spent the same amount time run- 
ning the study. The experimenters did not 
rehearse their roles, nor were they dir 
Subsequent questioning seemed to in 
behaved similarly during the actual s 
perimenters were not told the purpo 
ment until after the data was colle: 
tioning afterwards revealed that they 2 
the race of the experimenter was a \ ere 
not sure how it would affect the resu 


Measures 


The measure of performance w ital 
Symbol subscale of the Wechsler Adult Intellige: 
Scale (WAIS). This timed task involves copying 
symbols under numbers, matching symbol 
according to a given key. It could th d 
either as a measure of coordination < 1 test of 
intelligence. 
Attitudes of subjects thought to be relat 
performance were measured by 6-point 
specially devised for the experiment 
a scale whose extremes were labeled i 
and extremely important on which th 
indicate how important it was for him 
the experimental task; (b) expectancy 
extremes were labeled much below aver 
above average on which the subject c y 
how well he actually expected to do in relation to 
the “average” person on the experimental task: 
(c) satisfaction, a scale whose extremes were labeled 
thoroughly enjoyed the session and thoroughly dis- 
liked the session on which the subject could indicate 
how much he liked or disliked the experimental l 
session. i 
Various personality variables were also considered l 
to have a possible relation to performance. A test 
battery was assembled which consisted of 146 state 
ments with which subjects were asked to indicate 
agreement or disagreement on an answer sheet. This 
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booklet was labeled as a biographical inventory. The 
measures included in the biographical inventory are 
as follows: (a) the Manifest Anxiety Scale (Taylor, 
1953), used as a measure of generalized anxiety; 
(b) the Test Anxiety Scale (Sarason & Ganzer, 
1962), included as a more situation-specific anxiety 
measure; (c) the Manifest Hostility Scale (Siegel, 
1956), a measure of hostility; and (d) the Minne- 
sota Multiphasic Personality Inventory K scale 
(Dahlstrom & Welsh, 1960), used as a measure of 
defensiveness. 

In a final measure, subjects were asked to indicate 
their grade point averages on a 5-point scale ranging 
from failing to excellent. This was done to provide 
some indication of how comparable the two groups 
were in terms of ability. No objective check could 


be made, since subjects did not sign their names. 


Procedure 


All of the various measures were combined into an 
answer booklet numbered for identification. Subjects 
were asked not to give their names so that mea- 
sures could be scored without knowledge of the 
group to which a given subject belonged. On the 
front of the booklet a large distinctive mark in one 
of four colors was drawn. The booklets were mixed 
up using a table of random numbers. All subjects 
initially assembled in one large room, and as each 
subject came through the door, he was handed a 
booklet from the top of the stack. Later all subjects 
with marks of a given color assembled together and 
were led into a separate room by the experimenter 
to whom that color had been assigned using a table 
of random numbers. Each experimenter made sure 
that all subjects in a group belonged in that group 
and that the booklets were not opened until the 
appropriate time. 

At this point each experimenter asked the group 
to look at the first page of the answer book, read it, 
and fill out the incentive scale. The experimenters 
went around the room to make sure that the sub- 
jects understood the instructions; subjects were 
asked to raise their hands for help if needed. The 
importance of being honest was stressed. 

When the subjects were finished, they were asked 
to turn the page, read the instructions, and fill out 
the scale. The instructions were as follows for half 
of the subjects in each group: 

In a few minutes you will be asked to play a 
“same” which will consist of matching up certain 
marks with the numbers they belong with. This 
“game” has a serious purpose because it will give 
us some idea of how well you could do at work 
requiring the skilled use of hands (like mechanical 
drawing, for instance). This doesn’t necessarily 
mean how well you will do, or how well you have 
done, but just how well you could do under the 
right conditions. You'll be told more about how 
to play the “game” when the time comes. 

ceived identical instructions 
about the purpose of the 


except for the seintence 
the sentence read: 


game. For these subjects, 


This “game” has a serious purpose because it 
will give us some idea of how well you could do 
at work requiring academic skills (like school 
work, for instance)... . 


The subjects were then asked to fill out the expect- 
ancy measure. The importance of honesty was again 
stressed: the experimenters went around the room to 
make sure that the subjects knew how to use the 
scale. 

The biographical inventory was administered next. 
The experimenters explained how the IBM answer 
sheet was to be used, then read the instructions 
aloud, stressing that there were no right or wrong 
answers. Forty minutes were allowed for completion 
of the battery, a time period judged sufficient by 
educational personnel. After that interval, subjects 
were asked to turn to the instructions for the 
“game.” 

The game was the standard WAIS Digit Symbol 
subscale. The instructions read: 


This is the “game” we told you about, You 
remember that it is important because it will tell us 
how well you could do in academic work (or 
work requiring the skilled use of hands). Read the 
instructions carefully but please do not turn the 
page until you are asked to do so. 


The standard WAIS instructions were then given, 
The WAIS table of numbers and symbols was pre- 
sented, then the WAIS practice section was presented 
with instructions to try the task. Finally, the correct 
answers, were presented. The experimenters went 
around the room making sure that all subjects 
understood the task. After this the experimenter 
asked the subjects to turn the page and begin work. 
The subjects were told to work as quickly as pos- 
sible, since the task was timed. The subjects were 
given 90 seconds to complete the task, and then they 
were asked to turn the page and begin work again. 
The procedure was repeated so that all subjects 
took the Digit Symbol subtest three times each. 

Subjects were then asked to turn to another page 
and were asked to complete the satisfaction scale. 
As before, honesty was stressed, and the experi- 
menters were available for help. Finally, the subjects 
were asked to report their grade averages. 


RESULTS 


Figure 1 shows the mean number of items 
correctly completed on each of the three 
administrations of the digit symbol task by 
each of the different experimental groups. It 
can be seen that subjects with white experi- 
menters tended to do better than subjects 
with black experimenters, that subjects re- 
ceiving intellectual task-labeling instructions 
did better than subjects receiving motor-skill 
labeling instructions, and that all subjects 
tended to improve as the trials progressed. 
The results were evaluated using a three-way 
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Ficure 1. Mean correct number on digit symbol 
task. (E refers to experimenter.) 


analysis of variance with instructions, race of 
experimenter, and number of trials as factors. 
This analysis is summarized in Table 1. It can 
be seen that the effects for trials, Race x 
Trials, and Race X Trials X Instructions were 
significant, 

Looking again at Figure 1, it can be seen 
that the learning curves for subjects with 
white experimenters tend to be positviely ac- 
celerated, while the learning curves for sub- 
jects with black experimenters tend to be 
negatively accelerated. Put another way, sub- 


TABLE 1 


SUMMARY OF ANALYSIS OF VARIANCE FOR NUMBER 
Cor l ON DIGIT SYMBOL Tas 


Source SS df MS P 
Between subjects 
Race (A) 710.53 | 1 |710.53|  3,05** 
IQ (B) 410.70 | 1 |410.70| 1.73 
AXB 154.13 | 1 | 154.13 
Error 8,508.67 | 36 | 236.35 
Within subjects 
Trials (C) 1,956.64 | 2 | 978.32 | 103.61*** 
DENS 174.12 | 2 | 87.06] 9.22*+* 
BXC 38.70 | 2 | 19.35] 2.04* 
AXBXC 143,62 | 2 | 71.81 7.60*** 
Error 679.93 | 72 9.44 
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TABLE 2 


MEAN NUMBER OF BIOGRAPHICAL INVENTORY 
ITEMS COMPLETED 


Task label 
Race of uS CN 
experimenters* 
Intellectual | Motor Skill 
White | 143.10 146.00 
Negro 92.00 111.80 


^F (1, 36) = 17.88, p < .01. 


jects with white experimenters showed more 
improvement from Trial 2 to Trial 3 than 
from Trial 1 to Trial 2. The reverse was true 
for subjects with black experimenters. The 
significance of this was evaluated by means of 
one tailed ¢ tests for each of the experimental 
groups. For subjects with white experimenters 
who received IQ task-labeling instructions, 
the amount of improvement from Trial 2 to 
Trial 3 was larger than the amount of im- 
provement from Trial 1 to Trial 2, and this 
was significant at the .025 level, ¢ (18) = 2.15, 
For subjects with white experimenters receiv- 
ing motor-skill labeling instructions, the dif- 
ference was in the same direction but failed 
to reach significance. For subjects with black 
experimenters who received motor-skill label- 
ing instructions, the amount of improvement 
from Trial 1 to Trial 2 was greater than the 
amount of improvement from Trial 2 to 
Trial 3, and this difference was significant at 
the .025 level, # (18) = 2.54. For subjects 
with black experimenters receiving IQ label- 
ing instructions, the differences were in the 
same direction but failed to reach significance. 

Preliminary inspection of the data reveals 
that not all subjects completed the bio- 
graphical inventory. The number of items 
completed by each subject thus becomes an 
unexpected but no less real measure of per- 
formance. Table 2 shows the mean number of 
items completed by members of each group, 

It can be seen that subjects with white ex- 
perimenters completed more items than sub- 
jects with black experimenters, Subjects 
receiving motor-skill labeling instructions com- 
pleted more items than subjects receiving 1Q 
labeling instructions, These effects were eval- 
uated by means of a two-way analysis of 
variance with task labeling and race of ex- 
perimenter as factors. The effect for race was 
significant, 
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Differences among groups for all biographi- 
cal inventory subscales, attitude measures, 
and reported grade point averages were eval- 
uated using a two-way analysis of variance 
with instructions and race of experimenter as 
factors. No significant main effects or inter- 
actions were found. 


Discussion 


As in the earlier studies, black subjects 
responded differently depending on whether 
they were supervised by whites or blacks. 
However, rather than a performance decre- 
ment in the presence of white experimenters, 
a performance increment was found, even 
when the task was described as intellectual. 
This apparent contradiction of previous re- 
search results might be seen as a reflection of 
actual changes in the reactions of blacks to 
whites over the last several years. 

Before such a conclusion can be reached, 
however, it is necessary to take a closer look 
at the Spence model. Spence (1958), speaking 
of learning a complex task, has stated: 

In this instance the stronger the drive, the greater 
will be the percent choice of the wrong response, or, 
in other words, the poorer will be the performance 
at this initial stage. But, as training proceeds, sooner 
or later the habit strength of the correct, reinforced 
response will overtake that of the wrong, unrein- 
forced response and from this point on the percent 
choice of the correct response should in general be 
higher for the high drive group than for the low 
drive group. In other words, the performance curves 
would be expected to cross. . . . (P- 138) 

A special aside needs to be added for digit 
symbol tasks. Since the correct answers are 
always present at the top of the page, wrong 
choices do not appear on paper. However, 
when the subject checks his internal guess 
against the key and perceives he is wrong, he 
knows he cannot rely on his guesses alone but 
must continue to rely on the key until his 
guesses are right. This slows down his per- 
formance until these internal wrong responses 
are extinguished. 

The Spence model, then, does not predict 
that subjects with high drive perform better 
or worse than subjects with low drive but 
rather says that the relative performance of 
the two groups depends on the portion of the 
performance curves sampled. If performance 
is sampled before the curves Cross, then a 
performance decrement on the part of high- 


229 


drive subjects is found; if sampled after the 
curves cross, then performance facilitation for 
the high-drive group is found. If the Spence 
model applies to the present situation, then 
the performance curves obtained would be 
from shortly after the point of intersection to 
a point of clearer superiority for the high- 
drive group, It may be objected that the sup- 
posed high-drive groups, those with white ex- 
perimenters, were at least somewhat superior 
to the supposed low-drive groups, those with 
black experimenters, on the very first trial. 
Again the nature of the digit symbol task 
must be considered. Performance on Trial 1 
refers to the total number of correct responses 
achieved after 90 seconds of learning. This 
might very well be enough time for a high- 
drive group to overcome any initial perform- 
ance decrement on this particular task. Had 
the number of correct responses at the end of 
30 seconds been sampled, or had a more diffi- 
cult task been involved, then the relative 
performance of the two groups might have 
been reversed. It is recalled that the perform- 
ance curves for the groups with white experi- 
menters were positively accelerated, while 
those for the groups with black experimenters 
were negatively accelerated, as might be ex- 
pected on the basis of the Spence model. 

The digit symbol task used here can be 
viewed as moderately complex because, for 
any one digit, there are several symbols that 
could be matched, and none of these pairings 
have previously been in the subjects’ response 
hierarchy. Also, some, but not all, of the sym- 
bols resemble each other, resulting in some 
response competition. If the Spence model is 
correct, then a more complex task might make 
it more likely that the performance curves 
would be sampled before the point of inter- 
section, thus resulting in a response decre- 
ment. Katz et al. (1965) used a more com- 
plex task in which the symbols were small 
flaglike figures differing in subtle ways. Re- 
sponse competition was deliberately maxi- 
mized. They found that subjects with black 
experimenters performed as well or better than 
subjects with white experimenters on this task 
when it was described as a measure of IQ. On 
the other hand, if a task simpler than the 
digit symbol task of the present experiment 
were used, then the performance curves might 
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be sampled still further from the point of 
intersection resulting in a more clear-cut 
superiority for a high-drive group. It is pos- 
sible to consider the biographical inventory 
to be such a task, since there are only two 
possible responses (true or false), and the 
dominant response is by definition correct. In 
the present study, subjects with white experi- 
menters showed an overall superiority over 
subjects with black experimenters. On the 
digit symbol task the overall effect for race 
failed to reach significance. There are some 
dangers in interpreting these results in terms 
of task complexity, since the biographical in- 
ventory is not just a simpler version of the 
digit symbol task but involves a willingness 
to discuss information about one’s self. How- 
ever, unless the bulk of previous writing is 
wrong, blacks should be more defensive and 
hence slower with white experimenters (Katz, 
1964) unless some factor such as higher drive 
is compensating. It is conceivable that the 
recent stress on black pride has reversed the 
picture, and blacks are now more defensive 
around other blacks, but in that case it seems 
probable that significant differences would 
have been found on the K scale. 

The effects of labeling the digit symbol 
task as intellectual are not clear-cut, Overall, 
intellectual labeling seemed to facilitate per- 
formance in the presence of white experi- 
menters but not in the presence of black ex- 
perimenters. Intellectual labeling seemed to 
make the performance curve for subjects with 
black experimenters less negatively acceler- 
ated and the performance curve for subjects 
with white experimenters more positively ac- 
celerated. In the Katz et al, (1965) study, 
intellectual task labeling seemed to decrease 
performance in the presence of white experi- 
menters and increase it in the presence of 
black experimenters, One possible explanation 
is that there are different optimal levels of 
drive for tasks of differing complexity. The 
combination of white experimenter and IQ 
instructions may have produced an optimal 
level of drive for the task used in the present 

study but too much drive for the more com- 
plex task used in the Katz et al. study. The 
combination of intellectual task labeling with 
black experimenters may have produced a 
drive level too low to make much difference 
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in the present study but sufficient to improve 
performance in the Katz et al. study. 

The effects of labeling a task as intellectual 
might have been clearer if the variable had 
been manipulated in a more dramatic man- 
ner. It must be remembered that subjects 
knew about the alleged purpose of the tasks 
only by silently reading the instructions, 
while the race of the experimenter was clearly 
visible throughout the experiment. 

Of the theories considered, those that postu- 
late internal defense operations have the 
greatest difficulty in handling the present 
data. In addition to the failure to find differ- 
ences between groups on measure of anxiety, 
hostility, or defensiveness, the facilitation of 
performance in the presence of whites, par- 
ticularly on a self-revelation task, cannot 
easily be explained. Theories based on direct 
task interference as a result of internal cues 
resulting from anxiety, etc., also have trouble 
explaining facilitation in the presence of 
whites. Theories based on differential incen- 
tive or expectation were not helped by the 
present study but could conceivably still 
handle the data if one assumes that the mea- 
sures of these variables employed here are 
inadequate. 

The Spence theory does seem sufficient to 
handle data from the present study and earlier 
studies. Black subjects with white experi- 
menters do behave as though under higher 
drive than black subjects with black experi- 
menters. Black subjects who receive IQ in- 
structions may be under higher drive than 
those who receive motor-skill instructions, It 
is disappointing that the biographical inyen- 
tory subscale results do not permit a familiar 
label such as anxiety or hostility to be as- 
signed to the possible higher drive. However, 
this is not particularly surprising, even over- 
looking the possible inadequacies of the mea- 
sures used. Schachter and Singer (1962) have 
claimed that the specific emotion experienced 
is a function of both physiological arousal and 
cognitive labeling. Izard, Chappell, and 
Weaver (1970) have reported that the emo- 
tion consciously experienced by blacks in a 
black-white encounter characterized by white 
discrimination changes as a function of age. 
The initi $ 

e initial encounters (at ages 5 to 8) evoke 
responses of fear, distress, and shame. Later 
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encounters (at ages 18 and above) evoke 
responses of anger, disgust, and contempt. 
Since the subjects used in the present experi- 
ment were close to the middle of this age 
range, it may very well be that the cognitive 
experience of emotional response to discrimi- 
nation is mixed and hence unlikely to be 
clearly detected by the scales used in the 
present study. Regardless of cognitive label- 
ing, discrimination experiences might be ex- 
pected to condition physiological arousal 
which might generalize to black-white en- 
counters not characterized by discrimination. 

While the Spence model is sufficient to 
handle the present data, it is not necessary. 
One explanation which post hoc seems viable 
is a concept that different levels of legitimacy 
of work demands are perceived by blacks as 
a function of race and instructions. Thus, it 
may be seen as more legitimate to demand 
work in a school situation if the task is a 
measure of IQ. Whites may be perceived by 
blacks as having more of a right to demand 
work than other blacks. The results of Katz, 
Robinson, Epps, and Waly (1965) cited above 
are of some interest in this connection. The 
finding of greater hostility when black experi- 
menters demanded performance on a task de- 
scribed as an IQ test than when white experi- 
menters made the same demand may not 
reflect greater defensiveness in the presence 
of whites but simply anger that a black would 
overstep his perceived legitimacy. 

The problem with such an explanation is 
simply lack of evidence. While no firm conclu- 
sions can be based on the failure to find sig- 
nificant differences in expectancy, incentive, 
hostility, anxiety, or defensiveness in the 
present study, the data do point up the 
danger of assuming that manipulation of race 
of experimenter or task labeling produces the 
effects often logically attributed to them. 

While not demanded by the Spence model, 
a clear reversal from early inferiority on the 
part of black subjects with white experiment- 
ers to later superiority would have made the 
Spence explanation more compelling. A future 
study might use several tasks of differing 
complexity and measure performance earlier 
in the learning process. Most useful would 
be direct measurement of physiological 
arousal. Also, it would be interesting to de- 
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termine any possible differences between males 
and females in susceptibility to impairment of 
performance. Finally, a simple replication of 
the present study would be useful. Despite 
the attempts to equate both subjects and ex- 
perimenters for the different groups, there is 
always the problem of confounding race and 
specific personalities in a single study of this 
nature. 
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Performance and Satisfaction as Affected by Personal 
Control Over Salient Decisions 
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The effects of decision-making power on achievement in an introductory psy- 
chology course as well as attitudes toward the experience were investigated. 
Decision making was manipulated by (a) allowing some of the 171 participants 
to choose the type of recitation section they preferred (choice) while others 
received their preference by chance (no choice) and (b) manipulating the 
amount of student choice in the ongoing conduct of recitation sections (struc- 
tured and unstructured). Choice subjects did significantly better on one of 
three course exams than did no-choice subjects and reported greater satisfaction 
with their sections and leaders. Similarly, subjects in unstructured classes gave 
higher ratings of satisfaction with their sections than did subjects in struc- 
tured classes. The implications of the findings for structurally related, but 
broader, social problems are discussed. 


That participation in decision making may 
be important for effective behavior and per- 
sonal satisfaction has become an increasingly 
important theoretical contention (Haggstrom, 
1964; Riessman, 1967; Ryan, 1971; Bass, 
Note 1.). Self-determination has been stressed 
by Jahoda (1958) as a major criterion of 
positive mental health. She viewed the locus 
of the decision-making process as a funda- 
mental aspect of autonomy, emphasizing the 
importance of internal as opposed to external 
regulation of behavior. Further, proponents of 
community action and social advocacy in the 
area of community mental health place con- 
siderable importance on fostering self-deter- 
mination (Reiff, 1966; Levin, Note 2), Addi- 
tionally, the role of participation in decisions 
involving one’s fate is recognized to have been 
a significant element underlying recent social 
movements of major import in modern soci- 
ety; student unrest on campuses, revolution- 
ary Black Power movements, and community 
action to gain control of public education. 


This study is based on a doctoral dissertation sub- 
mitted to the Department of Psychology, University 
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The prominence of concern for decision-mak- 
ing power has created a pressing need to 
understand more fully how increased deci 
sion-making power, in various settings, affect 
behavior. A well-organized body of empirical 
data directly pertaining to this question is, as 
yet, unavailable. However, related studies in 
the areas of industrial management and edu- 
cation provide a limited basis for further 
investigation of the effects of decision-making 
power on behavior. 

Data pertaining to the relation between 
participation in decision making and task 
performance and attitudes are available from 
a series of studies carried out in management 
settings. Using self-perceptions of decision- 
making authority among supervisors in a de- 
livery company, Vroom (1960) found posi- 
tive relationships between perceived partici- 
pation in decision making and job-related 
attitudes and performance. Coch and French 
(1948) manipulated the decision-making rol 
of three groups of factory workers regarding 
introduction of a production change. Those 
allowed to decide how to accommodate to 
the new task were more productive and satis 
fied after its inception than those given rep- 
resentative participation in deciding on a 
work plan or those with no decision-making 
voice at all. Similar results were obtained by 
Morse and Reimer (1956), who increased 
the decision-making power of two groups of 
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clerks and decreased it for two others. Bass 
(Note 1) grouped managers-in-training in 
triads and asked them to develop plans for 
ordering and combining lists of numbers held 
by each participant. Higher productivity and 
greater satisfaction were found in all groups 
when they operated with their own plans 
rather than with formulae derived by others. 

While these studies suggest that perform- 
ance and satisfaction increase as a function of 
greater employee decision-making power, each 
contains a possible confounding of critical 
independent variables. Increase in perform- 
ance and satisfaction may result from moti- 
vational factors related to decision-making 
power, itself, or simply the opportunity to 
employ preferred work methods provided by 
self-selection. It is important that the effects 
of these two variables be separated. 

In the field of education the effects of deci- 
sion-making power on performance and satis- 
faction can be inferred from experiments com- 
paring different teaching methods as, for ex- 
ample, “student-centered” versus “instructor- 
centered” teaching. In a study by Faw (1949) 
student-centered discussion groups were char- 
acterized by greater student participation in 
determining activities, more discussion of per- 
sonal experience, a relative absence of instruc- 
tor direction, and minimal correction of in- 
accurate student statements, each of these 
elements in contrast to his instructor-centered 
sections. Scores on an objective, textbook- 
based exam showed small, but significant, 
differences favoring the student-centered 
classes which were also perceived by students 
to be of greater social and emotional value 
than instructor-centered classes. 

Other studies have compared student-cen- 
tered and instructor-centered methods, defined 
broadly as discussion among students versus 
discussion led by instructors (McKeachie, 
1951), instructor’s interpretations of student’s 
feelings versus no interpretations (Axelrod, 
1955), and student-determined evaluations 
versus traditional examinations (Ashmus & 
Haigh, Note 3). Although none of these stud- 
jes found differences between groups on ob- 
jective exams, McKeachie (1951) reported 
that student-centered subjects were superior 
on tests of problem-solving ability. Ashmus 
and Haigh (1952) reported greater student 
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satisfaction in student-centered classes in con- 
trast to instructor-centered ones. 

More recently Morgan (1972) studied the 
impact of broad, university curriculum 
changes eliminating all discipline-oriented 
graduation requirements and introducing con- 
siderable student choice on student achieve- 
ment and attitudes. Those exposed to the new 
curriculum performed at least as well as “old 
curriculum” students on traditional measures 
of achievement and reported greater satisfac- 
tion with their college experience. 

The foregoing data have some relevance for 
the present investigation, which was also con- 
ducted in an educational setting. Their fuller 
relevance is restricted, in part, by the con- 
founding of decision-making power, per se, 
with preferred activity afforded by self-selec- 
tion, much as noted earlier in studies of this 
type conducted in industrial settings. This 
confounding may be the result of the empha- 
sis, in educational studies, on evaluating 
teaching methods or classroom structure, per 
se, rather than the variable of decision-mak- 
ing power. 

The modal educational study evaluating 
teaching methods is susceptible to modifica- 
tion in ways that permit a more direct ap- 
proach to the pivotal question of the current 
research. By allowing some subjects their 
choice of method and withholding it from 
others, while keeping preference constant for 
all subjects, decision-making power, opera- 
tionally defined as choice of methods, is sepa- 
rated from preference. A pure test of the 
effects of the former is thus permitted. Under 
such circumstances different class conditions 
represent alternative settings for evaluating 
the initial choice-no choice manipulation. 

In addition, given classes with differences 
in amount of external structure, ongoing stu- 
dent determination of the nature of their 
learning experience would vary with the 
teaching method. However, the effects of this 
experimental manipulation, given the overall 
design of the study, are subject to certain 
confoundings. For one, the preference control 
implies that students in highly structured 
classes have opted to forego the choice of 
specific class activities. Their class perform- 
ance and attitudes may be largely determined 
by needs for a well-defined learning experi- 
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ence governed by instructor expertise rather 
than by concern for an ongoing, personal role 
in directing class procedure. For this reason 
observed performance or attitude differences 
between unstructured and structured groups, 
though relevant to the issue of decision- 
making power, would represent something 
less than a pure test of the effects of that 
variable. Prediction of effects related to class 
conditions based on the variable of decision- 
making power is thus tentative. 

The principal hypothesis of this investiga- 
tion is that students who are given decision- 
making power about the type of class they 
will have, regardless of the choice itself, 
should (a) perform better and (b) report 
greater satisfaction than those who are ac- 
corded their class preference through happen- 
stance. 

Student performance and satisfaction as 
related to class conditions alone are largely 
unpredictable due to the potential confound- 
ing of the independent variable. However, to 
the extent that ongoing determination of class 
activities can be demonstrated to be a salient 
issue for all students, those in unstructured 
classes with continuing self-determined choices 
should (a) perform better and (b) report 
greater satisfaction than those in structured 
classes. 


METHOD 
Subjects 


Subjects included 171 male and female students 
drawn from a spring semester introductory psychol- 
ogy course at the University of Rochester. Approxi- 
mately 74% were freshmen, 20% sophomores, and 
the remaining 6% juniors and seniors, For the major 
analyses, ns varied from 80 to 171, as further 
clarified below. 


Procedure 


The experimental manipulations were directed to 
50-minute recitation sections that characteristically 
served as a noncompulsory supplement to biweekly, 
75-minute lectures for the entire class of introductory 
students. These discussion sections averaged 15 
students each. 

Decision-making power. The entire class was pre- 
divided into two groups matched for sex, year level, 
and cumulative grade point average. Groups were 
matched at this point to increase the likelihood of 
being able to control for the same variables in subse- 
quent necessary subdivisions. One group, arbitrarily 
selected and hereafter referred to as the choice group, 
indicated which of two types of recitation sections 
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they preferred from written descriptions. The other 
half of the class was given neither a description of 
the two section types nor a choice; instead, it was 
assigned arbitrarily to sections, 

The experimenter met with both groups, sepa- 
rately, during the first and second halves of the 
second scheduled large lecture. Each group was told 
that the introductory psychology course was being 
evaluated for more effective learning approaches. The 
choice questionnaire eliciting preferences for type of 
recitation section was given only to the first half of 
the class, the choice group. It included a brief 
introductory paragraph describing current pressures 
from various groups in society for greater decision- 
making power and offered a choice of two types of 
recitation sections in recognition of student decision- 
making needs. Students were told that they had been 
arbitrarily selected to make this choice and that the 
other half of the group would not be allowed the 
same opportunity. Lack of availability of instructors 
for the two types of sections and difficulties in co- 
ordination of student, instructor, and room schedules 
were given as reasons for this decision. After making 
the choice rating, students indicated hours available 
for the discussion section. 

Students receiving no choice were given the same 
explanation and were also asked to fill out schedule 
sheets. At the first meeting of their randomly as- 
signed recitation section, before instruction began, 
instructors gave these students the same choice ques- 
tionnaire given to the choice group. They were told 
that even though they had already been assigned 
to a section, their responses would be useful in future 
course planning. They were assured that instructors 
would not look at their preferences. Classes follow- 
ing the predetermined method began immediately 
after submission of the completed questionnaires. 
Accordingly, there was no way these students could 
account for their actual assignments on the basis of 
their post hoc preferences. Only those students whose 
stated preference coincided with their actual assigned 
recitation section were included in the experimental 
groups. These subjects are henceforth referred to as 
the no-choice group. Thus, preference for a specific 
teaching method was controlled even though some 
students obtained their preference through choice 
and others through happenstance. 

Class conditions, Descriptions of the two types of 
recitation sections differed primarily in degree of 
student determination of class topics and procedures 
for _ investigating them, The structured recitation 
section consisted of 30-35 minutes of lecturing by 
the instructor sequentially following the topics out- 
lined in the course syllabus. The remaining 15-20 
abo Spe available for questions 

puti p lectures and discussion of the 
recitation leader’s lecture materials or currently as- 
signed readings. The unstructured sections had few 
formal guidelines. Five to 10 minutes were set asidi 
at the end of each class period during which Haa 
and their instructors decided both topics for th 
following class meeting and the method of eee TA 
tion. This procedure maximized the likelihood that 
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final decisions would reflect the interests of the ma- 
jority of class members. The only exception was the 
instructor’s theoretical right to veto a topic which, 
in his judgment, had no relevance to psychology. 
In fact, no instructor found it necessary to exercise 
this veto. Attendance for all recitation sections was 
noncompulsory. 

Sixty-two per cent of the choice subjects chose 
structured classes and 38% chose unstructured. 
Forty-seven per cent of the no-choice subjects indi- 
cated a preference for the former method and 53%, 
the latter. 

Instructors. Recitation leaders were psychology 
graduate students fulfilling a departmental teaching 
requirement by teaching two recitation sections. 
After having read a description of the two formats, 
each instructor opted for one of the approaches and 
was so assigned. As a result of discussions with the 
recitation leaders, it was evident that greater con- 
sistency in teaching across both sections would be 
achieved if the instructors employed one rather than 
both teaching methods. Four instructors chose the 
structured method and five, the unstructured method. 
All but one instructor was assigned one choice and 
one no-choice section to be conducted according to 
his preferred teaching method. One instructor pre- 
ferring the unstructured method was assigned two 
no-choice sections, as fewer no-choice students were 
expected to prefer the unstructured method on the 
basis of the distribution of structured and unstruc- 
tured choices among the choice subjects. Therefore, 
a larger number of no-choice-unstructured sections 
was needed to insure that a sufficient number of 
students in this condition also preferred the un- 
structured method. All instructors met with the ex- 
perimenter before the first scheduled recitation sec- 
tions to clarify the details of their teaching methods. 


Experimental Design 


The independent variables, structured—unstructured 
and choice-no choice, produced a 2 X 2 factorial de- 
sign with cell ms ranging from 21 to 58. The ms were 
smaller for no-choice conditions than for choice con- 
ditions because no-choice students whose preferences 
did not coincide with their actual assignments were 
not included in the study. They did, however, par- 
ticipate in recitation sections that included experi- 
mental subjects. The four conditions were matched 
for sex, year level, and cumulative grade point aver- 
age. Recitation sections met from 7 to 11 times 
during the semester. The differences in number of 
meetings were due to instructor illness and to the 
decision of several recitation leaders (structured and 
unstructured) to support a brief university-wide stu- 
dent boycott of classes affecting one class meeting. 
The average number of class meetings was the same 


for the four conditions. 


Measures 

Performance measures: course examinations. Nu- 
merical scores from two 50-item hourly examinations 
and a 100-item final examination, all multiple choice, 
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were used individually and as a sum to measure 
academic performance. All test items were drawn 
solely from the assigned readings by the recitation 
leaders and the course instructor. 

Performance measures: attendance. Student attend- 
ance at each recitation mecting was kept by instruc- 
tors. Since different numbers of meetings were held 
by different groups, percentage of meetings attended 
was tabulated for each student and used as a 
performance index. 

Attitude measures: presatisfaction. A single 7-point 
rating scale measuring overall satisfaction with 
recitation sections was administered during the large 
lecture period following the start of all recitation 
sections. Some sections had met twice by this time, 
while others completed only one class period. Anchor 
points were labeled at both extremes (very satisfied- 
very unsatisfied) with a score of 7 indicating greatest 
satisfaction. This item was administered as part of 
a general questionnaire by a faculty member who 
indicated that he was studying students’ early reac- 
tions to various courses, including introductory 
psychology. 

Attilude measures: course evaluation. This ques- 
tionnaire consisted of thirteen 7-point rating scales 
related to the student’s experience in introductory 
psychology drawn from two course evaluation sur- 
veys used at the university. Only four items pertain- 
ing to students’ satisfaction with recitation sections 
and ratings of recitation leaders’ performance were 
selected for analysis in the present study. These items 
were especially pertinent because they involved ques- 
tions equally relevant to both learning situations. 
The instrument was administered to students present 
at the last main class. Since attendance at that 
meeting was poor, evaluation sheets were left in an 
envelope tacked to the undergraduate psychology 
bulletin board, and a request was made for those 
who had not completed the form to do so. Forty- 
two additional questionnaires were received during 
the course of the following week. 

Attitude measures: importance rating. A single 
7-point rating scale attached to the bottom of the 
choice questionnaire asked students to judge the 
importance to them of the choice of recitation 
sections; a score of 7 indicated maximal importance. 

Process measure. This eight-item measure included 
two questions related to recitation topics and six 
dealing with their mode of presentation. After each 
section meeting, instructors indicated the degree to 
which class topics related to current material listed 
in the course syllabus and who determined the choice 
of topics. They also indicated the percentage of 
time spent in each of the six mode-of-presentation 
process categories. These data provided the basis for 
checking the actual conduct of the different teaching 
methods. 


RESULTS 


Since substantive analyses were dependent 
on the demonstration of actual differences in 
content and conduct of the structured and 
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TABLE 1 
COMPARISON OF STRUCTURED AND UNSTRUCTURED RECITATION SECTIONS ON THE 
Instructor-RAtep Process MEASURE 
Structured Unstructured 
Scale 1 
M SD M SD 
Material relatedness 5.20 2.12 3.19 2.12 6.07** 
Material determination 2.18 1.60 4.66 1.70 —9.61** 
Lecture by instructor (%) 48.86 37.36 24.95 29.61 E A 
Lecture by subjects (%) .00 -00 1.95 4.77 —3.68** 
Discussion: subjects and instructor (%) 17.14 24.43 26.93 22.59 —2.00"* 
Discussion: among subjects (%) 1.73 5.51 17.06 19.76 —6.62"" 
Questions: subjects ask instructor (%) 16.71 19.77 14.67 16.89 71 
Questions: instructor ask subjects (% T33 12.11 5.90 6.80 94 
Note, For structured and unstructured conditions ns are 86 and 79, respectively. So 
ED < OL. 


F 


unstructured sections, these data are presented 
initially. 


Process Measure 


Table 1 presents means and sigmas for 
structured and unstructured sections for eight 
process measures as well as ¢ ratios testing 
the significance of the differences between 
group means on these measures. Six of these 
eight measures yielded statistically significant 
ts. Thus, structured classes dealt more with 
course-related topics determined largely by 
the instructor, allowed a greater percentage 
of time for formal lectures by the instructor 
but less time for student lectures, and allowed 
less time for discussion among students or 
between students and instructor than did un- 
structured classes. No significant differences 
were found on percentage of time spent with 


TABLE 2 


CELL MEANS ror ANALYSIS OF VARIANCE FOR 
PERFORMANCE Measures: EXAM GRADES 
AND ATTENDANCE 


Cells 
Measure 

CH-ST CH-U_ | NCH-ST | NCH-U 
First exam* 36.08 | 34.86 33.95 35.04 
Second exam* | 37.43 | 36.43 | 34.67 35.17 
Final exam? 70.34 | 68.27 | 65.67 67.63 
Totale 143.86 | 140.56 | 134.24 | 137.85 
Attendance 52.09 | 55.06 50.52 50.98 


Nole. CH indicates choice conditions and NCH, no-choice 
conditions; ST indicates structured and U, unstructured 
conditions, For all analyses, n = 58 for the CH-ST group, 
n = 51 for the CH-U group, n = 21 for the NCH-ST group, 
and % = 41 for the NCH-U group. 

a 50 items, 

b 100 items. 

© 200 items, 


questions directed at either students or lead- 
ers. To determine whether or not choice 
and no-choice classes, both structured and 
unstructured, were conducted comparably, 
similar analyses were performed comparing 
choice-structured classes with no-choice— 
structured classes and choice—unstructured 
with no-choice-unstructured classes. Neither 


TABLE 3 


ANALYSIS OF VARIANCE FOR PERFORMANCE MEASURES: 
EXAM GRADES AND ATTENDANCE 


Source | df MS F 
First exam* 
ST-U (A) 1 37 OL 
CH-NCH (B) 1 34.90 719 
AXB 1 48.81 1.10 
Error 167 44.27 
Second exam* 
ST-U (A) 1 2.57 .07 
CH-NCH (B) 1 148.64 4.28* 
AXB 1 19.82 Si 
Error 167 34.71 
Final exam? 
ST-U (A) 1 37 05 
CH-NCH (B) 1 258.74 3.51 
AXB 1 146.80 1.99 
Error 167 73.63 
Totale 
ST-U (A) 1 YA) .002 
CH-NCH (B) 1 1,394.60 3.87 
AXB 1 438.98 1.27 
Error 167 360.09 
Attendance | 
ST-U (A) 1 107.90 20 
CH-NCH (B) 1 292.87 54 
AXB 1 57.62 -Ll 
Error 166 544.99 


Note.—ST indicates structured and U indicates nstructured ; 
CH indicates choice and NCH indicates no-choice conditions, 
For all analyses, the mean is the harmonic mean ña = 36.70. 

b < 05. 

^ 50 items. 

b 100 items. 

© 200 items, 
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of these sets of analyses yielded significant 
differences. 

The process data thus support the conclu- 
sions that (a) the two methods did, in fact, 
differ in ways that were consistent with their 
prior descriptions and (b) sections based on 
the same teaching method were identical 
regardless of their choice—-no-choice status. 


Performance Measures 


The first group of hypothesis-relevant 
analyses tests the effects of choice and class 
condition on actual performance. Tables 2 
and 3 present subgroup means and analyses 
of variance Fs, respectively, for each of the 
three exam measures, total exam performance, 
and percentage attendance. These analyses 
were based on a 2 X 2 factorial design em- 
ploying a harmonic mean (Winer, 1962) to 
take account of unequal cell frequencies. 

No significant differences between class 
conditions (structured versus unstructured) 
were found in any of these analyses. On the 
second exam, however, choice subjects scored 
significantly higher than did no-choice sub- 
jects. Directionally similar, but nonsignifi- 
cant, choice-no-choice differences also oc- 
curred in the final exam and for the course 
total, with both .10 > p > .05. This trend is 
particularly noteworthy in view of the fact 
that it conforms directionally with the major 
hypothesis of the study, and the F test used 
with directional hypothesis is conservative 


TABLE 4 


CELL MEANS FOR ANALYSIS OF VARIANCE FOR THE 
EARLY SEMESTER SATISFACTION AND THE 
COURSE EVALUATION MEASURES 


Cell 


Measure 
CH-ST | CH-U | NCH-ST | NCH-U 


Early semester: 


satisfaction 3.50 | 6.03 3.79 5.03 
rse evaluation: 
aeon 3.20 | 4.50 2,23 3.59 


Course evaluation: ii 
i r's overal 
Eii. 3.80 | 5.20 3.69 4.47 

Course evaluation: 
instructor's ability 


to stimulateinterest | 3.53 4.40 2.69 3.29 


indicates choice and NCH indicates no-choice 
P o indicates unstructured and ST indicates 
structured. A s A 6 
ster satisfaction analysis, n = 44 for the 
re an neo for the CH-U group, n = 19 for the 
NCELST group, and # = 34 for the NCH-U group. For all 
course evaluation analyses, » = 30 for the CH-ST group, 
n= 20 for the CH-U group, » = 13 for the NCH-ST group, 
and = 17 for the NCH-U group. 


TABLE 5 


ANALYSIS OF VARIANCE FOR THE EARLY SEMESTER 
SATISFACTION AND THE COURSE 
EVALUATION MEASURES 


Source df MS F 
Early semester: satisfaction 
ST-U (A) 1 | 103.20 | 46,47** 
CH-NCH (B) 1 3.77 | 1.70 
AXB 1 8.99} 4.05* 
Error 123 2.22 
Course evaluation : 
satisfaction 
ST-U (A) 1 | 33.32 | 11.07" 
CH-NCH (B) 1 | 16.07} 5.50* 
AXB 1 00} 00 
Error 76 2.92 
Course evaluation : 
instructor’s overall 
ability 
ST-U (A) 1 | 21.73 | 10.01** 
CH-NCH (B) 1 3.29.| 1.52 
AXB 1 2.92 | 1.35 
Error 76 2.17 
Course evaluation : 
instructor’s ability to 
stimulate interest 
ST-U (A) 1 | 10.04| 2.79 
CH-NCH (B) 1 | 17.35} 4.83* 
AXB 1 0.18 | .05 
Error 76 3,59 


Note. For all analyses the mean is the harmonic mean; 
ña = 28.99 for early semester satisfaction, and fa = 18.26 
for course evaluation measures. ST indicates structured and U 
indicates unstructured; CH indicates choice and NCH in- 
dicates no-choice conditions, 

5. 


(Hays, 1966). There were no significant dif- 
ferences among groups on the attendance 
measure, 


Attitude Measures 


A second set of outcome analyses uses stu- 
dent perceptions of their experience as the 
criteria. Tables 4 and 5 present subgroup 
means and Fs, respectively, for the early 
semester ratings of student satisfaction and 
three of the four end-of-semester course 
evaluations using the same analytic model 
described for the performance measures. For 
these analyses, however, ms were reduced, 
since some sutdents did not complete all 
forms. Ratings of the overall value of 
recitation meetings are not included in the 
tables, as they essentially duplicate the ratings 
of end-of-semester satisfaction (interitem 
r= +81, p < 01). 

Subjects in unstructured classes reported 
significantly greater satisfaction with their 
sections on the initial rating than did sub- 
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jects in structured classes. Students in the 
choice-unstructured condition also reported 
greater satisfaction than those in the no- 
choice-unstructured group. 

On the end-of-semester satisfaction ratings, 
subjects in unstructured classes reported sig- 
nificantly greater satisfaction than those in 
structured classes, as did choice subjects in 
comparison with no-choice subjects. This pat- 
tern of significant differences was replicated 
with students’ ratings of the overall value of 
their classes. Ratings of instructors’ overall 
teaching ability were higher for those in un- 
structured classes than those in structured 
sections, and choice subjects rated their in- 
structors’ ability to stimulate interest higher 
than did no-choice subjects. It is of interest 
to note, directionally, that the highest scoring 
subgroup for each of these five satisfaction 
measures was choice-unstructured. However, 
correlations among the four course evaluation 
scales ranged from +.51 to +.81, PS 01, 
which suggests item overlap. 

With respect to ratings of the importance 
of having a choice of recitation sections, 
subjects preferring unstructured classes at- 
tached significantly greater importance to 
self-determination of recitation sections than 
did subjects preferring the structured condi- 
tion, p < .01. No differences were found on 
this measure between subjects in the choice- 
no-choice conditions. 


Discussion 


The central hypothesis of the study was 
that students who were given decision-making 
power about the teaching mode of their dis- 
cussion sections would perform better and 
report more favorable attitudes than would 
9 those who were arbitrarily assigned to a 
teaching mode coinciding with their prefer- 
ence by chance. The main choice-no-choice 
effect was used to test this hypothesis. Sig- 
nificant differences between the groups on one 
of the four exam measures, and near signifi- 
cant differences on two others, offer at least 
partial confirmation of the hypothesis. Abso- 
lute differences between choice and no-choice 
were not, however, large. Considering the 
brevity of the experimental exposure (7-11 
weeks), control of preference patterns, com- 
mon attendance for 24 hours per week at 
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large lectures, and the use of objecti 
book examinations, the obtained result 
to be meaningful. 

That choice students tended to perform 
better than the no-choice students, particu- 
larly during the middle and latter portions of 
the semester, could not have been antici- 
pated; indeed, if anything, it might have been 
assumed that participation in a key decision 
such as choice of recitation section would 
have been most likely to produce more im- 
mediate effects (i.e., the first exam) on per- 
formance. On the other hand, all students did 
not actually start attending recitation sections 
until the third or fourth week of the semester. 
Moreover, it is conceivable that the demands 
of adjusting to a new course may have been 
more relevant to performance on the first 
exam than the effects of the choice-no-choice 
manipulation. The superior performance of 

the choice group on the later examinations is 
consistent with findings by Coch and French 
(1948) and Morse and Reimer (1956), who 
measured performance up to 2 years after re- 
distribution of decision-making power. Unlike 
these studies, however, the present investiga- 
tion controlled preference; consequently, the 
present data support the view that it is the 
way in which preference accrues (i.e., through 
acquisition of decision-making power rather 
than happenstance) that affects performance. 

Data on students’ satisfaction with their 
recitation sections, their judgment of the value 
of these classes, and their perception of the 
instructor’s ability to stimulate interest all 
support the basic hypothesis. That these find- 
ings are so strongly consistent across several 
scales may in part be a function of the non- 
independence of these measures. 

The absence of choice—no-choice differences 
for subjects in structured sections on the 
initial rating of satisfaction may reflect lesser 
importance attached to the initial choice by 
students in structured classes than by stu- 
dents in unstructured sections. On the other 
hand, the significant difference between re- 
ported satisfaction at the end of the semester 
for choice-structured and no-choice-struc- 
tured subjects may be a result of the freedom 
of students in the latter condition to report 
a decrease in satisfaction, having had no re- 
sponsibility for choosing their classes. This 


text- 


PERFORMANCE AND SATISFACTION 


explanation follows from the cognitive-disso- 
nance model as discussed by Brehm and 
Cohen (1962). A negative experience is less 
likely to be reported as favorable if partici- 
pation in the activity is perceived as occurring 
more through the efforts of others than 
through volition, No-choice subjects were, in 
fact, arbitrarily assigned to recitation sections, 
whereas choice subjects received a choice. 

While a major purpose of the two class 
conditions was to allow replication of the 
choice-no-choice manipulation, some concern 
for the direct effects of this independent vari- 
able was warranted. The absence of signifi- 
cant performance differences between struc- 
tured and unstructured groups suggests that 
potential effects related to differences in 
amount of student control over class activities 
in these sections were mitigated by unanimous 
student participation in recitation sections 
self-defined as best suited to their needs. At 
the level of objective textbook examinations, 
the two teaching methods were equally effec- 
tive for students whose learning styles were 
apparently complemented by them. This find- 
ing is consistent with the conclusion of other 
researchers (Fry, 1972; Noy & Hunt, 1972) 
that there may not be one “best” teaching 
method but, rather, different optimal learning 
conditions for different students. 

Attitude findings, however, indicate that 
the two class conditions had differential ef- 
fects on student satisfaction. That subjects 
in unstructured sections consistently reported 
more favorable attitudes about their recita- 
tion experience is somewhat perplexing in 
view of the apparent effect on performance of 
matching students’ needs and teaching meth- 
ods. One explanation is that instructors in 
unstructured classes might simply have been 
better teachers than those in structured sec- 
tions and that positive student perceptions of 
the unstructured teachers were generalized to 
the total recitation experience. Such an in- 
terpretation, while theoretically admissible, 
does not seem probable on several counts: 
(a) The early semester satisfaction difference 
between subjects in structured and unstruc- 
tured classes was based on ratings made 
essentially before students had a chance to 
evaluate their instructors; (b) attendance did 
not differentiate between class conditions as 
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it might have were there marked differences 
in performance of instructors; (c) exam per- 
formance for the two groups was comparable; 
and (d) all instructors were using their pre- 
ferred method of teaching and were partici- 
pating primarily because of a departmental 
requirement. 

An alternative explanation of these attitude 
differences is that subjects in structured sec- 
tions may have been more concerned than 
those in unstructured. sections about maxi- 
mizing their chances for a good grade in the 
course and less willing to risk perceived un- 
certainty in an unstructured class to obtain a 
more satisfying experience. Subjects in un- 
structured sections, by contrast, may have 
been stylistically more suited to unstructured 
classes and thus better able to take advantage 
of the personally satisfying experience of di- 
recting their own studies. The issue of risk 
is, thus, the core of this argument. ‘The 
risk of performing poorly may have deterred 
some students (those in structured classes) 
from a potentially more satisfying learning 
experience. 


Implications 


The scope of this study is sufficiently lim- 
ited to dampen unrestrained generalization of 
its findings to broader social problems. The 
major limitations concern the comparability 
of the subject population with other groups 
for whom decision-making power may be a 
salient issue and the narrow specificity of the 
experimental setting. The focal issue ad- 
dressed in the present research, however, is 
the phenomenon of decision-making control 
which, in the view of Rubenstein and Lass- 
well (1966), constitutes one common concern 
of students, blacks, women, and psychiatric 
patients. 

In the spirit of cautious speculation, the 
present findings may have implications for 
broader social problems. Decision-making 
power in salient areas may, of itself, consti- 
tute a valuable experience independent of the 
opportunity it permits to select preferred 
alternatives. Recalling the importance of au- 
tonomy for positive mental health as dis- 
cussed by Jahoda (1965), it can be inferred 
that decision-making power enhances the in- 
dividual’s sense of competency and feeling of 
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mastery. Improved performance and increased 
satisfaction may therefore result from in- 
creased self-esteem. The present findings thus 
support theoretical discussions of poverty 
groups leading to proposals for intervention 
based on social action models (Haggstrom, 
1964; Reiff, 1966; Riessman, 1967). 
Future investigations are clearly needed to 
test these inferences. For example, the view 
that decision-making power and subsequent 
performance and satisfaction are mediated by 
a heightened sense of importance or adequacy 
must be measured in a variety of situations of 
increased decision-making power. Such in- 
vestigations replicated in different contexts as, 
for example, with low-income whites and 
blacks in work settings, would help to estab- 
lish limits for generalizing the findings of this 
study and, more importantly, contribute to 
the narrowing of a tremendous gap between 
pressing social problems and knowledge of 


adequate solutions. 
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Cooperativeness and Social Perspective Taking 


David W. Johnson 
University of Minnesota 


The ability to take the perspective of other individuals is commonly assumed 
to be one of the most important aspects of cooperative interaction. While a 
number of theorists have posited that cooperativeness and social perspective 
taking are highly related, there is no direct evidence confirming such an 
assumption. In this study, predisposition to cooperate was compared with 
ability to take the perspective of other individuals both physically and emo- 
tionally. A series of tasks indicating predisposition to cooperate and ability in 
social perspective taking was given to 24 fourth-grade students in a midwestern 
suburban elementary school. The results indicate that there is no relationship 
between predisposition to cooperate and ability to take the physical perspective 
of other individuals, but a strong relationship exists between predisposition to 
cooperate and ability to take the emotional perspective of other individuals. 
No relationship was found between ability to take the physical and emotional 
perspective of other individuals. These results confirm a basic theoretical 
assumption in cooperation theory and empirically link the research and theory 


on cooperation and social development. 


Cooperation and social perspective taking 
are central processes in almost all human 
interaction. It has long been assumed that in 
order to cooperate with other individuals a 
person must have the ability to take the 
perspective of oneself and others in a joint 
situation. In order to discuss this assumed 
relationship between cooperative interaction 
and social perspective taking both must be 
defined. Cooperative interaction is the coordi- 
nation of behavior among individuals to 
achieve mutual goals. Taking the perspective 
of another person is the ability to understand 
how a situation appears to another person and 
how that person is reacting cognitively and 
emotionally to the situation. It is the ability 
to put oneself in the place of others and 
recognize that other individuals may have 
points of view different from one’s own. It is 
often referred to as “role taking” in the social 

' cognitive development literature. Although 
there is some indirect evidence that a rela- 
tionship exists between cooperation and social 
perspective taking (Johnson, 1971; Kelley & 
Stahelski, 1970), the relationship has not 
been directly validated. 


Requests for reprints should be sent to Dr. David 
W. Johnson, Department of Psychological Founda- 
tions, 330 Burton Hall, University of Minnesota, 
Minneapolis, Minnesota 55455. 
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While cooperation is consistently defined in 
research endeavors, social perspective taking 
has been operationalized in several different 
ways and the assumption that a single factor 
(decentering) underlies all operationalizations 
has not received clear support or refutation 
(Cowan, 1966; Irwin, 1973; Rubin, 1973; 
Sullivan & Hunt, 1967). Empirical studies 
have typically focused on a single dimension 
—the ability to take another’s visual perspec- 
tive. In cooperative interaction the ability to 
perceive accurately how others are feeling and 
to understand why they are feeling like they 
do is as important (if not more so) as the 
ability to perceive how they view the situa- 
tion. 

The present study is an attempt to test 
directly the proposition that the relationship 
between cooperativeness and social perspective 
taking exists by comparing the social perspec- 
tive taking abilities of individuals predisposed 
to behave cooperatively with those of indi- 
viduals predisposed to behave competitively. 
Two types of social perspective taking mea- 
sures are included in the study, one focusing 
on perceptual perspective taking and one fo- 
cusing on affective perspective taking. It is 
hypothesized that cooperators demonstrate 
more ability in taking the perceptual and 
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TABLE 1 


MEAN RESPONSE or HIGH AND Low CoOPERATORS ON 
PERSPECTIVE TAKING TASKS 


High Low 
Variable [$$ S t 
M SD M SD 
Perceptual 
perspective 11.08 | 8.49 9.50 | 7.03 | 0.50 
Recognition of 
feeling 2.58 | 1.50 1.25 | 1.36 | 2.28* 
Motive for 
feeling 4.17 | 3.24 | —1.42 | 2.37 | 4.15** 
Total for feeling | 6.42 | 4.37 | —0.17| 3.15 | 4.22** 


Note. n = 12 in each condition; the higher the score, the 
more accurate the perspective taking. 
*p <.05. 
** p < 001. 


affective perspective of others than competi- 
tors. 


METHOD 


The subjects in this study were all white, middle- 
class fourth-grade students in a suburban elementary 
school in a midwestern metropolitan area. Eleven 
were 9 years old, 12 were 10 years old, and 1 was 
11 years old. Fifteen were female and nine were 
male. Students from two different classrooms were 
used. Two white female interviewers were used, one 
a masters level student in elementary school counsel- 
ing and the other a freshman in college. Each jnter- 
viewed 12 subjects. Both received 6 hours of train- 
ing and supervised practice on conducting fourth 
graders through the tasks used. Neither interviewer 
knew the hypothesis of the study. 

Two variables were examined in the study: predis- 
position to behave in a cooperative or a competitive 
manner and ability to take the social perspective of 
another person, Predisposition to behave in a coop- 
erative manner was defined as behaving to maximize 
both another person’s and one’s own outcomes. 
Predisposition to behave in a competitive manner 
was defined as behaving to maximize one’s own 
outcomes while minimizing another person’s out- 
comes. Three tasks were given each subject to 
operationalize this variable; scores on the three 
tasks were added together to obtain a single score for 
predisposition to cooperate or compete. The tasks 
were a marble-pull task developed by Madsen 
(1971), a picture-rating task developed by Crocken- 
berg and Bryant (Note 1), and a choice-cards task 
developed by Kagan and Madsen (1972). Social 
perspective taking is defined as the ability to take 
the perceptual and affective perspective of another 
person. Two tasks were used to operationalize social 
perspective taking: a perceptual perspective task 
developed by Cooper and Flavell (in press) and an 
affective perspective task developed by Rothenberg 
(1970). In the perceptual perspective task the ex- 
perimenter placed a display of wooden blocks be- 
tween herself and the subject, placed matching 
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blocks on their side at the subject’s left, and in- 
structed the subject to “place your blocks so they 
look to you like my blocks look to me.” Five dis- 
plays were used. In the affective perspective task 
the subject listened to a tape recording focusing on 
a common emotion and was asked to describe how 
the actor felt and why he felt the way he did. Four 
different recordings were used. 

The specific procedure for the study was as fol- 
lows: First the two experimenters walked to the 
classroom from which the subjects were drawn. 
They selected two subjects who did not know each 
other well. They then conducted the subjects to 
interview rooms. The subjects first played the mar- 
ble-pull task together. They were then separated 
into private rooms and were given the picture-rating 
task, the choice-cards task, the perceptual perspec- 
tive task, and the affective perspective task, in that 
order. The subjects were then conducted back to 
their classroom. After all the subjects had been 
interviewed, they were each given a prize for being 
in the study and the nature and purpose of the 
study were explained. 


RESULTS 


The results of the study can be analyzed 
in two ways: (a) on the basis of the total 
cooperation score, subjects can be divided 
into high and low groups and ¢ tests con- 
ducted on the responses to the social perspec- 
tive taking tasks and (b) correlations can be 
conducted among the variables. The data is 
presented in both ways. From Table 1 it can 
be seen that there is no significant difference 
between the high- and low-cooperator groups 
on ability to take the perceptual perspective 
of another person. There are, however, con- 
sistent differences in ability to recognize and 
understand feelings of other individuals, Com- 
pared to low cooperators, high cooperators 
are more skilled in recognizing feelings in 
others (t= 2.28, p< .05), understand the 
reasons why the person feels the way he does 
(t= 4.15, p < .001), and thus are signifi- 
cantly more skilled in taking the affective 
perspective of others (£= 4.22, p < .001). 
Thus the hypothesis is confirmed in terms of 
affective perspective taking but not for per- 
ceptual perspective taking. 

_ The correlations indicate that predisposi- 
tion to behave cooperatively is not signifi- 
cantly related to skill in taking the perceptual 
perspective of other individuals but is signifi- 
cantly related to skill in taking others’ affec- 
tive perspective (recognition of feelings, r = 
34, p < .05; motive for feelings, r=.57, p 


CooPERATIVENESS AND SOCIAL PERSPECTIVE TAKING 


< .002; total for feelings, r = .57, p < .002). 
It is interesting to note, furthermore, that 
there is no correlation between skill in taking 
another’s perceptual perspective and affective 
perspective (r = —.03). Ability. to recognize 
feelings accurately and to understand the 
reasons for the feelings are significantly cor- 
related (7 = .52, p < .005). 


Discussion 


The results of this study indicate a strong 
relationship between predisposition to coop- 
erate and ability to take the affective perspec- 
tive of other individuals. These data confirm 
the assumed relationship between cooperation 
and social perspective taking posited in the 
theories of Mead, Piaget, Asch, Heider, 
Deutsch, Madsen, Kohlberg, and other social 
psychologists. The data also empirically tie 
the theory of cooperation more directly with 
the theory of social perspective taking and 
empathy. 

The confirmed relationship between coop- 
erativeness and ability to take the emotional 
perspective of other individuals implies that 
one way to increase individuals’ abilities to 
take the affective perspective of others is 
through experience in cooperative interactions. 
Since theoretically affective perspective taking 
abilities are necessary for cooperative inter- 
action, repeated cooperative interaction in- 
creases a person’s affective perspective taking 
skills. Thus, individuals interested in devel- 
oping affective perspective taking abilities in 
children may wish to emphasize cooperative 
goal structures in schools and children’s play 
activities. 

There are conflict situations, furthermore, 
in which one may wish to increase the coop- 
erativeness of the interaction among individu- 
als. Devising procedures for increasing the 
understanding of the others’ affection per- 
spectives may be a key to increasing coopera- 
tive interaction in a conflict situation. John- 
son (1971) conducted a series of experimental 
studies on role reversal, a procedure for fa- 
cilitating social perspective taking. He found 
that when role reversal is used in conflict 
situations, it induces cooperative behavior 
between participants, clarifies misunder- 
standings concerning participants’ positions, 
increases understanding of the other's posi- 
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tion, increases one’s ability to perceive the 
issue from the other’s perspective, results in a 
reevaluation of the issue and a change of atti- 
tude concerning the issue, and results in the 
person engaging in role reversal being per- 
ceived as attempting to understand the oth- 
er’s position, as an understanding person, as 
willing to compromise, as cooperative, and as 
trustworthy. Thus there is considerable evi- 
dence that role reversal facilitates the devel- 
opment of cooperative interaction among indi- 
viduals. 

The theorists in cooperation and social cog- 
nitive development state that social perspec- 
tive taking abilities are important for coop- 
erative interaction; they do not, however, 
specify which dimensions or types of social 
perspective taking are crucial. Flavell (1968) 
hypothesizes that perceptual perspective tak- 
ing may be easier for young children than is 
psychological perspective taking involving 
thoughts and feelings, and the work of Dy- 
mond, Hughes, and Raabe (1952) support 
this hypothesis. Thus it may be that percep- 
tual perspective taking is a prerequisite for 
the development of the ability to take another 
person’s affective perspective. Irwin (1973), 
however, found evidence that affective per- 
spective taking ability may emerge before 
perceptual perspective taking ability in young 
children, significantly confusing the line of 
reasoning above. Irwin also found no signifi- 
cant relationship between perceptual and 
affective perspective taking abilities, and per- 
ceptual perspective taking was unrelated to 
moral judgment while affective perspective 
taking was significantly related. 

The results of this study indicate that per- 
ceptual and affective perspective taking are 
unrelated: A child who is skilled at one may 
not be skilled at the other. This study also 
demonstrates that affective perspective taking 
ability is related to cooperativeness while 
perceptual perspective taking ability is not. 
On the basis of these results and the work of 
Irwin (1973), it can be tentatively concluded 
that perspective taking abilities are not repre- 
sentative of a single dimension of the ability 
to “decenter” and perceptual perspective tak- 
ing ability which has received so much atten- 
tion in past research is not related to such 
basic social processes as cooperation and 
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moral judgment. Future research in this area 
should focus upon the taking of psychological 
perspectives dealing with cognitions, motives, 
intentions, affects, and the like, rather than 
upon the taking of perceptual perspectives. 
Or, alternatively, the conditions under which 
perceptual perspective taking is related to 
cooperation and other types of perspective 
taking abilities should be demonstrated. It 
is also time that the basic theorizing of such 
individuals as Mead, Asch, Deutsch, Heider, 
Piaget, and Kohlberg be modified to take into 
account the complexity of social perspective 
taking and the different types of abilities 
involved in taking another person’s psycho- 
logical perspective. 
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Physical Attractiveness, Social Relations, and Personality Style 


Dennis Krebs and Allen A. Adinolfi 
Harvard University 


The relationship between physical attractiveness, social contact with members 
of the same and opposite sex, and personality factors was determined. Sixty 
male and 60 female university freshmen who were rejected, accepted, or un- 
known to their same-sex peers served as subjects. A reliable rating of physical 


attractiveness by independent judges s 


howed that rejected subjects were most 


attractive, accepted subjects were next most attractive and unknown subjects 
were least attractive. There was a positive relationship between physical attrac- 
tiveness and dating for females, but not for males. A factor analysis of sub- 
jects’ personality inventory scores and subsequent analyses of variance of per- 


sexes were independent, achieving, 


affiliative and affectionate; and isolate 


defensive, and withdrawn. 


Research on person perception and social 
influence has shown that people commonly 
elieve that aspects of external appearance 
exert a minor effect on impression formation 
nd social responsivity. In the words of Ber- 
cheid and Walster (1972) there is “a demo- 
cratic belief that appearance is a superficial 
and peripheral characteristic with little influ- 
ence on our lives.” (p. 43) As researchers 
such as Hudson and Henze (1969), Perrin 
(1921), and Tesser and Brodie (1971) have 
hown, observers are particularly reluctant to 
admit that their reactions are influenced by 
physical attractiveness. Yet, there is a grow- 
ing body of evidence which shows that the 
average person drastically underestimates the 
influence of physical attractiveness on his 
social behavior. 

Several studies have found that observers 
impute desirable traits to the physically at- 
tractive and undesirable traits to the physi- 
cally unattractive (Byrne, London, & Reeves, 
1968; Dion, Berscheid, & Walster, 1972; 
Walster, Aronson, Abrahams, & Rottmann, 
1966). In particular, Miller (1970) found 
that the physically attractive were judged to 
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sonality factors by sociometric groups 
and 
d subjects were emotionally constricted, 


showed that rejected subjects of both 


ambitious; accepted subjects were 


be more likable, friendly, confident, sensitive, 
and flexible than the physically unattractive. 
Other studies have found that subjects prefer 
to interact with potential social partners who 
are physically attractive. For example, Bris- 
lin and Lewis (1968), Byrne, Ervin, and 
Lamberth (1970), Tesser and Brodie (1971), 
and Walster et al. (1966) found that the 
physically attractive are more desirable than 
the physically unattractive as dating partners. 
Although past studies consistently have 
found that physical attractiveness affects im- 
pression formation and social contact with the 
opposite sex, the extent to which their find- 
ings can be generalized from the experimental 
contexts in which they occurred to everyday 
life remains undetermined. Many studies 
(e.g. Byrne et al., 1968; Dion et al., 1972; 
Miller, 1970; Perrin, 1921) failed to employ 
actual people as stimuli. Physical attractive- 
ness was displayed as an adjective or an 
aspect of a picture. Other studies (e.g. Brislin 
& Lewis, 1968; Byrne et al., 1970; Walster 
et al., 1966) allowed only short-term contact 
between subjects in relatively artificial situ- 
ations such as prearranged computer dates; 
and almost all past studies restricted their 
investigation to the effect of physical attrac- 
tiveness on members of the opposite sex. 
The present study attempted to extend the 
generality of past research by testing the rela- 
tionship between physical attractiveness and 
social contact between members of the oppo- 
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site sex in the natural dating world and 
members of the same sex in a real-life setting. 
Tt also measured for the first time (cf. Ber- 
scheid & Walster, 1974) the personalities of 
the physically attractive and unattractive in 
order to test the validity of the traits observ- 
ers attribute to them. It tested whether the 
physically attractive are actually more likable, 
friendly, confident, sensitive, and flexible 
(Miller, 1970) than the physically unattrac- 
tive and explored the implications of the 
veridicality or lack of veridicality of the 
attributions. 


METHOD 
Subjects and Sociometric Group Placement 


Subjects were 60 males and 60 females previ- 
ously selected by Adinolfi (1970) from a group of 
600 university freshmen. Subjects were selected on 
the basis of the sociometric ratings they received 
from their same-sex dormmates after two months on 
campus. The sociometric test included probes such 
as “Would want as a roommate” and “A person 
you try to avoid.” Subjects were divided into four 
groups. The 30 subjects (15 male and 15 female) 
who were mentioned most frequently in response to 
favorable probes were placed in the Accepted Group. 
The 30 subjects who received an approximately 
average (mostly favorable) number of nominations 
were employed as a Control Group. The 30 subjects 
who were mentioned most frequently in response to 
unfavorable probes comprised the Rejected Group. 
The 30 subjects who were mentioned least (in re- 
sponse either to favorable or unfavorable probes) 
were placed in the Isolated Group. 


Ratings of Physical Attractiveness 


Eight raters (four male and four female) who 
were blind to the sociometric status of the subjects 
rated high school graduation photographs of sub- 
jects on a scale from 1 (extremely unattractive) to 9 
(extremely attractive). The Kendall coefficient of 
concordance for male raters was .71; for female 
raters, .69; and for all raters, .13. The physical 
attractiveness score for each subject consisted of 
his mean rating from the eight raters. 


Measures of Personality, Interpersonal 
Orientation, Academic Achievement, 
and Social Contact 


Subjects responded to several personality question- 
naires: the Jackson Personality Research Form 
(PRF) (Jackson, 1967), the Occupational Intro- 
version-Extraversion Scale of the Strong Vocational 
Interest Blank (SVIB-OIE) (Campbell, 1966); and 
the Fundamental Interpersonal Orientations Test, 
Form B (FIRO) (Schutz, 1960). Several measures 
of academic performance, such as verbal and mathe- 


DENNIS KREBS AND ALLEN A. ADINOLFI 


matical Scholastic Aptitude Test scores and first 
year college grade average were obtained; and, 
finally, subjects were interviewed about their inter- 
personal relations and social activities. 


RESULTS 


Physical Attractiveness and Social Contact 
with the Opposite Sex 


Social contact with the opposite-sex was 
defined as the sum of high school dates for 
(a) the Junior Prom (b) the Senior Prom, 
and (c) New Year’s Eve. Each event was 
assigned equal weight, which allowed a maxi- 
mum total score of 3. The correlation between 
physical attractiveness and opposite-sex s0- 
cial contact for females yielded a Pearson 
r (49) of 454, p < 01. For males the com- 
parable correlation was 7 (52) = 088, p > 5+ 


Physical Attractiveness and Social Contact 
with the Same Sex 


One of the central questions of the present 
study was whether physical attractiveness is 
as powerful a determinant of contact with 
members of the same sex as members of the 
opposite sex. The previously described socio- 
metric instrument supplied a measure of the 
extent to which subjects were accepted, re- 
jected, or ignored by their same-sex dorm- 
mates. 

The mean physical attractiveness ratings.of 
female and male subjects from the four 
groups are displayed in Table 1.2 Predictably, 
subjects from the Accepted Group were rated 
as more physically attractive than subjects 
from the Control Group; and subjects from 
the Control Group were rated as more attrac- 
tive than subjects from the Isolated Group, 
who in turn were rated as least attractive. 
Unexpectedly, subjects from the Rejected 
Group were judged to be the most physically 
attractive. 

The relationship between physical attrac- 
tiveness and social contact with the same sex 


1Data were not available for all subjects because 
a small number of schools attended by some of the 
subjects did not have a junior and senior prom. 

2 All missing data resulted from the unavailability 
of yearbook photographs except for three females 
from the Isolated Group who were originally mis- 
classified (Adinolfi, 1970) because they moved out 
of their dormitories before their dormmates had a 
chance to become acquainted with them. 
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TABLE 1 TABLE 2 
MEAN PHYSICAL ATTRACTIVENESS RATING OF ANALYSIS OF VARIANCE OF ATTRACTIVEN RATINGS 
SUBJECTS IN SOCIOMETRIC GROUPS OF SUBJECTS IN SOCIOMETRIC GROUPS 
Female Male Combined Source SS of P > 
ey Group (A) | 94.101) 3 | 3.499 
sroup (/ 4. š 019 
n| M |SD|n| | sp|n “| SD Sex (B) 2.201 1 245 | 500 
Rejected | 15 | 5.03 | 1.04 | 14| 4.87 | .89| 29 | 4.95 | .98 AXB Su 3 170 | 500 
Accepted | 14 | 4.74 | 1.23 | 13 | 4.67 | .74 | 27 | 4.70 | 1.00 Within 914.399 | 102 
Control | 15 | 4.38 | 1.44 | 13 | 4.35 | 1.09 | 28 | 4.37 | 1.27 
Isolated | 12] 4.15 | 1:34 | 14 | 4.03 | 163 | 26 


4.09 | 1.00 


was tested by computing a 4 X 2 analysis of 
variance on physical attractiveness ratings. 
Table 2 shows that a significant main effect 
was obtained for sociometric group, but that 
the main effect for sex of subject, and the 
Group X Sex interaction were not significant.’ 


Physical Attractiveness and Personality 
Factors 


The relationship among measures of per- 
sonality, interpersonal orientation, academic 
achievement, social contact, and physical 
attractiveness was assessed by performing a 
principal components factor analysis on all 
variables. Five factors were obtained for males 
and five for females, which were rotated 
orthogonally by the Varimax method. Table 
3 (for females) and Table 4 (for males) dis- 
play the resulting factors. 

The extent to which the female factors 
corresponded to the male factors is note- 
worthy. For females, the five factors ac- 
counted for 57.5% of the variance, and for 
males, 54.5%. For both sexes, the strongest 
factor was defined by affiliative, other- 
oriented needs; the second strongest factor 
was defined by high needs for cognitive struc- 
ture, order, and harm avoidance; and the 
third strongest by independence and ambi- 
tion. Factors IV and V, which accounted for 


3 Jt should be noted that the analysis of variance 
supplied a conservative indication of the significance 
of the difference between groups because it employed 
the mean physical attractiveness rating of eight 
raters as its unit of analysis, and, therefore, failed 
to reap the potential power associated with con- 
sistency among raters. An alternate analysis, avail- 
able from the authors on request, tested the extent 
to which the eight raters consistently rank ordered 
the four groups the same on physical attractiveness. 
It produced highly significant differences among all 


groups. 


a small percentage of the variance, were some- 
what different across the two sexes. 

For females, only one factor—Factor III, 
Independent Ambitiousness, contained a posi- 
tive loading from physical attractiveness 
(.586). The constellation of variables that 
best characterized the personality of physi- 
cally attractive females was one of cognitive 
inquisitiveness, achievement, and individual- 
ity. A second factor, Factor Il—Self-Protec- 
tive Constraint—contained a negative loading 
from physical attractiveness, but it was ex- 
tremely weak (—.200). 

Two of the male factors contained loadings 
from physical attractiveness. Both were nega- 
tive. Factor II, Self-Protective Constraint, 
contained a loading of —.581 from physical 
attractiveness, and Factor V, Ascetic Asocia- 
bility, contained a loading of —.310, Physical 
unattractiveness in males contributed to per- 
sonality syndromes which, rather predicta- 
bly, consisted of protective, asocial systems 
of needs. 


Personality Factors and Opposite-Sex 
Social Contact 


Only one factor for females and one factor 
for males was loaded by dating. An earlier 
analysis revealed a positive relationship be- 
tween physical attractiveness and dating for 
females. Therefore, the factor which contained 
the strongest loading from physical attractive- 
ness (Factor III) was expected to contain 
the strongest loading from dating, and indeed 
it did. Dating was most strongly associated 
with the ambitious independent personality 
pattern. 

The earlier analysis failed to reveal a rela- 
tionship between physical attractiveness and 
dating for males, and the male factor which 
contained the strongest loading from physical 
attractiveness (Factor II) failed to contain a 
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TABLE 3 
Varimax Factor LOADINGS: FEMALE 


Factors 


Variables 


it IV 


Jackson Personality Research Form 
Abasement 
Achievement 
Affiliation 
Aggression 
Autonomy 
Change 
Cognition 
Defendence 
Dominance 
Endurance 
Exhibitionism 
Harm avoidance 
Impulsivity 
Nurturance 
Order 
Play 
Sentience 
Social recognition 

Succorance 
Understanding 
FIRO 
Inclusion expressed 
Inclusion wanted 
Control expressed 
Control wanted 
Affection expressed 
Affection wanted 
High school offices 
College academic performance 
Physical attractiveness 
Dating 


—360 
—678 
859 


678 
—763 


751 
—$25 
=517 


631 


324 


554 —378 


—599 


372 


560 
394 


349 


682 


—428 


=373 


307 
586 
614 


Note. Only loadings of .300 and g 
= 15.02%; Faci I (Self-Proteci 
(Defensive Aggressiveness) = 10.76%; 


r were included. The 
Constraint) 
Factor V (Passive 


loading from dating. However, the factor that 
contained the second strongest (negative) 
loading from physical attractiveness, Ascetic 
Asociability, (Factor V) contained a very 
strong (negative) loading from dating. In- 
frequency of dating was related to a lack of 
dominance, aggressiveness, and sentience, and 
a need to avoid harm. 


Personality Factors and Same-Sex Social 
Contact 


The measure of same-sex social contact 
(defined by sociometric group) was not in- 
cluded in the factor analysis because it was 
not defined as a continuous variable. In order 
to test the relationship between social contact 
with the same sex and personality style, the 
mean loading of subjects in each of the four 


iance accounted for by rotated Factor I (Affectionate Sociability) 
Factor III (Independent Ambitiousness) = 9.61%; Factor IV 
fajghnetor 101 fider mbitiousness) = 9.61%; Factor IV 


sociometric groups on each of the five factors 
was computed. Tables 5 and 6 display the 
mean factor scores of subjects in the four 
groups. Analyses of variance were performed 
on the mean scores of the four groups on 
each factor, and a number of planned com- 
parisons were made. In all cases, comparisons 
were made between the mean loading in ques- 
tion and the mean of the remaining three 
loadings (Winer, 1962, pp. 65-70). Tables 5 
and 6 contain the F values associated with the 
analysis of variance for each factor by socio- 
metric group and the probability levels asso- 
ciated with individual comparisons, 

A Tables 5 and 6 reveal that there were more 
significant differences among factor scores 
females than males. Personality styl 
to be more closely associated with 
tions for females than males. Yet, 


of 
e appears 
social rela- 
there was a 


S 
yt 
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TABLE 4 
Varimax Factor LoApincs: MALE 


Factors 
Variables 
I IL II Iv We he 
Jackson Personality Research Form 
Abasement 307 —631 —389 672 
Achievement 475 588 655 
Affiliation 716 —426 719 
Aggression s —420 399 —317 453 
Autonomy — 500 —354 407 —308 645 
Change —414 519 —321 559 
Cognition 758 701 
Defendence 439 209 
Dominance 434 623 —316 760 
Endurance 334 752 730 
Exhibitionism —512 526 596 
Harm avoidance 633 356 623 
Impulsivity —741 650 
Nurturance 598 450 634 
Order 636 438 
Play 335 —669 —393 739 
Sentience —642 479 
Social recognition 621 515 
Succorance 483 —513 510 
Understanding 734 579 
FIRO 
Inclusion expressed 715 398 698 
Inclusion wanted 670 500 
Control expressed 686 545 
Control wanted =572 —366 577 
Affection expressed 741 574 
Affection wanted 746 —313 662 
SVIB Occupational 
Introversion—Extrayersion Scale —398 —406 593 
High school offices —476 291 
College academic performance 306 384 276 
Physical attractiveness —581 3 —310 444 
Dating —667 538 


nce accounted for by rotated Factor I (Affectionate Sociability) 
actor ILI (Independent Ambitiousness) = 11.92%; Factor IV 
07%. 


high degree of consistency among the male other groups on Factor III, Independent Am- 
and female scores. bitiousness. The rejected females also were 
As expected, females and males in the characterized by a low need for Affectionate 
physically attractive Rejected Group scored Sociability (Factor I). 
significantly higher than females and males in Both females and males in the Accepted 
TABLE 5 
Soctomerric GROUP MEAN LoADINGS ON VARIMAX Factors: FEMALES 


Note. Only loadings of .300 and greater were included. The 
= 13.81%; Factor II (Self-Protective Constraint) = 12.837 
(Superordinance) = 8.90% ; Factor V (Ascetic Asociability) = 

S a 


Sociometric group 


Factor 
Rejected Accepted Control Isolated F p 
—.445** .478* -013 132 2.236 
i Toto —.336* —.071 673" 2.558 
Im 363** —.019 —.168 — .532* 2.000 
IV AI7 —.477* —.060 144 1.490 
Vv — 406* 054 ANL .566** 2.242 


Nole. Factor I = Affectionate Sociability, Factor II = Self-Protective Constraint, Factor II = Independent Ambitiousness, 
Pee = Defensive Aggressiveness, Factor V m Passive Withdrawal. A 
* The component of group variance due to the difference between the starred group and the mean of the remaining three groups 


significant at <.10 level. ‘Similarly, ** signifies p < .05. 


sa 
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TABLE 6 
Soctomerric GROUP MEAN LOADINGS ON VARIMAX Factors: MALES 
Sociometric group 
Factor 
Rejected Accepted Control Isolated F ? 
a 5 5 0 
I —.105 PSS ne 228 —.635*** 4.154 OL 
I 238 388 —.198 048 1.237 -305 
IHI .419* —.058 —.128 —.234 1.241 304 
IV 349 —.247 018 —.120 -968 AIS 
Vv .007 — 307 096 395* 1,297 285 
Note, Factor f = Affectionate Sociability, Factor II = Protective Constraint, Factor III = Independent Ambitiousness, 
Factor IV = Superordinance, Factor V = Ascetic Asociability. 
* The component of group variance due to the difference between the starred group and the mean of the remaining three groups 


significant at <.10 level. Similarly, * signifies 


Group predictably scored highest on Factor 
I, Affectionate Sociability. Accepted females 
also displayed a minimal disposition toward 
Defensive Aggressiveness (Factor IV). 

The Control Group of moderately accepted 
females and males contributed very little to 
any of the five factors. Subjects originally 
were placed in the Control Group because 
they received an average number of socio- 
metric nominations. The extent to which they 
were average across the factors is remarkable. 

Females and males in the Isolated Group 
scored high on Factor V, Passive Withdrawal 
(females) and Ascetic Asociability (males), 
which was consistent with their social isola- 
tion. Female, but not male, isolates were 
characterized by Self-Protective Constraint 
(Factor II). The tendency for female isolates 
to score low on Independent Ambitiousness 
(Factor III) seemed consistent with their 
interpersonal orientation, as did the male iso- 
lates’ distinct absence of Affectionate Socia- 
bility (Factor I). In general, female isolates 
seemed more constricted, defensive, and with- 
drawn than the male isolates, who were char- 
acterized more by an absence of affiliative 
needs. 


DISCUSSION 


The findings of the present study support 
a growing body of evidence that shows that 
physical attractiveness exerts a positive influ- 
ence on initial social contact among members 
of the opposite sex. The positive relationship 
between physical attractiveness and dating 
for females, and the tendency for the rela- 
tionship to be stronger for females than males, 
is consistent with the findings from studies 


p <.05, and ** signifies p < 01. 


by Berscheid, Dion, Walster, and Walster 
(1971), Walster et al. (1966), and others. 
However, these studies also found a significant 
positive correlation between physical attrac- 
tiveness and dating for males; whereas the 
present study did not. 

One explanation for the failure of the pres- 
ent study to find that physically attractive 
males date more than physically unattractive 
males may lie in the type of social contact it 
examined. In the situations characteristic of 
past research, where the measure of dating 
was taken in an experimentally contrived 
situation on a one-shot basis, males and fe- 
males were more equal than they were in the 
present study, which measured dating in mul- 
tiple real-life settings. In past studies, if either 
party did not desire the date, neither dated. 
The present findings suggest that the positive 
relationship between physical attractiveness 
and dating for males may not prevail in 
natural settings in which physically unattrac- 
tive males are able to compensate for initial 
refusals by pursuing alternate dates. This 
would appear especially to be the case with 
formal social occasions that everyone isgex- 
pected to attend. 

i j i 
accepted by the seal Some 

J peers were more 
physically attractive than subjects who were 
less accepted or were neglected was consistent 
with past findings on social contact among 
members of the opposite sex. For all except 
the most physically attractive (from the iE 
jected Group), the more physically attractiy, 
a subject, the more he tended to be TE 
as a socially desirable dormmate. 


It is interesting to note that the most 
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physically unattractive subjects were more 
neglected than rejected. The low incidence of 
dating for the physically unattractive custo- 
marily has been interpreted as rejection by 
the opposite sex. The present findings on so- 
cial contact with the same sex suggest a pos- 
sible refinement in this interpretation. In 
order to be actively rejected, one must be 
noticed; but the present data suggest that 
the physically unattractive tend to be ignored. 
It may be less that the physically unattrac- 
tive make a bad impression and more that 
they do not make an impression at all. Judg- 
ing from their self-protective, constrained, 
passive, asocial pattern of personality needs, 
and the fact that they seek professional 
psychological assistance more than subjects 
from the other groups (Adinolfi, 1970), the 
social neglect they experience may be psycho- 
logically more devastating than active rejec- 
tion. 

Unexpectedly, it was the most physically 
attractive who were rejected by members of 
the same sex. To date no study has found a 
negative relationship between physical at- 
tractiveness and social desirability, perhaps 
because no study has measured social contact 
among members of the same sex in a natural- 
ictic setting. 

"There are several possible explanations of 
this effect. One hinges on the finding by Dion 
et al. (1972) and others that a stereotype 
exists purporting “what is beautiful is good.” 
It is possible that when the ambitious, inde- 
pendent, and achieving personality disposi- 
tions that the physically attractive actually 
possess meet with the affiliative, cooperative, 
and social stereotype of their expected person- 
ality style (Miller, 1970), observers react to 
the discrepancy become disillusioned, and 
reject them. Or, more speculatively, the 
physically attractive may become aware of the 
fact that the personality stereotype elicited by 
their physical attractiveness is more favorable 
than their actual personalities, and attempt to 
minimizing social contact. Thus, 
they become independent, individualistic, and 
aloof. The reason why only same-sex peers 
were found to reject the physically attractive 
in the present study may have been because 


they, as dormmates, had much more knowl- 


preserve it by 
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edge of the actual personalities of the physi- 
cally attractive than their dating partners. 

Strictly speaking, it is misleading to equate 
the measures of same- and opposite-sex social 
contact. The measure of dating was more 
superficial, less reliable, and probably less 
valid than the sociometric ratings. To begin 
with, the measure of dating was based on 
self-report. Even though dates to events that 
should have left clear memories were in 
question, subjects could have reported them 
incorrectly. But more importantly, dates on 
formal social occasions are probably not rep- 
resentative of opposite-sex social contact. 
These limitations should be kept in mind in 
interpreting the data on social contact with 
the opposite sex. 

Other explanations of the rejection of the 
physically attractive by the same sex relate 
more specifically to differences in social reac- 
tions than would be expected between the 
sexes. If physical attractiveness is an attri- 
bute primarily employed to enhance relations 
with the opposite sex, it would not be ex- 
pected to exert a positive effect on social 
relations among members of the same sex, who 
may perceive themselves as in competition 
with one another. Dormmates may have re- 
jected their physically attractive peers be- 
cause they wanted to avoid competing with 
them or because they were jealous of their 
success with the opposite sex. Or, physically 
attractive dormmates may have supplied an 
unfavorable source of social comparison. It 
might well be difficult to maintain a high 
level of self-esteem when success with the op- 
posite sex is part of the currency of prestige 
and one is the roommate of the best-looking 
student on campus. Finally, the physically 
attractive may have been rejected because 
they failed to demonstrate a concern for their 
less attractive peers. The individualistic, am- 
bitious, and unaffiliative personality style 
characteristic of physically attractive females 
could well have been abrasive or threatening 
to their dormmates. 

The case of the physically unattractive is 
more clear. They tend to be ignored by mem- 
bers of both sexes. As a result, they are rela- 
tively unable to correct the unfavorable 
stereotype imputed to them. Although they 
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appear not to fulfill the letter of the prophesy 
associated with unfavorable trait attribution 
(perhaps because they never become exposed 
to it), they may fulfill its spirit by failing to 
develop socially effective personality styles. 

The most tragic aspect of the effect of 
physical unattractiveness is its potential to 
initiate a sequence of circular reactions. 
Physical unattractiveness evokes unfavorable 
trait attribution which mediates social avoid- 
ance. Social avoidance preserves the initial 
impression by reducing the probability of 

exposure to disconfirming information; and it 
perpetuates it by depriving the physically 
unattractive of the feedback necessary to 
develop socially effective personalities. 

The main question left unanswered by the 
present study pertains to the causal relation- 
ship among its measures. The causal sequence 
which seems most plausible is as follows: 
early in life physical attractiveness affects 
social reactions, which in turn affects the de- 
velopment of personality. The physically at- 
tractive are admired and pursued. Because 
they are pursued they develop a high level of 
self-esteem, which mediates the development 
of personality dispositions oriented toward 
ambition and success. The physically unat- 
tractive are ignored, which forces them to 
withdraw from social relations; and they 
develop asocial, self-protective personality 
dispositions. 

But the intuitively obvious one-way se- 
quence of causality is almost certainly only 
one facet of a much more complex network 
of relationships. The disparity between the 
reactions of members of same- and opposite- 
sex peers to the physically attractive suggests 
that personality style may affect social rela- 
tions. Once the ambitiousness of physically 
attractive females becomes a personality dis- 
position, they may be rejected by other fe- 
males because of it. Similarly, the socially 

constricted personality styles of physically 
unattractive males may cause them to go 
unnoticed. 

Researchers generally have assumed that 
physical attractiveness is relatively immuta- 
ble, and, therefore, the primary locus of 
causality. However, social popularity could 
cause people to enhance their appearance. 
They could do such things as style their hair, 
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wear cosmetics, employ various kinds of body 
padding, buy complementary clothes, or even 
submit to plastic surgery. Conversely, social 
isolation could cause them to neglect their 
appearance. Or, perhaps more importantly, 
the perception of physical attractiveness could 
be modified by exposure to the personalities 
of the physically attractive even though an 
objective measure of physical attractiveness 
failed to reflect the effect. Researchers have 
measured physical attractiveness by having 
raters rate people with whom they were un- 
acquainted. This practice has controlled for 
the potential influence that the personalities 
of the physically attractive could have on 
ratings of their attractiveness. In the present 
study, for example, the physical attractiveness 
of the subjects was assessed by eight objective 
raters. However, if people who were ac- 
quainted with the subjects, such as their dorm- 
mates, had rated them, they may well have 
rated them differently. Although the initial 
effect of physical attractiveness may be based 
on an objective appraisal, its subsequent ef- 
fect may be influenced by aspects of the per- 
sonalities of the perceived. Further studies are 
now in progress to test the extent to which 
beauty is, in fact, in the eye of the beholder. 
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Cognitive Appraisals and Transformations in Delay Behavior 
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The effects of different cognitive representations of the rewards (outcomes) in 
a delay of gratification paradigm on children’s ability to wait for these 
rewards were investigated. Consummatory (arousing) ideation directed at the 
relevant (contingent) rewards hindered effective delay. In contrast, cognitive 
transformations of the rewards which focused on their nonconsummatory 
qualities and associations significantly facilitated delay behavior more than did 
comparable ideation about similar rewards irrelevant to the delay contingency. 
Finally, consummatory ideation focused on rewards irrelevant to the contin- 
gency also greatly helped to maintain delay. Theoretical implications for the 
role of fantasy and cognitive appraisal in self-control were examined. 


In view of the central importance of the 
“reinforcement” concept in contemporary 
psychology, it is surprising that so little is 
known about how the mental representation 
of rewards affects the individual's pursuit of 
them. There is an enormous discrepancy be- 
tween the theoretical significance attributed 
to rewards in the regulation of goal-directed 
behavior and our lack of understanding of 
how their cognitive representation by the 
subject influences his behavior. The present 
study is part of a program to explore this 
topic by focusing specifically on how a per- 
son’s ideation about the contingent rewards 
in a choice situation affects his ability to 
maintain goal-directed activity until he 
achieves his preferred outcomes. 

Several experiments have indicated that 
the capacity to sustain self-imposed delay of 
gratification depends in part on the extent to 
which the person avoids cues that remind him 
of the rewards (outcomes) that he expects 
and wants but is prevented (interrupted, 
blocked, or delayed) from getting (eg., 
Mischel & Ebbesen, 1970; Mischel, Ebbesen, 
& Zeiss, 1972; Schack & Massari, 1973). To 
increase subjective frustration, one would 
have to focus cognitively on the blocked 
goal objects (e.g, by engaging covertly in 


This study was supported in part by Grant M 
6830 to Walter Mischel from the National Institutes 
of Mental Health, U. S. Public Health Service. 

Requests for reprints should be sent to Walter 
Mischel, Department of Psychology, Stanford Uni- 
versity, Stanford, California 94305. 


254 


anticipatory goal responses) ; to decrease frus- 
tration, one would have to suppress the goal 
objects by avoiding them cognitively. “Frus- 
tration tolerance” in the delay paradigm 
would depend on the subject’s ability to sup- 
press his attention to the rewards while re- 
maining in the frustrative situation until he 
has attained the goal. 

While this interpretation seems reasonable, 
it is probably incomplete. Indeed, Mischel 
and Moore (1973) found that exposure to 
symbolically presented rewards (slides) dur- 
ing the delay period significantly increased 
delay of gratification, This facilitative effect 
of exposure to slides of the rewards was di- 
rectly opposite to the earlier finding that 
visual exposure to the rewards themselves 
greatly decreased delay time (Mischel & 
Ebbesen, 1970; Mischel et al., 1972). The 
earlier findings had been obtained in basically 
the same subject population (i.e., preschool 
children in the same nursery school) and 
from an essentially similar delay paradigm. 
Nevertheless, the Mischel and Moore study 
showed that exposure to slides of the relevant 
rewards enhanced delay behavior more than 
did exposure to comparable distractions 
(slides of similar but reward-irrelevant ob- 
jects and blank slides). The critical difference 
between the reward-relevant attention manip- 
ulations in the earlier experiments and in the 
Mischel and Moore study was that previously 
children had been exposed to the actual re- 
ward objects, but in the Mischel and Moore 
experiment they were exposed to symbolically 


presented images (slides) of these rewards. 
It therefore seems that while attention to the 
rewards themselves decreases delay behavior, 
attention to the symbolically presented re- 
wards enhances delay behavior. 

At this juncture, the overall results on 
attention and imagery in self-control sug- 
gested that the mode of presentation of the 
reward stimuli (i.e., real versus symbolic in 
the form of slide-presented images) has ex- 
tremely significant effects. Why? Mischel and 
Moore (1973) speculated that this pattern 
of results may reflect two different functions 
of reinforcing (rewarding) stimuli that, in 
turn, may have completely different effects on 
self-control behavior. Extrapolating from 
Berlyne’s (1960) distinctions, a stimulus may 
be said to have both a motivating (consumma- 
tory, arousal) function and a cue (informa- 
tive) function. The actual reward stimuli 
probably have a more powerful motivational 
effect than do their symbolic representations 
(i.e., slide images), whereas the latter have a 
more abstract cue function. Attention to the 
real rewards increases the person’s motivation 
to make the blocked consummatory responses 
appropriate to the outcome (e.g., eat it, play 
with it). Since the subject cannot let himself 
make the consummatory response, frustration 
is increased, thus leading to decreased delay. 
In contrast, the cue or informative function 
$ of the symbolic reward stimulus may serve to 
guide and sustain the subject’s delay behavior. 
It may do so by serving as a reminder of the 
contingency in the delay situation (a re- 
minder of what the person will get) without 
being so real as to frustrate him. This inter- 
pretation suggests that exposure to the reward 
stimuli themselves may lead the subject to 
become too frustrated (“aroused”) and ready 
to perform the terminal response, but expo- 
© sure to their slide-presented symbolic repre- 
sentations may preserve the cue function of 
the rewards while reducing their arousal 
value. That is, the “image” of the objects 
may serve more as a “token” or reminder to 
sustain delay behavior, while the real pres- 
ence of the objects interferes with effective 
self-control by generating excessive arousal 
and frustration. 

The foregoing reasoning suggests that the 
effects of attention to the rewards upon delay 
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behavior may depend on how the subject at- 
tends to the reward stimuli rather than on 
whether or not he does attend or how 
the rewards are physically presented. We 
speculated that if attention is focused on 
the motivational (consummatory, arousing) 
qualities of the rewards, the frustration of 
delay should be increased and interfere with 
effective self-control. But if the same rewards 
are transformed by a cognitive focus on their 
nonconsummatory qualities, delay behavior 
would not be impaired and might even be 
facilitated. The present study began to ex- 
plore these theoretical possibilities by investi- 
gating how the impact of attention to the 
rewards in the delay paradigm can be modi- 
fied by specific cognitive transformations 
which the subject performs on the reward 
stimuli. 

For this purpose, just before the start of 
the delay period, children were given brief in- 
structions (plans) designed to encourage them 
to ideate in different ways during the actual 
delay time. The study compared the effects of 
instructions to ideate about the motivational 
(consummatory) qualities of the rewards in 
the delay contingency with instructions to 
cognitively transform the rewards and to 
ideate about their nonmotivational (noncon- 
summatory) qualities and associations. More- 
over, these same two types of instructions 
were used for both the “relevant rewards” 
(the rewards in the delay contingency) and 
the “irrelevant rewards” (comparable objects 
but not relevant to the actual delay contin- 
gency). We expected that consummatory ide- 
ation about the relevant rewards would result 
in less delay than would nonconsummatory 
(transformational) ideation about them. We 
also predicted that consummatory ideation 
about the relevant rewards would impede 
delay more than would comparable ideation 
about the irrelevant reward objects. Finally, 
we hypothesized that the detrimental effects 
of attention to the relevant rewards would 
be eliminated and even reversed if subjects 
transformed the reward objects cognitively by 
focusing in ideation on their nonconsumma- 
tory qualities and associations. Hence, non- 
consummatory ideation about the relevant re- 
wards should facilitate delay more than should 


256 


comparable ideation about the irrelevant re- 
wards (i.e., distraction). 


METHOD 
Design 


Consistent with the basic delay of gratification 
paradigm developed by Mischel et al. (1972), all 
subjects were given a choice between two rewards. 
Then they were allowed to wait (20 minutes) to get 
their preferred choice or to signal at any time to 
receive the less preferred outcome immediately. Two 
different pairs of reward choices were used; half of 
the children chose between one marshmallow versus 
two marshmallows, and half chose between one 
pretzel and two pretzels. All of the children waited 
with both the immediate and delayed rewards in 
their choice pair facing them. The dependent vari- 
able was the length of time each child waited alone 
in the room before signaling for the experimenter 
to return by ringing a bell. 

Before the start of the delay period, experimental 
subjects were given instructions to ideate in different 
ways while waiting. For half of the experimental 
subjects, the ideation instructions dealt with the 
rewards between which the subjects had chosen 
(relevant rewards); for the other half, the instruc- 
tions dealt with comparable objects which the sub- 
ject had not seen before (irrelevant rewards). For 
example, if a subject had been given a choice be- 
tween one versus two marshmallows, the relevant 
reward instructions dealt with those objects; con- 
versely, the irrelevant reward instructions dealt with 
the other objects (pretzels) to which he had not 
been exposed previously and which thus were not 

relevant to the waiting contingency. 

Within the relevant reward and irrelevant reward 
conditions, half of the children received consumma- 
tory ideation instructions and half, nonconsummatory 
(transformational) instructions. The consummatory 
instructions emphasized the consummatory (arousing) 
aspects of the objects, primarily taste and texture. 
The nonconsummatory (transformational) instruc- 
tions emphasized nonconsummatory aspects of the 
objects and their associations with other objects and 
activities. 

Thus four experimental groups were used: (a) 
consummatory instructions for the relevant objects 
(consume relevant), (b) transformational instruc- 
tions for the relevant objects (transform relevant), 
(c) consummatory instructions for the irrelevant 
objects (consume irrelevant), and (d) transforma- 
tional instructions for the irrelevant objects (trans- 
form irrelevant). In addition to the four experi- 
mental treatments, a group which received no 
imagery instructions was included as an additional 
control. 


Subjects and Experimenters 


The subjects were 30 boys and 30 girls attending 
the Bing Nursery School of Stanford University. The 
children ranged in age from 3 years 4 months to 
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5 years 5 months, with a mean age 
months and a median age of 4 years 7 The 
procedures were conducted by one male and two fe 
male experimenters. Twelve subjects were randomly 
assigned to each of four experimental conditio 
one control group. In each condition, all expe 
menters tested an equal number of males and femal 
in each group to avoid systematic biasing effects < 
sex or experimenters. 


of 4 years 


s and 


Procedure 


The experimental room and setting was similar to 
that described previously (Mischel & Ebbesen, 1970; 
Mischel et al., 1972). The experimental room was s 
divided by a wooden barrier on one side of which t 
was a box containing various battery-operated toys 
and interesting games. A small table and chair were 
on the other side of the barrier, and a desk bell was 
on the table. One side of the experimental room 
contained two large one-way mirrors which allowed 
an observer to watch the child during the entire 
procedure. Except for a small vision strip at the 
top of the mirror which permitted observation 
(above the child’s direct line of vision), the mirrors 
were covered with brown paper so the child would 
not be distracted by looking into them. 

During the initial part of the experiment, the 
procedure was essentially the same as that in pre- 
vious studies (Mischel et al., 1972). The experi- 
menter escorted the child into the room, showed the 
child the box of toys, explained that he and the 
experimenter would play with them later in the 
session, and seated the subject in a chair in front of 
the table. The experimenter then showed the child 
how to ring the desk bell on the table and explained 
that by ringing it, the child could always “bring 
back” the experimenter. At this point the experi- 
menter and the child played a “game” which in- 
volved the experimenter stepping out of the room rg 
and returning immediately when the child signaled $ 
by ringing the bell. This game was played three 
times. Following the bell-signal training, the experi- 
menter returned to the table and picked up an 
opaque cake dish which had been placed under the 
table. The cake dish contained either three marsh- 
mallows (arranged with two marshmallows together 
and one marshmallow on the other side of the dish) 
or three pretzels arranged in the same manner, The 
experimenter explained as follows: 


If you sit in the waiting chair and don’t get up, ” 
and wait until I come back by myself, you can 
have the two - But if you don’t want to 
wait, you can ring the bell and make me come 
back anytime you want to. But if you ring the 
bell and make me come back, you can’t have the 
ee As you can have the one 
1t you wait for me to come back by mysel 
can have the two ___. But if you PELNA 
to wait until I come back by myself, you can ring 
the bell and bring me back. But if you ring the bell 
and make me come back, you can’t have the 
two —___ but you can have the one 


r 


ay SO, 
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The experimenter then asked the child three criterion 
questions to assess whether or not the child under- 
stood the instructions: 


What do you get if you wait for me to come 
back? But, if you want to, how can you make me 
come back? If you ring the bell and make me 
come back, what do you get? 


If the child successfully answered these questions, 
the experimenter consulted a slip of paper which 
informed him of the condition to which the subject 
was assigned. In the experimental conditions the 
experimenter then introduced the imagery instruc- 
tions by saying, “And now I have another game you 
can play, if you want to, for as long as you want 
to, while I’m gone. It’s called the ‘think about’ 
game.” For subjects in the control condition, the 
experimenter skipped that section entirely and pro- 
ceeded immediately to the final criterion questions 
which were readministered to the experimental sub- 
jects after the imagery instructions. 


Ideation Instructions 


Subjects in the consume relevant group received 
the following instructions designed to focus attention 
on the consummatory qualities of the objects for 
which they were waiting. 


[For subjects waiting for marshmallows] Look 
at the marshmallows. They are sweet and chewy 
and soft. When you look at marshmallows, think 
about how sweet they are when you eat them. 
When you look at marshmallows, think about 
how sweet they taste. Or you can think about how 
soft they are. When you look at marshmallows, 
think about how soft and sticky they are in your 
mouth when you eat them. Or you can think 
about how chewy they are. When you look at 
marshmallows, think about how chewy and fun 
they are to eat. Now, why don’t you try playing 
“think about”? When you look at marshmallows, 
what can you think about? 

[For subjects waiting for pretzels] Look at the 
pretzels; they are crunchy and salty. When you 
look at pretzels, think about how crunchy they 
are, When you look at pretzels, think about how 
crunchy they are when you bite them. Or you 
can think about how salty they are. When you 
look at pretzels, think about how salty they taste 
when you lick them or chew them. Or you can 
think about how toasty brown they are. When 
you look at pretzels, think about the toasty taste 
in your mouth when you eat them. Now, why 
don’t you try playing “think about”? When you 
look at pretzels, what can you think about? 


In the consume irrelevant condition, the instruc- 
tions were the same except that they always referred 
to the set of reward objects for which subjects were 
not waiting and which was never shown. Thus sub- 
jects waiting for marshmallows were instructed to 
think about consummatory qualities of pretzels j con- 
versely, those waiting for pretzels were instructed to 
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focus on the consummatory qualities of marshmal- 
lows. The only change in the wording of instructions 
was that instead of saying “look at [the objects]” 
the experimenter always said “think about.” 

In the transform relevant condition, subjects re- 
ceived instructions designed to distract attention 
from the consummatory aspects of the objects for 
which they were waiting and to focus instead on 
nonconsummatory qualities of the objects and their 
associations. 


[For subjects waiting for marshmallows] Look 
at the marshmallows; they are round and white 
and puffy. When you look at marshmallows, think 
about how white and puffy they are. Clouds are 
white and puffy too—when you look at marsh- 
mallows, think about clouds. Or you can think 
about how round and white a marshmallow is. 
The moon is round and white. When you look at 
marshmallows, think about the moon. Or you can 
think about how round a marshmallow is on top. 
A ball is round. When you look at marshmallows, 
think about playing ball. Now, you try playing 
“think about.” When you look at marshmallows, 
what can you think about? 

[For subjects waiting for pretzels] Look at the 
pretzels; they are long and thin and brown. When 
you look at pretzels you can think about how 
long and brown they are. A log is long and brown. 
When you look at pretzels, think about logs and 
tree trunks. Or you can think about how round 
and tall they are. A pole is round and tall. When 
you look at pretzels, think about telephone poles, 
or light poles, or fishing poles. Or you can think 
about how round and thin it is. A thin coloring 
crayon or paint brush is round and thin. When 
you look at pretzels, think about coloring or 
painting. Now you try playing “think about.” 
When you look at pretzels, what can you think 
about? 


For the transform irrelevant condition, subjects were 
given the same instructions, but they always referred 
to a set of reward objects for which the subjects 
were not waiting (i.e., did not have in front of them 
and which was never shown). Thus for subjects 
waiting for marshmallows, the instructions referred 
to pretzels and vice versa. Moreover, instead of 
saying “when you look at” the experimenter always 
said “when you think about.” 


Practicing Ideation 


In all experimental conditions, after delivering the 
imagery examples, the experimenter asked the child 
to try playing “think about,” that is, to think aloud 
in the ways suggested by the instructions. If the 
subject volunteered an appropriate response, the 
experimenter responded, “Yes, you know how to 
play ‘think about.’ ” If the subject failed to respond 
appropriately or said nothing, the experimenter 
would repeat the critical images from the “think 
about” instructions, saying, “You could think about 

. , couldn’t you?” At this point the experimenter 


=. 
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TABLE 1 


Mean DELAY Time IN EACH IDEATION 
INSTRUCTION CONDITION 


Ideation instructions | 


Relevance of Control (no 


| Transfor- 


rewards in Control 
ideation Consum- | mational | ‘nstuetions) 
matory (nonconsum- 
matory) 
Relevant* 5.60 13.51 8.44 
Trrelevant" 16.82 4.46 


Note. Data are given in minutes. Maximum possible delay 
time is 20 minutes, All subjects were facing the rewards. 
a To contingency in the waiting situation, 


administered the wait-contingency criterion question 
again. 

In all five conditions, the experimenter left after 
the final criterion questions. Departing instructions 
were the following: “Remember, if you wait for me 
to come back by myself and get the two ‘ 
or if you ring the bell and make me come back and 
get the one , we'll play with all my toys when 
I get back.” In the experimental conditions, the 
experimenter added, “And don’t forget, you can play 
the ‘think about’ game for as long as you want to, 
if you want to, while I’m gone.” 


RESULTS 


Table 1 shows the mean delay time in each 
of the five conditions. An analysis of variance 
for these groups (2 X 2 factorial design with 
a control group added) was computed (see 
Table 2). It showed that there was no main 
effect of ideation instructions (consume-trans- 
form) or of reward relevance, but there was 
a highly significant interaction between these 
two variables, F = 22.45, p < .001. Table 1 
suggests that this interaction was due to high 
delay times in the transform relevant and 
consume irrelevant conditions but low delay 
times in the consume relevant and transform 
irrelevant conditions. 

Inspection of the mean waiting times for 
each group (see Table 1) showed that the 


TABLE 2 
SUMMARY OF ANALYSIS OF VARIANCE 


MS F 


Source df 
Ideation instruction : 

Consume-transform (A) 1 59.65 | 1.09 
Relevance of rewards (B) si 14.12 | <1 
AXB 1 1,231.91 |22.44* 

Error 55 54.89 

*p < 001. 
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TABLE 
CONTRASTS FOR MEAN D 


AY OF GRATIFICATION 


Source | df 


Transform relevant versus | 
consume relevant 
Consume relevant versus 
consume irrelevant 
Transform relevant versus 
transform irrelevant 1 
Error 55 


1 | 374.70) 6.83* 
| 
1 | 754.88 | 13.75%** 


| 491.14 
| 54.89 


| 8.95** 
Ds 


mean for the no-instruction control group was 
intermediate between the two high and two 
low experimental groups* Because the strong 
interaction effect would prevent any overall 
differences between the control group and the 
experimental groups collectively, ¢ tests were 
computed to compare the control group with 
each of the other groups individually, The 
control group was not significantly different 
from the consume relevant or the transform 
irrelevant conditions, ts = .94 and 1.36, re- 
spectively.? But the transform relevant condi- 
tion and the consume irrelevant condition 
yielded delay times significantly higher than 
those in the control group, ¢ = 1.68, p < .05, 
and ¢ = 2.77, p < .005, respectively. 
Contrasts (Winer, 1962) were computed to 
test for hypothesized differences between the 
experimental groups (see Table 3). As pre- 
dicted, delay time was significantly lower 
p < 025, when subjects were instructed to 
ideate about the consummatory qualities of 
the relevant rewards than when they trans- 
formed the relevant rewards by focusing on 
their nonconsummatory qualities and associa- 


1In previous studies, conditions 
the present control group tended EATEN 
mean waiting times (Mischel & Ebbesen, 1970; 
Mischel, Ebbesen, & Zeiss, 1972), However, : 
study comparisons are limited by the fact ch 
the present study, both the Specific contingenc a 
the total delay period to criterion (i.e. he a ni 
JR ee) were different. j kotis 

e lack of a significant differe 
control group and the consume ee E 
suggests that when the relevant reward objects ae 
presented to children with no Specific tate fee 
they tend to ideate about them in a cones z 
fashion, as may have occurred in earlier sd 


(Mischel & Ebbi ; Mi 
Zeiss, 1972). esen, 1970; Mischel, Ebbesen, & 


tions. Also, as predicted, consummatory idea- 
tion led to less delay, p < .001, when it was 
-focused on the relevant rewards than on the 
irrelevant reward objects. Moreover, as hy- 
pothesized, transformational (nonconsumma- 
tory) ideation regarding the relevant rewards 
facilitated delay significantly more, p < .005, 
than did comparable ideation directed at the 
irrelevant rewards, Š 


DISCUSSION 


The results showed that through instruc- 
tions, the child can cognitively transform the 
reward objects that face him during the delay 
period in ways that either permit or prevent 
effective delay of gratification. If the child 
has been instructed to focus cognitively on 
+ the consummatory qualities of the relevant 
i` reward objects (such as the pretzel’s crunchy, 
salty taste or the chewy, sweet, soft taste of 
the marshmallows), it becomes difficult for 
him to wait. Conversely, if the child cogni- 
tively transforms the stimulus by focusing on 
consummatory qualities (by thinking about 
the pretzel sticks, for example, as long, thin 
brown logs or by thinking about the marsh- 
mallows as white, puffy clouds or as round, 
white moons), he can wait for long time 
periods. 

More specifically, the hypotheses were 
strongly supported by the data. Consum- 
matory ideation about the rewards in the 
contingency led to short delay, whereas 
4 transformational (nonconsummatory ) ideation 
about the same rewards led to much longer 

delay of gratification. Most important, such 
nonconsummatory ideation about the rewards 
in the contingency led to significantly longer 
delay than did similar ideation directed at 
comparable rewards irrelevant to the delay 
contingency. Thus, attention to the consum- 
“matory features of the rewards in the con- 
tingency makes delay difficult, but attention 
to their nonconsummatory features and 
associations facilitates delay more than does 
either comparable cognitive distraction (i.e., 
the transform irrelevant condition) or no 
instructions. 

Tt is most interesting, and must be empha- 
sized, that transformations of the reward ob- 
* jects that focus on their nonconsummatory 
qualities provide more than mere cognitive 
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distraction. In this regard, compare the effects 
of instructions that focus on nonconsumma- 
tory qualities of the relevant reward objects 
(i.e., those for which the subject is actually 
waiting) with the same instructions for irrele- 
vant rewards. When the children were in- 
structed to ideate about nonconsummatory 
qualities of the relevant objects (i.e., those 
for which they were actually waiting), their 
mean delay time was 13.51 minutes. In con- 
trast, when subjects were given the same 
instructions with regard to the irrelevant re- 
wards (i.e., comparable but not in the delay 
contingency), their average delay time was 
less than 5 minutes. Thus attention to the 
nonconsummatory qualities and associations 
of the actual reward objects in the delay con- 
tingency substantially enhances the ability to 
wait for these rewards, and does so more ef- 
fectively than when the same ideation in- 
structions focus on comparable objects irrele- 
vant to the delay contingency. The exact 
mechanisms underlying this facilitative effect 
merit further study. We suggest that when 
subjects ideate about the nonconsummatory 
qualities of the relevant rewards and their 
associations, they may remind themselves of 
what they will get if they fulfill the contin- 
gency and thus reinforce their own delay 
without becoming excessively frustrated by 
dwelling on the consummatory qualities of 
what they cannot have. 

It might be argued that the relatively low . 
delay time obtained when instructions dealt 
with ideation for the irrelevant rewards re- 
flects that young children simply have trouble 
thinking about reward objects that are not 
present. Note, however, that the longest mean 
delay time (almost 17 minutes)? occurred 
when subjects were instructed to ideate about 
those same objects but with regard to their 
consummatory qualities (see Table 1). This 
finding is also provocative theoretically. It 
suggests that while consummatory ideation 
about a potentially available object makes it 
difficult to delay gratification, similar con- 
summatory ideation about an outcome that is 
simply unattainable in the situation (i.e., the 
irrelevant rewards), rather than being aver- 

3 Although this was the highest mean delay time, 


it was not significantly greater than that in the 
transform relevant condition. 
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sive, is highly pleasurable and may serve to 
sustain prolonged delay behavior. That is, 
consummatory ideation about reward objects 
that are not expected and not available in the 
delay contingency (the irrelevant rewards) 
may serve as an interesting and effective dis- 
tractor and hence facilitate waiting. In con- 
trast, similar ideation about the relevant but 
blocked rewards heightens the frustration of 
wanting what one expects but cannot yet 
have and, by making the delay more aversive, 
reduces the length of time that one continues 
to wait. 

The conclusion that when the subject is 
aroused (e.g., to attain a goal, to consume a 
reward) and unable to achieve immediate 
satisfaction, additional arousing (consumma- 
tory) ideation is increasingly frustrative and 
hence avoided is congruent with data from 
other studies on goal-related fantasies emerg- 
ing from extremely different paradigms. Re- 
call the fact that aroused subjects avoid goal- 
related ideation, as seen, for example, in the 
tendency of severely food-deprived persons to 
engage in less food-related ideation (Lazarus, 
Yousem, & Arenberg, 1953). In the same 
vein, Clark (1952) noted less sexual imagery 
in the Thematic Apperception Test stories of 
sexually aroused males compared with non- 
aroused males. The present study, while sup- 
porting these conclusions, also points to some 
of the specific conditions that may render 
consummatory ideation positive and facilita- 
tive, rather than debilitating, in its effects; 
namely, when consummatory ideation is di- 
rected at outcomes irrelevant to the psycho- 
logical situation at the moment, it may facili- 
tate the maintenance of behavior toward the 
relevant goals in the situation. For example, 
if subjects are in a sexually arousing but 
sexually frustrative situation, consummatory 
food ideation might help them tolerate the 
delay of sexual gratification (while arousing 
sexual ideation would be frustrative and hence 
avoided). Conversely, if subjects are in a 
hunger-arousing but frustrative situation (i.e., 
one in which eating must be postponed), con- 
summatory sexual ideation might be an excel- 
lent way to help bridge the temporal delay 
before food becomes available (although food- 

relevant consummatory ideation would be 
aversive), Thus, the impact of ideation about 
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goal objects on self-regulatory beh: 
pends on both the type of ideation 
matory versus nonconsummatory) anà - 
particular relation (relevant—irrelev. ant) 

the outcomes in the self-control situatig® 

In sum, in the delay-of-gratificati 
digm, cognitions about the rewards 
trative and impede delay when they 
both of two requirements. Such cognitic 
must focus on (a) the outcomes that are 
rently blocked in the situation and (b) the 
consummatory qualities of those outcor 
contrast, cognitions directed at the noncon- 
summatory qualities of the relevant re 
or which focus on the consummatory qualit 
of outcomes that are not in the current con- 
tingency, facilitate delay of gratification. The 
total pattern of results reveals both the speci- 
ficity and the meaningfulness of the ci 
relationship between thought and a: 
self-control. The findings also show } 
subject’s specific cognitive activities can trans 
form the objective reward stimuli so as to 
either help or hinder his chances of ultimately 
receiving them. 

Additional support for the important role 
of cognitive transformations in delay behavior 
also comes from another source. A follow-11)) 4 
of the Mischel and Moore (1973) experiment 
replicated the original finding that exposure 
to slides of the relevant rewards leads to sig- 
nificantly longer delay than does exposure to 
slides of the comparable rewards that areg 
irrelevant to the delay contingency (Mo 
Mischel, Note 1). The same stud 
showed that the delay-enhancing efforts of 
the relevant slides can be completely obliter- 
ated when subjects are instructed (before the 
delay interval) to ideate about the con 
matory qualities of the relevant ; dg 
while waiting for them. | 

Our studies on cognitive transformation 
also have implications for earlier research 
under the label “cognitive appraisal.” Previ- 
ous research has investigated how the cogni- 
tive appraisal_of threatening stimuli (a film 
of crude, prittitive genital operations) influ- 
ences emotional responses to» :these. stimuli 
(Speisman, Lazarus, Mordkoff, & Davison, 
1964). The findings showed that emotional 
responses were higher when the film was ac- 
companied by a sound track that emphasized 


eur 


In 


a 


the dangers of such an operation as opposed 
to. sound tracks that denied such dangers or 
‘fntellectualized” them in a detached man- 
ner. The authors of this widely cited, inter- 
esting study interpreted their results as due 
to diffsrences in cognitive appraisal generated 
‘by the different sound tracks. The present 
study provides a more direct demonstration of 
the importance of cognitive appraisal because 
the stimulus situation remained identical 
across situations: All children faced the 
identical rewards. (In contrast, in the Spies- 
man et al. study, different sound tracks ac- 
companied the film across groups, thus cre- 
ating different stimulus conditions.) The pres- 
ent study provides a method of manipulating 
cognitive appraisal more directly by means of 
instructions through which subjects may 
transform the identical stimuli into diverse 
cognitive representations. The results demon- 
strate the powerful role of such cognitive 
transformations and support the theoretical 
view that the cognitive encoding of stimuli 
significantly and predictably influences their 
impact on behavior (Mischel, 1973, 1974). 


REFERENCE NOTE 


1, Moore, B., & Mischel, W. Further studies on 
symbolic reward presentation in delay of gratifica- 
tion. Unpublished manuscript, 1973. (Available 
from Walter Mischel, Department of Psychology, 
Stanford University, Stanford, California 94305.) 
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tude direction. Sadler and Tesser ( c 
lected evidence in support of this assumption. 
Subjects were exposed to either a likable or 
dislikable other person whom they were told 
to think about or were distracted from 
thinking about. Then they listed their cogni- 
tions concerning the other person. When the 
stimulus other was likable, subjects who 
thought about him listed more positive cogni- 
tions than subjects who were distracted from 
thinking about him. There were few negative 
cognitions, and they were unaffected by 
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were performed to examine the effects of thought c 
t It was predicted that (a) thought produces a change in at 
sake it more extreme in the initial direction (i.c., polarisation) an 
b) the longer one thinks about the attitude object, the greater the tendency 
urd polarization. In Experiments 1 and 2, the attitude objects were drawn 
m a heterogeneous attitude questionnaire; in Experiment 3, they were news 
Hypothesis b was tested over the following thought durations 
ind 180 sec in Experiment 1; 45, 60, 90, and 180 sec in Experi- 
1 28 and 60 sec in Experiment 3. Experiments 2 and 3 also had 
thereby permitting a test of Hypothesis a. Using 
r or not an attitude became more polarized as the dependent variable, 
Hypothesis b received significant support in all three experiments, and Hy- 
esis a was supported only in Experiment 3. 
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ization 


thought. The dislikable stimulus other pro- 
duced a complementary pattern of results. 
Subjects who thought about him produced 
more negative cognitions than did distracted 
subjects. Also, there were few positive cog- 
nitions, and they were unaffected by thought. 

The second assumption is that there is a 
causal relationship between the cognitions and 
affect associated with some attitude object. 
The more consistent with a particular direc- 
tion an individual’s attitudes are, the more 
polarized will be his attitude. The literature 
also provides ample support for this assump- 
tion. While there are a number of ways in 
which cognitions have been conceptualized 
manipulated, measured, and combined int 
indices, these indices have invariably been 
shown to bear a systematic relationship to the 
overall affect exhibited toward the object 
(e.g., see Carlson, 1956; Greenwald, 196 
Rosenberg, 1956). 

If affect and cognitions are related 
thought tends to make cognitions more cox- 
sistent with the initial affect, it follows thai 
(a) when an individual is cut off from exter- 
nal information concerning some attitude oi 
ject, his attitude (i.e, affect/evaluation ) 
toward that object will tend toward greater 
polarization given that he thinks about the 
attitude object. Since thought involves proc- 
ess and is therefore time dependent, it also 
follows from the foregoing assumptions that 
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(b) when an individual is cut off from exter- 
nal information concerning some attitude ob- 
ject, his attitude toward that object will tend 
toward increasing polarization as the amount 
“of time spent thinking about the object 
. increases, at least over relatively short time 
durations. Naturally these two hypotheses are 
expected to hold only up to a point. Atti- 
tudes that are extremely polarized initially 
cannot become detectably more polarized. 
With an initially neutral attitude, either there 
are no existing cognitions, they are all evalua- 
tively neutral, or they are equally balanced 
(Edwards & Ostrom, 1971) such that neither 
evaluative direction is favored and therefore 
no change is to be expected. Furthermore, 
individuals are bound by reality constraints 
so that their cognitions are not expected to 
` become completely autistic. 
~ These hypotheses were tested in the Sadler 
and Tesser (1973) study. Hypothesis 1 was 
fully supported. Compared with subjects who 
were distracted from thinking about their 
partner, subjects who thought about a likable 
partner had a more positive attitude toward 
him and subjects who thought about a dis- 
likable partner had a more negative attitude 
. toward him. On the other hand, Hypothesis 2 
was not supported. There were three time 
conditions within the experiment (10 sec, 
1 min, and 4 min), as well as a 1-week 
follow-up. Time, however, appeared to have 
no systematic effect. The major purpose of 
the present series of experiments was to 
explore Hypothesis 2 further and to gen- 
eralize the validity of Hypothesis 1 through 
conceptual replication. 


EXPERIMENT 1 


` The first experiment was primarily con- 
cerned with testing the hypothesis that in the 
absence of external information, the longer 
one thinks about some attitude object the 
more polarized will his attitude toward that 
object become. This hypothesis was tested 
„over a variety of nonpersonal attitude objects. 


Method 


Subjects 


Students at the University of Georgia (17 males 
and 23 females) participated in this study as part of 


their introductory psychology course requirements. 
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Subjects were run individually or in groups of 2, 3, 
or 4. 


Procedure 


When each subject arrived for the experiment, he 
was seated at a desk which was partitioned from the 
other desks in the room and was given a booklet. 
Each booklet consisted of eight pages containing a 
single attitude item: the first page contained a prac- 
tice item, and the remaining seven pages contained 
items randomly selected for each subject from a pool 
of 61 heterogencous items. This procedure was em- 
ployed in order to ensure a wide sampling of items 
across subjects. The following are examples of items: 
“Prostitution should be legalized”; “A grading sys- 
tem of satisfactory-nonsatisfactory would encourage 
learning for the sake of learning”; “In many cases, 
revolution is the best way to correct political and 
social problems”; etc. 

The cover page of the booklet contained instruc- 
tions which included the following: 


This experiment is part of an in-depth study 
of attitudes. On each of the following pages is an 
attitude statement which we would like your 
honest opinion on. . . . When told by the experi- 
menter to begin, turn the page and read through 
the attitude statement. As soon as you have fin- 
ished reading the item, turn the booklet over and 
indicate on the first scale the extent to which you 
agree or disagree with that statement by marking 
the scale. [Instructions for using the scale which 
consisted of a 12-cm line anchored with Strongly 
Agree and Strongly Disagree were included.] 
As soon as you have indicated your immediate 
reaction to the statement, turn back to the page 
which had the attitude statement on it, and think 
about the issue until told to stop by the experi- 
menter. As you consider the issue, you may keep 
track of your thoughts by listing what you are 


thinking . . . on the scratch paper provided for 
you. ... If you can keep your attention on the 
attitude item the entire time without jotting 


down notes, you need not write anything down. 
The important thing is that you think about the 
item until the experimenter tells you to stop. At 
the experimenter’s signal, reread the item and then 

. indicate on the scale provided... the ex- 
tent to which you agree or disagree with the item 
at that particular moment. Your attitude may or 
may not completely agree with your past feelings, 
but we are interested only in your attitude at that 
particular moment. [The subject’s initial rating 
was not visible when the second rating was made.] 


When the subject had finished reading the instruc- 
tions, he was given a practice item. The experimenter 
verbally instructed the subject on the procedure 
he was to follow on the practice item as the subject 
completed that item. 


Thought Duration Manipulation 


Ten subjects were randomly assigned to each of 
four thought duration conditions: 30, 60, 90, or 
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% POLARIZATION 
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SECONDS OF THOUGHT 
Ficure 1. Percentage of attitudes becoming polar- 


ized as a function of thought (Experiment 2) and 
time thinking (Experiments 1 and 2). 


180 sec. After completing the practice item, each 
subject proceeded to read, rate, think about (and 
thought-list if desired) for 30, 60, 90, or 180 sec, 
reread, and rerate each of the remaining items. 


Results and Discussion 


Each subject’s initial rating of each atti- 
tude statement was examined. Statements 
that were rated within 4 cm from either ex- 
treme (agree or disagree) were eliminated 
from further consideration because ceiling ef- 
fects would prevent them from becoming 
more extreme. Also neutral statements (i.e., 
within 4 cm of the center of the scale) were 
eliminated, since change in either direction 
could be viewed as polarization. It was hy- 
pothesized that attitudes would become in- 
creasingly more polarized as the amount of 
time spent thinking about them increased. It 
is possible, however, that persons simply be- 
came more positive or more negative over 
time. If there were more items on which the 
subjects were initially positive than negative 
(or vice versa), such a tendency would sys- 
tematically bias the results. Therefore, to ex- 
plore and control for the possibility of a uni- 
directional shift, each of the usable attitude 
items were classified as positive if the subject 
initially agreed with the statement or negative 
if the subject initially disagreed with the 
statement. Two scores were computed for 
each subject: the proportion of positive atti- 
tudes he held which became more polarized 
(i.e., was rated, to the nearest $ cm, more 
Positively after thought) and the proportion 
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of negative attitudes he held w ar 
more polarized (i.e., was rated, to 

4 cm, more negatively after though 

use of this index instead of mean chan 
attitude is dealt with in the General Di 
sion section.) 

Preliminary analysis revealed no system 
atic sex differences, and the sex factor was 
eliminated from further analysis, The scores 
were subjected to a 4 (Thought Duration) x 
2 (Positive versus Negative Attitude)* analy- 
sis of variance with repeated measures on the 
second factor. Neither attitude direction alone 
nor its interaction with thought duration had 
a significant effect on attitude polarization, 
F <1, As predicted, however, polarization 
increased monotonically with thought dura- 
tion, F(3,36) = 4.83, p<.01. The means 
are shown graphically in Figure 1. 

While all four means are ordered as pre 
dicted, clearly it is the 180-sec condition 
which is most different from the rest. It is 
also noteworthy that although the hypothe- 
sis calls for increasing polarization with in- 
creasing thought duration, it is only at 180 
sec that the proportion of attitudes becoming 
more polarized exceeds 35%. Rather low 
levels of polarization were also obtained in the 
other experiments; this finding is discussed 
subsequently. In general, the data from this 
experiment support the conclusion that over a 
wide range of attitudes, increased time spent 
thinking about some attitude produces a 
greater likelihood of that attitude becoming 
more polarized. 


EXPERIMENT 2 


; The previous study supported the hypothe- 
sis dealing with time duration and attitude 
polarization. The second experiment was 


1 Since the items were randomly selected for each 
subject without regard for the subject’s initial atti- 
tude, some subjects did not have usable items in- 
cluding both positive and negative positions. Four 
subjects had no usable positive positions. Positive 
attitude polarization scores were estimated for these 
subjects by averaging their own negative attitude 
polarization score with the mean positive attitude 
polarization score of the remaining subjects. Negative 
attitude polarization scores were estimated in an 
analogous way for the three subjects without a 
usable negative position, that is, by averaging each 
subject’s own positive attitude polarization score 
with the mean negative polarization score, 
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undertaken to replicate those findings as well 
as to provide a test of the hypothesis that 
thought. (regardless of duration) produces 
greater polarization than no thought over a 
general sample of nonperson attitude objects. 
In this study an attempt was made to more 
systematically evaluate the effects of initial 
direction and extremity of attitudes. This was 
accomplished by pretesting subjects on a 
large number of items and then selecting for 
each subject those attitude items that con- 
formed to experimental requirements. 


Method 
Subjects 


In partial fulfillment of their introductory psy- 
chology course requirement, 32 female and 29 male 
students at the University of Georgia participated in 
this experiment. Subjects were randomly assigned to 
no-thought (m= 11), 45-sec thought (2 = 13), 60-sec 
thought (m= 12), 90-sec thought (= 13), and 180- 
sec thought (#=12) conditions without regard to 
sex. (Soon after the data for the no-thought, 60-sec, 
and 180-sec condition were collected, it was felt 
that it would be useful to have additional time 
points, and the 45- and 90-sec conditions were run.) 


Procedure 


Subjects filled out an attitude questionnaire con- 
sisting of 62 heterogeneous attitude statements in 
a group-testing session. Each item was followed by 
a 12-cm line labeled Strongly Disagree and Strongly 
Agree at the endpoints. Each subject indicated his 
attitude by putting a slash mark through the 12-cm 
lite below the statement. 

For each subject, eight items were selected to pro- 
vide two replications of a 2 (Attitude Direction: 
Agree versus Disagee) X 2 (Attitude Extremity: Low 
versus Moderate) design.? With .0 cm indicating 
maximal disagreement and 12.0 cm indicating maxi- 
“mal agreement, the selected attitudes fell into the 
following ranges: moderate disagreement, 2.0-3.5 
cm; low disagreement, 4.0-5.5 cm; low agreement, 
6.5-8.0 cm; and moderate agreement, 8.5-10.0 cm. 


2Jt proved impossible to select attitudes meeting 
all the specifications for several subjects. Of the 61 
scores in each. condition, 2 scores were estimated for 
low disagreement, 7 for moderate disagreement, 4 
for low agreement, and 3 for moderate agreement. 
Estimates were made by averaging each subject's 
own mean polarization score (over the items which 
were usable) with the mean of the condition for 
which the estimate was being made. For example, 
if a subject had no low-agreement attitude, his score 
im that cell was estimated by taking his own mean 
degree of polarization over the remaining three cells 
and averaging it with the mean degree of polariza- 
tion associated with the low-agreement cell over all 
subjects providing data in that cell. 
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When subjects arrived in groups of from one to 
four for the second session some 1 to 4 weeks after 
the group-testing session, they were seated at desks 
with partitions between them, Each was handed an 
attitude questionnaire booklet. On the cover page it 
was mentioned that the booklet contained a number 
of attitude statements about which subjects were to 
express their current feelings. Subjects in the thought 
conditions were further instructed to think about 
each issue after reading the statement until given a 
signal from the experimenter. They were to jot 
down a phrase or word for each thought during this 
process. Finally, all subjects were instructed how to 
use the 12-cm scale to indicate their attitude on 
each issue. 

Subjects in the no-thought condition then pro- 
ceeded to read and immediately indicate their atti- 
tude on each item. Subjects in the thought condi- 
tions read, thought about each item for 45, 60, 90, 
or 180 sec, and then indicated their attitude on each 
of the items. The items that each subject responded 
to were the eight that had been previously selected 
for him. 


Results and Discussion 


The subject’s attitude at the time of the 
initial group testing was compared with his 
attitude at the end of the second session for 
each item. The number of items (0, 1, or 2) 
on which the subject became more polarized 
(ie., his score, to the nearest $ cm, changed 
in the direction of his initial attitude) in each 
of four attitude categories served as the de- 
pendent variable. Since a preliminary inspec- 
tion of the data revealed no significant sex 
effects, the data for males and females were 
combined. A 2 (Attitude Direction) x 2 
(Attitude Extremity) X 5 (No Thought ver- 
sus 45, 60, 90, or 180 sec of Thought) un- 
weighted means analysis of variance with 
repeated measure across the first two factors 
was employed. Attitude direction had no sig- 
nificant effect on polarization either alone, 
F<1, or in interaction with thought, 
F = 2.40. Similarly, attitude extremity had 
no significant effect on polarization as a main 
effect, F=1.09, or in interaction with 
thought, F < 1. 

The effects of thought are shown in Fig- 
ure 1. (The main frequency of polarization 
measure was converted to percentage of polar- 
ization in order to make it comparable to the 
metric system used in Experiment 1.) It was 
predicted that among those given an oppor- 
tunity to think about their attitudes, the 
more time they spent thinking, the more po- 
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larized their attitude should become. This, in 
fact, was the case, as shown in Figure 1. The 
F associated with time within thought only 
approached significance, F(3, 56) = 2.20, $ 
< .10. However, the longest time (180 sec) 
differs significantly from the shortest time 
(45 sec) by individual comparison, F = 5.66, 
p < .05. Also, the overall F statistic is not 
sensitive to the predicted order of means. 
When this ordering is taken into account by 
testing the linear trend over time within 
thought, the null hypothesis is rejected at a 
more conventionally acceptable level of sig- 
nificance, F (1,56) = 14.59, p < .01. These 
time within thought data provide a nice 
replication of the results of Experiment 1. 

It was also predicted that subjects given an 
opportunity to think about their attitude 
would become more polarized than would 
those not given the opportunity. This is 
clearly not the case. The no-thought condi- 
tion mean is nonsignificantly, F < 1, higher 
than the combined thought conditions mean. 
The failure of the no-thought condition to 
result in lesser polarization than the thought 
conditions is distressing. Perhaps thought 
listing initially decreases the probability of 
polarization, and this effect dissipates over 
time. A close examination of the procedure 
used in this study, however, suggests another 
explanation which is more consistent with the 
initial reasoning. Subjects in the no-thought 
condition were simply asked to record their 
attitude. On the other hand, subjects in the 
various thought conditions were asked to list 
their thoughts concerning their attitudes. Per- 
haps subjects assumed that this thought list- 
ing would be viewed as a kind of defense of 

_ their attitude, and they therefore moderated 
their position so as to appear to have a more 
defensible position. (Such moderation shifts 
have been observed when a subject anticipated 
discussing his attitude with another, cf. Cial- 
dini, Levy, Herman, and Evenbeck, 1973). 
The net result of such a process would be 
to lower the polarization curve in the 
thought conditions relative to the no-thought 
condition. 


EXPERIMENT 3 


Experiment 3 was designed to provide an 
additional test of Hypotheses 1 and 2. Al- 
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though Sadler and Tesser (1973) found that 
thought produced more polarized attitudes 
than did no thought (ie. distraction), this 
was clearly not the case in Experiment 2. In 
Experiment 2, subjects in the thought condi- 
tions listed their thoughts, and subjects in 
the no-thought condition did not. In contrast, 
subjects in the present experiment, except for 
the opportunity to think about a particular 
attitude object, had identical tasks inter- 
vening between pre- and posttest. This should 
provide a setting for a more unambiguous 
test of Hypothesis 1. 

The hypothesis that the longer one thinks 
about some attitude object, the more like! 
it is that his attitude will polarize was 
quately supported in Experiments 1 and 
However, the stimuli for both studies wei 
drawn from the same pool, and the proce- 
dures were quite similar. Furthermore, using 
different stimuli and a different procedure, 
Sadler and Tesser (1973) failed to confirm 
the hypothesis. In the interest of generality, 
it was felt that an additional test of the hy- 
pothesis with new stimuli and procedure 
would be useful. 

Also, in Experiments 1 and 2, duration 
was the only parameter of thought manipu- 
lated. In the present study, frequency of 
thought was also explored. There is much 
evidence to suggest that the more frequently 
a subject is merely exposed to an attitude 
object, the more he tends to like it (cf. 
Zajonc, 1968). Additional evidence suggests 
that the effects of frequency of exposure are 
not always unidirectional but may, in some 
instances, lead to decreased liking, For exam- 
ple, Brickman, Redfield, Harrison, and Cran- 
dall (1972, Experiments 1 and 3) have 
shown that when the initial attitude toward 
some object is negative, the attitude becomes 
even More negative with repeated exposure, 
These findings are consistent with the present 
ae hr equates “mere exposure” to 

g.” It is impossible, however, to 
rule out the possibility that the subject is 
getting additional information from the stim- 
ulus itself each time he is e d to i i 
tude polarization in such Eien 
the resul i =a pnutices may be 

t of this new information rather th 
self-generated information. = 


Specific predictions concerning the effects 


EFFECTS oF TIME AND THOUGHT ON ATTITUDE POLARIZATION 267 


of frequency when total duration is held con- 
stant are difficult to make. On the one hand, 
one could argue that each time an individual 
begins to think about an attitude object, he 
tends to approach it from a different perspec- 
tive, and his thought processes are focused. 
As he continues thinking, he tends to con- 
tinue along the same cognitive track and his 
attention begins to wane. Therefore, with a 
greater number of instances of short-duration 
thought, a broader sample of the cognitions 
are touched upon with more concentration 
than is the case with fewer instances of long- 
duration thought. To the extent that there is 
any merit to these speculations, one would 
predict that polarization is a positive function 
of frequency of thought. On the other hand, 
it appears equally plausible to argue that there 
is a good bit of cognitive inertia. In order for 
thought to have an impact on cognitions, one 
must sustain thought about the relevant cog- 
nitions. Frequent episodes of rather short- 
duration thought might interfere with this 
process. If this is true, one would expect po- 
larization to be negatively related to fre- 
quency of exposure where duration is con- 
trolled. Finally, there is the possibility that 
frequency is unrelated to polarization. 


Method 

Subjects 

In partial fulfillment of their introductory 
psychology course requirement, 54 female and 
56 male students at the University of Georgia 
participated in this experiment. They were 
randomly assigned, in equal numbers, to 
treatment conditions without regard to sex. 


Procedure 

Subjects were recruited for an experiment 
concerned with aesthetic taste and general in- 
formation. When each subject reported for 
the experiment, he was seated at a table fac- 
ing a screen. On the table was a board con- 
taining a signal light next to a button marked 
Think, eight screw heads labeled from A to 
H, and a stylus. 

Each subject was told to indicate how each 
of 20 slides of news photos * “makes you feel 


3 These photos were selected by the authors from 
several photo essay books. They were all in black 
and white and were chosen so as to elicit a range 


—that is, whether it makes you feel good or 
bad, pleasant or unpleasant” by touching the 
stylus to one of the screw heads labeled A 
(maximally unpleasant) through H (max- 
imally pleasant). After the subject indicated 
that he understood the rating procedure, he 
was shown a focus slide and each of the 20 
stimulus slides for about 2 sec each. This was 
intended to give him an idea of what the 
range of slides was like before rating them. 
The subject was then shown each slide again, 
and he indicated how each made him feel. 

Initial attitude direction was manipulated 
by selecting a slide rated C (moderately nega- 
tive) for half of the subjects and a slide rated 
F (moderately positive) for the remaining 
subjects. 

The signal light was illuminated, and the 
subject was told by the experimenter, 
I am now going to show you one of the slides again, 
Whenever the signal light that is on now is illumi- 
nated, I want you to think about the particular 
photograph that I’m about to show you. I’d just like 
you to think about what the photograph is like, how 
you feel about various features of it, what you like 
or dislike about it, whether it makes you feel good 
or bad, and so on. You are supposed to just think 
about the photograph and how you feel about it the 
whole time that the signal light is illuminated... . 
Now, except for this one time, when the signal light 
is illuminated, the photograph will not be shown at 
the same time. You will not be shown the photo- 
graph again but are just to think about it when the 
signal light is on. 


Subjects were not instructed to list their 
thoughts or keep track of them in any way 
for the experimenter. 

Following these instructions, the slide pre- 
viously selected for the subject was projected 
on the screen for 10 sec. The signal light was 
illuminated during the same 10 sec and then 
turned off. 

Subjects were then instructed how to fill 
out a booklet consisting of four pages of 
multiple-choice general information questions. 
These questions were used to prevent the sub- 
jects from thinking about the target photo- 
graph. 

The duration and frequency manipulations 
were accomplished by varying the length and 
number of times the Think signal light was 


of feelings from positive (e.g., a smiling child) to 
negative (e.g., a poverty-stricken woman sitting in 
a dilapidated shack). 


Us). a.” 
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TABLE 1 


NUMBER or Supjects WHOSE ATTITUDE BECAME 
MORE POLARIZED IN EXPERIMENT 3 


Thought condition 


% eae EN 
Acude change l |O thought, 


condition | Short dura- | Long dura- 
tion tion 
(28 sec) (90 sec) 
Polarized 1 7 14 
Not polarized 21 37 30 


illuminated. In short-duration conditions, the 
light was illuminated for a total of 28 sec; in 
the long-duration condition, the light was 
illuminated for a total of 60 sec. In the low- 
frequency condition, the Think light was 
illuminated once for the total duration e; 
28 or 60 sec) after the subject worked 4 min- 
utes on the information items. In the high- 
frequency condition, the Think light was il- 
luminated four times (i.e., 7 or 15 sec each 
time), once after each minute the subject 
worked on the general information questions. 
In the no-thought condition, subjects worked 
on the information items for 4 minutes and 
the Think light was not illuminated. 
Finally subjects were told, 

I would now like you to once again indicate how 
you feel about the photograph that I showed you 
before, since sometimes people's feelings shift even 
over as short a period of time as this. Of course, 
you may or may not feel the same way about the 
photograph. Just indicate how the photograph makes 
you feel now.4 


There are a couple of aspects of the pro- 
cedure worth noting. First, since the no- 
thought group differed from the thought 
groups with respect to thought listing in Ex- 
periment 2, no subject, regardless of condition, 
was asked to list, make known to the experi- 
menter, or keep track of his thoughts in any 
formal way in this experiment. Second, all 
subjects, regardless of condition, worked on 
the information questions for 4 minutes. In 


4In an attempt to increase the overall rate of 
polarization, about halfway through the experiment 
an additional sentence was added to this instruction: 
“Sometimes people’s feelings become even more 
pleasant than they originally were, and sometimes 
they become even more unpleasant.” Since this 
instruction did not have the desired effect and 
seemed to create too great a “demand,” it was 
dropped from the present experiment, as were the 
subjects who had received it. 
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sum, conditions differed in that subjects were 
either given an opportunity to think about 
the object or not, and that opportunity wa 
either long or short, with a frequency of on 
or four. 


Results and Discussion 


The data were analyzed in terms of whether 
or not a subjects attitude became more 
polarized. The attitude of subjects in the posi- 
tive direction condition were about as likely 
to become more polarized (18%) as those in 
the negative direction condition (22%). Sub- 
jects who thought about the photograph once 
were about as likely to exhibit a shift to a 
more extreme attitude (23%) as those who 
thought about the photograph four times 
(25%). Perhaps the difference between 
thinking about the picture one or four times 
was not extreme enough for measurable ef- 
fects to have resulted; or frequency of 
thought might be irrelevant to attitude polari- 
zation when duration of thought is controlled; 
or whatever cognitive work is accomplished 
by thought is unaffected by interruption or 
by renewed effort. It is worth noting that the 
typical “mere exposure” effect of increased 
positivity with higher frequencies of exposure 
was also not obtained on this index. This is 
not too surprising in view of the differences 
between this study and the typical mere ex- 
posure study. In the present study the di fer- 
ence between frequencies was small, frequency 
was a between-subjects factor (cf. Zajonc, 
Swap, Harrison, & Roberts, 1971), frequency 
was not confounded with duration, and ‘he 
stimulus was not exposed but only tho ht 
about. 

Duration of thought with frequency 
constant did have an effect. (Table | 
sents frequency of polarization collapsed « 
attitude direction and frequency of thou; 9 : 
Subjects who thought about the photog: 
for 60 sec were twice as likely to exhibit A 
tude polarization (32%) as were subjects vi 
thought about the photograph for 
(16%). Although this difference falls < 
short of conventional significance level 
.07, Fisher’s exact test) it provides a repi 
tion of the time effects found in Experiments 
1 and 2 with different materials and a dif: 
ent procedure, 
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Finally, the hypothesis that thinking about 
an attitude object should produce greater 
polarization than not thinking about the atti- 
tude object was supported. Only 1 of the 22 
subjects in the no-thought condition had a 
more extreme attitude on the posttest com- 
pared with 24% of the subjects in the com- 
bined thought conditions (p < .03, Fisher’s 
exact test). Furthermore, a pilot study using 
slides of paintings instead of news type photos 
yielded similar results. Two of 14 subjects 
had attitudes which became more polarized in 
the no-thought condition compared with 12 
out of 21 subjects in thought conditions (p 
< .01, Fisher’s exact test). 


GENERAL DISCUSSION 


Taken as a whole, the results of the three 
experiments reported here seem to provide 
adequate support for the two major hypothe- 
ses. The hypothesis that thought (regardless 
of duration) is more likely to result in atti- 
tude polarization than no thought was not 
supported in Experiment 2, but it was strongly 
supported both in the pilot study for Experi- 
ment 3 and in Experiment 3 itself. 

The hypothesis that given an opportunity 
to think about some attitude object, the longer 
one spends time thinking, the more likely it is 
that one’s attitude will polarize was also con- 
firmed. It is noteworthy that confirmation 
was obtained in two different paradigms with 
two different kinds of stimuli, complex atti- 
tude statements and news type photos. 

Although both hypotheses seem to have 
been supported and replicated, there remains 
an intriguing anomoly in the data. The de- 
pendent measure reported in each study was 
not degree of polarization but rather the pro- 
portion of attitudes changing in a polarized 
direction, or probability of polarization. While 
mean degree of polarization followed a 


- roughly similar pattern over the experiments, 


there were no significant differences on the 
measure. The nature of the probability of 
polarization index suggests a reason for the 
difference. The index treats no change and 
depolarization as equivalent. Suppose in- 
creases in polarization are accompanied by 
increases in depolarization (and decreases 
result in no change). The means would be 
relatively unaffected by thought, and the in- 


terpretation of the results would shift from 
one dealing with polarization to one dealing 
with change per se. In order to examine this 
possibility, a probability of depolarization 
index was constructed. In Experiment 1, the 
relationship between time and depolarization 
was, with one reversal, negative (mean de- 
polarization for 30-, 60-, 90-, and 180-sec 
conditions was 58%, 40%, 52%, and 31%); 
in Experiment 2, the relationship over dura- 
tion within thought was also negative (mean 
depolarization for 45-, 60-, 90-, and 180-sec 
conditions was 57%, 53%, 46%, and 42%; 
for the no-thought condition, 43%); in Ex- 
periment 3, there was more depolarization in 
the no-thought condition (50%) than the 
combined thought conditions (39%), although 
there was no difference associated with dura- 
tion within thought. It appears that it is not 
change per se that is affected by thought, and 
the lack of sensitivity with mean degree of 
change cannot be accounted for in these terms. 
Perhaps degree of polarization is simply a 
less reliable index than whether or not an 
attitude changes in a more polarized direc- 
tion. With the former index, one or two ex- 
treme shifts in either direction can have a 
much greater impact on a condition mean than 
is possible with the latter index. 

A casual glance at Figure 1 and Table 1 
reveals that the absolute level of probability 
of polarization is quite low: It reaches .5 
only after 3 min in Experiments 1 and 2 and 
never reaches .5 in Experiment 3 (however, 
the long-duration period in that experiment 
was only 60 sec). One explanation for this is 
that since the items were all selected to be 
different from the neutral point of the scale, 
their means would tend to regress back to the 
neutral point, and hence, the amount of 
polarization would appear to be less. Statisti- 
cal regression may provide a partial expla- 
nation, but it does not appear to provide a 
full account. Although the items used were 
all somewhat different from the theoretical 
neutral point of the scale, there is no reason 
to expect that they are very divergent in 
either direction from the empirical population 
mean on the item. Another explanation for 
the low probability of polarization is the fact 
that the subjects wrote their thoughts con- 
cerning each attitude object. The subjects 
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may have regarded this as a kind of defense 
of their attitude, and a moderate position is 
more defensible given any but a univalent set 
of cognitions. This explanation, however, does 
not hold for the data collected in Experiment 
3, where subjects did not note their thoughts. 
On the other hand, perhaps one should not 
view this as a particularly low level of polari- 
zation. Each of the curves were monotonic 
over time, which suggests that over longer 
periods of time, the probability of polarization 
would be higher. 
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Comparison of Social Judgments of Creativity and Intelligence 


Betty B. Rossman and Harry F. Gollob 
University of Denver 


Social judgment of creativity and intelligence was investigated by having 
subjects give trait ratings for stimulus profiles of actual art students based on 
abilities information alone, based on personality information alone, based on 
biographical information alone, and, finally, based on the total set of all three 
types of cue information. Regression equations differed depending upon 
whether intelligence or creativity judgments were being predicted. For all types 
of information, there was a significant amount of variation unique to creativity 
judgments which could not be accounted for by intelligence judgments, and 
vice versa. However, the proportion of variation unique to each type of trait 
judgment was smallest when subjects had only abilities information available 
and was approximately four times larger when judgments were based on the 
total set of three types of cue information. Implications of this finding for 


the study of creativity are discussed. 


Creativity represents a scientifically elusive 
concept of current interest from both a psy- 
chometric and educational point of view, but 
one which has not been carefully examined 
from a judgmental point of view. Existing 
research has suggested that creativity mea- 
sures do not account for noteworthy variation 
which is independent of that accounted for by 
intelligence measures, and, on this basis, sev- 
eral investigators have questioned the useful- 
ness of pursuing creativity as an empirically 
distinct concept (e.g., McNemar, 1964; Ver- 
non, 1964; Wallach & Kogan, 1965). This 
well-known discriminant-convergent valida- 
tion controversy notwithstanding, some in- 
vestigators (e.g., MacKinnon, 1967; Rossman 
& Horn, 1972; Taylor, 1964) are still of the 
opinion that creativity is an identifiable con- 
cept, but one which is multifaceted and may 
have to be assessed by using more diverse 
measures than those typically used in the 
abilities-achievement domain. 

This article takes a different approach to 
this problem by looking not at how people 
perform on tests of creativity and intelligence, 
but rather by looking at how people form 
judgments of creativity and intelligence. Cur- 
rently, such judgments often serve as criteria 
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for the validation of proposed measures of 
creativity. Therefore, we reasoned that it 
would be useful to try to understand more 
fully how people go about making judgments 
of creativity and intelligence and to compare 
the ways in which these judgments differ 
from each other. Even though current abili- 
ties tests may not discriminate adequately 
between creativity and intelligence, it is 
known that people do make these two differ- 
ent kinds of judgments with some frequency 
and confidence. Thus, we evolved the follow- 
ing three hypotheses regarding the relation- 
ship between judgments of creativity and in- 
telligence. 

From the review of creativity research by 
Freeman, Butcher, and Christie (1968) we 
note that, in general, attempts to separate 
creativity and intelligence using tests in the 
cognitive domain have been unsuccessful. For 
example, work by Hasan and Butcher (1966), 
Cropley (1966), and the reanalyses which 
Burt (1962) and Marsh (1964) made of the 
Getzels and Jackson (1962) data failed to 
demonstrate clear differences between cogni- 
tive measures of creativity and intelligence. 
In addition, Taylor, Smith, Ghiselin, and 
Ellison (Note 1) found that ability tests were 
not effective in distinguishing between people 
who had been designated as creative or non- 
creative based on real-life achievements. Thus, 
it is not surprising that researchers such as 
MacKinnon (1962, 1965, 1967) have empha- 
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sized the importance of assessing biographical, 
temperamental, and motivational factors in 
addition to ability factors when investigating 
creativity. Taken together this information 
suggests that the often reported substantial 
relationship between creativity and intelli- 
gence may be especially high due to the fact 
that many studies investigating this relation- 
ship emphasized only abilities-oriented infor- 
mation. Thus, we hypothesized that judgments 
of creativity and intelligence would be most 
highly related when only ability information 
was available and would display substantial 
divergence when a variety of cue information 
was available. 

Second, while we thought that judgments 
of creativity and intelligence would be sub- 
stantially related, we also thought that people 
would hold different impressions of what is 
important for judging creativity as opposed 
to intelligence. Thus, we hypothesized that 
when judges are provided with diverse infor- 
mation, the regression equations which best 
predict creativity and intelligence judgments 
would be different both in terms of the cues 
included in the equations and in terms of the 
weights given to any cues which were com- 
mon to both equations. And, finally, we hy- 
pothesized that the cues which would be con- 
sidered most important by judges for making 
one type of judgment would be more effective 
in accounting for that type of judgment than 
in accounting for the other type of judgment. 

Wiggins, Hoffman, and Taber (1969) have 
examined individual differences in judgments 
of intelligence, but did not investigate judg- 
ments of creativity. They worked with stimu- 
lus profiles generated by using factorial com- 
binations of standard cue values which fall 
primarily into the abilities-achievement do- 
main. In order to examine our hypotheses, 
we used stimulus profiles which differed from 
theirs in at least two respects: we provided 

judges with profiles containing a wide variety 
of abilities, biographical, and personality in- 
formation; and our stimulus profiles were for 
actual college students who had been tested 
and judged previously with regard to creativ- 
ity and intelligence (Rossman & Horn, 1972). 
This allowed us to see if judges used informa- 
tion differently depending upon whether they 
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were making judgments of creativity or in- 
telligence and to compare judgments formed 
previously by professors and peers of our 
actual stimulus persons with those formed by 
the subjects in the present study who knew 
our stimulus persons only through descriptions 
we provided. 


METHOD 
Overview 


College student subjects completed two replications 
of a questionnaire which consisted of four sections. 
Each of the four sections presented a different set of 
information relevant to each of the 18 stimulus 
persons. Within each section of the questionnaire, 
subjects considered the information presented and 
made separate judgments of the creativity and 
intelligence for each of the 18 stimulus persons. One 
section of the questionnaire, P, provided only person- 
ality cue information; another section, A, presented 
only abilities information; another section, B, pre- 
sented only biographical cues; and a final section, 
T, contained the total set of the three types of cues 
for each stimulus person. Although each subject 
made four different judgments of creativity and 
intelligence for the same 18 stimulus persons, they 
were led to believe that they were making judgments 
of 72 (i.e., 4X 18) different people. Thus, the basic 
research design can be represented as a seven-way 
analysis of variance which includes subjects, 2 
replications, 4 types of cue information sections, 18 
stimulus persons, 2 kinds of trait judgments, 6 orders 
of presentation of the cue information sections, and 
2 sexes, in which subjects are nested in order and 
sex, and all other factors are crossed. 


Subjects, Order, and Replication 


A total of 24 college students from an Introduc- 
tory Psychology class participated in this study to 
fulfill part of their research requirement. Two males 
and two females were randomly assigned to one of 
the six order groups, an order group being defined 
by the order in which different sections of cue infor- 
mation were presented in the questionnaire (ie, 
PAB, APB, BAP, ABP, BPA, PBA). The total cue 
section was presented last in all questionnaires. For 
e ple, in the PAB order group, subjects rated the 
creativity and intelligence of all 18 stimulus persons 
on the basis of personality cues alone, then on the 
basis of ability cues alone, then on the basis of 
biographical cues alone, and finally on the basis of 
personality, ability, and biographical cues combined. 
The two replications were obtained by having sub- 
jects complete the questionnaire twice, once in each 
of two 1-hour sessions separated by about 1 week. 
The replication questionnaires completed by a given 
subject were identical except that for each subject 
the order of stimulus profiles within each cue sec- 
tion was varied between replications. 


SOCIAL JUDGMENT OF CREATIVITY AND INTELLIGENCE 


Stimulus Profile Construction 


The basic data used in making up stimulus pro- 
files came from a previous study (Rossman & Horn, 
1972) in which half of the sample consisted of 94 
college students majoring in art for whom the fol- 
lowing information was obtained: abilities and non- 
abilities measures related to creativity and intelli- 
gence; biographical characteristics related to creativ- 
ity and intelligence; and “criterion” judgments of 
creativity and intelligence from peers and faculty 
members. From these 94 students, the 18 stimulus 
persons were selected such that 6 of the stimulus 
persons were at least one standard deviation above 
the mean on peer creativity judgments, 6 were close 
to this mean, and 6 were at least one standard 
deviation below this mean in order to achieve some 
degree of between-profile variability in levels of 
judged creativity. 

Information provided by the results of previous 
studies (e.g., Anastasi & Schaefer, 1969; Domino, 
1970; Rossman & Horn, 1972; Taylor & Holland, 
1964) and by pilot subjects who were asked to list 
traits which they felt described highly creative 
people and highly intelligent people, was used in 
selecting the final personality, abilities, and biograph- 
ical cues. The 21 cues selected were those which 
seemed most relevant to the assessment and judg- 
ment of creativity and intelligence which also had 
reasonable counterparts in the measures obtained by 
Rossman and Horn. For each of the 18 stimulus 
persons, their real scores on the cue variables num- 
bered from 1 to 19 in Table 1 were separately coded 
into the three categories above average, average, and 
below average with average merely representing the 
arithmetic mean for the 18 stimulus persons. The 
two remaining cues numbered 20 and 21 were di- 
chotomous. In making up the stimulus profiles the 
recoding of the 21 cues was considered preferable to 
presenting actual scores on each cue variable be- 
cause the recoding would reduce subject confusion 
resulting from trying to integrate many widely 
varying numbers and also would give the cues more 
meaning relative to the sample of profiles being 
judged. Since we wanted to select cues which were 
most salient in affecting judgments of creativity and 
intelligence, we made no attempt to select cues with 
any particular patterns of intercorrelations. As it 
turned out, the correlations among the cues, based 
on the code values assigned to them (i.e., = above 
average, 2=average, 1= below average, and 1= 
yes or 0= n0, depending on the type of variable), 
ranged from —.72 for independence with high 
school grade point average to .73 for independence 
with enjoying unusual things. The matrix of inter- 
correlations among all the cues and criterion judg- 
ments is presented in Table 1. 


Questionnaire Construction 


The 21 cues for each of the 18 stimulus persons 
were assembled into a questionnaire having the four 
sections mentioned above, with the P section in- 
cluding cues numbered 1-5 in Table 1, the A section 
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containing cues 6-13, the B section having cues 14-21, 
and the T section presenting all 21 cues for each 
profile. Although the first three sections were pre- 
sented in different orders for the different order 
groups, there was no formal separation between 
sections. Thus, subjects simply worked through the 
questionnaire making creativity and intelligence judg- 
ments for each profile they encountered, being in- 
structed to think of each of the 72 profiles as be- 
longing to a different real person. The 18 profiles 
within each cue section were put in a different 
shuffled order for each subject. In part, the subjects 
were instructed to respond as follows: 


In working in the area of creativity we have 
found four sources of data which have in common 
the fact that information was gathered about col- 
lege students who were majoring in art. We have 
used information from these sources to make up 
profiles for particular students who participated in 
the research, and it is these profiles that you are 
being asked to judge. 

Each of the following pages contains a profile 
for a real person who is majoring in art at a 
University similar to this one. Therefore, when 
information on a profile shows that the person 
described is above average in, say, independence, 
that means that he is above average in indepen- 
dence for a college student. We have used the 
labels “above average,” “average,” and “below 
average” to make your task somewhat easier and 
more meaningful than it might be if we expected 
you to make sense out of lots of specific numerical 
scores. 

We want you to carefully study each profile in 
turn and make separate judgments of how cre- 
ative and how intelligent you think the person 
described is. Each profile represents information 
about a different real person so please try to make 
separate judgments for each profile. Placed below 
each profile are the following scales: 


yuncep creativity: (for a college student) 


-7 —6 -5 —4 —3 —2 -1 0 +1 42 +3 +4 +5 +6 +7 


High 
Creativity 


Average 
Creativity 


JUDGED INTELLIGENCE: (for a college student) 


~7 —6 —5 —4 —3 —2 —1 0 +1 +2 +3 +4 +5 +6 +7 


High 
Intelligence 


Low Average 
Intelligence Intelligence 


Instructions for using these rating scales were then 
presented. 


Direct Judgments oj Cue Importance 


At the end of the second session each subject 
completed a final questionnaire which asked him to 
think of a college student whom he knew personally 
and whom he regarded as highly creative. The sub- 
ject was then asked to mark all of the 21 cues on 
which he thought that that person was above 
average for a college student and then to rate that 
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person on scales identical to those presented above 
as to his overall degree of creativity and intelligence. 
He also was asked to do this for a different college 
student he knew whom he regarded as highly in- 
telligent. Finally, each subject was asked to rank 
from 1 to 5 the five cues he regarded as most im- 
portant in making his judgments of creativity and 
then was asked to do this for his judgments of in- 
telligence. This was done to provide some informa- 
tion as to whether the kinds of cues subjects consid- 
ered to be most important for making judgments 
would show up as being most important in regres- 
sion equations used to predict the creativity and 
intelligent judgments. 


RESULTS AND DISCUSSION 


In the following discussion we first consider 
results of the study with regard to general 
factors in the design and then present evi- 
dence more directly concerned with the con- 
vergent-discriminant validation issue. 


Effects Due to Replication, Order, Cue Type, 
Sex, and Profile 


Separate analyses of variance were per- 
formed for the creativity and intelligence 
judgments. The six factors included replica- 
tion, order, cue type, profile, sex, and subjects 
nested within order and sex in which subjects 
were regarded as random and all other factors 
were regarded as fixed. Sources of variation 
involving repeated measurements of the type 
obtained in the present study typically vio- 
late the homogeneity of covariance assump- 
tion. Therefore, the Geisser-Greenhouse con- 
servative procedure (Winer, 1971) was em- 
ployed for testing levels of significance for 
sources of variation which included profile 
and cue type. The only effects which were 
significant at at least the .05 level in both 
the creativity and intelligence analyses were: 
order, profile, cue type, and the Profile x 
Cue Type interaction. 

In both analyses the means for the profile 
main effect indicated that subjects saw differ- 
ences among the stimulus profiles. The Cue 
Type X Profile interaction indicated that 
-profiles were ranked somewhat differently de- 
pending upon the type of cue information 
used to make the judgment. Although the cue 
type main effects were statistically significant, 
the differences between the lowest and high- 
est means were fairly small, with the range 
being .27-1.53 for intelligence judgments and 
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.19-.77 for creativity judgments. The order 
main effect means for creativity judgments 
were ordered from high to low, BPA, BAP, 
ABP, APB, PAB, and PBA, and those for 
intelligence judgments were ordered identi- 
cally except for an ABP-APB reversal. 
Though order may be an interesting variable 
for further study, the present order effect was 
significant at only the .05 level and a follow- 
up pilot study showed a nonsignificant order 
effect. Thus we did not consider these order 
effects compelling enough to warrant further 
analysis or discussion in the present context. 


Differences between Judgments of Creativity 
and Intelligence 


The first question to be investigated in our 
data is whether our subjects’ judgments of 
creativity and intelligence differed in a reli- 
able fashion. Since significance tests derived 
from conventional ways of answering this 
question were not appropriate for our data 
due to the lack of independence of the 18 
mean ratings of the stimulus persons, we used 
a different and less conventional approach in 
order to test significance. The same basic 
analysis was done separately for the creativ- 
ity and intelligence judgments which were 
made on the basis of ability information 
alone, on the basis of personality informa- 
tion alone, on the basis of biographical infor- 
mation alone, and on the basis of the total 
set of cue information. To illustrate the type 
of analysis done, the following paragraph 
describes the analysis of the creativity judg- 
ments which were made on the basis of ability 
cues. 

First, a Subject (Sex) X Profile X Replica- 
tion analysis of variance in which sex was a 
between-subjects variable and profile and 
replication were within-subjects variables was 
performed using creativity judgments from 
the ability cue section as the dependent varia- 
ble. All sources of variation which included 
profile as a factor were then partitioned into 
a part which could be accounted for by the 
intelligence judgments based on ability cues 
and an orthogonal part which could not be 
accounted for by these intelligence judgments. 
This partitioning was done by first construct- 
ing a profile contrast by taking the mean 
ability-cue intelligence rating for each of the 


276 


18 profiles and scaling these means so they 
summed to zero (see Winer, 1971, for a dis- 
cussion of use of contrasts in analysis of 
variance). After computing the weights which 
defined the contrast, standard analysis of 
variance techniques were used to do the de- 
sired partitioning and significance testing. Of 
central interest in answering our question are 
the results of the partitioning of the profile 
main effect. If that portion of the creativity 
judgment profile main effect which is ac- 
counted for by the intelligence contrast is 
significant, this means that the correlation 
between mean creativity and intelligence judg- 
ments is significant (i.e., that there is a sig- 
nificant amount of overlap between the two 
measures). In addition, if the creativity vari- 
ation which is not accounted for by the in- 
telligence contrast is significant, this means 
that a statistically significant amount of reli- 
able variation in mean creativity judgments 
is not accounted for by the mean intelligence 
judgments. As mentioned above, this type of 
analysis was carried out for creativity and 
intelligence judgments from each of the four 
cue-type sections of the questionnaire. In none 
of the analyses were sources of variation 
which involved sex or replication noteworthy. 
The amount of overlap between mean cre- 
ativity and intelligence ratings was significant 
in every case (p < .0001). However, discrimi- 
nant. validity was supported by the fact that 
the variation in creativity judgments which 
was not accounted for by the mean intelli- 
gence judgment contrasts was highly signifi- 
cant for all four types of information (p < 
0001). For ability, personality, biographical, 
and total information, respectively, the 
amount of total creativity variation not ac- 
counted for by the intelligence ratings was 
about 16%, 24%, 17%, and 49%. Four di- 
rectly analogous analyses were done for the 
intelligence ratings, but using the mean cre- 
ativity ratings to construct the contrast. For 
each type of information there was a signifi- 
cant (p < .05 for biographical data and p < 
.0001 for all other data) amount of variation 
in mean intelligence ratings which was not 
accounted for by the mean creativity ratings. 
Taken together, these analyses suggest that 
our subjects’ judgments of creativity and in- 
telligence differed from each other in some 
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systematic fashion. The following 
discuss specific ways in which these 
differed. 


Judgments of Creativity and Intel 
Function of Different Types of In) 


Analyses of total cue judgments. in 
by using a linear regression model t 
gate the role of different types of in! 
in our subjects’ judgments of cre y 
intelligence. Two regression equations were 
calculated: one using the mean profile creativ- 
ity judgments based on total cues a ri 
terion and allowing mean profile jud 
either creativity or intelligence based on the 
separate personality, abilities, or biographical 
information sections to enter as predictors; 
and the other using the mean profile intelli- 
gence judgments based on total information 
as a criterion and allowing all of the sam 
predictors to enter. Regression equations were 
calculated using the stepwise multiple regre 
sion program BMD-02R (Dixon, 19 { 
that the additional variance provided b» 
new predictor would maximize R? and could 
be examined. Retaining the order in whi 
variables were selected by the progran 
of the following equations resulted in © 1: 
ple R? of .98, with each predictor « nti 

for statistically significant additio: 
tion: * 
Cy = .53C, + -26C, + .26C,, 20 (4) 


I, = 571, + 32, + .131, - 


1 Using a procedure which is describ 
detail in Gollob and Rossman (1973, pp 
the significance of each additional 
Equations 1 and 2 was tested by allow t pre 
dictor in turn to define a contrast « ij 
the preceding ones and then ascertaining 
variance contribution of the contrast dei 
successive predictor was statistically sig 
Subject in Sex X Replication X Profile 
variance. For both the creativity and 
judgments each predictor in turn d 
significantly in accounting for the total 
effect variation (p< .01). In addition, + 
Greenhouse test using F with 1 and 22 
freedom indicated that the residual pro 
tion (i.e., that main effect variation not 
for by the appropriate regression equation) 
significant for either the intelligence judgments o 
the creativity judgments. Thus, it is clear that 
Equations 1 and 2 did well in accounting for th 
variation in mean creativity and intelligence jud 
ments. j 
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in which C and I, respectively, indicate 
whether a creativity or an intelligence judg- 
ment is involved, and the subscripts t, a, p, 
and b are used, respectively, to indicate 
whether the type of information on which 
the judgment was based was total cues (i.e., 
all information), ability information alone, 
personality information alone, or biographical 
information alone. Although all predictor 
variables could have entered each of the 
equations, the C, judgments were best pre- 
dicted by other judgments of creativity, and 
the I, judgments were best predicted by other 
judgments of intelligence. This is what one 
would expect if indeed our subjects had dif- 
ferent conceptions or stereotypes of creativity 
and intelligence. 

Regression equations for C, and I, were 
also calculated for each subject to provide 
comparison values for Equations 1 and 2 
which were based on the mean data. The step- 
wise regression algorithm was allowed to se- 
lect the best three-predictor regression equa- 
tion for C, and for I; for each subject. Exami- 
nation of these results indicated that these 
equations were quite similar to those based on 
the mean data and resulted in an average R° 
for predicting C; of .78 and an average R? 
for predicting I, of .81. Moreover, when each 
individual subject’s Ca, Cy, and Cp judgments 
were used to predict his C; judgments a mean 
R? of .76 resulted; and when each subject’s 
In, Ip, and I, judgments were used to predict 
his I, judgments, a mean R? of .73 resulted. 
Thus, it is clear that Equations 1 and 2 are 
also useful in accounting for the judgments 
of individual subjects. 

We now examine the criterion differentia- 
tion problem by using the Ca, Co, Crp, Ta, Th, 
and Ip judgments which were obtained in the 
present study to predict the “criterion” judg- 
ments of creativity and intelligence which 
Rossman and Horn (1972) obtained in an 
academic environment. The criterion judg- 
ments are the actual creativity and intelli- 
gence ratings given previously by peers and 
faculty members who knew the 18 art stu- 
dents. Conceptually it is possible to think of 
Rossman and Horn’s criterion judgments as 
similar to the Ce and I; judgments obtained 
in the present study, since they are also over- 
all judgments of creativity and intelligence. 
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Thus, any differences between the equations 
obtained for these criteria and Equations 1 
and 2 may be of interest. Being unable to 
test adequately the significance of each pre- 
dictor in these equations, we elected to retain 
only those predictors which accounted for at 
least an additional 15% of the variation. The 
resulting stepwise regression equations were: 


Peer Creativity Ratings 


22 Lp tr slid perso lon = LN 
Faculty Creativity Ratings 
= 381, + 3.41 (R = 61), 
Peer Intelligence Ratings 
= 101, + .091, + 3.38 (R? = .50), 
Faculty Intelligence Ratings 
= .281, + 3.35 (R = .50). 


These equations differ rather strikingly from 
Equations 1 and 2 in that both creativity and 
intelligence ratings from the Rossman and 
Horn (1972) study were best predicted by 
intelligence judgments. These equations sug- 
gest that all of the actual criterion judgments 
from this previous study may be better un- 
derstood in terms of our subjects’ judgments 
of intelligence, while our subjects’ judgments 
of creativity are not important in accounting 
for the Rossman and Horn criterion judg- 
ments of either creativity or intelligence. In 
order to understand this interesting outcome 
it is important to remember that both types 
of criterion judgments were made in an aca- 
demic setting in which the type of informa- 
tion available to judges is more likely to be 
relevant for judgments of intelligence than for 
judgments of creativity. Let us now look 
more carefully at how the type of informa- 
tion available to judges may influence judg- 
ments of creativity and intelligence. 
Discriminant validity as a function of the 
diversity of information available to the 
judge. In the introduction we hypothesized 
that judgments of creativity and intelligence 
would show more divergence when a wide 
range of information was available to the 
judge than when only ability information was 
considered in making the judgments, Indirect 
support for this hypothesis was provided in 
our above comparisons of regression equa- 
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Ficure 1. Proportion of common, unique and 
unreliable variation in creativity and intelligence 
judgments based on ability cues (upper circles) and 
based on total cues (lower circles). 


tions for criterion judgments. More direct 
evidence in support of this hypothesis is pre- 
sented in Figure 1. 

Separately for abilities information alone 
and then for abilities, personality, and bio- 
graphical information combined (i.e., for the 
total cue section), the creativity and intelli- 
gence judgments obtained in the present study 
were partitioned into the following compo- 
nents as illustrated in Figure 1: unreliable 
variation in intelligence judgments; reliable 
but unique variation in intelligence; variation 
in intelligence which was reliable but over- 
lapping with creativity; reliable but unique 
variation in creativity; and unreliable varia- 
tion in creativity judgments.? Examination of 


*Here we describe the method by which the 
percentages shown in Figure 1 were computed. Let 
T, G, I, and Cz denote the mean intelligence and 
mean Creativity ratings obtained for the 18 stimulus 
Profiles on the first and second replication, respec- 
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the upper portion of Figure 1 indicates that 
when only abilities information was available 
to judges, 84% of the variation in judgments 
of creativity and intelligence was common 
variation. However, looking now at the lower 
portion of Figure 1, when subjects were pro- 
vided with more varied cues which provided 
biographical and personality information in 
addition to abilities data, the overlapping vari- 
ance dropped to 51%. It is clear from the 
figure that this large difference in degree of 
overlap is not due to differences in the reli- 
ability of the judgments. When judgments 
were based on abilities information alone, 
only 11% of the reliable variation in creativ- 
ity judgments was unique to creativity, 
whereas, when the creativity and intelligence 
judgments were based on total cue informa- 
tion, 45% or over four times as much reliable 
variation in creativity judgments could not be 
accounted for by the intelligence judgments. 


tively. Then the squared test-retest reliabilities for 
the mean intelligence and mean creativity judgments 
based on one replication are (rn, r)* and (rc, o:)*, 
respectively. 

Therefore, the proportion of unreliable variation in 
mean intelligence judgments (indicated by the slashed 
line section on the left side of Figure 1) is given by 


1 = (rr,,1,)2, 


and the proportion of unreliable variation in mean 
creativity judgments (indicated by the slashed line 
section on the right side of Figure 1) is given by 


1 — (ro,,0,)*. 


ince the estimates of the proportion of unreliable 
variation in creativity and intelligence judgments refers 
to the unreliability of the ratings given in a single 
replication, it was desirable also to estimate the propor- 
tion of variation common to creativity and intelligence 
judgments based on one replication, Since the values of 
(rr, c)%, (1y,¢9)% (T1,0,)2 and (rp,¢,)? each provide 
equally reasonable estimates of the proportion of varia- 
tion common to intelligence and creativity judgments 
based on one replication, the following quantity was 
used to estimate the proportion of common variation 
(indicated by the gray section of Figure 1): 


Clen c)? + (tnc)? + (rm01)? + (tr, c,)*]. 
Finally, the Proportion of reliable variation unique 
to intelligence judgments (indicated by the white 
section in Figure 1) was computed by 
(t1119) — HEr)? + (rr, 04)? + (Fig, 01)? + (rp, c)?], 


and the proportion of reliable variation unique to crea- 
tivity judgments (indicated by the black section in 
Figure 1) was computed by 


(ey, ¢)? — Clen, o)? + (ric)? + (tinc)? + (tn 0:)2. 
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Similarly, about four times more variation in 
intelligence judgments is unique to intelli- 
gence judgments when all three types of in- 
formation are available to judges. 

This type of partitioning was also done 
separately for judgments based on biographi- 
cal and personality information. For bio- 
graphical and personality cues, respectively, 
it was found that only 15% and 21% of the 
reliable variation in creativity judgments 
could not be accounted for by the intelli- 
gence judgments. Thus, although much less 
striking, the results were similar to those ob- 
tained when judgments were based on ability 
information alone. 

The preceding analyses suggest that there 
may be little basis for distinguishing between 
creativity and intelligence judgments when 
only abilities information is available. Per- 
haps the narrow range of information re- 
stricts the operation of subjects’ implicit 
theories or stereotypes relating to differences 
between creativity and intelligence. However, 
when judges are allowed to consider simul- 
taneously all three types of information, 
their judgments of the concepts are clearly 
distinguishable. Before considering this point 
further, let us turn to some analyses which 
clarify the role played by particular cues in 
our subjects’ judgments of creativity and in- 
telligence. 


Judgments of Creativity and Intelligence as a 
Function of Specific Cues 


Cues selected by subjects. Recall that in the 
final questionnaire subjects were asked to 
select and rank five of the 21 individual cues 
that they considered most important in judg- 
ing creativity, then to think of a highly cre- 
ative person they knew and check those cues 
of the 21 on which they would regard this 
person as above average, and finally to rate 
this person on creativity and intelligence. 
Subjects also performed this task with regard 
to intelligence and a highly intelligent person 
they knew. In all cases, the five cues a sub- 
ject ranked as most important for making one 
gment were also marked as char- 


type of jud n 5 
acteristic of the person classified as high on 
that trait. With regard to our hypothesis that 
judgments of creativity and intelligence dif- 


fer systematically, we also wanted to examine 
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whether different cues were judged important 
for the the different judgments and, whether 
the cues judged most important for creativity 
did better in predicting creativity judgments 
than intelligence judgments and vice versa. 

One set of three cues for creativity and a 
different set of three cues for intelligence had 
the highest sum of ranks across subjects and 
also were most frequently chosen as most 
important. With regard to importance in 
judging creativity these three cues were ad- 
venturousness, cleverness, and independence. 
Using these cues in a stepwise multiple re- 
gression equation to predict creativity and 
intelligence judgments (the order in which 
the predictors were added to the equation in 
each case has been retained): 


C, = 1.2 adventurousness + .25 cleverness 
+ .14 independence — 3.10 
I, = — .48 independence 


+ .50 adventurousness 
+ .35 cleverness + .11, 


The multiple R’s for Ce and I, were .31 and 
.22, respectively. Thus, the creativity-impor- 
tant cues did somewhat better in predicting 
creativity than intelligence judgments. The 
weights given to the cues also seem to differ 
depending on whether creativity or intelli- 
gence judgments are being predicted. Of in- 
terest is the change in sign of the regression 
coefficient for independence which suggests 
that more independence is perceived as a sign 
of creativity while less independence is per- 
ceived as a sign of intelligence. 

The three most important cues for judging 
intelligence were overall grade point average 
(GPA), vocabulary ability, and need for 
achievement (nAch). Using these three cues 
to predict I, and C, resulted in the following 
equations (the order of the predictors is 
again retained): 


I, = 1.53 overall college GPA 
+ 1.02 vocabulary + .28 nAch — 5.54 
C, = .11 vocabulary + .47 nAch 


+ .48 overall college GPA — 3.34, 


The resultant multiple R’s were .66 for I; and 
19 for Ce In this case the intelligence-im- 
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portant cues did much better in predicting dent's major was the most important cue for 
intelligence than creativity, predicting both of these types of criterion 
Thus our hypothesis gained support, Diiler- judgments, 
ent cues were considered most important for In sum, whether selected by our subjects 
judging creativity as opposed to intelligence, or the computer, the cues which were most 
and the judged tralt-appropriate cues were in important for judgments of creativity were 
fact more effective in predicting the traits for diferent from those which were most impor- 
which they were judged appropriate. It also is tant for judging intelligence. In addition, 
interesting that personality cues were judged while personality cues were more important 
most important for creativity, whereas abil- for our subjects’ judgments of creativity, abil- 
ity cues were judged most important for in- ity cues were more important in accounting 
telligence, for both the intelligence judgments from the 
Cues selected by a stepwise regression pro» present study and the 
cedure, A related consideration is bow well from the Rossman and Horn study, These 
the three “best” specific cues do in predicting results, in conjunction with the results re- 
creativity and intelligence when the regression ported in the 
algorithm, rather than subjects’ judgments, is creativity and intelligence are not nearly as 
used to determine which are best and whether sharply differentiated by judges who know the 


these best cues are different for creativity and stimulus persons in an academic setting as 
intelligence, The relevant stepwise equations they are by judges who are presented with 
are: more diverse types of information upon which 
Cy = 1,46 creative products pr Ba = ware oa 

+ .79 adventurousness study of creativity and intelligence. 

+ 60 unconventionaliam — 5.47 

Ty = 1,31 overall college GPA we Concrumone s 
+ .90 high school GPA results strongly support the Idea that 
: 49 vocabulary — 4,20, subjects have distinct conceptions of creativ- 
; ity and intelligence, but that diverse types of 
These equations resulted in multiple R's of information must be available in order for 
ABB for C, and 83 for I). Note first that even the differences between the conceptions to 
the predictor cues selected by the regression greatly affect judgments of these traits, The 
| Program are different for judgments of crea- fact that we found strong evidence tor dis- 
_ tivity and intelligence. Another point of in- criminant validity in judgments of creativity 
terest is that there is more agreement between and intelligence when diverse information was 
| mabjects’ choices and the regression pro: available and found only weak evidence for 
| tedure choices as to which cues are most im discriminant validity when abilities informa- 
portant for intelligence than there is for cre Uon alone was available has at least three 
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Some Determinants of Perceived Control 


Camille B. Wortman 
Northwestern University 


An experiment was conducted to test the hypothesis that an individual will 
feel control over an outcome if he causes the outcome and if he knows before 
causing it what he hopes to obtain. Subjects were shown two consumer items 
and told that they would get to win one by a chance drawing. Two marbles 
of different colors were placed in a can and mixed up. One third of the 
subjects were told that the experimenter would pick a marble to determine 
their prize and were told beforehand which marble stood for which prize. 
Another third were told to select a marble to determine their prize and were 
told beforehand which marble stood for which prize. The remaining subjects 
were told to select a marble to determine their prize but were not told until 
after they had picked their marble which marble stood for which prize. 
Subjects then received a marble which led them to win either the item they 
preferred or the item they did not prefer. The results strongly supported the 
hypothesis: Subjects who caused their own outcome and knew beforehand 
what they hoped to obtain perceived themselves to have more control over 
the outcome, more choice about which outcome they received, and more 
responsibility for their outcome than subjects in the remaining conditions. 
These results were replicated in a second experiment. The relationship between 
these studies and previous experiments on control were explored, and some 


possibilities for future research on control were considered. 


The idea that individuals strive for mas- 
tery and control over their environment is not 
new and has played a central role in many 
theoretical statements (see, e.g., deCharms, 
1968; G. A. Kelly, 1955; H. H. Kelley, 1971; 
White, 1959; Woodworth, 1958). Recently, 
several experiments have been conducted to 
test the psychological implications of per- 
ceived control. Experiments by Glass and 
Singer (1972) have demonstrated that sub- 
jects who have perceived control over 
aversive stimuli show fewer decrements in 
performance and greater tolerance for frustra- 
tion following exposure to such stimuli 
than subjects who do not. There is some evi- 
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dence (see, e.g., Corah & Boffa, 19/0; Geer, 
Davison, & Gatchel, 1970; Hokanson, De- 
Good, Forrest, & Brittain, 1971; Stotland & 
Blumenthal, 1964) that subjects experience 
less autonomic reactivity to stressful stimuli 
when they perceive themselves to be in con- 
trol of the stimuli. Pervin (1963) found 
that when given a choice, subjeci: to 
control the administration of the stimulus 
rather than have the experimente: 

it. The work of Seligman and hi 

(see, e.g., Seligman & Maier, 1967; Se! ; 
Maier, & Geer, 1968; Seligman +) has 
shown that organisms trained to control aver- 
sive stimuli fare better when subjected to un- 


controllable stimuli than organisms without 
such training. Trained animals do exhibit 
the pattern of behavior of giving ip, called 


“Jearned helplessness” by Seligm:n (1974). 
And organisms exhibiting such helpless or de- 
pressed behavior can be “cured” if they are 
forced to emit behavior that changes their 
environment and are thus confronted with the 
evidence that they can control their environ- 
ment. 

Since perceived control has generally been 
shown to benefit the organism, it is importar 
to determine what factors influence 


is) 
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of control. Some early studies suggest the 
intriguing possibility that merely causing a 
behavioral outcome may produce feelings of 
control. For example, Lerner and Matthews 
(1967) conducted a study that was presented 
to subjects as an experiment on learning. 
Pairs of subjects were told that one of them 
would be assigned to a “shock” condition, 
while the other would be in a more desirable 
(“control” or “reward”) condition. The de- 
cision as to which subject was assigned to 
which condition was to be determined strictly 
by chance, by which of two slips of paper 
each subject selected from a bowl. Some sub- 
jects were shown a bowl with two slips, 
shock and control, and were either asked to 
pick first or were left with the remaining 
slip. Subjects tended to view the person 
who picked out- the first slip, and thus 
“caused” both outcomes, as more responsible 


_for both subjects’ fates than the experimenter 


or the other subject, even though the out- 
comes were obviously determined by chance. 

Several other investigators (see, e.g., Per- 
vin, 1963) have conducted experiments in 
which subjects were asked to administer 
aversive stimuli to themselves, and their reac- 
tion to this stimulation was compared to that 
produced when the stimuli were administered 
by the experimenter. In general, these studies 


_ have concluded that aversive stimulation is 


more tolerable in the former condition than 
the latter. Thus, subjects seem to react dif- 
ferently if they cause an outcome for them- 
selves than if they experience the same out- 
come, but it is caused by someone else. Per- 
haps the most intriguing aspect of this find- 
ing is its irrationality: Given that the sub- 
ject receives a certain number of shocks, and 
has no choice about their intensity or dura- 
tion, what difference should it make who 
presses the button that turns the shocks on or 
off? 

These results provide suggestive evidence 
that causing a certain outcome leads to feel- 
ings of control over the outcome. However, 
most of the experiments manipulating who 
administers the stimulus have been beset by 
methodological problems. In several of these 
experiments (see, €.8-; Haggard, 1946; Ho- 
kanson et al., 1971, Experiment 1; Staub, 
Tursky, & Schwartz, 1971, Experiment 2) 
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control and predictability have been con- 
founded, since subjects in the self-adminis- 
tered condition not only administered the 
shocks but had knowledge of when the 
shocks would occur, Second, in at least one 
of these experiments (LePanto, Moroney, & 
Zehnhausern, 1965), the duration of the 
aversive stimulus was apparently not kept 
constant in the two experimental conditions. 
Third, although most of these investigators 
have conceived of this variable in terms 
of control, not one has included a check on 
the manipulation by asking subjects if they 
indeed felt more control when they admin- 
istered the stimulus than when the experi- 
menter did. In the current experiment, an 
attempt was made to examine the theoretical 
possibility that merely causing an outcome 
for oneself leads to the perception of control 
while avoiding the problems that have char- 
acterized most previous studies. 

In addition to causality, a second possible 
determinant of perceived control was exam- 
ined in the current study. An interesting ex- 
periment by Strickland, Lewicke, and Katz 
(1966) has suggested that a person who has 
foreknowledge of what he wants to obtain 
before causing an outcome will feel more con- 
trol over the outcome than a person who does 
not have such foreknowledge. Subjects in this 
experiment were told that they would be roll- 
ing dice. It was explained to them that they 
would each make a bet and throw the dice 30 
times. Subjects were led to believe that they 
could earn money if they won their bets. 
Strickland et al. (1966) were primarily in- 
terested in the level of conservatism or riski- 
ness that would be exhibited by their subjects. 
They predicted that subjects who were asked 
to bet after they had thrown the dice but did 
not know the outcome would be more con- 
servative than subjects who were asked to 
bet before throwing the dice. The results sup- 
ported the authors’ reasoning. 

These findings may have occurred because 
subjects who were asked to bet before throw- 
ing the dice felt that they could in some way 
influence the outcome. Since subjects in both 
conditions “caused” the outcome by throwing 
the dice, these results imply that causality 
alone may not be sufficient to result in feel- 
ings of control. They suggest that individuals 
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feel they are more in control if they know 
what they hope to obtain before causing the 
outcome than if they do not. Although this 
is an interesting possibility, Strickland et al. 
(1966) did not include a dependent measure 
on control, so it is impossible to tell whether 
subjects who bet before throwing the dice 
perceived themselves to have control over 
the outcome. 

Taken as a whole, these studies suggest that 
in order for one to perceive control over a 
behavioral outcome, two factors are neces- 
sary: One must cause the outcome, and one 
must have foreknowledge of exactly what he 
hopes to obtain. The main purpose of the 
present experiment was to test this hypothe- 
sis. 

An additional purpose of the experiment 
was to explore some variables that may medi- 
ate the effects of perceived control. One possi- 
bility is that subjects who believe that they 
have control over an outcome have greater 
feelings of choice and responsibility for the 
outcome. In order to test this possibility, 
questions concerning perceived choice and 
responsibility were included as dependent 
measures. 

In order to ascertain the importance of 
causality, the following variable was decided 
upon: The outcome that some subjects re- 
ceived was determined by the color of a mar- 
ble they obtained by reaching into a can con- 
taining two marbles of different colors; the 
outcome that others received was determined 
by the color of a marble picked by the ex- 
perimenter. In both cases, the marbles were 
placed into the can and mixed up by the per- 
son who ultimately selected the marble, and 
the person choosing the marble could not see 
into the can. This may be regarded as a 
rather strict test of the hypothesis that caus- 
ing an outcome leads to feelings of control, 
since in all cases, the outcome was obviously 
determined by chance. This manipulation 
avoids confounding causality of the outcome 
with such factors as predictability or the 
length of time that the outcome is experi- 
enced. In order to ascertain the importance of 
foreknowledge (i.e., knowing before the out- 
come occurs what one hopes to obtain), some 
subjects were told before the marble was 
picked which color of marble stood for which 
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outcome, and some were not told this until 
after their marble had been picked out. Ques- 
tions concerning perceived control, choice, 
and responsibility were included as dependent 
measures. 

In order to check on the possibility that re- 
actions to controlled or uncontrolled outcomes 
may depend upon the outcome’s valence, 
attractiveness was included as a variable. In 
departure from previous studies, however, 
only positive outcomes were used. The out- 
comes employed in the study were consumer 
items. Half of the subjects received an item 
that they found very attractive; the remain- 
ing subjects received a minimally attractive 
outcome. The decision to use positive out- 
comes was made because of a possible am- 
biguity in the use of negative ones. It was 
felt that the use of negative outcomes may 
lead subjects to feel that they had been taken 
advantage of or treated unfairly and that such 
feelings might interact with the independent 


variables to produce artifactual differences in “| 


subjects’ responses. 

The experiment contained the following 
conditions: an experimenter-caused-foreknowl- 
edge condition, in which the experimenter 
picked a marble and assigned the subject an 
outcome, and the subject knew beforehand 
which marble stood for which outcome; a 


subject-caused-foreknowledge condition, in ~< 


which the subject picked a marble and as- 
signed himself an outcome and knew before- 
hand which marble stood for which outcome; 
and a subject-caused no-foreknowledge condi- 
tion, in which the subject picked a marble 
and assigned himself an outcome but was not 
told until after he had picked the marble 
which color stood for which outcome. Cross- 
cutting these conditions was the attractiveness 
variable: Half of the subjects received the 
marble which resulted in their winning a 
highly attractive item; the remaining sub- 
jects received the marble which resulted in 
their winning a minimally attractive outcome, 
If the hypotheses suggested earlier were cor- 
rect and both causality and foreknowledge 
were necessary for perceived control, the sub- 
ject-caused-foreknowledge subjects should 
perceive themselves to have more control than 
subjects in the other two conditions, and sub- 
Jects in the other two conditions should not 
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differ from one another in feelings of per- 
ceived control. No predictions were made con- 
cerning the attractiveness variable or interac- 
tions between the two variables.' 


EXPERIMENT 1 


Method 
Overview 


Subjects had signed up for an experiment on young 
people’s attitudes and values and were first asked to 
rate eight consumer items according to how much 
they liked each one. The experimenter then ex- 
plained that since two of the manufacturers had 
given her a large number of the items, she was going 
to give an item to each student who participated. 
Each subject was shown two items, one that he had 
previously rated high and one that he had rated 
low, and was told that these were the items that 
had been donated. Subjects were told that since it 
was necessary to give away equal numbers of the 
items, the item each person received would be 
determined by chance, by the color of a marble 
pulled from a coffee can. Each subject was asked to 
take a red and blue marble from a dish of marbles 
and hand them to the experimenter, who placed 
them in an empty coffee can. One third of the sub- 
jects were told that the experimenter would pick a 
marble from the can and were told which color of 
marble stood for each prize. One third were told 
that they should pick the marble and were told 
before picking which color of marble stood for each 
prize. The remaining subjects were told that they 
should pick a marble but that after they picked the 
marble they would find out which color stcod for 
each prize. Half of the subjects obtained the marble 
that led them to win the highly attractive prize; 
the remaining subjects received a marble assigning 
them to the less attractive prize. After a few min- 
utes, subjects were asked to complete questionnaires 
concerning their perceptions of control; their feel- 
ings of choice, responsibility, and fairness; as’ well 
as their opinion of the prize, the experimenter, and 
the experiment itself. 


Subjects 


The subjects were 65 male undergraduates: from 
the Lake Forest College, in Lake Forest, Illinois.* 


1%t was decided that an experimenter-caused 
no-foreknowledge condition was not essential for the 
design. Extensive pretesting had revealed that sub- 
jects in an experimenter-caused-foreknowledge con- 
dition experienced no feelings of control whatsoever. 
Therefore, it was thought to be extremely unlikely 
that subjects in an experimenter-caused no-fore- 
knowledge condition would experience significantly 
less control than these subjects. 

20f the 65 subjects who participated in the 
experiment, 58 were jncluded in the analyses. Of the 
remaining subjects, 1 was discarded because he 
rated all of the consumer items below the midpoint 
on the rating scale, and 1 was excluded because he 
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Subjects were recruited by means of posters offering 
$2.50 for their participation in a study on young 
people’s attitudes and values. None of the subjects 
had previously participated in psychological re- 
search, and the majority of the students had no 
background in psychology. Subjects were tested in 
groups of three; each experimental session lasted 
approximately one hour. 


Procedure 


When the subjects arrived, they were reminded 
that the experiment concerned young people’s atti- 
tudes and values. They were told that the experi- 
menter would be asking them many questions about 
their opinions. It was explained that their responses 
would be used to study a variety of problems, in- 
cluding how values change and cross-cultural dif- 
ferences in values. 

The experimenter told the subjects that she first 
wanted to get some information concerning the kinds 
of consumer items that are attractive to young 
people today. They were then told that in a few 
moments, they would be taken to a booth contain- 
ing eight consumer items, and they would be asked 
to examine these items very carefully. Subjects were 
also told that they would be given 5 minutes to 
examine these items and they would then be asked 
to rate them. 

At this point, each subject was taken to a sepa- 
rate booth, where he saw eight items in front of him. 
Four had been selected to be very attractive (e.g. a 
pottery coffee mug), and four had been selected to 
be minimally attractive (e.g., a scratch pad). Sub- 
jects were given 5 minutes to inspect the items in 
any order they wished. After examining the items, 
subjects were given a questionnaire on which they 
were asked to rate each item according to how much 
they liked it. They were also asked to indicate 
whether they owned any of these items, and if so, 
if the items they owned were similar to those pre- 
sented here. 

After a few moments, the experimenter collected 
subjects’ ratings and explained that she would next 
like the subjects to rate advertisements from a 
current magazine. They were told that their ratings 
would provide useful information about their reac- 
tions to current products. In reality, this was a 
filler task. While the subjects were working on these 


indicated that he did not want either of the con- 
sumer items that were available. In addition, 5 sub- 
jects were excluded because they believed that their 
initial ratings of the items had influenced the items 
that the experimenter said were available. These 
subjects were in the following experimental condi- 
tions: 3 in the subject-caused foreknowledge-high 
attractive; 1 in the subject-caused no foreknowledge- 
high attractive; 1 in the subject-caused no fore- 
knowledge-low attractive; 1 in the experimenter- 
caused foreknowledge-high attractive; and 1 in the 
experimenter-caused foreknowledge-low attractive. 
The inclusion of these subjects would in no way 
have altered the results reported or the conclusions 
drawn. 
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ratings, the experimenter examined the consumer 
item ratings that they had made previously. For 
each subject, two consumer items were selected to 
serve as the outcomes in the experiment. The experi- 
menter selected one item for each subject that was 
rated high, but at least 3 points from the top of the 
scale, and one that was rated low, but at least 3 
points from the bottom of the scale. Items rated 
at or very near the end of the scale were avoided, 
since such items would have made it difficult to 
detect changes in attractiveness. Of course, an item 
was not picked to serve as an outcome for a par- 
ticular student if he indicated that he already owned 
it. Using this procedure, most students received an 
opportunity to win the items that they had rated 
second highest and seventh highest. 

The experimenter then collected the subjects’ ques- 
tionnaires concerning current advertisements. While 
doing so, she mentioned that she realized answering 
so many questionnaires can get tiring. She told the 
subjects that she was going to do something now to 
break up the questionnaires, something that she 
would not normally do until the end of the study. 
She explained that she had obtained the consumer 
items from several manufacturers in the Chicago 
arca. She pointed out that two of the manufacturers 
were very happy that their products were going to 
be used in this study because of the free publicity 
and had therefore donated a large sample of their 
items. She explained that since the manufacturers 
had been kind enough to provide such a large supply 
of these items, she was going to give one to each 
student who participated in addition to the $2.50. 
She mentioned, however, that since she had to give 
away equal numbers of the items, it would not be 
possible for students to choose which item they 
wanted: 

In a moment I’ll come around to each booth 
and show you the two items that are available. 
Which of these items you get will be determined 
by chance—by the color of a marble that is 
drawn from a coffee can, This way, everyone will 
have a fair chance of getting the item that he 
prefers. 


The subjects’ attention was then directed to a 
dish of red and blue marbles, a red and blue mat, 
and an empty coffee can that had been placed in a 
corner of each booth. All subjects were asked to 
pick a red and a blue marble out of the dish and 
hand them to the experimenter when she came by. 
They were also asked to check the coffee can to 
make sure nothing was in it. The experimenter then 
came around to each booth and showed each sub- 
ject which two items had been “donated.” She 
placed an item that the subject had rated high on 
one of the colored mats and an item that the sub- 
ject had rated low on the other colored mat. 

At this point, the experimenter turned over a 
card informing her of the experimental condition, 
and the instructions differed depending upon the 
condition to which the subjects had been assigned. In 
the experimenter-caused-foreknowledge condition, the 
subjects were told: 
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I will put the two marbles that you hand me 
in the coffee can, mix them up, reach in without 
looking, and pull one out. If I pull out the red 
marble, you will get the item that is on top of the 
red mat. If I pull out the blue marble, you will 
get the item that is on the blue mat. 


In the subject-caused-foreknowledge condition, the 
experimenter said: 


I would like you to put the two marbles in 
the coffee can, mix them up, reach in without 
looking and pull one out. If you pull out the red 
marble, you will get the item that is on top of 
the red mat. If you pull out the blue marble, you 
will get the item that is on the blue mat. 


In the subject-caused no-foreknowledge condition, 
subjects were told, 


I would like you to put the two marbles in the 
coffee can, mix them up, reach in without look- 
ing, and pull one out. After you pull one out, you 
can turn over the two small cards I will put on 
your desk. The cards will tell you which item you 
will get if you pulled out the red marble and 
which item you will get if you pulled out the 
blue one.® 


The purpose of letting the subjects select the two 
critical marbles from a large dish of marbles was to 
minimize suspicion in the experimenter-caused con- 
dition. It was felt that if the experimenter selected 
the marbles, subjects might think that the marbles 
differed in size or texture and that the outcome was 
rigged. Marble colors were counterbalanced so that 
in half the sessions getting the red marble meant 
getting the highly attractive item, while in the 
remainder of the sessions getting the blue marble 
meant getting the highly attractive item. 

After explaining how the marble was to be picked, 
the experimenter went around to each booth so 
that each subject could pick or receive a marble 
and then said to him, “Okay, that means you get 
this item.” Prior to receiving their marble, subjects 
were instructed to keep their reactions to the prize 
“to themselves,” so as not to “bias the other sub- 
jects’ opinions.” Half of the subjects received a 
marble assigning them to the attractive prize; the 


*The purpose of using these cards rather than 
having the experimenter simply tell the subject what 
item his marble stood for was to minimize suspicion. 
It was felt that problems with suspicion would be 
especially likely in the low-attractive outcome con- 
dition. Suppose the experimenter were to tell the 
subject after he had picked out the red marble: 
“Okay, that means you get the back scratcher in- 
stead of the candle.” The subject might conclude 
that the experimenter had never intended to give 
him the candle and would have told him that he 
had won the back scratcher regardless of which 
marble he picked. By picking a red marble and then 
turning over cards that said: “Red = Back Scratcher” 
and “Blue = Candle,” a subject can see that this is 
clearly not the case. 
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remaining subjects received a marble assigning them 
to the minimally attractive prize. 

After each subject’s outcome had been determined, 
a second filler task was instituted. It was reasoned 
that a time delay before reevaluation of the out- 
comes might be desirable (cf. Walster, 1964). 
Therefore, subjects were given a task that took about 
5 minutes but that would permit them to think 
about the items they had obtained if they wished to 
do so. It was explained to the subjects that in addi- 
tion to obtaining information about their attitudes 
and values, “we would also like to obtain some in- 
formation about the kinds of things young people 
typically think about.” Subjects were given a pencil 
and a sheet of blank paper and were asked to keep 
a record of their thoughts for 5 minutes. While the 
subjects worked on this task, the experimenter went 
to the supply closet and got the items that each 
subject had won. She placed the item that had been 
won next to each subject so that he would not have 
doubts about actually receiving the item. 

After 5 minutes had passed, the experimenter col- 
lected the papers and informed the subjects that she 
would like them to rate the consumer items again. 
The subjects were handed a rating sheet, along with 
a title sheet that provided a rationale for a second 
rating. This sheet informed the subjects that “we 
are interested in how people’s opinions of consumer 
items change as they become familiar with the 
items.” It was pointed out that some investigators 
feel that individuals become more favorable in their 
attitudes, while others believe that individuals be- 
come negative. The sheet instructed subjects to take 
a few moments to reexamine the items and to rate 
them according to how they felt about them “now.” 

After completing these ratings, subjects were told: 


Now I would like you to think back to the part 
of the experiment where I put two items out in 
front of you and told you that you would be 
receiving one of these items. We're interested in 
what your feelings were during this time, and the 
next questionnaire asks you some questions about 
your reactions to this part of the study. 


This questionnaire contained questions designed to 
check the effectiveness of the experimental manipula- 
tions and to ascertain subjects’ feelings of control, 
choice, and responsibility. Subjects were also 
asked whether they perceived the outcome of the 
experiment as fair. 

After completing this questionnaire, one final 
request was made of the subjects. They were told 
that the individuals who were funding the research 
were interested in how the experimenters were 
“coming across” to the students. Subjects were given 
an “anonymous” questionnaire that asked them to 
evaluate the experimenter and to comment on vari- 
ous aspects of the experiment. They were instructed 
to place their completed questionnaire in a sealed en- 
velope and to place it in a box in the front of the 
room. 

“Alter this final questionnaire had been collected, 
a postexperimental interview was conducted in which 
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the experimenter probed for suspicion. Subjects were 
given additional details about the actual purpose 
of the experiment and were encouraged to ask as 
many questions as they wished about the experi- 
ment. They were permitted to keep the consumer 
item they had won. At the end of the interview, 
subjects were paid the $2.50 they had been promised 
and were asked not to discuss the experiment with 
other potential subjects. 


Results 


Check on the Effectiveness of the 
Manipulations 


From the subjects’ ratings, there is no 
doubt that the attractiveness manipulation 
was effective. The mean for the high-attrac- 
tive item was 15.5 on a 20-point scale; the 
mean for the low-attractive item was 3.81, 
and there was no overlap between the dis- 
tributions. As an additional check on the at- 
tractiveness manipulation, subjects were asked 
to indicate which item they hoped they would 
get. All but one subject said he hoped he 
would get the item that he initially rated as 
more attractive. Although there was no check 
on the causality and foreknowledge manipula- 
tions, the results leave little doubt that these 
manipulations were effective. 


Major Analyses 


In order to evaluate the effect of the treat- 
ments and provide a test for the assertion 
that two attributes—causality and foreknowl- 
edge—are necessary for perceived control, 
the following orthogonal comparisons were 
performed: (a) number of attributes (the 
subject-caused-foreknowledge subjects versus 
the subjects in the other conditions): Since 
the subject-caused-foreknowledge condition is 
characterized by both causality and fore- 
knowledge, while the other conditions were 
each characterized by only one, this compari- 
son was thought to be appropriate to test the 
hypothesis that both are necessary for per- 
ceived control; (b) attributes (the experi- 
menter-caused-foreknowledge subjects versus 
the subject-caused no-foreknowledge sub- 
jects): Since each of these conditions contains 
only one attribute—causality or foreknowl- 
edge—no significant differences were expected 
to emerge. However, the comparison was in- 
cluded to test the possibility that one of the 
variables might be sufficient to induce feel- 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS FOR THE 
PERCEIVED CONTROL QUESTION 


Causality and foreknowledge 
Attrac- Experi- j ject- 
tiveness menter || | Sublets | as subieet 
caused foreknowl- no fore- 
foreknowl- edge knowledge 
edge 
Low 
M 1.70 6.50 70 
SD 5,39) 8.08 2.21 
High 
M 2.00 4.80 4.25 
SD 6.32 5.09 6.69 


Note. Subjects were asked to rate how much they felt, they 
could influence the color of marble they received on a scale 
ranging from 0 (not at all) to 20 (very much). For the subject- 
caused no-foreknowledge-high-attractive condition, n = 8; for 
all other conditions, n = 10. 


ings of perceived control, while the other 
variable was not; (c) attractiveness (subjects 
receiving the high-attractive item versus the 
low-attractive item); (d) the number of at- 
tributes X attractiveness interaction; and (e) 
the attributes X attractiveness interaction. 
Perceived control. Subjects were asked to 
indicate how much they felt they could influ- 
ence the color of marble they received, and 
their mean ratings of perceived control are 
presented in Table 1. The analyses provide 
strong support for the hypothesis that both 
causality ahd foreknowledge are necessary 
for feelings of control. Subjects in the subject- 
caused-foreknowledge condition felt they 
could influence what color of marble they 
received significantly more than did subjects 


TABLE 2 


Means AND STANDARD DEVIATIONS FOR THE 
PERCEIVED CHOICE QUESTION 
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in the other two conditions, F (1, 52) = 4.815, 
< .033. Subjects in the experimenter- 
caused-foreknowledge condition and the sub- 
ject-caused no-foreknowledge condition did 
not differ in their feelings of control (F < 1). 
Neither the main effect for attractiveness 
nor the interactions revealed any significant 
differences between conditions. 

Choice and responsibility. Subjects were 
asked to indicate how much choice they felt 
they had about which item they got and how 
responsible they felt they were for the fact 
that they got this item instead of the other 
one. Their ratings of choice and responsibil 
ity are presented in Tables 2 and 3. It can 
be seen that these ratings closely parallel 
subjects’ ratings of control. A comparison 
of the subject-caused-foreknowledge condition 
with the other two conditions revealed a 
strong effect for both choice, (1,52) = 
42.869, p < .001, and responsibility, F (1, 52) 
= 17.130, $ < .001. Again, there were no sig 
nificant differences between the experimenter- 
caused-foreknowledge condition and the sub 
ject-caused no-foreknowledge condition (both 
Fs <1), which suggests that causality or 
foreknowledge alone does not affect perceived 
choice or responsibility. No significant differ- 
ences were obtained for the attractiveness vari- 
able or the interaction between the variables. 

An examination of the standard deviations 
presented in Tables 1, 2, and 3 reveals that 
there is quite a bit of within-cell variability 
in subjects’ perceptions of control, choice, and 
responsibility. Therefore, the data deviate 


TABLE 3 


MEANS AND STANDARD DEVIATIONS FOR THE 
PERCEIVED RESPONSIBILITY QUESTION 


Causality and foreknowledge Causality and foreknowledge 
Attrac- Experi- AS sis Attrac- Experi- 
dvema [mee | Subject | sunje tiveness | R&P | Subject- | Subject- 
faud | foreknowi- | “natfors= caused PaT E 
eens edge knowledge fore no wis edge | knowledge 
Low Low 
M -30 8.20 -00 M 2.00 8.80 1.00 
SD 95 7.58 -00 SD 4.83 9.44 3.16 
Ea 00 7.30 2.88 a8 
j k 5 5 M -00 6.50 3.13 
SD -00 3.56 4.52 SD .00 4.86 5.52 


Note. Subjects were asked to r 
choice on a scale ranging from 0 (i 


plete freedom of choice), 


ate their level of perceived 
no choice at all) to 20 (com- 


on a scale ranging 
pletely responsible). 


Nole. Subjects were asked to rate their level of responsibility 


from 0 (not at all responsible) to 20 (com- 


e 
= 
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somewhat from a normal distribution. Al- 
though analysis of variance is regarded as 
relatively robust under conditions of non- 
normality, nonparametric tests were per- 
formed on the data as a precautionary mea- 
sure. Subjects were divided at the median 
according to their feelings of control, choice, 
and responsibility, and chi-square tests were 
performed on the frequency data. A compari- 
son of the subject-caused-foreknowledge con- 
dition with the remaining conditions again 
revealed highly significant effects for con- 
trol, x?(1) = 12.258, p < .001; choice, y7(1) 
= 28.323, p< .001; and responsibility, x? 
(1) = 14.730, p < 001. No significant differ- 
ences emerged from a comparison of the ex- 
perimenter-caused-foreknowledge and the sub- 
ject-caused no-foreknowledge conditions or a 
comparison of the high- versus low-attractive- 
ness conditions, 

Outcome evaluation data. As noted previ- 
ously, subjects were asked to rate each item 
upon entering the experiment and again after 
receiving one of the items. This made it pos- 
sible to compute changes in attractiveness for 
both the obtained item and the nonobtained 
item. In order to provide an overall measure 
of outcome evaluation, each subject’s change 
in attractiveness for the nonobtained item 
was subtracted from his change in attractive- 
ness for the obtained item. 

These results revealed no effects of any 
kind on the outcome evaluation measures. 
Causality and foreknowledge did not affect 
changes in evaluation. Subjects did not re- 
spend differently to a highly attractive out- 
come than to a minimally attractive one, and 
there were no interactions between the vari- 
ables. An examination of the data suggests 
that subjects tended to derogate the obtained 
item relative the nonobtained item in one 
cell: the experimenter-caused-foreknowledge— 
low-attractiveness condition. If subjects who 
receive a low-attractive outcome are consid- 
ered separately, a comparison of the experi- 
menter-caused-foreknowledge subjects with 
the other two groups reveals that they show 
a marginally significant tendency to dero- 
gate the obtained outcome, F(1, 52) = 2.894, 
p < .095. : 

Although subjects in the subject-caused- 
foreknowledge cells manifest feelings of choice 
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and responsibility, they do not show a “dis- 
sonance effect” and enhance their rating of 
the obtained item, In fact, they show a slight 
tendency to derogate the obtained outcome 
relative to the nonobtained one. Further- 
more, the correlation between the subjects’ 
ratings of perceived control, choice, and re- 
sponsibility and their evaluation of the out- 
comes are minimal and do not even approach 
significance. Interestingly, the control, choice, 
and responsibility ratings are intercorrelated 
with one another. Perceived control ratings 
show a within-cell correlation of .48 with per- 
ceived choice ratings and of .35 with per- 
ceived responsibility ratings; perceived choice 
and responsibility ratings are correlated at the 
-61 level. All of these correlations are signifi- 
cant beyond the .05 level. 


Additional Measures 


On the questionnaire dealing with control, 
responsibility, and choice, subjects were also 
asked to state their subjective probability of 
winning, to indicate which item they expected 
to win, and to indicate whether they felt the 
procedure was fair. All means for the prob- 
ability question were quite close to the actual 
probability of .50, and none of them differed 
significantly from one another. This suggests 
that the majority of subjects in the experi- 
ment perceived the probability accurately. 

When asked which item they expected to 
win, subjects could indicate that they ex- 
pected to win the attractive item or the un- 
attractive item, or they could avoid making 
an expectation. A chi-square analysis per- 
formed on these frequency data revealed that 
subjects who won the high-attractive item 
indicated that they expected to do so, while 
subjects who won the low-attractive item 
indicated that they expected to do so, y7(2) 
= 6.81, p < .05. This provides corroboration 
for Walster’s (1967) hypothesis that after an 
event has occurred, individuals exaggerate the 
extent to which they would have predicted it. 

Subjects were also asked two questions 
about the fairness of the procedure: first, how 
fair it was that they got the item they re- 
ceived instead of the other one and, second, 
how fair the procedure was for deciding who 
gets which item “in general.” The means for 
the first question were not found to be sig- 
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TABLE 4 


MEANS AND STANDARD DEVIATIONS FOR SUBJECTS’ 
FEELINGS THAT THE OUTCOME Was Fixep 


Causality and foreknowledge 
Attrac- Experi- i ey 
tiveeess ante. Subject- Subject- 
Be caused caused 
foreknowl- | foreknowl- no fore- 
eae edge knwoledge 
Low 
M 10.60 2.20 4.60 
SD 8.07 4.64 8.26 
High 
M Er MEAN -80 4.37 
SD -63 1.62 8.21 
Note. Subjects were asked to rate their feelingson a scale 


ranging from 0 (No, not at all) to 20 (very much s0). 


nificantly different from one another. How- 
ever, analyses performed on the second ques- 
tion revealed a main effect for outcome. 
Subjects who received a highly attractive 
outcome perceived the procedure to be sig- 
nificantly more fair than did subjects who 
received a minimally attractive outcome, 
F(1, 52) = 5.967, p < 018, 

On the “anonymous” questionnaire com- 
pleted next, subjects were asked to indicate 
whether they thought the outcome was 
“fixed” and to give their opinion of the ex- 
perimenter. On an open-ended question, sub- 
jects were asked to think back to the part of 
the experiment where the marble was drawn 
and to indicate if anything about that part of 
the experiment had bothered them or if the 
experimenter had done anything during that 
time that had annoyed them. Only 1 of the 
58 subjects responded that he felt the out- 
come might have been rigged. This subject 
was in the experimenter-caused-foreknowl- 
edge-low-attractive-outcome condition and in- 
dicated that he thought the experimenter 
“knew which marble she was picking out.” 
Most of the subjects wrote that nothing had 
bothered them. A few indicated that they 
wished they had had free choice, and some 
commented that they were unhappy with the 
item they received. However, it never oc- 
curred to the vast majority of the subjects 
that anything was amiss, This suggests that 
subjects’ suspicions concerning the outcome 
they received were minimal, even in the 
“perimenter-caused-foreknowledge condition. 
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On the next page of the questionnaire, sub- 
jects were faced with a question that asked 
specifically whether they ever had the feeling 
that the outcome was fixed, “that the experi- 
menter somewhat cheated to make sure that 
you got one item instead of the other,” The 
cell means and standard deviations for this 
question are presented in Table 4, A compari- 
son of the subject-caused-foreknowledge sub- 
jects with subjects in the remaining two 
conditions revealed a significant effect, F 
(1, 52) = 3.995, p< -051, with subjects in 


remaining condi- 
tions. There was also a significant effect for 


the outcome as significantly more 
high-attractive 
experimenter- 
with the sub- 
subjects re- 


in the experi- 
menter-caused-foreknowledge cell viewed -the 


c the experimenter- 
caused-low-attractive-outcome cell not only 
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more incredulous that causing an outcome 
results in feelings of perceived control only 
when the subject has foreknowledge concern- 
ing what outcome he hopes to obtain. A 
second experiment was conducted for the 
purpose of replicating this finding. 


EXPERIMENT 2 
Method 


This experiment was a 2 X2 factorial design in 
which foreknowledge and outcome attractiveness 
were varied. The method was quite similar to that 
of Experiment 1, except that all subjects caused 
their own outcome by picking a marble from a bowl, 
and the outcomes obtained were foods to be tasted 
rather than consumer items. The subjects, 28 male 
Duke University students who participated in order 
to satisfy a course requirement, entered the lab in 
groups of three. They were told that the purpose 
of the study was to examine people’s attitudes 
toward food. It was explained that the aim of the 
project was to reduce the frequency of poor eating 
habits and malnutrition. After listening to this spiel, 
each subject went to a private booth where he saw 
eight foods in front of him. The foods were in 
individual containers. Half had been selected to be 
quite attractive (e.g. chocolate torte); the remain- 
ing foods were selected to be unattractive (e.g. 
fried grasshoppers). After inspecting the foods for 
several minutes, subjects were asked to rate them 
on a 21-point scale according to how much they 
liked each one. 

Subjects were then given a filler task, a question- 
naire asking them to rate a variety of food products. 
~ While they were working on this task, the experi- 
menter studied their initial ratings and selected a 
high- and a low-rated food to be used as outcomes 
for each subject. The experimenter collected the 
filler questionnaire. She then informed the subjects 
that they would be asked to consider two of the 
foods in more detail than they had in the initial 
ratings. She came around and set aside the two 
foods that had been predetermined for each subject 
as his outcomes, Then she informed the subjects 
that “an important component of people’s attitudes 
toward foods is how much they like their taste” 
and told them that they would be tasting one of 

the foods before them. It was explained that they 
} would not be able to choose which food they 
tasted, since subjects must be “randomly assigned” 
to conditions. They were told that which food they 
would taste would be determined by chance, by the 
color of marble that was picked from a bowl 
containing two marbles of different colors. 

Subjects were then given a bowl containing a red 
and a blue marble. All subjects were required to 
“reach in without looking and pick out a marble. 
Subjects in the foreknowledge condition were told 
before picking: “If you pick the red marble, you 
will get to taste the food on your left. If you pick 
the blue marble you will get to taste the food on 
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your right.” Subjects in the no-foreknowledge condi- 
tion were told, “After you pick out a marble, please 
turn over the cards I have put down in front of 
you. They will tell you which marble stands for 
which food.” Half of the subjects picked the marble 
that assigned them to the attractive food; the re- 
maining subjects picked the marble assigning them 
to the unattractive food, 

Subjects were then given another filler task before 
making the final ratings of the foods. They were 
asked to indicate their prior experience with some of 
the foods. After approximately 5 minutes had passed, 
subjects were asked to rerate the foods and to 
complete a questionnaire designed to measure their 
reactions to the study. The questionnaire asked sub- 
jects about perceived control, choice, and responsibil- 
ity. After completing this questionnaire, subjects 
were given an opportunity to taste the foods if they 
so desired but were not required to do so. All sub- 
jects were then probed for suspicion and debriefed. 


Results 


The means for subjects’ feelings of control, 
choice, and responsibility are reported in 
Table 5. A two-factor analysis of variance 
revealed that subjects in the foreknowledge 
condition perceived themselves to have sig- 
nificantly more control, F(1,24) = 5.740, 
p < .025, and significantly more choice about 
which food they ate, F(1,24) = 9.092, p < 
.006, than subjects in the no-foreknowledge 
condition. The analysis for the responsibility 
question fell short of significance, although 
the means are clearly in the predicted direc- 
tion (see Table 5). As in the previous study, 


TABLE 5 


Means ror SUBJECTS’ PERCEPTIONS OF CONTROL, 
CHOICE, AND RESPONSIBILITY IN EXPERIMENT 2 


Foreknowledge 
Attractiveness — — — — 
No foreknowledge Foreknowledge 
Low 
Control 7.00* 11.71 
Choice 7.00% 11.71 
Responsibility 10.00° 13.86 
(n = 5) (n= 1) 
High 
Control 6.71 17.44 
Choice 1.71 12.11 
Responsibility 9.57 16.33 
(n= 7) (n = 9) 


à Subjects were asked how much they felt they could influ- 
ence what color of marble they received on a scale ranging from 
0 (not at all) to 31 (very much). 

b Subjects were asked how much choice they felt they had 
about which food they got to taste on a scale ranging from 0 
(no choice at all) to (complete freedom of choice). 

«Subjects we! how responsible they thought they 
were for the fact that they got to taste “‘this food" instead of 
the other one on a scale ranging from 0 (not at all responsible) 
to 31 (completely responsible). 
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there were no effects for the attractiveness 
variable or interactions between the variables. 
Again, there were no significant effects for the 
outcome evaluation data. 


Discussion 


The most important finding to emerge from 
this research is that merely causing a chance 
outcome and having foreknowledge about the 
consequences induces feelings of perceived 
control. It is perhaps surprising that such 
subtle variables as who picks the marble 
that determines the chance outcome and 
whether the subject knows what each marble 
stands for at the time should influence per- 
ceived control. These variables produced ef- 
fects even though subjects were specifically 
told that the outcome would be determined 
by chance. The major contribution of the 
present study, then, is the delineation of two 
important determinants of perceived control, 

Although it seems inherently plausible that 
most manipulations of perceived control might 
affect subjects’ attributions of choice and re- 
sponsibility as well, these variables have not 
previously been included as dependent mea- 
sures. The present research demonstrates that 
causality and foreknowledge produce feelings 
of choice and responsibility as well as feelings 
of control. It is somewhat remarkable that 
subjects felt any choice or responsibility at 
all for the outcome, since they were told that 
the outcome was determined by chance. The 
relatively high intercorrelations between con- 
trol, choice, and responsibility in the present 
studies suggest the possibility that the effects 
of perceived control reported in previous 
studies are mediated by feelings of choice and 
responsibility. The present results clearly sug- 
gest that these variables should be taken more 
seriously by individuals studying controlled 
and uncontrolled outcomes. 

One surprising thing about the results of 
these experiments was the failure to find any 
significant differences between subjects’ evalu- 

ation of their outcomes in the various experi- 
mental conditions. Particularly since a rela- 
tionship has been found between control, 
choice, and responsibility, one might expect 
subjects to evaluate controlled outcomes more 
favorably than uncontrolled outcomes. There 
is not even the weakest trend in the data to 
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suggest that this is the case. Of course, it is 
important to determine whether there is really 
no relationship between control over an out- 
come and evaluation of that outcome or 
whether the present experiments were not 
designed well enough to detect such a 
difference. 

Whether control is defined as the ability 
to perform an instrumental response that 
terminates the stimulus, access to an “escape 
button” that will turn off the stimulus, or 
self-administration of the stimulus, there is , 
little consistent evidence that individuals view 
outcomes that they “control” more favorably 
than outcomes they do not “control.” Geer, 
Davison, and Gatchel (1970) manipulated 
control by leading subjects to believe that 
they had avoided an aversive stimulus by 
speeding up their reaction time. There were 
no significant differences in pain thresholds 
between subjects who had perceived control 
and subjects who had not. F urthermore, sub- £ 
jects with control did not rate the experience 
as any less irritating than subjects without 
control. Using a very similar procedure, 
Glass, Singer, Leonard, Krantz, Cohen, and 
Cummings (1973) found evidence that sub- 
jects who believed they could control electric 
shocks recalled them as less painful than sub- 
jects who did not, In a study by Corah and 
Boffa (1970) and two experiments by Glass 
and Singer (1972), control was manipulated \ 
by informing subjects of an “escape button” 
and giving them “free choice” about using it. 
Corah and Boffa found only a weak trend 
when they asked subjects to make discomfort 
ratings of the shock. In the two experiments 
by Glass and Singer, subjects were asked to 
indicate how distracting, irritating, and un- 
pleasant the noise was. In the first study, sub- 
jects with control rated the noise as signifi- 
cantly less aversive on only one of these ad- ï 
jectives (irritating), although ratings for a 
second adjective (distracting) approached 
significance. In the second experiment, there 
were no significant differences on any of these 
items although there was a marginally sig- 
nificant tendency for subjects with control to 
view the noise as less unpleasant. Even Glass 
and Singer did not take these differences 
seriously (see Glass & Singer, 1972, p. 68) 
and they mentioned that subsequent experi 
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ments in their laboratory had not replicated 
these effects. In fact, they dismissed a disso- 
nance or Bem-type explanation for their data 
by pointing out that a significant difference 
in the subjective ratings did not occur (see 
Glass & Singer, pp. 68-69). 

Most of the experiments manipulating who 
administers the aversive stimulus have in- 
cluded subjective measures, and none have 
found evidence that control affects subjective 
ratings of the aversive stimulus. Haggard 
(1946) found reliable evidence that subjects 
who administer their own shock show less 
autonomic arousal than those who do not. 
Although he indicated that subjects were also 
asked to rate “the subjective strength of the 
shocks,” Haggard never reported on this mea- 
sure in his results section, most likely be- 
cause significant differences were not ob- 
tained. Pervin (1963) manipulated whether 
the shocks were administered by the sub- 
ject or the experimenter and found that 
although subjects preferred subject-adminis- 
tered shocks, they did not rate these shocks 
as less painful than experimenter-adminis- 
tered ones. Staub et al. (1971, Experiment 1) 
also manipulated who administered the shocks 
and found no significant differences in pain 
thresholds reported by the subjects. Stotland 
and Blumenthal (1964) exposed subjects to 


J a testing situation and asked them for sub- 


jective ratings of anxiety. They found no sig- 
nificant differences between a group that had 
some control over the situation and a group 
that did not. Taken as a whole, these studies 
do not suggest that there is a clear tendency 
for individuals to evaluate controlled out- 
comes more favorably than uncontrolled ones. 

There are clearly many unanswered questions 
concerning individuals’ evaluation of outcomes 
they receive. First of all, the significant mar- 
ginal results obtained in some of the experi- 
ments on perceived control (see, €.g., Corah 
& Boffa, 1970; Glass & Singer, 1972; Glass 
et al., 1973) are tempting. They suggest that 
there may be some subtle and elusive condi- 
tions under which perceived control does af- 
fect outcome evaluation. Second, many in- 
vestigators have manipulated variables that 
would seem to be conceptually related to con- 
trol and have found that these variables do 
affect outcome evaluation. For example, 
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Pervin (1963) and Staub et al. (1971, Ex- 
periment 2) manipulated whether the subject 
could predict the occurrence of an aversive 
stimulus and found that predictability re- 
duced pain thresholds and affected subjective 
estimates of the stimulus. It is not clear why 
predictability should affect outcome evalua- 
tion differently than perceived control. To 
complicate matters further, other experiments 
manipulating predictability (see Glass & 
Singer 1972; Haggard, 1946) have failed to 
find a relationship between predictability and 
outcome evaluation, even though their ma- 
nipulations were virtually identical to those 
used by Pervin and Staub et al. Clearly, the 
relationship between these variables and 
evaluation of one’s outcomes is in need of 
more research attention. 

Suggestions for additional research are also 
provided by the responses of subjects in the 
experimenter-caused condition who received a 
low-attractive outcome. Subjects in this cell 
tended to view the procedure as unfair and 
concluded that the outcomes were “fixed” 
once that possibility has been suggested to 
them. They also showed a tendency to dero- 
gate the outcome they obtained. These find- 
ings suggest that individuals react quite nega- 
tively when they receive a relatively unat- 
tractive outcome that they do not cause. It is 
important to determine whether this negative 
reaction is a reliable one, and if so, whether 
it is ameliorated as time passes. 

Since these results suggest that the fore- 
knowledge variable produces reliable differ- 
ences in perceived control, it is appropriate 
to ask why this is the case. One possibility 
is that in this condition, subjects make an 
effort to get a certain marble—they may 
reach into the dish in a certain way, for ex- 
ample—and this attempt at control leads to 
feelings of actual control. Hopefully, future 
research will shed light on this question. 

There are some practical implications of 
the present results that might be mentioned. 
They imply that in any situation where it is 
necessary to allocate treatments randomly, 
recipients’ reactions are likely to be more 
favorable if they are led to believe that they 
have some control over the process. In the 
present study, subjects who picked their own 
marble and had foreknowledge of what each 
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marble stood for viewed the outcome as less 
“fixed” than subjects in the remaining condi- 
tions. This technique might be used by ad- 
ministrators of social programs who wish to 
randomly assign participants to treatment and 
control groups in order to evaluate the effec- 
tiveness of their program. Obviously, judg- 
ments that the outcomes are fixed may have 
detrimental effects on their research project. 
The present results suggest that such judg- 
ments will be less likely if the randomization 
process is set up so that participants feel that 
they have some control over it (see Campbell, 
1971, for a more detailed discussion of these 
issues). 

In conclusion, the present research has 
added to our knowledge concerning control 
by delineating two important determinants of 
perceived control and by demonstrating a 
relationship between perceived control and 
feelings of choice and responsibility. Many 
questions still remain unanswered, but the 
results suggest that control is indeed a rich 
area for further investigation. 


REFERENCE NOTE 


1. Campbell, D. T. Methods for the experimenting 
society. Unpublished manuscript, Northwestern 
University, 1971, (Manuscript submitted for 
publication, 1974. 
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Do People Have Inflated Views of Their Own Ability? 


Judith Weiner Regan, Holly Gosselink, Janet Hubsch, and Elizabeth Ulsh 
Wells College 


An experiment tested a need-for-self- 
appraisals of ability, deny criticism, an 


esteem notion that people inflate self- 
d overrespond to praise. Actors and 


bystanders rated actors’ performances on a nonobjective task after hearing 
the performance praised or criticized or before hearing it evaluated. No evi- 


dence of seli-enhancement was found; 


to the contrary, actors compared to 


bystanders rated themselves harshly, lowered their ratings after criticism 

equally, and showed relief after praise. A second experiment compared actor 

and bystander ratings of actors who expected evaluation of their performances 

or who expected no evaluation. Results suggested self-derogation by actors as 
~ a defense against possible loss of self-esteem. 


One of the most popular notions in psy- 
chology is that people need high self-esteem. 
Psychological theories stemming from this 
view range from the clinical theories of 
Freud and Rogers to the experimental social 
psychology models of Dittes (1959) and 
Walster (1965). One of the implications of 
this notion which has received considerable 
experimental attention is that need for high 
self-esteem causes a person to have an in- 
flated and distorted view of his own ability. 
People are hypothesized to be self-enhancing 
and defensive in their self-evaluations. Spe- 
cifically, three predictions derive from this 
hypothesis: People should inflate evaluations 
of their own ability and skill, they should 
believe praise, and they should defend against 
criticism. 

Gergen (1971) sounds this theme repeat- 
edly in his recent discussion of the self- 
concept. He says for example, “[a] person 
learns to seek out the feeling of being es- 
teemed for its own sake and to avoid the 
feeling of being valueless. To feel esteem for 
self is akin to one’s most basic experience of 
well being” (p. 69). “We are hungry for com- 
pliments but skeptical of criticism” (p. 48). 
“We may come to believe we are something 
simply because we wish to be. . . . Specif- 
ically, there is a strong tendency to select out 
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those cues that provide positive confirma- 
tion—that is, gratify one’s aspirations” (p. 
SED 

Most research appears to confirm the pre- 
diction of seli-enhancement. However, there 
are three stringent criteria which should be 
met for an adequate test: (a) Self-esteem 
needs must be aroused in a situation in 
which distortion of one’s ability level is pos- 
sible. This requires that the ability dimension 
be important enough to arouse self-esteem 
needs and that actors be unsure of how skilled 
they are on this dimension. (b) The reac- 
tions of actors should be compared with the 
reactions of bystanders, and this comparison 
should be made in the absence of evaluations 
from others about the actor’s ability, after 
praise, and after criticism. If people engage 
in ego enhancement, the actor should rate 
himself higher, believe praise more, and be- 
lieve criticism less than a bystander does 
when rating the actor. The actor and by- 
stander should independently rate the actor’s 
performance which the bystander observes. 
As Gergen (1971) has noted, correlational 
studies showing that actors and people who 
know them usually agree in their evaluations 
of the actor’s ability do not rule out the 
possibility that actors have merely convinced 
the observers of their self-appraisals. (c) 
Ratings by the actor and bystander should be 
obtained of the actor’s ability as well as the 
actor’s and bystander’s liking for the praise 
giver or critic. Many theories predict that 
criticism is painful and the critic disliked 
(and vice versa for praise). The need-for- 
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self-esteem hypothesis adds the prediction 
that criticism is denied and not allowed to 
affect the recipient’s self-evaluation. 

None of the studies which seems to show 
that people are defensive and self-enhancing 
in their evaluations of their ability has met 
all three criteria, and none is therefore con- 
vincing. For example, the first criterion is not 
met by studies in which the skill level being 
rated was not both uncertain and important. 
Deutsch (1960) and Deutsch and Solomon 
(1959) compared actors’ and observers’ rat- 
ings of the actors’ performances as well as 
their reactions to praise and criticism. They 
found little indication of self-enhancement by 
actors rating their own ability, but recent 
studies (Dutton, 1972; Jones & Schneider, 
1968; Skolnick, 1971) have suggested that 
Deutsch and Solomon’s subjects regarded the 
ability as unimportant and were convinced of 
their level of ability before they heard the 
interpersonal evaluations. Under such circum- 
stances esteem needs may not operate. 

Several studies have shown that people 
change their self-evaluations more after praise 
than after criticism (cf. Wylie, 1961), but 
this effect is not necessarily the result of self- 
esteem needs. Everyone, including people not 
ego involved, might find negative information 
simply less credible or valuable than positive 
information. As stated in the second criterion, 
actors must show this difference in responsive- 
ness to praise and criticism more than observ- 
ers. 

Evidence which is often cited as showing 
that people are motivated by self-esteem 
needs in their self-evaluations is the fre- 
quently demonstrated effect that praise givers 
are liked, whereas derogators are disliked 
(e.g., Aronson & Linder, 1965; Dittes, 1959; 
Harvey, Kelley, & Shapiro, 1957; Jones, 
Gergen, & Davis, 1962). These studies should 
not be taken as evidence of ego needs be- 
cause they did not compare actors and by- 
standers (the second criterion), Perhaps 
everyone (target and bystander) finds people 
who praise more likable than people who 
criticize. On the other hand, a few studies 
have met the second criterion. For example, 
Jones, Hester, Farina, and Davis (1959) 
compared actors and bystanders, showing 
that actors disliked a credible derogator more 
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than bystanders, but the experiment did not 
determine the effect of the feedback on evalu 
ations of the actors (the third criterion) 
Perhaps actors were more distressed than by 
standers by the criticism but also more re 
sponsive to it in reevaluating their ability 
Both the second and third criteria must be 
met if we are to distinguish distress from de 
nial and elation from self-enhancement. 

The present experiment attempted to meet 
all three criteria specified above to see ii 
people are motivated by esteem needs in thei: 
self-evaluations of ability. A bystander ob- 
served an actor perform a task for which no 
objective means of evaluation was available. 
The bystander and the actor then heard the 
performance praised or criticized. Instead of 
measuring evaluations of the performance 
twice, before and after feedback, which would 
have made subjects self-conscious about their 
reactions to the feedback, an after-only de- 
sign was used. Each bystander and actor rated 
the actor’s performance after hearing it 
praised or criticized. Two additional groups 
of subjects (actors and bystanders) rated the 
performance without having heard it evalu- 
ated. An ego-involving task was chosen and 
the feedback was made personally threatening 
or personally gratifying to evoke whatever 
defensiveness subjects might have. 


EXPERIMENT 1 
Method 


The experimental design called for 12 treatments: 
actors and bystanders received either no feedback, 
praise, or criticism, and two experimenters (the sec- 
ond and third authors) were each responsible for 
half of the subjects in each treatment. The subjects 
were all females at Wells College. Each subject was 
randomly assigned to an experimenter and to an 
actor or bystander treatment before she arrived at 
the experiment. 


Actor Condition 


Experimenter A met the actor subject and took 
her through the first of two small, interconnecting, 
soundproof rooms. She pointed out a microphone 
and one-way mirror in the first room and seated the 
subject in the second room facing a dummy micro- 
phone and the blind side of the mirror, She surrep- 
titiously activated a hidden tape recorder. 

The subject was led to believe that she was taking 
Part in a project to determine the Personality traits 
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which distinguish good and poor interviewers. The 
subject would be interviewed by a novice inter- 
viewer who would make some inferences about the 
subject on the basis of her answers. The subject 
was given four written questions supposedly from 
the interviewer and was told that she could think 
about her answers for 5 minutes and then answer 
two questions aloud for 5 minutes. The questions 
were designed to be common and nonfactual, for 
example, “Why did you choose a women’s college, 
Wells in particular?” The experimenter explained 
that the interviewer would listen to the subject's 
answers, tell the subject her evaluation, and then 
the subject would describe how accurate and insight- 
ful she thought the interviewer had been, The inter- 
viewer would remain behind the one-way mirror “to 
eliminate extraneous influences” from the subject’s 
judgment. 

The experimenter left the subject alone, closing 
the soundproof door. She randomly assigned the 
subject to one of three feedback conditions, and 
after giving the subject 5 minutes to answer the 
questions she spoke to the subject over the micro- 
phone and told her that after a few moments “to 
collect her thoughts” the interviewer would give her 
impressions. 

In the positive and negative feedback conditions, 
after about 30 seconds a female “interviewer” gave 
what was in fact a prerecorded evaluation of the 
subject’s performance and intuitions about the sub- 
ject’s personality. The feedback was designed to 
sound like a student giving off-the-cuff first impres- 
sions. 

The positive feedback praised the subject’s an- 
swers for being interesting, well-organized, etc. and 
praised the subject for having confidence in her 
ideas, handling a stressful situation well and showing 
interest in what she was asked to do. The same 
person made the critical tape, and she noted roughly 
parallel attributes although with negative com- 
ments: giving uninteresting and narrowminded 
answers, lacking confidence, using standard ideas, 
handling stress poorly, etc. The evaluator was 
pleasant and sounded apologetic for being critical. 

When the interviewer's feedback was finished the 
experimenter entered the subject’s room, gave her 
the questionnaire to answer and left the room while 
the subject filled it out. The subject was finally 
questioned for suspicion and then debriefed with 
care to ensure that she realized the evaluation was 
fictitious. She was asked for permission to use the 
tape recording of her experimental session. 

In the no-feedback condition, after telling the 
subject to expect the interviewer’s remarks mo- 
mentarily, the experimenter entered the subject’s 
room and said that before the subject heard the 
interviewer's evaluation she should rate her own 


he subject was given the standard 


performance. Tl 
questionnaire and asked to answer only the self- 


evaluation question. The subject’s suspicion of the 
manipulations was then explored; she was debriefed 
and permission to use her tape recording was ob- 


tained. 
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Bystander Condition 


Each bystander subject was randomly yoked to an 
actor subject. Experimenter B met the subject, led 
her through the first room pointing out the micro- 
phone and one-way mirror, and seated the subject 
in the second room at the table on which the tape 
recording of the actor subject was now placed, The 
experimenter said, “You will be listening to the tape 
of an experiment run by my partner a few days 
ago. The subject was unaware of being taped. Please 
listen carefully and then I will explain your role to 
you.” The subject and experimenter listened to the 
explanation and instructions given to the actor. Then 
the experimenter stopped the tape and explained 
that the subject would hear the recorded student’s 
answers and the interviewer's evaluation and would 
be asked to evaluate how accurate and insightful 
she thought the interviewer had been. The subject 
was given a copy of the questions, the tape re- 
corder was turned back on, and the subject was left 
alone. 

The subject heard the answers and then either 
heard one of the two feedbacks from the inter- 
viewer or heard Experimenter A interrupt the re- 
corded student to ask for her self-evaluation prior to 
the feedback. Since the tape recordings were un- 
marked, Experimenter B did not know which feed- 
back the subject would hear. After the feedback or 
no-feedback treatment, the experimenter entered the 
subject’s room, turned off the tape recorder, gave 
her a questionnaire, and left her alone to answer it. 
Subjects were debriefed and all subjects were prom- 
ised and later given a copy of the results. 

The questions for actor and bystander conditions 
were basically the same with appropriate pronoun 
changes. Four questions were answered on 11-point 
scales with higher numbers indicating more positive 
ratings. They were the following: “How accurate 
was (her) (the interviewer's) evaluation of (your) 
(her) answers? How good do you think (your) 
(her) answers were? How much do you think the 
interviewer liked (your) (her) answers? Overall, 
how would you rate her as an interviewer?” There 
were, in addition, five 7-point semantic-differential- 
type scales for rating the interviewer: cold-warm, 
unfairfair, critical-complimentary, unlikable—likable, 
insensitive-perceptive. Higher numbers indicated 
more positive ratings. Bystander subjects also 
answered questions to ascertain whether they recog- 
nized the voice of the actor subject they heard, 


Results 


Eight bystander subjects evenly distributed 
among feedback conditions thought they rec- 
ognized the actor subject. Rather than elimi- 
nate so many subjects from the analysis, we 
decided to keep them although doing so works 
against effective manipulation of the feed- 
back variable. 

Three subjects were lost from the actor 
condition because trouble with the tape re- 
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TABLE 1 


RATINGS ror THE Question, “How Mucu Do You 
THINK (SHE) (THE INTERVIEWER) LIKED (Your) 
(HER) ANSWERS?” IN EXPERIMENT 1 


Role Praise Criticism 
Actor 8.8 1.7 
Bystander 7.9 2.6 


Note, 1 = not at all; 11 = very much. 


corder prevented adequate taping of their 
performances. An additional actor subject 
took part in the experiment twice and her 
second performance was eliminated. Bystand- 
ers yoked to these actors were also elimi- 
nated. Two subjects confessed to having heard 
about the experiment, and their experimental 
sessions were terminated before the dependent 
measures were obtained. All 10 of these sub- 
jects were replaced and a total of 60 subjects, 
evenly distributed among conditions, remained 
in the sample. 

The success of the manipulation of praise 
and criticism can be seen in answers to the 
question, “How much do you think (she) 
(the interviewer) liked (your) (her) an- 
swers?” The two experimenters were included 
as one of the factors in analyses of variance 
on these and other data, but since there were 
no significant experimenter effects, the results 
for the experimenters have been combined in 
tabular presentations. The difference between 
praise and criticism shown in Table 1 is 
highly significant, F (1,32) = 377.25, p< 
.001. A related measure is the semantic differ- 
ential dimension of “critical-complimentary,” 
which also shows a strong effect of feedback, 
F (1,32) = 206.22, p < .001. The interaction 
effects on these two measures is discussed 
later. 

The primary dependent variable was the 
question, “How good do you think (your) 
(her) answers were?” The results presented 


TABLE 2 


RATINGS FOR THE Question, “How Goop Do You 
THINK (Your) (HER) ANSWERS WERE?” 
IN EXPERIMENT 1 


Role No evaluation Praise Criticism 
Actor 45 6.5 3.8 
Bystander 6.5 6.4 5.0 


Nole. 1 = very poor; 11 = very good. 
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in Tables 2 and 3 show that feedback 
cantly affected evaluations of the acto 
formances and that actors rated then 
significantly less positively than their 

ers rated them. The prediction that for 
self-esteem leads to self-enhancem 

thus not confirmed. This becomes eve 

when we examine the results in mor 

The self-esteem notion predicts di 
response to criticism. Criticism shou! 
self-evaluations less than it affects 
tions by observers. An analysis of 
performed only on the no-feedbac! 
criticism treatments shows that criticis 
ered the evaluations of both actors and ob 
servers, F (1,32) = 4.46, p < .05. Moreover, 
in spite of the no-feedback difference, ciili- 
cism lowered the evaluations of actors and 
observers equally (for the interaction, F < 
1). Actors apparently did not respon’ 
criticism by defensively refusing to 
their self-evaluations. 

Self-esteem theory suggests that actors raise 
their evaluations in response to praise 1 
than observers because they are eager 
enhancement. The results do show that ac 
responded to praise significantly more 
bystanders (for the interaction, F (1, 
4.4, p < .05). However, the effect is < 
tirely to the low no-feedback self-eva 
by actors, which makes their relatively strong 
response to praise look more like relief than 
like ego enhancement. 

If actors were motivated by desire for self- 
enhancement, we might expect to sec 
rejecting the critic and accepting the 
giver more than bystanders. Actors did dis- 


TABLE 3 
ANALYSIS OF VARIANCE ON ANSWERS TO THE Q 
“How Goop Do You Tunk (Your) (H 
ANSWERS WERE?” IN EXPERIMENT | 


Source df MS E 


Actor/bystander (A) 
Feedback (B) 
Experimenter (C) | 
AXB 


20* 


16.017 6 


F] 
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tinguish the critic from the praise giver as 
less likable, F (1,32) = 18.8, p< .005, a 
worse interviewer overall, F (1,31)* = 4.9, 
p<.05, and colder, F (1,32) =36.6, p= 
001. They described the critic and praise 
giver as equally sensitive (F < 1) and accu- 
rate (F <1). In none of the comparisons 
above did the interaction between actors and 
observers approach significance; that is, ac- 
tors and bystanders evaluated the interviewer 
very similarly. On one rating of the inter- 
viewer, actors and bystanders disagreed some- 
what, but the direction of the difference was 
opposite to that expected from a notion of 
ego defense. In ratings of fair-unfair, subjects 
in both actor and bystander conditions rated 
the critic as less fair than the praise giver, 
F (1,32) = 7.0, p < .025. In addition, there 
was a tendency for actors to see the praise 
giver as less fair than bystanders saw her, 
while seeing the critic as more fair than by- 
standers saw her, F (1,32) = 2.1, p< -18. 
Clearly none of the ratings of the evaluator 
suggest that actors accepted the praise giver 
or rejected the critic to protect their self- 
esteem. 

A possible alternative mode of defense to 
maintain high self-esteem is to distort the 
message itself so that it is regarded as more 
positive than it really is. We tested this by 
asking actors and observers to rate how much 
they thought the interviewer liked the actors’ 
answers. As shown in Table 1, actors thought 
the critic was more unfavorable and the praise 
giver more favorable than bystanders thought 
she was, F (1,32) = 7.95, p< 01. A similar, 
albeit weaker, interaction occurred on ratings 
of critical-complimentary, F (1,32) = 3.8, $ 
= 06. These results not only do not fit the 
notion of defensive distortion but suggest 
heightened sensitivity by actors to both posi- 
tive and negative feedback. 

While actors did not inflate their self- 
evaluations, they seem not to have responded 
to all of the same motivations and/or cues as 
bystanders. The correlation between actors’ 
and bystanders’ ratings was .33 which is sig- 
nificant at p < -01 but moderate. Actors rated 


themselves less positively than bystanders 


rated them and this self-derogation was sig- 


1 One subject failed to answer this question. 
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nificant in the no-feedback treatment (p < 
05). 

One possible explanation of the difference 
in the no-feedback treatment is that actors 
were preparing for a possible negative evalu- 
ation from the interviewer. They may have 
been protecting themselves from losing self- 
esteem by taking a low profile in advance. 
Jones et al. (1959) also speculated that their 
actor subjects who were most harsh on 
themselves, “through this mechanism of self- 
derogation . . . may have prepared for the 
worst” (p. 368). Self-derogation as a defense 
mechanism is plausible only if holding a low 
self-opinion is less painful than the experi- 
ence of losing self-esteem. Recent evidence 
(Aronson & Linder, 1965; Mettee, 1971) 
suggests that loss of esteem is indeed more 
unpleasant than constant low esteem. 

If this explanation of the finding that ac- 
tors rated themselves relatively harshly is 
correct, the effect should depend on the fact 
that actors anticipated being evaluated by 
the interviewer. Actors might not have rated 
themselves lower than bystanders if no eval- 
uation had been anticipated. To test this 
explanation a second experiment was per- 
formed. 


EXPERIMENT 2 


Method 


Thirty-eight female students in introductory psy- 
chology who volunteered to participate in research 
for course credit were randomly assigned to one of 
four conditions: actor or bystander and expecting 
evaluation from an interviewer or not expecting 
evaluation. The procedure was almost identical to 
that used for the no-feedback condition in Experi- 
ment 1. 

When the subject arrived at the experimental room, 
Experimenter C randomly assigned her to be an 
actor or a bystander. She then introduced the ex- 
periment as previously described, telling actor sub- 
jects that they would answer questions orally, re- 
ceive feedback from an interviewer, and then would 
evaluate the interviewer. The session was surrepti- 
tiously tape-recorded in the actor conditions. As the 
actor subject answered the questions orally, the 
experimenter randomly assigned her to the evalua- 
tion or no-evaluation condition. 

If the subject was in the evaluation treatment, 
the experimenter entered the room when the sub- 
ject’s 5 minutes for answering the questions were 
over and gave her the same questionnaire as used 
in Experiment 1. She pointed to Question 2 (“How 
good do you think your answers were”) and said, 
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TABLE 4 


RATINGS FOR THE Question, “How GooD Do You 
Tuk (Your) (HER) ANSWERS WERE? 
IN EXPERIMENT 2 


Variable Evaluation No evaluation 
5.56 6.4 
Actor O ci 
Bystander 7.11 .2 
4 (9) (10) 


Note. n is in parentheses, 


“While the interviewer is thinking over her evalua- 
tion, would you please answer this question?” This 
condition duplicated the no-feedback treatment in 
Experiment 1. : 

If the actor was assigned to the no-evaluation 
treatment, the experimenter entered the room when 
the 5-minute answer period was over and delivered 
the following message: 


Actually, you are a pretest subject and will not be 
evaluated. There is no interviewer. The experiment 
was run today for my practice so that we can 
make sure things will run smoothly when we 
actually begin the experiment. So there was no 
interviewer listening to you, and in fact, I didn’t 
hear you either because your microphone isn’t 
hooked up. But before you leave would you mind 
filling out some questionnaires for me? 


The subject was then given the same questionnaire 
as in the evaluation condition and was finally de- 
briefed. 

The tape recordings of these sessions were un- 
marked and the experimenter was thus able to re- 
main unaware of whether the recorded actor was 
anticipating evaluation. When a subject was assigned 
to a bystander condition, the experimenter selected 
one tape at random and gave it to the bystander 
subject. She listened to the initial recorded instruc- 
tions with the subject and explained that the sub- 
ject was to listen to the rest of the recording and 
later give her impression of the interviewer. She 
then told the subject to signal her when the re- 
corded subject began to fill out her questionnaire 
and left the subject alone: As the actor subject began 
to fill out the questionnaire the experimenter en- 
tered the room and gave the bystander question- 
naires parallel to the ones filled out by actors. Thus, 
before the bystander evaluated the actor’s perform- 
ance she knew whether the actor expected to be 
evaluated by the interviewer. Bystanders were then 
asked if they recognized the voice on the tape and 

were debriefed. 


Results 


The primary results can be seen in Table 
4. Actors rated themselves significantly more 
harshly than bystanders rated them when 
they anticipated feedback from the inter- 
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viewer (contrast between independent means, 
Winer, 1971; F (1,34) = 4.89, p <.05). 
This difference replicates the difference found 
in Experiment 1. As expected, the no-evalu- 
ation cells do not differ, but the interaction is 
short of significance, F (1,34) = 3.12, p= 
.09. We expected actor subjects to rate them- 
selves more harshly when they anticipated 
evaluation than when they anticipated no 
evaluation. This difference, while in the ex- 
pected direction, is not significant. 

Several questions were asked to see if 
actors defended against possible criticism by 
discounting the situation or the questions as 
good indicators of their ability. Questions 
asking subjects to rate how nervous they 
were and how adequate the questions were 
showed no differences among the four condi- 
tions. One question asked, “To what extent 
did this type of situation allow (you) (her) 
to reveal (your) (her) true abilities?” The 
situation was seen as allowing actors to re- 
veal their true abilities to a greater extent by 
actors and bystanders in the no-evaluation 
condition than by subjects in the evaluation 
condition (p < .025). Thus, there was no 
evidence of defensiveness by actors other than 
self-derogation in anticipation of evaluation. 


GENERAL DISCUSSION 


To summarize the results of the two ex- 
periments briefly: Actors’ evaluations of their 
own ability were less positive than bystand- 
ers’ evaluations of them, and this was par- 
ticularly strong when feedback was antici- 
pated but not yet known. Actors accepted 
criticism equally as much as bystanders ac- 
cepted it. Praise was responded to more by 
actors than by bystanders, apparently in 
relief from an initially harsh self-evaluation. 
Ratings of the evaluator were similar for 
actors and bystanders, and ratings of her mes- 
sage suggested heightened sensitivity by ac- 
tors to both criticism and praise. These com- 
parisons are contrary to predictions of self- 
enhancement, but defense against loss of self- 
esteem is indicated. - 

It is interesting to note that these results 
conflict not for the most part with previous 
evidence but with the conclusions drawn from 

that evidence. Here, as previously, people 
changed their self-evaluations more after 
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praise than after criticism. Without compari- 
son with an observer we would not have 
realized that after praise the actors’ self- 
evaluations were only as high as observers 
rated them prior to feedback. Similarly, this 
study, like many previous studies, showed 
that critics are liked less than praise givers. 
The present study showed that dislike of the 
critic did not necessarily indicate rejection of 
her message. 

We should note that the results of these 
experiments show only relative self-deroga- 
tion by actors vis-a-vis bystanders. It may be 
that observers gave actors sympathy votes or 
the benefit of the doubt as is in fact suggested 
by the nonsignificantly higher evaluation by 
bystanders in the evaluation than in the no- 
evaluation condition of Experiment 2. Since 
the task was deliberately nonobjective, we do 
not know whether actors or bystanders were 
closer to being accurate. In fact, relative ac- 
curacy might vary depending on the nature 
of the task. We can say only that actors rated 
themselves less positively than observers rated 
them on an ego-involying, nonobjective task. 

While many questions are raised by the 
present study, the implications of previous 
research have not been upheld when all three 
of the criteria described earlier have been 
met. Subjects did not enhance evaluations of 
their own ability; they did not deny criti- 
cism, and they did not overreact to praise. 
Need for self-esteem may affect self-evalu- 
ations in some situations. Here it seemed only 
to lead those anticipating evaluation to de- 
fend against possible loss of self-esteem. 
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Effects of Intergroup Orientation on Group Attitudes 
Proxemic Behavior 


Allen H. Ryen and Arnold Kahn 
Iowa State University 


The present study examined the effects of five levels of intergroup orientation 
on (a) member evaluations of both ingroup and outgroup and (b) proxemic 


behavior both within and between 


groups. Twenty-four subjects divided into 


triads were assigned to each condition: (a) alone, (b) coacting, (c) cooperat- 


ing, (d) competing no feedback, and (e 


Members of alone and coacting groups 


) competing with win-lose feedback. 
chose random seating patterns, and 


coacting group members did not differentiate evaluatively between own and 
other group. Subjects in cooperating groups sat near both ingroup and out- 


group members, and displayed a slight 


no-feedback group members sat near oi 
outgroup members, rating own group much higher than other group. 
to the losing group, relative 


feedback caused subjects to sit closer 


bias favoring own group. Competing 
ther ingroup members but far from 
Winning 
to no- 


feedback conditions, while losing groups tended to sit as far from the win- 


ning group as possible; winning 


feedback caused even higher own-group 


evaluations than in the no-feedback conditions, while losing feedback pro- 


duced relatively low own-group ratings, 


The results suggest intergroup orien- 


tation provides (a) expectations about involvement with the outgroup and its 


effects on group goal 


attainment which 
evaluations, and (b) normative information 


influences ingroup and outgroup 
as to how to place oneself with 


regard to own- and other-group members, 


When individuals form into two or more 
groups and interact they often become an- 
tagonistic, developing positive attitudes 
toward their own group and negative atti- 
tudes, stereotypes, and prejudicial behavior 
toward the other group (e.g., Harding, Pro- 
shansky, Kutner, & Chein, 1969; Sherif, Har- 
vey, White, Hood, & Sherif, 1961; Tajfel, 
1969). Recent evidence by Kahn and Ryen 
(1972; Ryen & Kahn, 1970) and Rabbie and 
Wilkens (1971) suggests that mere classifica- 
tion into one group and exclusion from the 
other is sufficient to produce such an own- 
group bias. Kahn and Ryen (1972) inter- 
preted this phenomenon in terms of Heider’s 
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(1958) balance theory: Membership classifi 
cation into a group produces a positive u 
relation with it which implies a posit 

timent relation; exclusion from th 

group yields either a negative unit relation or 
the absence of a unit relation which implies a 
less positive sentiment. 

The orientation 
toward the other has also been shown to affect 
Own-group bias, 
distinctive attitudinal and possibly behavioral 
expectations, 
and intergrou 
for particular 
Structures (Sa 
tation is one ii 


er group goals were 


} t interdependent” (cooperative 
orientation) or “contriently interdependent” 


ich success by 
roup) 


EFFECTS OF INTERGROUP ORIENTATION ON Group ATTITUDES 


affected the interpersonal relations both with- 
in and between groups. 

Kahn and Ryen (1972) found that a com- 
petitive orientation produced a greater own- 
group bias than a cooperative one. Appar- 
ently, the greater the threat of the outgroup 
on the goal acquisition or stability of the 
ingroup the greater the reinforcement of the 
ingroup, and the greater the ingroup cohesive- 
ness (liking; see Lott & Lott, 1965) and own- 
group bias (cf. Coser, 1956; Sherif et al. 
1961). Kahn and Ryen (1972; Ryen & Kahn, 
1970) and Dion (1973) have found that 
group success and failure increase and de- 
crease the degree of bias, respectively. Kahn 
and Ryen (1972) interpreted these findings 
as reflecting the differential reinforcing prop- 
erties of the ingroup relative to the outgroup. 

Though the effects of intergroup orienta- 
tion of intra- and intergroup attitudes have 
been well documented, the effects of orienta- 
tion on intra- and intergroup behavior are 
less well explored (Sommer, 1969; Tajfel, 
1969). The present study was an attempt to 
examine the effects of intergroup orientation 
on intra- and intergroup proxemic behavior 
as well as on own-group bias. 

Recent evidence (Sommer, 1969) suggests 
that both intra- and intergroup spatial be- 
havior reflect the operation of cultural norms 
or expectations for various orientations. Som- 
mer (1969) suggests that the nature of the 
interpersonal and intergroup relationships 
(goal ‘orientations) determines seating behav- 
ior. Sommer has shown that cooperating indi- 
viduals prefer close distances with a side-by- 
side pattern, while competing persons choose 
far distances with a face-to-face pattern, and 
coacting pairs choose patterns which mini- 
mize intimacy. Hall (1966) suggests we de- 
velop normative proxemic relationships which 
are functional for particular types of social 
interaction. As the nature of the interaction 
becomes less personal and involved, the nor- 
mative distance for interaction increases; vio- 
lations of these spatial expectancies exert con- 
siderable influence on the perceptions of 
status, communication patterns, and affective 
reactions. The consistent seating patterns and 
small variances reported in interpersonal dis- 
tances across numerous studies (e.g., Batche- 
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Ficure 1. Diagram of the seating arrangements 
of the experimental room including the cooperative 
and competitive seating choices predicted by the 
cultural norm model. 


lor & Goethals, 1972; Sommer, 1969) attest 
to the usefulness of a normative model. 

Assuming intergroup orientation affects 
group proxemic behavior similarly to the 
way interpersonal orientation affects indi- 
vidual proxemic behavior (Sommer, 1969), 
then the cultural expectancy model would sug- 
gest orientation, per se, should predict seating 
distances and patterns. Results from the own- 
group bias literature suggest intergroup inter- 
action, actual or anticipated, is necessary to 
produce a cognitive differentiation between 
own group and other group (Kahn & Ryen, 
1972; Rabbie & Horowitz, 1969). Thus, co- 
acting groups should not display distinctive 
seating patterns with respect to either own- 
or other-group members. Cooperative group 
members should sit closer to one another than 
those in competitive groups, and the pattern- 
ing should resemble that for individuals 
(Sommer, 1969). The anticipated seating pat- 
terns derived from the cultural expectancy 
model are illustrated in Figure 1. Cooperating 
group members should choose seats close to 
one another and close to the other group at 
seats labeled “cooperative,” indicated in Fig- 
ure 1 by as and bs. Competing groups should 
choose seats which maintain maximum dis- 
tance from the other group, but with mini- 
mum distance from the ingroup sitting to- 
gether at the seats labeled “competitive,” in- 
dicated in Figure 1 by As and Bs. Alone and 
coacting groups were expected to choose co- 
operative and competitive seats only at chance 
levels. 
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In addition to coacting, cooperating, and 
competing orientations, the present study in- 
cluded additional competitive groups which 
were given feedback as to whether they won 
or lost, which was not given in the other 
orientation conditions. Past own-group bias 
research suggests that relative to no-feedback 
conditions, winning increases the bias and 
losing decreases it (Kahn & Ryen, 1972), 
On this basis, from the reinforcement view- 
point, winning group members would be pre- 
dicted to sit closer to one another and farther 
from members of the other group, while 
losing group members would be predicted to 
sit farther from one another and closer to 
the other group. In contrast, predictions from 
the cultural norm position would suggest that 
winning and losing produce differences in 
individual and group status, which would be 
manifested in different seating patterns for 
winning and losing groups, compared to 
groups receiving no feedback (Lott & Som- 
mer, 1967; Mehrabian & Friar, 1969). Thus, 
winning groups should take positions of 
higher prominence at the closer and more cen- 
tral cooperative seats (see Figure 1), while 
losing should lead one to assume an inferior 
space, a more distant competitive position, 


METHOD 
Subjects 


The subjects were 123 male students enrolled in 
introductory psychology courses at Iowa State Uni- 
versity who volunteered in return for course credit, 
Subjects were run in triads, and all members of a 
given triad were initially unacquainted. The data 
from one triad were deleted because the subjects’ 
confessed prior knowledge of the experimental ma- 
nipulations. This left eight triads (24 subjects) in 
each of the five experimental conditions. 


Design and Overview 


Five levels of the independent variable, intergroup 
orientation, were included in a one-way completely 
random analysis of variance design. The conditions 
were (a) alone, (b) coacting, (c) cooperating, (d) 
competing without win-lose feedback, and (e) com- 
peting with win-lose feedback. Following perform- 
ance of the experimental task, subjects were requested 
to take seats in another room. The distance subjects 
sat from others in their own group, the distance from 
other group members, and the pattern of seating 
choices were recorded nonobtrusively, Subjects then 
completed scales measuring the evaluation of their 
own and the other group, 
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Procedure 


When subjects arrived for the experiment they 
were immediately and randomly assigned to one of 
two experimental rooms and told they were members 
of that group. If two subjects arrived simultane- 
ously, each was assigned to a different room. When 
both groups had been filled with three members 
each, taped instructions were played which gave the 
intergroup orientation and explained the nature of 
the task. The same instructions were given simul- 
taneously to each group. 

Each group worked on the task for 15 minutes. 
After this time, the subjects were told the follow- 
lowing: “Please stop. The experiment is now fin- 
ished. If you would please take a seat outside in the 
waiting room, the experimenter will be with you 
in about 5 minutes to give you your experimental 
credit cards, as soon as he has finished recording 
the data.” 

Exactly three minutes after subjects were in- 
structed to wait in the adjacent room, seating 
choices, and intra- and intergroup distances were 
recorded nonobtrusively. After recording the dis- 
tances and seating patterns, the own-group bias 
measure was administered. 

Following completion of these scales the subjects 
were given credit for their participation, ques- 
tioned about the manipulations, fully debriefed, and 
dismissed. 


Task 


The experimental task was identical for all sub- 
jects. It required a group effort in solving a series 
of simple number problems. More specifically the 
task involved a trial-and-error process on the part 
of the subjects in matching a list of simple digit 
patterns. Group choices for a Particular digit se- 
quence were registered by activating switches on an 
apparatus, corresponding to ther choices. This ex- 
perimental apparatus was connected to a control 
panel in an adjacent room. Whether a given choice 
was “correct” or “incorrect” was indicated by lights 
on the apparatus which were controlled by the 
experimenter. Groups were required to make at least 
one choice every 5 seconds. 

In reality, there were no correct or in 
choices, and all feedback was Programmed Bate 
experimenter, Consequently, all groups were given 


mately every 30 Seconds, such th; 

30 s y at all groups re- 
gardless of experimental condition received ae 
identical performance feedback. hd 

The task was purpose} i 
, ly ambiguous so that sub- 
jects would find assessing inputs and outcomes dif- 


Intergroup Orientations 


The alone condition was used 
assess seating biases inherent i 
ture of the Seating accommoda: 


as a control to 
in the physica} struc- 
tions. Only one triad 
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at a time was present in this condition, and the 
possibility of the existence of another group was 
not mentioned. The coacting condition was included 
to assess the effects of a second, but noninteracting 
group on subsequent group evaluations and seating 
behavior. In this condition, subjects were informed 
that another group was present, but they were led 
to believe this was merely for experimental con- 
venience and that the performance of each group 
was entirely independent. In the cooperating condi- 
tion, each group believed their own performance was 
interdependent with the performance of the other 
group such that good performance by either group 
resulted in the simplification of the task for the 
other group. In the competing no-feedback condi- 
tion, each group believed that its successful solution 
of a problem resulted in increased task difficulty for 
the other group, and vice versa, such that the task 
success of either group was detrimental to the 
accomplishments of the other. 

In these first four conditions, the groups received 
no evaluation of their performance. In the fifth 
condition, competing with feedback, the orientation 
manipulation was identical to the competing no- 
feedback condition, except that after task comple- 
tion, one triad was publicly declared the winner and 
the other the loser. Winning and losing were based 
on the relative number of problems solved which 
was arbitrarily determined by the experimenter. 


Room 


The layout of the large room is illustrated in 
Figure 1. The room was 40X40 feet (12.192 X 
12.192m and contained four large tables (4 X 6 feet, 
1.22 X 1.83m) with six chairs around each (see 
Figure 1). All 24 chairs were neatly placed around 
the tables, and the tables were cleared. The tables 


“were arranged symmetrically such that two were 


equidistant from the entrances of both experimental 
rooms, one near (12 feet, 3.66m) both rooms and 
one far (27 feet, 8.23m) from both rooms. The two 
remaining tables were placed at an intermediate dis- 
tance, each centered 15 feet (4.57m) from the 
entrance of one room, and 24 feet (7.32m) from 
the other room. 


Dependent Measures 


Seating choices were observed through a hidden 
opening in the control room door. These choices 
were recorded on data sheets using as reference 
points the rectangular features of the tables. Subse- 
quent estimates of distance between subjects were 
based on knowledge of chosen seat, and the average 
distance between any two seats. 

Tntragroup seating distance was defined as the 
average distance among all three members of a 
group. In those conditions in which two groups were 
present during the experimental session, intragroup 
seating distances were averaged across both groups 
to yield a single intragroup distance measure per 
experimental session, since a group’s seating dis- 
tances may not have. been independent of the seating 


patterns of the second group. Intragroup seating 
distance was indexed by the average distance 
between members of different groups. 

Seating patterns were assessed by recording the 
incidence of sitting at particular seating positions 
and comparing this to predicted seating choices. 
Two other measures of seating pattern were derived 
to compare degrees of subject clustering across con- 
ditions, which are described in detail in the results 
section. 

The own-group bias measure (Kahn & Ryen, 
1972) consists of 11 semantic-differential-type adjec- 
tive pairs which yield a composite evaluation of 
own group and other group. In determining own- 
group bias, other-group ratings were subtracted from 
own-group evaluations; the more positive the score, 
the greater the own-group bias. In making evalua- 
tions of one’s own group, subjects were explicitly 
asked to exclude themselves from the group rating. 

Incidental data were also collected on the visual 
behavior of individuals toward own- and other- 
group members. Shoulder orientation was considered 
to be a reasonably stable index of visual orientation. 
Average comfortable head position is perpendicular 
to the shoulder plane, and thus visual orientation 
was defined as the deviation measured in degrees 
between this perpendicular and the geometric center 
of a group. Visual orientations for both own group 
and other group were calculated from the data 
sheets on which shoulder orientations had been 
estimated in reference to the rectangular features 
of the experimental room. 


RESULTS 
Own-Group Bias 


The evaluations of own group, other 
group, and the difference between these two 
measures—own-group bias—are presented in 
Table 1. After homogeneity of cell variance 
was demonstrated using Hartley’s statistic, 
Finax(4. 23) = 1.42, p > .05, individual ¢ tests 
were performed to test the hypothesis that 
own-group bias scores differed significantly 
from zero, As predicted, all conditions pro- 
duced significant own-group biases (ps < .02, 
dj = 23 using ¢ tests) with the exception of 
the coaction condition, in which there was a 
slight but nonsignificant bias, t(23) = 2.03, 
p< .10. In line with past research, competi- 
tion was expected to produce the greatest 
own-group bias, cooperation a moderate bias, 
and coaction the least bias. Winning was ex- 
pected to increase the bias and losing to de- 
crease it, relative to the competition no- 
feedback condition. The means for own-group 
bias, presented in Table 1, confirm these 
expectations. An analysis of variance for the 
own-group bias scores yielded a significant 
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TABLE 1 


EVALUATIONS OF OWN GROUP, OTHER GROUP, AND Own-Grovp Bras 
AS A FUNCTION OF ĪNTERGROUP ORIENTATION 


Intergroup orientation 


Dependent measure 


Coaction Cooperation omnes Win Lose 
e 66.83" 
Own group 61.50" 65.04" 69.92 71.58" 
a 59.08. 60.13 61.71 61.75 63.42 
gine men Bi 4.91Þb 8.21° 9.83° 3,418 


Own-group bias 


Note. For own-group ratings, those mi 
Keuls; for own-group bias those means 


overall effect for the experimental conditions, 
F(3.91) = 6.29, p < .001. Losing in competi- 
tion significantly decreased the bias relative 
to the no-feedback competition, (34) = 3.18, 
p < 01, but winning did not significantly in- 
crease the bias, £(34) = .99, ms. Also con- 
sistent with past research, own-group bias 
appeared due more to variations in own- 
group evaluation, F(3,92) = 7.53, p < .001, 
than other group evaluation, F (3, 92) = 1.84, 
p< .202 


Intragroup Seating Distance 


An analysis of variance revealed a signifi- 
cant overall effect for the orientation ma- 
nipulations on intragroup distances, F (4, 19) 
= 4.93, p<.01. The average intragroup 
seating distances within conditions is pre- 
sented in Table 2. In line with planned pre- 
dictions from the normative model, ¢ tests 
showed that subjects in the alone and coact- 
ing conditions sat significantly farther from 
one another than did subjects in the other 
conditions (ps < .05; for smallest difference, 
t(6) = 3.04) with no significant differences 
within the noninteracting conditions (alone 
and coacting) or within the interacting con- 
ditions (cooperative, competive no feedback, 
competitive win, and competitive lose). The 

1The split-half reliability for the semantic differ- 
ential instrument was r (94) =.65 for own group 
ratings and r (94) =.74 for other group ratings. 
Own-group and other-group scores were highly 
correlated, r (94) =.77, much in line with the find- 
ings of Wilson and Miller (1961) and Wilson and 
Kayatani (1968). Though this finding is typically 
taken to mean that esteem for own group varies 
directly as a psychological function of esteem for 
other group, this relationship may be inflated due 
to the effects of correlated errors in the repeated 
Measures design. 


eans having common superscripts are not significantly different at the .05 level by Neuman 
having common superscripts are not significantly different at the ,05 level by ¢ test. 


relationship between intragroup distance and 
own-group bias was not significant, 7(94) 
= — .17, p < 102 


Intergroup Seating Distance 


An analysis of variance performed on the 
average intergroup distances yielded a sig- 
nificant effect of the orientation manipula- 
tions, F (3,12) = 5.12, < .05, and the means 
are presented in Table 2. Consistent with 
normative expectations, cooperating groups 
sat closest to one another and groups com- 
peting with no feedback sat farthest away, 
with coacting and competing groups with 
feedback sitting at intermediate distances. 
All distances between treatment groups were 
significantly different from one another (p 
< .05 using ¢ tests) except between coacting 
and competition with feedback conditions. 
The relationship between intergroup distance 
and own-group bias was not significant, 
(94) = 12, p > .10. 


Seating Patterns 


On the basis of past research on dyadic 
seating positions and general normative ex- 
pectations, alone and coacting individuals 
were expected to take positions randomly, not 
preferring a given seat to any other. Co- 
operative groups were expected to occupy the 
same table, with each team taking opposite 
sides, and competitive no-feedback groups 
were expected to take seats at opposing 
tables, around the most distant corner from 
the other team. The hypothesized cooperative 
and competitive positions are illustrated in 
Figure 1. Winning competitive groups were 
hypothesized to assume the more central co- 


All b values reported are two-tailed. 


T 


TABLE 2 


MEAN IntTRAGROUP AND INTERGROUP SEATING 
DISTANCES AND STANDARD DEVIATIONS 
AS A FUNCTION OF INTERGROUP ORIENTATION 


Intragroup distance | Intergroup distance 


aren (in feet) (in feet) 
orientation 
M SD M SD 
Alone 7.508 2.08 = — 
Coaction 7.38" 2.91 10.88* 
Cooperation 4.25 1.15 5.74 
Competition 4.67» 1.12 14.39° 
Win 4.79% 1.22 | 
9.58" 
Lose 4.88 1.35 


| 


Note. Entries in columns with common superscripts are not 
significantly different at the .05 level by £ tests. 


operative positions, while losing competitive 
groups were expected to choose the more 
distant competitive seats. The frequency with 
which the cooperative and competitive seating 
positions were occupied with respect all others 
is displayed in Table 3. By inspection it is 
obvious the obtained seating patterns are not 
at all random. Generally, alone and coacting 
groups sat randomly, cooperative group mem- 
bers sat adjacent to one another at the same 
table as the other group members, and com- 
peting no-feedback members sat near own- 
group members at the farthest possible posi- 
tions from the other group. Furthermore, 
winning groups tended to sit in the near co- 
operative positions, while losing groups tended 
to assume distant positions, either in com- 
petitive seats or at the table containing these 
seats. 

Because the seating pattern data were not 
amenable to a chi-square analysis for good- 
ness of fit due to the small predicted cell 
frequencies for cooperative and competitive 
positions, orthogonal analyses were performed 
on two measures of clustering across the six- 
subject experimental units: incidence of any 
two subjects sitting together at a particular 
table and incidence of any two subjects 
sitting adjacent to one another, given they 
are seated at the same table. For the first 
analysis, a matrix of all possible two-subject 
seating patterns at a given table was, derived; 
two subjects were either both sitting at a 
“3 Generous thanks are 
for his assistance in deriving 
for the seating position data. 


due to Dr. Leroy Wolins 
the clustering analyses 
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TABLE 3 


EXPECTED AND ACUTAL SEATING FREQUENCIES AS A 
FUNCTION OF INTERGROUP ORIENTATION 


Seating positions* 


Intergroup |, 
orientation 

Cooperative | Competitive | Other 
Alone 24 3( 3) 4( 3) 17 (18) 
Coaction 24 Sirs) 7 (3) 12 (18) 
Cooperation | 24 22 (24) 1 (0) 1( 0) 
Competition | 24 1( 0) 16 (24) 7 (0) 
Win 12 11 (12) 0( 0) 1( 0) 
Tose 12 | 2(0) 5 (12) | 5 


Note, Values within the parentheses are expected frequen 
a See F - 1 for description of cooperative, competitive, 
and other seating positions. 


particular table or they were not. If two sub- 
jects were sitting at the same table, that cell 
entry was scored 1, and if they were not, 0. 
This scoring procedure is demonstrated in 
Table 4 for two groups sitting, all at a single 
table. Next, all the values in the off-diagonal 
of the matrix were summed. This measure 
yields scores ranging from 2, if subjects are 
sitting as randomly as possible, to 6, if the 
triads are sitting together but at a different 
table from the other triad, to 15, if all sub- 
jects are sitting together at the same table. 
In the second analysis, similar matrices were 
constructed for each six-man composite, and 
their seating patterns were scored for inci- 
dence of subjects sitting adjacent to one an- 
other at a table. Since there were 15 possible 
two-subject combinations in the off-diagonal, 
but only six possible adjacencies given only 


TABLE 4 


SAMPLE SCORING MATRIX FOR TABLE 
CLUSTERING MEASURE 


Subject identification 
Subject 
identifi- Group one Group two 
cation ET A 4 
A | B | C X | M | Z 
Group one 
1 1 1 1 1 
B 1 1 1 1 
Cc 1 1 1 
Group two 
x 1 1 
y 1 
Z 


Nole. Incidence of any combination of two subjects sitting at 
the same table (scored 1), for the instance when all six subjects 
sit together at the same table. 


308 ALL 


TABLE 5 


EVIATION (IN DEGR 7 INTRAGROUP AND 
VisuaL ORT TION FROM THE 

3 xs oF Bota GROUPS AS A 
sROUP ORIENTATION 


isual | Intergroup visual 


Intragroup 


Intergroup orien- orientation orientation 
Alone 44.04 
Coaction 38.13 
Cooperation 74.04 
Competition 31.83 
Dice LES 


six subjects per matrix (and thus nine pos- 
sible nonadjacencies), if two subjects were 
adjacent, they were given a score of 9, and if 
nonadjacent, —6. This measure yields a score 
range from —90, if none of the six subjects 
sit adjacent, to a score of 0, if all six subjects 
sit adjacent. 

Analyses of the clusteri 
significant effects of the 
both for clustering at a given table, F(3, 12) 
= 9.13, p < .01, and for seating adjacencies 
around a given table, F(3,12) = 11.27, p 
<.01. The means for the table clustering 
measure were: coacting, 4.25; cooperating, 
12.75; competitive no feedback, 5.00; and 
competitive with feedback, 6.50. Using 
Newman-Keuls analysis, only the mean for 
cooperating was significantly different from 
the other means (p < .05). For the seating 
adjacency measure, the obtained means were: 
coacting, — 67.50; cooperating, —11.25; com- 
petitive no feedback, —56.25; and competi- 
tive with feedback, —48.75. Again, using 
Newman-Keuls analysis, only the mean for 
cooperating was significantly different from 
the others (p < .05). In addition, these two 
measures of seating patterning were relatively 
independent of one another, r(14) = .36, 


dj = < .20. 


ng measures showed 
dependent variables 


Visual Orientation 


The mean visual orientation data are sum- 


ized in Table 5. In light of the uncertain 
reliability of the data, a formal analysis was 
Je; however, a number 


not considered justifiab 

of interesting trends are suggested. In general, 
members of interacting groups (cooperating, 
competing no feedback, competing with feed- 
back) tended to orient more closely toward 


mar: 
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own-group locus than noninteracting group 
members (alone, coacting). This effect þe- 
comes somewhat more pronounced in the 
group competition case. Further, competing 
s under no-feedback and losing 


group member: ; 
conditions tended to orient markedly away 


from the other group, avoiding visual contact 
with them; while winning groups and co- 
operating groups faced more directly toward 
the other groups. 


Discussion 


The present study shows that intergroup 
orientation affects not only the bias group 
members’ display in favor of their own group, 
but determines seating pattern and distance 
as well. The proxemic data suggest that 
studies demonstrating own-group bias are not 
merely artifacts resulting from subjects rating 
both the ingroup and outgroup (Gerard & 
Hoyt, 1974). The proxemic results clearly 
support the cultural norm position. Seating 
choices within a group, as predicted by the 
cultural norm model, were dependent on 
whether one’s group interacted with the other 
group. In noninteracting conditions (alone, 
coacting) individuals sat randomly, with 
group spatial boundaries difficult to distin- 
guish; in the interacting conditions (cooper- 
ating, competing without feedback, competing 


win, competing lose) group members sat 


together at the same table with small dis- 
tances among them. The variances in distance 
in these latter conditions were small. 

The seating patterns between groups were 
also in line with the cultural expectancy 
model, which predicted that different inter- 
group orientations would produce different 
seating positions. Coacting groups sat ran- 
domly with regard to one another; cooper- 
ating groups sat together at the same table; 
competing groups without feedback sat as far 
as possible from each other; groups winning 
in competition approached the other group 
sitting at cooperative positions, while groups 
losing in competition sat in the competitive 
seats. The striking correspondence between 
the anticipated seating choices predicted from 
the normative model and those obtained (see 
Table 3) could not, we believe, have been 
obtained using another model. 

A simple reinforcement position would pre- 


ee 


EFFECTS OF INTERGROUP ORIENTATION ON Group ATTITUDES 309 


dict a positive relationship between own- 
group bias and intragroup seating distance 
and a negative relationship between own- 
group bias and intergroup seating distance 
(Lott & Lott, 1972; Mehrabian, 1968; 
Mehrabian & Diamond, 1971; Schneiderman 
& Ewens, 1971). That such relationships 
were not obtained does not imply that rein- 
forcement considerations are always absent. 
We would suspect that in situations in which 
(a) intergroup orientation is unclear, (b) fur- 
niture arrangements do not produce clear 
seating expectancies, or (c) cues for nor- 
mative expectations are minimal (e.g., free 
seating in an empty room), the liking for 
others more directly determines seating 
distances. ` 

In line with past research (Kahn & Ryen, 
1972; Rabbie & Horowitz, 1969), intergroup 
orientation also strongly affected own-group 
bias. Own-group bias was least for coacting 
groups and greatest for competing no-feed- 
back and competing win conditions. The bias 
in favor of one’s own group was significant 
for all but the coacting condition. Interest- 
ingly, the various orientations did not affect 
attitudes toward the other group, but rather 
attitudes toward own group, or cohesive- 
ness (Lott & Lott, 1965). As the relationship 
with the other group moved from indepen- 
dence with regard to own-group outcome 
(coaction), through a positive relationship 
(cooperation), to a negative relationship 
(competition), the cohesiveness of the own 
group increased while attitudes toward other 
group remained relatively constant. That is, 
the greater the involvement with the outgroup 
and its threat to the ingroup’s independent 
goal attainment, the more cohesive one’s 
group (cf., Coser, 1956; Sherif et al., 1961). 

The proxemic and attitude data, taken 
together, suggest that a cooperative orienta- 
tion is most conducive to intergroup harmony 
(cf. Sherif et al., 1961) since the group mem- 
bers have only a moderate preference for 
their own group, and they sit in a manner 
conducive to intergroup communication, Com- 
petition, on the other hand, produces not 
only greater evaluative differences between 
own and other group, but leads to seating 
patterns which make intergroup interaction 


difficult. 


Winning in competition produces approach 
behavior which might be interpreted as either 
a friendly gesture or an attempt to take the 
best, most prominent position. The own- 
group bias data suggest the latter interpreta- 
tion. Losing in competition leads to distant 
seating and low own-group evaluation, which 
suggests lowered confidence in the ingroup 
and a willingness to demure to the fringe. 

In summary, orientation appears to provide 
a group member with two distinct kinds of 
information: where one should sit with regard 
to own and other group and how one should 
feel about own and other group. However, 
these two types of information appear inde- 
pendent in that seating distance and own- 
group bias are unrelated. Seating patterns 
seem to be predicated on cultural norms 
which are apparently well learned, while the 
group attitudes are based on the extent to 
which the other group is involved and might 
interfere with one’s own-group goal. 

The fact that such differences in evalua- 
tion and proxemic behavior were found with 
only minimal interaction both within and 
between groups, and in a situation in which 
there was no tangible reward for the behav- 
iors of the groups, suggests that both own- 
group bias and group proxemic behavior are 
quite robust phenomena. The present study 
shows that intergroup orientation not only 
produces differential ingroup-outgroup evalu- 
ations, but also provides normative expecta- 
tions concerning proxemic behavior with re- 
spect to own group and other group. Thus, 
not only can orientation affect the develop- 
ment of intergroup attitudes, it can also in- 
fluence future intergroup relations by medi- 
ating the amount of intergroup contact and 
the quality of intergroup affect and communi- 
cation. If one holds relative negative senti- 
ments toward another group, and proxemic 
factors act as a barrier to meaningful per- 
sonal communication, then any trends toward 
intergroup antagonism would be exacerbated 
(Ehrlich, 1973). 
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Generality of Learned Helplessness in Man 


Donald S. Hiroto and Martin E. P. Seligman 
University of Pennsylvania 


Learned helplessness, the interference with instrumental responding following 
inescapable aversive events, has been found in animals and man. This study 
tested for the generality of the debilitation produced by uncontrollable events 
across tasks and motivational systems. Four experiments with college students 
were simultaneously conducted: (a) pretreatment with inescapable, escapable, 
or control aversive tone followed by shuttlebox escape testing; (b) pretreat- 
ment with insoluble, soluble, or control discrimination problems followed by 
anagram solution testing; (c) pretreatments with inescapable, escapable, or 
control aversive tone followed by anagram solution testing; (d) pretreatments 
with insoluble, soluble, or control discrimination problems followed by 
shuttlebox escape testing. Learned helplessness was found with all four ex- 
periments: Both insolubility and inescapability produced failure to escape and 
failure to solve anagrams. We suggest that inescapability and insolubility both 
engendered expectancies that responding is independent of reinforcement. 
The generality of this process suggests that learned helplessness may be an 


induced “trait.” 


Inescapable aversive events presented to 
animals or to men result in profound inter- 
ference with later instrumental learning (e.g., 
Hiroto, 1974; Overmier & Seligman, 1967; 
Seligman & Maier, 1967; Thornton & Jacobs, 
1971). If a subject can escape the aversive 
event, later instrumental behavior remains 
normal. This phenomenon has been inter- 
preted as learned helplessness (Maier, Selig- 
man, & Solomon, 1969; Seligman, Maier, & 
Solomon, 1971). This interpretation claims 
that organisms learn that responding and 
reinforcement (e.g., shock termination) are 
independent when shock is inescapable. Such 
learning undermines the motivation for initi- 
ating instrumental responses. 

Is learned helplessness a specific state 
which only impairs performance in situations 
similar to original training, or does it impair 
a broad range of behavior? It is possible that 
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specific environmental cues of training medi- 
ate the interference. In contrast, Hiroto 
(1974) and Miller and Seligman (1973) 
hypothesized that expectancy of indepen- 
dence is an internal state of the organism 
that is broadly transferred. 

If interference with learning occurs fol- 
lowing uncontrollable events from very dif- 
ferent response and stimulus modalities, 
learned helplessness would seem to be a 
stable and pervasive process. To put it loosely, 
is learned helplessness a state or trait? This 
question is of particular interest since learned 
helplessness has been postulated as under- 
lying human depression (Seligman, 1973, in 
press a; Seligman, Klein, & Miller, in press). 

Hiroto (1974) reported results which dem- 
onstrated parallel behaviors between animals 
and man in a learned helplessness paradigm. 
One group received aversive loud noise which 
it could escape by button pressing. A second 
group received inescapable noise, and a third 
group received no pretreatment. All groups 
then received controllable noise in a two-way 
shuttlebox. As with animals, the inescapable 
group tended to sit and take the noise with- 
out responding, while the escape and no- 
pretreatment groups escaped readily. A per- 
sonality measure, external control of rein- 
forcement (Rotter, 1966), as well as the in- 
structions of chance both produced passivity 
similar to the effects of inescapability. Hiroto 
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FIGURE 4. Group means, standard deviations, and significance levels of three 
dependent variables for the instrumental pretreatment-instrumental test for 
helplessness experiment: mean number of failures to escape. 


concluded that all three variables—inescapa- 
bility, externality, and chance instructions— 
engender a similar state, that is, the expec- 
tancy that reinforcement is independent of 
responding. 

The Hiroto study, like most others in the 
helplessness literature, tested its subjects in a 
situation similar, but not identical, to the one 
in which helplessness is trained: Both train- 
ing and testing used aversive loud noise and 
an instrumental problem. The study we report 
trains its subjects in either instrumental help- 
lessness with loud noise or insoluble cognitive 
problems and then tests them in both new 
instrumental or cognitive problems which are 
soluble. It should be pointed out that insolu- 
bility in a cognitive task is formally anal- 
ogous to inescapability, since in both the 
probability of reinforcement (correct or in- 
correct, or shock or no shock) is independent 


of responding. So we determine whether in- 
strumental helplessness or mastery transfer 
to cognitive task and whether cognitive help- 
lessness or mastery transfer to instrumental 
tasks. 

METHOD 
Overview 


The study consisted of four independent, but si- 
multaneously conducted, experiments. Two of the 
four experiments used an instrumental (Inst.) or 
cognitive (Cog.) pretreatment followed by an 
instrumental test for helplessness (Inst.-Inst., Cog.- 
Inst.). The other two experiments used an instru- 
mental or cognitive pretreatment followed by a cog- 
nitive test for helplessness (Inst.-Cog., Cog.-Cog.). 
Each experiment consisted of three groups which 
received escapable (soluble), inescapable (insoluble), 
or a control pretreatment prior to the test for help- 
lessness. Figure 1 presents the design for the Inst.- 
Inst. and Cog.-Inst. experiments, and Figure 2 pre- 
sents the design for the Inst.-Cog. and Cog.-Cog. 
experiments. z 
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Subjects 


Ninety-six undergraduate students, consisting of 
51 men and 45 women, at the University of Penn- 
sylvania participated in the study. They responded 
to advertisements for a study in noise pollution and 
were advised prior to their participation that they 
might be exposed to a “slightly unpleasant” tone. 
Two women refused to participate and were re- 
placed. None of the subjects had previously par- 
ticipated in an experiment using an aversive stimu- 
lus. Each subject was paid $2.00 at the end of the 
experiment and debriefed. 


Apparatus 


Instrumental pretreatment. The apparatus was a 
spring-loaded button in the center of a 12-in. (30 
cm.) circular base. Symmetrically on each side of the 
button were two 24-V de lights. In the escapable 
condition, pressing the button four times terminated 
the tone. In the inescapable condition the button 
had no effect on the tone. 
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Cognitive pretreatment. A series of four-dimen- 
sional stimulus patterns used in previous discrimi- 
nation learning studies (Levine, 1966, 1971) was 
the cognitive task. Each of the four dimensions had 
two associated values: (a) letter (A or T); (b) 
letter color (black or white); (c) letter size (large 
or small); (d) border surrounding letter (circle or 
square). In the soluble condition, one value of one 
of the dimensions, for example, square border, was 
always correct. In the insoluble condition, no value 
was consistently correct. Levine (1971) provides a 
detailed description of the patterns. Each pattern 
was on a 4 X 6-in. (10 X 16 cm.) “wire-index” card. 

Soluble instrumental test task. The apparatus was 
a modified Turner and Solomon (1962) human 
shuttlebox. Moving a knob from one side to the 
other escaped and avoided the tone. Specifications of 
the manipulandum have been described elsewhere 
(Hiroto, 1974). 

The aversive stimulus in both instrumental tasks 
was a 3,000 hertz tone emanating from an Eico 
audio generator (Model 377). The tone was pre- 
sented to the subject at 90 decibels through cali- 
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Ficure 6, Group means, standard deviations, and significance levels of three 
dependent variables for the cognitive pretreatment-instrumental test for help- 
lessness experiment: mean trials to criterion for escape acquisition. 


brated Grason-Stadler earphones (Model TDH-39), 
and as we see the tone was generally judged “mod- 
erately aversive.” 

Soluble cognitive test task. A series of 20 ana- 
grams taken from a list of five-letter anagrams 
(Tresselt & Mayzner, 1966) was used as the cogni- 
tive test task. The anagrams were placed indi- 
vidually on 4X 6-in. (10X 16 cm.) wire-index 
cards and composed of # in. (.64 cm.) letters spaced 
jis in. (.48 cm.) apart. Examples of the anagrams 
were: (a) I A RD T; (b) BI A T H; (c) 
ULATF; (d) ERLKC. The letter order 
for all anagrams was 3-4-2-5-1. This enabled us to 
look at both the time to solution of each anagram 
and trials to catching on to the pattern. 

Communication between subject and experimenter 
was conducted with a two-way intercom. Control- 
ling circuitry was located in an adjacent room sepa- 
rated by a one-way mirror. All response variables 
were measured by (1/100-sec.) times for the instru- 
mental tasks and a (1/100-min.) stopwatch for the 
cognitive tasks. 


Procedure 


The subjects were assigned into 1 of the 12 
possible groups generated by the four experiments 
each with the three pretreatment contingencies. The 
first study used an instrumental pretreatment (but- 
ton press) followed by a second instrumental task 
(shuttlebox) to test for helplessness. The second 
study used a cognitive task (Levine discrimination 
problems) for the pretreatment followed by the 
instrumental task. The third and fourth studies used 
an instrumental and cognitive pretreatment task, 
respectively, and followed by the cognitive test task. 
There were 24 subjects in each experiment. 

The three groups pretreated with the instrumental 
task were: (a) escapable (E) subjects who received 
45 trials of unsignaled escapable noise; (b) ines- 
capable yoked (E) subjects who received the iden- 
tical 45 trials of unsignaled inescapable loud noise; 
and (c) control (C) subjects who passively listened 
to the identical number and duration of tones as E 
and Ë subjects. The C group differed from E group 
in that C was instructed merely to sit and listen to 
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the tones, while E was instructed to try to terminate 
the tones (see “Instructions,” below). Both C and 
E received identical patterns of tones, since we 
deemed it important that the inescapable group 
differ from the control group not in habituation to 
tone, but only in perceived inescapability (Geer, 
Davison, & Gatchel, 1970; Glass, Singer, & Fried- 
man, 1969). The groups pretreated with the cogni- 
tive task were: (a) soluble (S) which received three 
soluble concept identification problems; (b) insolu- 
ble (S) which received three identical concept prob- 
lems but with no consistent correct values; (c) 
control (C) which inspected the three problems but 
like the instrumental control group received no 
instructions to solve the problems and no feedback. 

Instructions for instrumental pretreatment. The 
button press-shuttlebox and button Press-anagram 
groups were pretreated with the same instrumental 
button pressing task. Each subject was initially 
informed that the study involved listening to noise 
and was given the option of leaving after listening 
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to brief samples of the tone. The control groups 
were told only “From time to time a loud tone 
will come on for awhile. Please sit and listen to it.” 
The E and Ē groups were told “From time to time 
a loud tone will come on for awhile. When that 
tone comes on there is something you can do to 
stop it.” The function of the two lights on the 
apparatus was next described, but only to E and E 
groups, 


There are two lights located on this base. The 
lights will tell you how the noise on each trial 
was controlled. If you find the way to stop the 
noise then the green light marked ‘S-out’ will 
momentarily flash on after each time you stop 
the loud tone. If you don’t stop the tone then 
the red light marked ‘time out’ will flash when 
the tone stops. Remember, when the green light 
flashes on this means that you have stopped the 
tone. But if the red light flashes this means you 
did not stop the tone but that it stopped auto- 
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Ficure 7. Group means, standard deviations, and significance levels of three 
dependent variables for the cognitive pretreatment-instrumental test for hel 
lessness experiment: mean number of failures to escape. 2 
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Ficure 8. Group means, standard deviations, and significance levels of three 
dependent variables for the cognitive pretreatment-instrumental test for help- 
lessness experiment: mean response latency. 


matically. Taking the earphones off or disman- 
tling the apparatus is not the way to stop the 
noise. 


The pretreatments consisted of 45 unsignaled 
trials with the 90-decibel tone. If a subject in the E 
group failed to terminate the tone it lasted for 5 
secs. The onset of the red (failure) light was corre- 
lated with the end of the 5-sec. interval. The onset 
of the green (success) light was correlated with 
subject’s termination of the tone. The intertrial 
interval (J7I) ranged from 10 to 25 secs. with a 
14-sec. mean ITI. Triads of subjects in all groups 
received identical, yoked durations of tone. At the 
conclusion of the pretreatments subjects rated the 
aversiveness of the tone. 

Instructions for cognitive pretreatment. The Le- 
vine discrimination-anagram and Levine discrimina- 
tion-shuttlebox groups received the following in- 


structions to introduce the cognitive task: 


you will be Jooking at cards 


In this experiment n 
d has two stimulus pat- 


like this one. Each car 


terns on it. The sample patterns are composed of 
five different dimensions and two values associ- 
ated with each dimension. The dimensions and 
their values are [expérimenter described each 
dimension and value]. Each stimulus pattern has 
one value from each of the five dimensions. 


Here the C instructions ceased and the rest of the 
instructions were given to S and § groups. 


I have arbitrarily chosen one of the ten values as 
being correct. For each card I want you to choose 
which side contains this value, and I will then 
tell you if your choice was correct or incorrect. 
In a few trials you can learn what the correct 
value is by this feedback. The object for you is 
to figure out what the answer is so you can 
choose correctly as often as possible. 


Five sample trials of a single five-dimension prob- 
lem was first presented. This clarified the task of 
finding the “correct” value. The C group was merely 
shown the sample trials without any clarifying in- 
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structions. All subjects were asked if there were any 
questions. No subject asked if the experimental 
problems were soluble or insoluble. 

The experimental stimulus patterns were composed 
of four dimensions. Three different problems were 
Presented in blocks of 10 trials each. At the end of 
each 10-trial problem S and § groups were asked for 
the correct answer. The criterion for acquisition was 
subject identifying the correct value after each 10- 
trial block. Six subjects were discarded for not 
meeting criterion on all of the three problems and 
were subsequently replaced. The § group received 
a predetermined schedule of “correct” and “incor- 
rect” regardless of what value was guessed. In this 
manner reinforcements were independent and not 
contingent on the § groups responding. The schedule 
of reinforcements were: (a) C-I-I-C-C-I-I-C-C-I 
for the first problem; (b) I-C-I-C-C-I-C-L-C-I for 
the second; and (c) I-C-I-C-1-C C-I for the 
last problem, In addition, the S group was told 
“that’s the wrong answer” when subject tried to 
guess the correct value after each problem. 
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Instructions to the S and § groups were “We are 
now starting a new problem. You do not know at 
this point if I have chosen a different value for this 
problem. I will continue telling you if you are 
correct or incorrect.” 

The C group was instructed to “Please continue 
studying each stimulus pattern carefully and turn 
each card when I ask you to.” 

The S and § groups were allowed 10 secs, to make 
a decision before the experimenter warned them that 
a decision must be made within 5 secs. In other 
words, a trial could never be longer than 15 secs., but 
no subject took longer than 10 secs. on any trial. 

Instructions for instrumental test trials. The in- 
strumental test trials were conducted at a different 
location but within the same experimental room as 
the pretreatments. The manipulandum was covered 
until subject received the following instructions: 


You will be given some trials in which a rela- 


tively loud tone will be presented to you. When- 
ever you hear the tone come on there is some- 
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Ficure 10. Group means, standard deviations, and significance levels of three 
dependent variables for the instrumental pretreatment-cognitive test for help- 
lessness experiment; mean number of failures to solve. 
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The sliding knob was always located at the mid- 


thing you can do to stop it. Taking the earphones 
off or dismantling the apparatus is not the way to 
stop the noise. I'll answer all questions and pay 
you for your time at the completion of the study. 
Uncover the apparatus and we'll begin. 


point of the manipulandum such that the subject 
could slide the knob with equal ease to either the 
left or right end of the box. A 24-V de warning light 
at the midpoint of the manipulandum cover was on 
for 5 secs. before and terminated when the 5-sec. 
tone began. The test phase consisted of 20 signaled 
10-sec. trials with the 777 ranging from 10 to 45 
secs. and a mean ITZ of 21 secs. 
The appropriate response was moving the knob 
} to one side of the manipulation to throw the micro- 
switch controlling the stimulus light or noise. On 
the next trial, moving to the opposite side escaped 
or avoided the noise. The instructions specified 
l escape contingencies only, but an avoidance response 
was possible by sliding the knob to the appropriate 


side before the noise began, A response latency 
under 5 secs. terminated the warning light and 
avoided the tone. If the subject d'd not terminate 
the light (latency less than 5 .) or escape the 
tone (between 5 to 9.99 secs.) a latency of 10 secs. 
was given for that trial. At the completion of the 
test phase subjects rated the unpleasantness of the 
3,000 hertz tone and completed questionnaires relat- 
ing to the pretreatment and test tri 
Instructions for cognitive test trials. The cogniti 
test was located in the same location as the instru- 
mental test task. The following instructions intro- 
duced the trials: 


You will be asked to solve some anagrams. As 
you know anagrams are words with the letters 
scrambled. The problem for you is to unscramble 
the letters so they form a word, When you've 
found the word tell me what it is over the inter- 
com system. Now [subject’s name], there could be 
a pattern or principle by which to solve the ana- 
grams. But that’s up to you to figure out. T 
can’t answer any questions now. After the experi- 
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ment is over I'll answer all questions and pay 
you for your time. 


All 20 anagrams were soluble and had the same 
letter sequence. The anagrams could be solved indi- 
vidually; but the easiest method was to learn to use 
the letter sequence. The anagrams were selected such 
that only one word could be arranged with each 
anagram. But there were two instances in which 
subjects found additional words. The anagram 
B L O EN (“noble”) was rearranged as Nobel” 
by three subjects, but experimenter disqualified this 
answer since it was a proper name. On these occa- 
sions subject was asked to try again, and each sub- 
ject eventually found the acceptable word. The 
anagram UNATJ (“jaunt”) was seen as “junta” 
by three different subjects and scored as acceptable 
answers (experimenter was tempted to award these 
subjects a medal). In cases in which subject gave a 

nonsense word experimenter replied, “that’s not a 

word, please try again.” 

Three dependent variables were analyzed on the 
instrumental and cognitive test tasks. The instru- 
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mental measures were: (a) trials to criterion for 
escape acquisition, defined as subject completing 
three consecutive escape responses; (b) number of 
failures to escape, defined as the number of trials 
with latencies of 10 secs.; and (c) the mean latency 
for the 20 trials. Three analogous measures were 
analyzed for the cognitive test task: (a) trials to 
criterion for anagram solution was defined as sub- 
ject solving three consecutive anagrams in less than 
15 secs. each. (Reaching this criterion meant that 
subject recognized the principle of fixed-letter se- 
quence, The definition, although arbitrary, was 
highly reliable. When subject “caught on” to the 
anagram construction, latencies dropped dramatically 
from an average of 45 secs. to well below 15 secs.) ; 
(b) number of failures to solve, defined as the 
number of trials with latencies of 100 secs., the point 
at which the trial ended; (c) mean response latency 
for the 20 anagrams. The last two measures for the 
instrumental and cognitive tests parallel the indices 
reported in the human and animal learned helpless- 
ness literature. 
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lessness experiment: mean response latency, 


es 


A 


GENERALITY OF LEARNED HELPLESSNESS IN MAN 


20 


16 


14 


10 


MEAN TRIALS TO CRITERION 


sD. 6.30 


So.us.e 


321 


S.D. 6.74 


Contra. 


eae L_., ai 
P>.05 


Ficure 12. Group means, standard deviations, and significance levels of three 
dependent variables for the cognitive pretreatment-cognitive test for helplessness 
experiment; mean trials to criterion for anagram solution. 


RESULTS 


Overall, interference was produced by in- 
escapability and insolubility in three of the 
four experiments: Inst.-Inst.;  Inst.-Cog., 
Cog.-Inst. The Cog.-Cog. experiment did not 
show significant effects although the results 
were in the nredicted direction. 

P 
Pretreatment Trials 


All subjects who received the escapable or 
soluble pretreatments performed appropri- 
ately to the relevant contingencies during 
pretreatment. No significant differences in the 
number of trials to escape the tone by button 
pressing were found between the button-press 
escapable group followed by the instrumental 
task and the button-press escapable group 
followed by the cognitive test. Likewise, no 


differences appeared in solution of the Levine 
discrimination pretreatment between the solu- 
ble group followed by the cognitive test and 
the soluble group followed by the instru- 
mental test (all ps > .05). All subjects in the 
escapable and soluble pretreatments reached 
the criteria of learning in pretreatment. 

A check on the pretreatment manipulation 
was conducted to assess the effectiveness of 
the inescapability and insolubility manipula- 
tions. The subjects in the helplessness-induc- 
tion groups (i.e., E and 5) believed they had 
no control over solution. Two items from a 
questionnaire relating to controllability in the 
pretreatment provided the relevant informa- 
tion. The items were worded to determine if 
subject attributed uncontrollability to himself 
or to the task. Question 1 asked if subject 
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believed “you couldn’t solve the problem” 
while the second asked if subject believed the 
“problem was unsoluble—that it couldn’t be 
solved.” Answers were recorded on a 7-point 
scale with higher scores denoting greater in- 
solubility, All E and 5 groups believed they 
lacked the ability to solve their respective 
tasks relative to E and S groups. Subjects in 
the E groups had a mean rating of 4.4 on 
Question 1 and the S groups a rating of 4.9. 
This compares with a rating of 2.25 for the 
E groups. The difference between the pooled 
E and § groups versus pooled E and S groups 
was significant, F (1,62) = 43.54, p < .001. 
In addition, the E and S groups believed the 
problem was unsolvable. The mean rating for 
E groups on Question 2 was 4.63 while the 
mean rating for S groups was 4.31; the E 
groups had a rating of 2.37 on the same ques- 
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tion and S groups rated the question at 1.56. 


The difference between the pooled E and 5 
versus pooled E and S groups was significant, 
F (1, 62) = 5.13, p < .001. 


Helplessness Test Trials + 


Inst.-Inst. The group pretreated with ines- 
capable tone in button pressing escaped sig- 
nificantly more poorly in the shuttlebox than 
the escapable and control pretreated groups 
on all measures. Figures 3, 4, and 5 present 
these results and significance levels for trials 


1Since it was predicted that groups pretreated 
with inescapability (insolubility) would demon- 
strate an interference to learning relative to groups 
without such pretreatments, the following statistical 
tests were one-tailed: E versus Ë, E versus C, S 
versus 5, and § versus C; tests between escapable 
(soluble) and control pretreated groups were two- 
tailed: E versus C and S versus C. 
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Ficure 13. Group means, standard deviations, and significance levels of three 
dependent variables for the cognitive pretreatment-cognitive test for helplessness 
experiment: mean number of failures to solve. 
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Ficure 14. Group means, standard deviations, and significance levels of three 
dependent variables for the cognitive pretreatment-cognitive test for helplessness 


experiment: mean response latency. 


to escape criterion, number of failures to es- 
cape, and mean latency. 

Cog.-Inst. The group pretreated with in- 
soluble discrimination problems did signifi- 
cantly worse on shuttlebox escape than the 
soluble and control pretreated groups in the 
shuttlebox. Figures 6, 7, and 8 present these 
results and significance levels for trials to 
escape criterion, number of failures to escape, 
and mean latency. 

Inst.-Cog. The group pretreated with ines- 
capable tone in button pressing generally was 
worse at solving anagrams than the soluble 
or control pretreated groups. Figures 9, 10, 
and 11 present these results and significance 
levels for trials to criterion for anagram solu- 
tion, number of failures to solve, and mean 
latency. ~ 

Cog.-Cog. No significant differences were 
found on anagram performance following in- 


` 


soluble, soluble, or control discrimination 
problems. Figures 12, 13, and 14 present the 
trials to criterion for anagram solution, num- 
ber of failures to solve, mean latency, and 
significance levels for anagram solution. 
Questionnaire scores. The subjects pre- 
treated with inescapable tone were yoked to 
subjects pretreated with escapable tone such 
that density, frequency, and duration of tone 
were equated. It is possible that differential 
subjective aversiveness of tone have produced 
the subsequent retarded performance. How- 
ever, all subjects rated the pretreatment tone 
as moderately unpleasant on a 7-point scale. 
The pretreated E groups rated the tone at 
3.91, while E groups rated the stimulus at 
3.70. The C groups who passively listened to 
the 3,000 hertz tone rated the same stimulus 
somewhat lower at 3.14. These differences 
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were not significant, F (2,45) = 1.02, p> 
05. 

Additional post hoc comparisons on the 
questionnaires were conducted. There were no 
differences in “amount of trying” between 
pretreated groups in either the pretreatment 
or test trials (p > .10), nor did they change 
in amount of trying from pretreatment to 
test trials (p > .10). All subjects were asked 
if they felt “frustrated” during any part of 
the experiment. The E and S groups rated the 
pretreatment as more frustrating than the 
pooled E and S groups (45 vs. 2.2, respec- 
tively), F (1,62) = 42.3, P< 001. Within 
the helplessness-induction groups two S pre- 
treated groups indicated greater frustration 

than the two E pretreated groups (5.3 vs. 3.6, 

respectively), F (1,62) = 6.96, p < .018. 

Since only the cognitive pretreated group 
tested with anagrams failed to show helpless- 
ness, we repeated the Cog.-Cog. procedure 


Donatp S. Hiroto AND MARTIN E. P. SELIGMAN 


problems rather than three. The subjects, ap-\ 
paratus, and procedures were identical to the 

Cog.-Cog. experiment reported above, except 

that the S group (n = 8) received four 10- 

trial block insoluble problems, the S group 

(n = 8) received four soluble problems, and 

the C group ( = 8) looked passively at four 

10-trial block problems without attempting 

solution. 

When four insoluble problems were used 
interference with anagram solution was found. 
Figures 15, 16, and 17 present the trials to 2 
solution criterion, number of failures to solve, 
and mean latency to solution as well as sig- 
nificance levels for anagram performance of 
the three pretreated groups. 


Discussion 


There were three main findings of the 
experiment: (a) a group pretreated with an 
inescapable aversive tone showed greatly de- 
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tlebox; a control group and an escapable pre- 
treated group performed well in shuttlebox 
escape. This replicates the findings of Hiroto 
(1974) and again demonstrates a learned 
helplessness phenomenon in man, directly 
parallel to learned helplessness in dogs, cats, 
and rats (cf. Seligman, in pressb, for a 
` cross-species review). (b) A group pretreated 
with. four insoluble discrimination problems 
` was. ‘debilitated at solving later anagrams 


groups. This demonstrates that learned help- 
lessness can be produced within cognitive 
tasks, without aversive unconditioned stimuli 
or instrumental components. (c) Cross-modal 
helplessness was also found. A group pre- 
” treated with insoluble cognitive problems was 
debilitated at instrumental escape. Interest- 
ingly, this group was just as debilitated as 
the group pretreated with instrumental ines- 
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Ficure 16. Group means, standard deviations, and significance levels of three 
dependent variables for the revised cognitive pretreatment-cognitive test for 
helplessness experiment: mean number of failures to solve. 


capability. In addition, a group pretreated 
with inescapable tone was debilitated at ana- 
gram solution to the same extent as the group 
pretreated with four insoluble discrimination 
problems, 

The finding of cross-modal helplessness is 
of considerable theoretical interest. Initially, 
critics of the animal helplessness findings 
argued that failure to escape in a shuttlebox 
following inescapable shock might have re- 
sulted from a competing motor response, 
rather than a more general “organismic’” de- 
bilitation (e.g., Miller & Weiss, 1969). Maier 
(1970) and Seligman and Maier (1967) dis- 
confirmed this peripheral interpretation by 
using very different training and testing 
situations. These situations, however, were 
similar at least in the fact that pretreatment 
and testing both involved the same uncondi- 
tioned stimulus-shock-and instrumental re- 
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dependent 


sponding. Our present findings provide the 
coup de grace against any peripheralist in- 
terpretation. No competing motor response 
could generate cognitive interference from 
instrumental inescapability or instrumental 
interference from cognitive insolubility. 
Alternatively, neither frustration, differen- 
tial aversion of tone, nor demand character- 
istics (Orne, 1962) seem able to account for 
these differences. Frustration was experienced 
to a greater extent in the inescapable and 
insoluble pretreatments than in the soluble, 
escapable, or control pretreatment groups. 
Frustration is usually interpreted as energiz- 
ing, rather than deenergizing, motivation 
(eg, Amsel & Roussel, 1952). One can 
choose to interpret frustration post hoc as 
rigidifying rather than energizing responses, 
but then the explanation becomes difficult to 
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differentiate from learned helplessness, which 
is not post hoc (e.g., Schmeck & Clements, 
1971; Schmeck & Ribich, 1969). Since the 
groups all reported similar levels of aversive- 
ness to the tone, this variable does not ex- 
plain the findings. Finally, during our de- 
briefing no subject gave evidence of having 


detected the purpose of the study or what 
” 


was predicted of him. 

One limitation on the generality of these 
effects should be mentioned. The subjects 
clearly perceived both tasks, as different as 
they are, as part of the same experiment. We 
do not know whether any learned helpless- 
ness was carried out of the laboratory. Fu- 
ture research with unobtrusive tests should 
determine the extraexperimental generality of 
our effects. 


What is the process of learned helplessness, 


TIENT A 


then? If- the interference produced was both 
short-lived and highly specific to the condi- 
= tions of training, we might call it a transient 
} and peripheral “state.” Since it is general 
across motivations and tasks, however, we 
suggest; that the process induced by uncon- 


\_rate the expectancy that responding and rein- 
forcement are independent generalizes widely. 
` Seligman (1973, in press a) and Seligman 
et al. (in press) proposed that learned help- 
lessness is a model for depression in man. 
This speculation entails the consequence that 
our procedure for producing debilitation 
should also produce those symptoms associ- 
ated with mild depression: self devaluation, 
psychomotor retardation, feelings of sadness, 
êc. Future, research should answer these 
questions. 
In conclusion, we have produced learned 
. help'*ssness in humans. The process engen- 
dered” debilitate’ performance well beyond 
the condition under which helplessness is first 
trained. We suggest therefore that learned 
f helplessness may involve a trait-like system 
of expectancies that responding is futile. , 
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This laboratory investigation using 6 
the role of three disinhibiting 
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What kinds of forces can lead crowd mem- 
bers to behave in uncivil, brutish, and inhu- 
mane ways toward other people? Although 
some crowd members may be persons with 
antisocial histories attracted by the excite- 
ment of the crowd (Akers & Fox, 1944; 
Brown, 1954), certainly many other crowd 
members are normally law-abiding citizens 
who seem to fall prey to crowd “contagion” 
(LeBon, 1903). Zimbardo (1969) has taken 
the main thrust of LeBon’s argument—the 
idea that men are transformed into mindless 
brutes when immersed in a crowd—and recast 
the approach in scientifically testable terms. 
While LeBon’s idea of group mind is a rather 
nebulous concept, Zimbardo’s theory of de- 
individuation quite clearly delineates “input,” 
“output,” and mediating variables in a test- 
able form. 

Zimbardo (1969) specified several input 
variables such as group presence, anonymity, 
and altered responsibility which, according to 
the theory, lead to deindividuation. “Output” 
resulting from the deindividuated state is 
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4 college students as sub: s 
variables in producing both physical aggression 
dividuation. Altered 


aggression. No interaction pro 
responsibility and varying cognitions sup- 
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d Deindividuation 
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jects assessed 


responsibility, cognitive set, 
and all three variables 


factorial design, 
duced significant 


ation which relates certain input varia- 
ionnaire data indicated that the increase 
internal mediational factors such as 
disinhibiting forces may produce in- 
necessarily producing a deindividuated 


wild, impulsive, and irrational behavior that 
tends not to be under discriminative cue con- 
trol. Inferred subjective changes which medi- 
ate between input and output may be such 
internal events as loss of self-awareness, re- 
duced concern for social evaluation, narrowed 
focus of attention, and diminished awareness 
of the past and future. Zimbardo hypothesized 
a series of relationships between these varia- 
bles, and the theory has in the past few 
years produced an impressive amount of re- 
search. For example, several recent studies 
have investigated the effect of anonymity on 
aggression (Diener, Westford, Dineen, & 
Fraser, 1973; Zabrack & Miller, 1972; Zim- 
bardo, 1969). The impact of group presence 
has also been assessed by several investigators 
(Diener, Westford, Diener, & Beaman, 1973; 
Fraser, Kelem, Diener, & Beaman, in press; 
Paloutzian, 1972). Festinger, Pepitone, and 
Newcomb (1952) studied the correlation be- 
tween disinhibited behavior in a group and 
the degree to which group members paid 
attention to one another as individuals. Some 
investigators have considered deindividuation 
to be exclusively a group phenomenon (Can- 
navale, Scarr, & Pepitone, 1970), but Zim- 
bardo (see 1969, p. 256) considered it to be 
an intraindividual process susceptible to group 
Hoa as well as other antecedent varia- 

es. 


DEDIVIDUATED BEHAVIOR 


Thus far, there has been insufficient re- 
search directed at investigating a number of 
other input variables which should, according 
to ‘the theory, lead to deindividuated behav- 
ior, nor has the link between these unexplored 
variables and an internal state of deindividu- 
atiðn been established. The present study was 
directed at answering two related questions: 
(a) What influence do several deindividuat- 
ing input variables, previously neglected, by 


` a researchers, have on behavior? (b) Is the ag- 


gressive “behavior released by these input 
variables accompanied by an internal state of 
deindividuation? Two “input variables,” al- 
tered responsibility and cognitive set, were 
manipulated, and their effect on physical ag- 
“gression and the deindividuated state was 
assessed. In addition, the effect of aggressive 
modeling on physical aggression and, more 
importantly, on the deindividuated internal 
state was measured. 

Zimbardo (1969) hypothesized that dif- 
fusion of responsibility among crowd mem- 
bers would act to disinhibit antisocial behav- 
ior, Tilker (1970) found that when subjects 
received feedback that the victim was suf- 
fering in a Milgram (1963) shock experiment 
paradigm, they were more likely to intervene 
in an attempt to stop the affair if they had 
been designated by the experimenter as re- 
sponsible for the proceedings. In the present 
study, responsibility was also manipulated in 
order to assess the effects of this variable on 
aggressive behavior, with the prediction that 
the less an individual feels that he is responsi- 
ble for his behavior, the more aggressive he 
will be. In the Tilker experiment, subjects 
observed aggression and were made to feel 
responsible or not responsible for aggression 
they were observing, whereas in the present 
study subjects were made to feel responsible 
or not responsible for aggression they them- 
selves could enact. Thus, a more direct form 
of responsibility, that for one’s own actions, 
was manipulated in the present study. 

The second variable manipulated in this 
study was cognitive set, a factor not explic- 
itly mentioned by Zimbardo (1969) but one 
which seems to hold promise in a viable 
theory of deindividuation and crowd behavior. 
Zimbardo stated, “Cognitive-verbal-intellec- 
* tual activities are anathema to the spontane- 
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ous behavioral releases we are talking about” 
(p. 258). It would seem that initial cognitive 
activity which justified disinhibited behavior 
would free the participant from further eval- 
uative cognitions. If crowd members can 
place their aggressive and destructive acts 
into a cognitive structure in which aggres- 
sive acts seem moral and justified, then 
the resulting behavior may be antisocial. 
For example, members of lynch mobs may 
view the murder they commit as “justice” 
and therefore not perceive their act as aggres- 
sive at all. Similarly, looters in a riot may 
perceive their own behavior not as immoral 
stealing but rather as “getting back our due 
from gougers.” It seems probable that once a 
person justifies his behavior, critical self- 
observation-evaluation of that behavior will 
diminish. 

Geen and Stonner (1973) have shown that 
the interpretations a subject places on ob- 
served violence influence his subsequent level 
of aggressiveness. While the Geen and Ston- 
ner endings related to subjects’ interpreta- 
tions of viewed violence, the present study 
explored how the subject’s cognitions about 
the type of behavior he was actually to per- 
form affected his aggressive behavior. As 
Bandura (1973) has noted, a variety of cri- 
teria are used in judging acts as “aggressive,” 
and he speculated that such judgments influ- 
ence behavior. As yet, there has been no ex- 
perimental evidence bearing on this point. 

The third variable manipulated in the 
present study was aggressive modeling shown 
in a movie. Modeling has been extensively 
studied, and a relationship to aggressive be- 
havior has been established (Bandura, 1973). 
This variable was chosen because it has been 
so thoroughly researched and shown to fre- 
quently disinhibit aggression. Assessment of 
the behavioral effect of aggressive modeling 
was completed, but this was of secondary 
importance to the question whether or not an 
increase in aggression produced by viewing 
an aggressive model is accompanied by dein- 
dividuation. A basic question is the follow- 
ing: Does a state of deindividuation generally 
occur along with disinhibited antisocial be- 
havior, or does the former phenomenon occur 
only in more limited circumstances? 

While some evidence has been accumulated 
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on the relationship of input variables to overt 
behavior, very little research has been done 
on the internal state of deindividuation. Zim- 
bardo (1969) uncovered evidence that anony- 
mous college coeds in a group were less at- 
tentive to the personality (nasty vs. nice) of 
their “shock” victim than were nonanony- 
mous girls, The data, as the deindividuation 
model predicted, showed that anonymous 
girls’ aggressive behavior was not affected by 
a relevant discriminative cue. This is the only 
evidence known to the investigators concern- 
ing the internal deindividuated state. None 
of Zimbardo’s other “input variables” have 
been related to a deindividuated state, nor 
riables such as modeling been 
individuation. The model is es- 
i device (see Zimbardo, 


have other v 


related to de 


sentially 

1969, such leaves open the 
questior whic variables or combi- 
nations o m wil irily be accompa- 
nied by i nportant step in 
the deve ent í of deindividu- 
ation i t the relation- 
ship bet ivior and 
deindivid ); tate that 
antisocial bel panied 
by a de ardo, 
1969, p. 2 1av- 
iors such as the overt € ler 
emotions which are normally inhibited, €s- 
pecially in males in our society, would be 


ions of deindividuation. 
It is important to establish empirically 
whether the antisocial behavior that often 
follows from disinhibiting variables such as 


modeling is usually accompanied by deindi- 
viduation. 


In summary, the present experiment was 
designed to assess the effects of altered re- 
sponsibility, cognitive set, and aggressive 
modeling upon subsequent aggressive behav- 
ior. It was predicted that all three manipula- 
tions would lead to increased aggression. A 
further purpose of this study was to assess 
the degree to which both input variables and 
aggressive behavior were accompanied by a 
deindividuated internal state characterized by 
diminished self-awareness, lessened attention 
to the environment, and depersonalization of 


the victim. 
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METHOD 

Overview 

The study was a 2X2 X2 factorial design with 
two levels in the cognitive set condition (game vs. 
aggressive), two levels in the model condition (ag- 
gressive VS. nonaggressive movie models), and two 
levels of responsibility (responsible vs. not responsi- 
ble). The dependent measure of physical aggression 
was a composite score of aggressive behavior dis- 
played by the subject toward a passive confederate. 
The deindividuated state was assessed on a ques- 
tionnaire following the aggression session. 


Subjects 


Subjects were 64 University of Washington male 
introductory psychology students who earned bonus 
points for participation. Groups of subjects were 
randomly assigned to the cell conditions, with 8 
subjects in each group. 


Procedure 


Subjects entered the experiment as a group, and 
the complete group was instructed by the male 
experimenter, who described the role-playing task. 
Participants were told that in a “dimly lit room” 
nearby, they would find a role player sitting on the 
floor and that while they were in that room, they 
could do various things to him, Subjects were 
instructed that upon entering the room they would 
find a table of materials available to them. The 
materials were as follows: three buckets of news- 
paper balls (for throwing), a large bowl of rubber 
bands (for shooting or snapping), sponge rubber 
bricks (for hitting or throwing), a plastic gun’ for 
shooting Ping-Pong balls, and three Styrofoam 
swords (for hitting). Subjects were not told how 
long they would be in the experimental room (2 
minutes) so that the period would not be perceived 
as strictly time bound. Subjects took turns going into 
the experimental room alone after the instructions 
and movie presentation (which is described sub- 
sequently). 

The actual task the participants were to perform 
was never clearly defined, which left subjects free 
to do what they wished. The experimenter stated 
that one important purpose of the study was to 
discover subjects’ reactions and feelings after the 
activity session. Subjects were also told that, as in 
all psychology experiments, they were free to leave 
at any time and did not have to do anything they 
did not want to do in the experiment. 

Observers behind a two-way mirror recorded the 
actions of the participants. Each time an act was 
performed such as throwing a ball, the behavior was 
recorded on a rating form. The total number of 
times each act was performed was multiplied by a 
constant scaled aggressiveness score for that act. 
and the sums of these products were the subjects? 
total aggression score. The constant scaled aggres- 
siveness score for an activity was the average value 
a a 10-point scale given by five raters who had 
een familiarized with the activities. The scale 
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ranged from 0 (not at all aggressive) to 10 (ex- 
tremely aggressive), The constant scaled aggressive- 
ness scores are shown in Table 1. Soft and kard 
were defined behaviorally in rating practice sessions. 
An experimenter demonstrated hits of varying mag- 
nitudes and classified them. After this practice, pilot 
subjects were run and the agreement of ratings were 
high. The raters and role player were unaware of 
the subjects’ conditions. 

Reliability data was obtained by having two raters 
record the activities of six different subjects across 
conditions, The total scores of the subjects as re- 
corded by each of the two raters were then corre- 
lated to arrive at an estimate of reliability of the 
scoring technique. A Pearson product-moment cor- 
relation of .86 was obtained between the two sets 
of scores, which indicated that the aggressive activ- 
ity of the type used in this study can be reliably 
measured, 

Model. An aggressive or nonaggressive movie of 
a “person who had been in a similar role-playing 
situation in the past” was shown to the group of 
subjects during the instructions. It was empha- 
sized that the movie was meant to give them an 
idea of the situation, the instruments, and what the 
room looked like but was not meant to imply that 
they should do what the other role player did. In 
order to avoid possible demand characteristics in 
the modeling manipulation, the experimenter stressed 
that the movie role player’s actions were not neces- 
sarily appropriate or inappropriate but simply what 
one person chose to do in the experiment, In one 
condition the movie model was very aggressive, 
hitting the other role player repeatedly with the 
swords and rubber bricks, dragging him around the 
room, and so on. In the other condition, the movie 
model was nonaggressive, never hitting the other 
role player but rather talking to him and pacing 
around the room. 

Cognitions. Two different types of cognitive set 
were induced by the instructions. In the “game” 
instructions for cognitive set, subjects were told to 
think of the situation as a game. The object of the 
game, they were told, was to do as many things as 
possible, as quickly as possible, in the short timed 
period. The instructions continued, “It’s like a water 
‘fight where everybody splashes as much water as 
possible but nobody gets hurt, everybody partici- 
pating because they want to and because it’s fun.” 
The rationale given for this game was that the ex- 
perimenters wanted to record the activity and then 
assess the subjects’ feelings and thoughts after this 
fast physical activity. 

In the “aggressive” cognitive set condition, sub- 
jects were told that the situation was a test of 
aggression to be thought of much like the violence 
in the world except that nobody could be physically 
hurt. They were told that they were to do as much as 
possible, as quickly as possible, in the short timed 
period. The instructions continued, “Think of the 
situation like a person in the army who shoots 
others or stabs them in the chest or clubs them to 
death, except that here you are just role-playing 
the situation, and nobody can get hurt.” Subjects 
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TABLE 1 
Constant SCALED AGGRESSION SCORES 
Category Score 

Hitting swords 

Hitting light 3.3 

Hitting hard 6.9 

Jabbing 5.5 
Throwing 

Taped newspaper balls 4.6 

Sponge bricks 30 
Rubber bands 

Shooting 5.0 

Snapping TS 
Manhandling 

Dragging around room 7.6 

Picking up by underarms 7.0 

Shoving 7.5 

Kicking lightly hed 
Miscellaneous 

Shooting gun at pacifist* 5.6 

Dumping bucket of balls on pacifist 5.6 

Other 


a Tallied once per segment. 
b Unique acts were scored by five raters on an individual basis. 


were reminded to be fast, since the experimenters 
were essentially interested in the subjects’ feelings 
and thoughts after fast physical activity. 

It was predicted that those in the game condition 
would be significantly more aggressive than those in 
the aggressive cognitive set group. Those viewing 
the activity as a game could become active with- 
out questioning the justifiability of their actions, 
since games in our culture are generally considered 
to be “wholesome” activities. On the other hand, 
those in the aggressive cognitive set group were 
asked to behave “aggressively” toward someone who 
had done nothing to them, behavior usually consid- 
ered immoral in our culture. These subjects, then, 
could not as easily justify aggressive behavior, and 
therefore it was predicted that they would be rela- 
tively nonaggressive. In both conditions the experi- 
menter described the activities in a nonemotional 
tone. It was emphasized that the experimenters were 
not really interested in personally evaluating the 
subjects. The order of presentation of the cognitive 
set instructions and the modeling movie was counter- 
balanced for different groups. 

Responsibility. In the responsible condition, sub- 
jects were told that even though the study was a 
scientific analysis of behavior where someone else 
had assigned them a role, they, as free individuals, 
were still responsible for their own actions at all 
times. In the nonresponsible conditions, subjects 
were told that the experimenter would, of course, 
take full responsibility for the subject’s actions. The 
responsibility manipulation came at the end of the 
instruction period. 

Questionnaire. At the end of the role-playing ses- 
sion, subjects were asked to complete a question- 
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naire. The form asked questions about the subjects’ 
feelings toward their “victim” (to assess dehumani- 
zation of the other role player), about the length of 
time of the session (to assess time distortion), and 
about the frequency of their own aggressive behav- 
iors (to assess self-observation-evaluation). There 
were eight bipolar adjective rating scales on which 
the subject was to rate his victim (e.g., strong— 
weak). To the extent that subjects dehumanized 
their victim, lower ratings should have been given. 
An additional item asked how similar the victim 
was to the subject, also a possible reflection of 
dehumanization. The measure of time distortion was 
the difference between the subject's estimate of the 
length of time of the aggression session and the 
actual time period (120 sec). 

The questionnaire also assessed the frequency of 
certain thoughts while subjects were in the session 
on bipolar rating scales ranging from thought about 
very frequently to never thought about. These 18 
items assessed the degree to which subjects thought 
about their own individuality, about the possibility 
that their victim might get hurt, about guilt feel- 
ings, and about their own past and future. Under a 
state of deindividuation, one would expect dimin- 
ished frequency of thoughts in each of these cate- 
gories. Subjects answered all questions on a 7-point 
bipolar scale. Another set of items assessed subjects’ 
recall of various cues in the aggression room, some 
of peripheral importance (e.g., “How many posters 
were on the wall?”) and some of more central im- 
port (eg, “How much did the other role players 
weigh?”). One would expect that deindividuated 
subjects, although cognizant of cues of central im- 
portance, would be insensitive to incidental stimuli. 

A last series of questions provided checks on the 
manipulations. One series of these checks acertained 
whether subjects had perceived and remembered 
the manipulations, for example, whether they could 
report who the experimenter had said was responsi- 
ble for the role playing and whether the movie 
model was aggressive or not. A second set of ques- 
tions asked how uninhibited each of the various 
manipulations made subjects feel; a third check on 
the cognitive set manipulation was the question 
whether subjects saw the role-playing activity as a 
game or as aggression. 

At the completion of the questionnaire, partici- 
pants were given a letter from the primary experi- 
menter, which thanked them for their participation 
and asked them to drop by his office if there were 
questions about the research or if they wanted to 
discuss any negative feelings they may have had 
about the experiment. Participants were thoroughly 
debriefed by a second experimenter whose sole task 
was to talk to subjects about the experiment and 
to make sure no one had any negative reactions after 
participation in the study. Subjects were told they 
had merely been playing a role and that their 
behavior did not indicate what they would do in 
real life. As far as could be ascertained, subjects 
were interested in the experience, but none were 
upset by it. At the end of the quarter, all partici- 


pants were given a detailed letter describing the 
purposes and results of the study. 


RESULTS 


Checks on the Manipulations 


Checks on the manipulations showed that 
the variations between conditions had the 
intended effects. In response to an open-ended 
query about the amount of aggression dis- 
played by the movie model, 91% of the sub- 
jects in the aggressive model condition gave 
answers indicating “a lot,” whereas in the 
nonaggressive model condition, 50% of the 
subjects correctly reported that the model 
displayed no aggression. When the responses 
to the question were divided into two discreet 
categories, with “some” or “a lot” answers 
constituting one category and “none” re- 
sponses the other, it was revealed that a 
significant number of subjects saw and re- 
membered the activities of the movie model 
to whom they were exposed, x? (1) = 12.65, 
p < .001. Those missing the question did so 
because they left it blank. Another item 
asked subjects to indicate on a 7-point scale 
the degree to which the movie changed their 
personal inhibitions (anchored by more in- 
hibited and less inhibited). The mean score 
for the aggressive model group was 4.56 and 
for the nonaggressive group, 3.16. This sig- 
nificant difference, F (1,56) = 11.65, p< 
.01, indicates that subjects believed watching 
an aggressive model lowered their own inhi- 
bitions. 

The question on the responsibility manipu- 
lation asked subjects whom the experimenter 
had designated as responsible for their ag- 
gressive acts. All subjects in the nonresponsi- 
ble group and 91% in the responsible group 
answered correctly, x? (1) = 53.3, p < .001, 
which demonstrated that subjects did perceive 
and remember the responsibility manipula- 
tion. In addition, nonresponsible subjects 
reported that when the experimenter indicated 
that he alone was responsible for their ac- 
tions, they felt significantly more likely to 
perform the aggressive activity than did the 
responsible subjects, F (1,56) = 53.3, p< 
001. On this 7-point scale, ranging from 
more likely to do the activity to less likely to 
do the activity, the average scores were 3.72 
for the nonresponsible subjects and 4.69 for 
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the responsible subjects. The questionnaire 
data referring to the movie model and the 
responsibility manipulations suggest that both 
manipulations affected aggression levels 
through disinhibition of aggressive respond- 
ing. 

The data for the check on the cognitive set 
manipulation were not as clear. A significant 
number of subjects, X (1) = 36.6, p< .01, 
correctly reported whether the experimenter 
described the activity as a game or as aggres- 
sion in their condition, with 81% correctly 
stating the answer in the game condition and 
94% doing so in the aggressive cognitive set 
condition. However, when asked whether they 
saw the expected role behavior as aggressive 
or nonaggressive, subjects’ reports did not 
produce a significant main effect for cognition 
but yielded a significant interaction with the 
modeling variable, F (1, 56) = 4.04, p < .05. 
Subjects in the conditions where the model’s 
behavior and the cognitive set were not con- 
cordant (e.g., game instructions paired with 
inactive model) rated the role behavior as 
being more aggressive than did those who had 
viewed a model whose behavior seemed to 
follow from the cognitive set instructions. 
Another question asking subjects how inhib- 
ited they felt about the activities in which 
they engaged also revealed a similar signifi- 
cant Cognitive Set x Type of Model inter- 
action, F (1,56) = 4.52, p < .05. 

Because of the possibility that the effects 
of the manipulations of the present study 
could be confounded by demand character- 
istics (Orne, 1962), an attempt was made to 
assess the presence of these influences, The 
postsession questionnaire asked, “What did 
you think the real purpose(s) of this experi- 
ment was after the experimenter gave you 
your instructions?” Out of 64 students, 3 
correctly expressed one or more of the experi- 
mental hypotheses. Other students answered 
that the purpose of the experiment had to do 
with aggression or fast physical activity but 
did not express awareness of the manipulated 
variables. In the debriefing interview, sub- 
jects were also asked about their awareness 
of the manipulations. Several stated that they 
were influenced by the instructions but did 
not know that these instructions were manipu- 

s lated, The study was run early in the quarter 
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TABLE 2 
CELL MEANS FOR PHYSICAL AGGRESSION SCORES 


Game cognition Aggressive cognition 


Model 


Not Not 
Respon- Respon- 
nipe | “sible | respon- | Repon 
Aggressive 462.9 | 198.5 | 212.8 | 116.4 
SD 359.1 | 198.4 | 251.5 132.2 
Nonaggressive 123.3 92.3 23.1 12.1 
SD 162.0 | 149.4 32.4 23.6 


and used students from an introductory psy- 
chology class in which social psychology and 
deception had not yet been discussed. 


Aggression Data 


Aggression scores were derived as follows: 
Scaled scores were multiplied by the fre- 
quency of each activity, and the sum of these 
products for all activities performed by a 
given subject was his aggression score, (The 
score for one subject was derived by extrap- 
olating his score from a 30-sec period because 
his aggression became so excessive that the 
session had to be ended by the experimenter.) 

The mean aggression scores are shown in 
Table 2. It will be remembered that three 
subjects revealed awareness of what might 
be termed “demand characteristics.” That is, 
three subjects stated awareness that one or 
more of the manipulations were designed to 
influence the subjects’ aggressive behavior, 
Analysis of the aggression scores was com- 
pleted without these subjects’ scores but re- 
vealed the same pattern of significant results 
as when their data were retained, As Table 
3 indicates, all three manipulations were 
effective in producing significantly higher 
aggression scores. None of the interactions 
between the three factors were significant, 
The strongest main effect was due to mod- 
eling: Subjects who viewed the aggressive 
model were on the average approximately 
four times as aggressive as those viewing the 
nonaggressive model. 

One of the most striking findings in the 
study was the large difference between the 
cell designed to minimize personal inhibitions 
and the cell designed to maximize them. 
Mean aggression scores for these two cells 
were 462.9 and 12.1, respectively. When all 
three situational variables acted to lower 
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TABLE 3 


ANALYSIS OF VARIANCE ON PHYSICAL 
AGGRESSION SCORES 


Source FERIA F | b | w? 
Modeling (A) 547,545 | 14.72 | .001 7 
Cognitions (B) 262,695 7.06 | .010 -08 
Responsibility (C) | 162,157 4.36 | .039 -05 
AXB 23,039 -62 | .441 aS 
AXC 101,625 2.73 | .100 .03 
BXC 35,330 | .95 | 336 | — 
AXBXC 21,206 59 | 453 = 

Error 37,206 


inhibitions, subjects were almost 38 times as 
aggressive as when the situation operated to 
increase inhibitions. This strong finding indi- 
cates that the three experimental manipula- 
tions acting in concert are indeed powerful. 


Deindividuation Data 


The questionnaire completed by subjects 
after the aggression session contained items 
which, in addition to checks on the manipula- 
tion, roughly fell into five categories: feelings 
toward the victim, perception of time passage 
in the session, individuated thought content 
during the session, memory of various external 
cues in the role-playing room, and estimates 
of the frequency of one’s own aggressive be- 
havior. These items were analyzed in two 
distinct ways. First, aggression scores were 
correlated with subjects? questionnaire data, 
which provided an indication of the relation- 
ship between uninhibited aggressive output 
behavior and the components of an internal 
deindividuated state. Second, an analysis of 
variance by independent variable conditions 


TABLE 4 


DEINDIVIDUATION DATA CORRELATIONS 
WITH SCALE AGGRESSION SCORES 


ee ler hy _No. rs 
Deindividuation area cotlation eenia 
Dehumanization of victim 
(8 items) +.016 |1outof8 
Perceived similarity of the 
victim (1 item) —.097 |Ooutof1 
Time distortion (1 item) +.047 |Ooutof1 
Individuated thinking 
(18 items) +.002 | 3 out of 18 
Memory of cues (11 items) —.088 | 4out of 11 
Estimates of frequency of 
specific behaviors (2 items) +.513 | 2 out of 2 


ay = 21 or higher. 


was completed on each of the items, which 
reflected the degree of the relationship be- 
tween the input situational variables and a 
state of deindividuation. Both types of 
analyses are important if a network of rela- 
tionships between input, mediational, and 
output behaviors is to be uncovered. 

Table 4 shows the average correlation be- 
tween aggression scores and a number of 
questionnaire items grouped into categories. 
In most cases these were Pearson product- 
moment correlations, except in the case where 
questionnaire items were discreet right- 
wrong categories in the memory section; for 
these items point-biserial correlation was 
used. One would expect that if aggressive be- 
havior was usually accompanied by a deindi- 
viduated internal state, the first five averaged 
correlations in Table 4 would be sizable, 
whereas the last would be small. The first 
five correlations represent the relationship 
between an internal deindividuated state and 
the aggressive behavior in this study. Clearly 
these relationships are quite small. The last 
averaged correlation, that between self-esti- 
mates of aggressive behaviors (sword hitting 
and ball throwing) and subjects’ aggression 
scores is quite large, considering that the 
frequency of either behavior by itself was not 
perfectly correlated with a subject’s overall 
aggression score. These estimates tended to 
be quite accurate, which indicated that the 
subjects were quite aware of their own be- 
havior during the aggression session and 
could accurately remember this behavior. The 
pattern of the correlations suggests strongly 
that aggressive behavior was not accompanied 
by an internal deindividuated state. But one 
might wonder about the few significant corre- 
lations which did occur in the first five cate- 
gories. Generally, these were of low magni- 
tude (.21 to .35) and were not uniformly in 
the direction one would expect if aggressive 
subjects were more deindividuated. For ex- 
ample, there was a significant positive corre- 
lation (r = +.26) between a subject’s aggres- 
sion score and the amount of time he reported 
worrying about hurting his victim, an under- 
standable relationship in light of the high 
levels of aggressive intensity exhibited by 
some subjects but a relationship not predicted 
by deindividuation, which would credit the 
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increased aggression to an internal state 
where moral considerations would not enter 
the subject’s awareness. The evidence seems 
quite strong, then, that aggressive behavior 
was not produced nor accompanied by a de- 
individuated internal state. 

The data also indicate that the three ex- 
perimental manipulations, although produc- 
ing increases in aggression, did not result in 
an internal state of deindividuation. Each of 
the 39 deindividuation items was analyzed 
ina 2 X 2 X 2 analysis of variance, and only 
6 of the 117 main effects (3 for each of the 
39 items) were significant at the .05 level, 
which is about the number one would expect 
just by chance. The few significant figures 
for the deindividuation items revealed no 
apparent pattern and were not always in the 
direction one would expect if the manipula- 
tions produced deindividuation. On two addi- 
tional items which asked subjects for esti- 
mates of their own aggressive behavior, there 
were several significant differences between 
groups, and these differences were in the di- 
rection of the actual behavior differences 
which existed between conditions. Three of 
the six main effects reached the .05 level of 
significance. These two items again reflected 
the ability of subjects to observe and remem- 
ber their own aggressive behavior. 


Discussion 


Magnitude of Aggression 


The results of this experiment support the 
hypothesis that certain variables break down 
personal inhibitions and release antisocial 
behavior, in this case, aggression. The manip- 
ulations implemented in this study all led to 
increased aggression, and analyses of post- 
session self-report data lent credence to the 
idea that these increases in aggression were 
generally’ due to lower personal restraints, 
Although no direct evidence is yet available, 
it seems reasonable to surmise that the forces 
which released aggression in this controlled 
laboratory study are often forces that are 
present in real-life crowds. Real-life crowd 
members may commit acts which are shock- 
ing, especially in view of the large number of 
normally law-abiding citizens who seem to go 
along with the group. It is instructive to note 
in this regard that the most aggressive condi- 
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tion in this experiment revealed surprisingly 
high levels of aggressive behavior. To achieve 
a score of 463 (the average score for the 
highest condition), subjects could, for exam- 
ple, hit the other role player hard 65 times 
with the foam sword within a 2-minute 
period. The aggressive activity was often 
feverish in this condition, and in one case the 
session had to be prematurely terminated 
because of excessive aggressiveness (a subject 
was hitting the role-playing experimenter 
quite hard on the ears and face). 

It should be recalled that the instructions 
to subjects stressed their freedom to do as 
they pleased and to perform only acts with 
which they could feel “comfortable.” The ad- 
vantage of the aggression measure used in 
this study compared to the ubiquitous shock 
apparatus (Buss, 1961) is that it allows sub- 
jects wide freedom of response and demands 
face-to-face physical confrontation with the 
victim if aggression is to be enacted, An indi- 
cation that nonaggressive behavior was possi- 
ble in the room with the confederate, who sat 
passively on the floor, was given by some of 
the behaviors of nonaggressive subjects, for 
example, talking to the other role player, jug- 
gling the balls, and throwing things at the 
posters on the walls. The disinhibiting manip- 
ulations produced a high amount of aggres- 
sion, then, in the absence of direct pressure 
by the experimenter to be aggressive, in con- 
trast with the Milgram (1963) obedience 
studies, where there was continuous direct 
pressure from the experimenter for subjects 
to shock the victim. Aggressive behavior was 
released by the disinhibiting variables with 
only minimal experimenter demands for ag- 
gressive activity. The release of high levels of 
aggression such as occurred in this study in 
the absence of strong motivation to aggress 
is supportive of Zimbardo’s (1969) concep- 
tualization that wild and impulsive behavior 
is an automatic result of deindividuating in- 
put variables. 


Experimenter Effects 


There is substantial evidence that the ag- 
gression results were due to the manipulations 
and not to demand characteristics (Ome, 
1962). A check for demand characteristics 
revealed that only three subjects spontane- 
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ously expressed awareness of experimental 
hypotheses, and in debriefing interviews, it 
was found that few additional subjects ex- 
pressed awareness of the manipulations. Those 
that did express such awareness in the inter- 
view were suspicious of the movie. Great ef- 
forts were made in the instructions to avoid 
suspicion, and subtle demands and only 
naive subjects were used. Great freedom of 
responding was allowed. Yet in some con- 
ditions a large amount of aggression was 
manifested, far more than would have been 
“required” by subtle demand characteristics, 
Furthermore, the checks on the manipulation 
indicated generally that the manipulations 
did tend to raise or lower inhibitions. This 
entire accumulation of evidence indicates, 
then, that demand characteristics were of 
minimal importance in producing subjects’ 
aggressive behavior. 


Internal Deindividuated State 


The data indicated that an internal dein- 
dividuated state did not accompany aggres- 
sive behavior overall or any of the three dis- 
inhibiting variables. The method of assessing 
deindividuation, of course, leaves open the 
possibility of error. A questionnaire may be a 
particularly insensitive means of measuring 
deindividuation. This criticism seems most 
appropriate to subjects’ reports of how they 
felt toward their victim because subjects may 
have tried to appear to be objective and ra- 
tional and therefore given a positive or neu- 
tral report on their victim. Other categories 
such as time estimation, memory of environ- 
mental cues, and memory of one’s own ag- 
gressive behaviors would appear much less 
influenced by the nature of the measurement, 
It would be difficult for a subject to answer 
defensively on such items, and yet, they too 
fail to reveal deindividuation. In fact, mem- 
ory of one’s own aggressive behaviors reveals 
an individuated pattern in which subjects 
have accurately observed their own behavior. 
The post hoc nature of the questionnaire is 
another possible source of error, but again, 
the dehumanization of the victim items are 
those which seem most distortable due to the 
fact that the questionnaire was completed 
after the aggression session had been termi- 
nated. It is possible too that individuated 
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thinking could have been falsely reported 
after the aggression session, but it is hard to 
imagine how time perception, memory for 
cues, or memory of one’s own aggressive be- 
havior could have been falsely reported. Al- 
though certain categories of the questionnaire 
were definitely subject to error, the fact 
remains that the whole pattern of results 
points not to deindividuation but, if any- 
thing, in the opposite direction, Subjects, re- 
gardless of condition or aggression level, 
seemed to be uniformly aware of self and 
their surroundings. 

Zimbardo’s (1969) theory does not specify 
the necessary or sufficient input conditions 
for deindividuation to occur. Rather, the 
theory specifies a number of input variables, 
and the question of their relationship to a 
deindividuated internal state is left to em- 
pirical inquiry. The failure in the present 
study to find a relationship between input 
variables and deindividuation does not in any 
sense disconfirm the theory of deindividua- 
tion but rather indicates that the input varia- 
bles used in this study were not sufficient in 
themselves to produce an internal deindi- 
viduated state. The results uncovered in the 
Present study confirm Zimbardo’s (1969, p. 
259) contention that disinhibition of wild, 
irrational behavior can occur in the absence 
of deindividuation, Perhaps, as Cannavale et 
al. (1970) have asserted, deindividuation is 
solely a group phenomenon, High priority 
should be given to studies which assess the 
relationship of a deindividuated internal state 
to variables most central to the theory (group 
presence and anonymity). 


Disinhibition of Aggression 


It appears that cognitive set and altered 
responsibility can affect the level of aggres- 
sive behavior. The present findings on altered 
responsibility indicate that responsibility for 
one’s own behavior as well as for the behavior 
of others (Tilker, 1970) is an important 
determiner of the course of action one takes. 
Self-report data indicated that subjects be- 
lieved they felt more prone to do the aggres- 
sive activity if told that they were not re- 
sponsible for their actions. Feeling unrespon- 
sible for one’s acts may release normally 
restrained behavior in two ways. First, one is 
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not normally punished for his acts if others 
are responsible, and second, one may not 
reflect as much on the morality of the act if 
another is responsible. Some of Milgram’s 
(1963) subjects were willing to continue on 
to the highest shock levels only after they 
were reassured that the experimenter ac- 
cepted complete responsibility for their ac- 
tions. In the present experiment, subjects 
were almost twice as aggressive if they did 
not feel responsible as were those who were 
made to feel responsible for their actions. 
It was found that the cognitive interpreta- 
tion a subject was told to place upon a possi- 
ble action he could take affected the course of 
this behavior. The result extends the findings 
of Geen and Stonner (1973), which dealt 
only with the cognitive interpretation of 
viewed violence. If subjects were told to view 
the role behavior as a game, they were more 
than twice as aggressive on the average as 
those told to view the behavior as aggression. 
The present study also demonstrated the 
strong effect of aggressive models. If aggres- 
sion such as that manifested in the most un- 
restrained condition of the present study can 
be released in college students being observed 
in a psychology laboratory, how much ag- 
gression can be released when these forces 
are acting in the “real world”? The present 
authors are currently conducting research to 
ascertain the extent to which these forces act 
upon members of college campus crowds. 
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Reactions to Severity and Nature of Threat Among Children of 
Dissimilar Socioeconomic Leyels 
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The purpose of this study was to explore the conditions under which lower- 
and middle-class preschool children react differently in the “forbidden toy” 
paradigm. In a 2 X2 X 2 design, 40 middle- and 43 lower-class subjects were 
threatened with loss of a gift of money (mild or severe) or loss of approval 
(mild or severe) if they played with an attractive toy. Rankings of the for- 
bidden toy before and after a 5-minute temptation period indicated opposite 
reactions in middle- and lower-class subjects. For middle-class subjects, k 
threatening loss of approval produced an effect consistent with a prediction 
derived from dissonance theory, that is, devaluation of the toy under mild 
rather than severe threat. Threatening loss of small or large quantities of 
money generated intermediate and similar amounts of derogation. For lower- 
class subjects, threatening mild or severe loss of approval produced moderate 
and similar amounts of devaluation. But threatening loss of money produced 
an effect opposite to that predicted by dissonance theory, that is, more de- 
valuation under threat of large than small money loss. This reversal resists 


both dissonance and reinforcement-familiarity interpretations. 


In a ground-breaking experiment, Aronson 
and Carlsmith (1963) introduced the forbid- 
den toy paradigm. Children were asked to 
rank order their relative preferences for five 
toys. The second-ranked toy was then forbid- 
den to the children under conditions of either 
mild or severe threat for transgression. After 
a 5-minute temptation period, in which the 
children successfully refrained from playing 
with the forbidden toy, a final ranking of 
the toys indicated decreased liking for the 
forbidden toy in the mild-threat condition 
but not in the severe-threat condition. This 
finding was interpreted within a cognitive dis- 
sonance framework. According to dissonance 
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theory, the severe threat provided the chil- 
dren with a very good reason for not playing 
with the toy while the mild threat aroused 
cognitive dissonance. It was assumed that the 
children experienced dissonance in the mild- 
threat condition because they had insufficient 
reason for refraining from playing with an 
attractive toy. Presumably to reduce disso- 
nance, the children concluded that the toy 
must not be that attractive after all, and 
devalued it accordingly. 

The Aronson and Carlsmith finding has 
been replicated by a number of studies (Carl- 
smith, Ebbesen, Lepper, Zanna, Joncas, & 
Abelson, 1969; Lepper, Zanna, & Abelson, 
1970; Ostfeld & Katz, 1969; Turner & 
Wright, 1965), including those studies which 
used either long- or short-term behavioral 
avoidance as a dependent measure (Freed- 
man, 1965; Pepitone, McCauley, & Ham- 
mond, 1967), and one study (Lepper, 1973) 
which suggests that the insufficient justifica- 
tion provided by a mild threat may induce a 
child not only to devalue the toy, but to 
reason that he is an honest person as well. 
These findings suggest that the successful use 
of mild rather than severe threat in control- 
ling a child’s behavior is more efficacious in 
the long run because it promotes internaliza- 
tion of values, which facilitates resistance to 
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temptation in the absence of external super- 
vision, 

There is evidence, however, that the disso- 
nance prediction may not hold for all chil- 
dren. Ostfeld and Katz (1969), working with 
both middle- and lower-socioeconomic-class 
children, found, as expected, that middle-class 
children derogated the toy under mild threat 
though not under severe threat. But the 
lower-income children responded in an oppo- 
site manner; that is, they derogated the for- 
bidden toy more when they received harsh 
rather than mild threat. Ostfeld and Katz 
offered an alternative to dissonance theory, 
the reinforcement-familiarity hypothesis, to 
account for the results of both middle- and 
lower-class subjects. It was assumed (a) that 
the child’s verbal report (i.e., devaluation of 
the forbidden toy) reflects the effectiveness 
of the threats, (b) that children respond best 
to the disciplinary technique they are most 
accustomed to receiving and (c) since lower- 
income families use harsher threats than mid- 
dle-income families, severe threat is more 
likely than mild threat to generate devalua- 
tion of a forbidden behavior among lower- 
class children while the reverse is true for 
middle-class children. 

Two observations regarding the methodol- 
ogy of the Ostfeld and Katz study deserve 
comment. First, the experimenter did not 
leave the subjects alone during the tempta- 
tion period, but instead “busied herself for 
three minutes in another part of the room” 
(Ostfeld & Katz, 1969, p. 206). It is there- 
fore possible that some of the children might 
have perceived themselves under surveillance 
and consequently did not feel the freedom of 
choice that is necessary to arouse dissonance. 
Second, the experimenter was not blind to the 
experimental condition of the subjects. Thus, 
it is also possible that the experimenter’s ex- 
pectancies could have influenced the final 
rankings of the toys. Since the Ostfeld and 
Katz study appears to offer the only findings 
inconsistent with that predicted by dissonance 
theory, one purpose of this study was to con- 
duct the “forbidden toy” experiment with 
lower-class subjects in a manner which avoids 
the problems pointed out above.’ 

1It is noted that Biaggio and Rodrigues (1971) 
have reported a nonsignificant trend in this direc- 
tion for lower-middle-class, Brazilian children. 
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A second aim of this study was to explore 
the viability of the reinforcement-familiarity 
hypothesis. One of the assumptions crucial to 
this hypothesis is that children from lower- 
income families are more familiar than chil- 
dren from middle-class families with harsh 
verbal threats and therefore respond most 
effectively to such threats. Consequently, Ost- 
feld and Katz in their study delivered a 
harsh threat of punishment in an angry tone 
of voice. It is possible, however, that harsh 
threats are both effective and familiar to 
lower-income children. Thus, in the present 
study an attempt was made to select a 
threat that would be effective, but not harsh 
or necessarily more familiar to lower-class 
than to middle-class children. If the rein- 
forcement-familiarity hypothesis is correct, 
middle-class and lower-class children should 
respond to such a threat in a similar manner. 

Finally, a third objective was to investi- 
gate the effects of threats involving the loss of 
different types of positive stimuli. Research 
in the area of behavior modification suggests 
that material reinforcers are more effective 
than social approval for lower-socioeconomic- 
class children (Risley, 1968). Generalizing 
from this literature, it was assumed that, for 
lower-income children, the level of threat in- 
corporating the loss of a valued commodity 
would be more easily discriminated than 
threats involving the loss of differential 
amounts of adult approval. However, it was 
also assumed that threatening the loss of a 
material reinforcer in an even-tempered, mat- 
ter-of-fact fashion is a disciplinary technique 
that is not more familiar to lower-class than 
middle-class children. The present study, 
then, employed threats involving the loss of 
two classes of positive reinforcers, loss of 
money and loss of adult approval. 


METHOD 
Subjects 


The subjects were 83 preschool children, with a 
mean age of 58 months. Approximately one-half of 
the subjects were from families of low socioeconomic 
status and one-half from families of middle and 
upper-middle socioeconomic status. Each group had 
about the same number of boys and girls and each 
ranged in age from 42 to 66 months. For the low- 
income group, 43 black children were selected from 
a predominantly black, day-nursery school located 
in a low socioeconomic neighborhood in St. Peters- 
burg, Florida. The sample constituted all of the 4- 
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and 5-year-old children in attendance at the school. 
The median family income for these subjects was 
$4,800, a figure that approximates the government’s 
poverty threshold for an urban family. Parents of 
these subjects secured their income largely from 
semiskilled and unskilled occupations or welfare. 
For the middle-class group, a total of 40 children 
were obtained from two day-nursery schools located 
in an upper-middle socioeconomic, suburban area of 
St. Petersburg. The sample exhausted the population 
of 4- and. 5-year-olds at both schools. With few 
exceptions, fathers of these subjects were profes- 
sionals or high-level businessmen. 


Procedure 


Following an introduction by the teacher, two 
white male experimenters escorted subjects in pairs 
from their classroom to another section of the 
school, where each subject was taken individually 
to one of two experimental rooms by an experi- 
menter, From this point on, each experimenter 
independently subjected his subject to the following 
procedure. Upon entering the room, the experimenter 
announced that he had a gift for the subject and 
then proceeded to slowly count out 20 pennies. The 
subject was informed that the 20-penny gift was his 
to keep and take home, but would be placed aside 
for the time being until he had played with some 
toys. The subject was seated before a table on which 
there were five toys—a friction-powered, custom- 
ized car; a miniature piano; a varicolored ball; a 
mechanical “popcorn” popper; and a kaleidoscope. 
After the subject was acquainted with the toys, his 


"relative preferences for all five toys were assessed 


‘by the process of elimination. The subject was first 

asked to select his favorite toy; that toy was then 
removed from the table and the child was instructed 
to select the most preferred toy from among those 
remaining, and so on, until all five toys had been 
ranked. All rankings were double-checked by the 
method of paired comparisons.? 

When the toys had been ranked, the experimenter 
announced: “I have to leave for a few minutes, but 
while I’m gone you can play with any of the toys 
except the [second-ranked toy|.” The experimenter 
then placed the second-ranked toy in a designated 
position at the back of the table which allowed 
later determination as to whether it had been 
moved. Although sequestered from the other toys, 
the forbidden toy along with the 20-pennies re- 
mained on the table within the child’s sight and 
reach, 

At this point, the threat was introduced. Subjects 
were randomly assigned to one of four experimental 
conditions in which nature of threat (loss of ap- 
proval vs. loss of money) and level of threat 
(mild vs. severe) were varied. Special care was 
taken so that all of the threats were spoken in an 
equally serious, but not angry or harsh tone of 
voice, In the mild-threat-loss of approval condition, 
the experimenter announced: “If you play with 
the [second-ranked toy], PII be upset with you.” 


2Tnitially 93 subjects participated, but 10 were 
eliminated due to inconsistent rankings. 
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In the severe-threat-loss of approval jition, the 
experimenter said: “If you play with [second- 
ranked toy], I would be very angry LNA 
would take all of my toys and go hon r 
come back. I would think you were 
baby.” This particular manipulation v 
because it was judged to represent prim 
of the experimenter’s approval, and for < 
purposes, since it had been used in a previous 
(Aronson & Carlsmith, 1963). 

For the loss of money condition, the experi: 
varied mild and severe threat by declaring: “11 
play with the [second-ranked toy], I'll be upset with 
you and TIl have to take away 5 (all ur 
pennies.” When delivering the verbal i 
volving money loss, the experimenter also | 
demonstrated to the subject the number o 
at stake. 

After delivering the threat, the exp: left 
the subject alone in the room for 5 m 
end of the temptation period, the seco 
menter (who was blind to the experiment 
tion of the subject) entered the room and checi 
whether the subject had played with the forbidden 
toy. (None of the subjects transgressed.) T ond 
experimenter then placed the forbidden | 
the others and recorded the subject's | 
ranking of the toys using the sam 
described earlier. Afterwards, the subje: vere t 
that their teacher would give them their pe 
later and then were escorted back to their classrox 
where they rejoined structured activity, maki: 
intersubject communication unlikely. 
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RESULTS AND Discussio 


Initial analyses, by # tests, revealed n 
significant effects of sex of subjects or 
experimenters. Thus, the data from the two 
experimenters and the males and females 
were combined in all subsequent analyses 
The dependent measure was the change 
relative preference for the forbidden toy, 
Since the second-ranked toy was 
forbidden, change scores could r: 
+1 to —3. These scores were subir toa 
2 (Socioeconomic Status) X 2 (Nature of 
Threat) X 2 (Level of Threat) an: 
variance for unequal ms. This analy. elded 
a significant, F (1, 75) = 4.60, p < .05, main 
effect of nature of threat, which indicated 
more devaluation of the forbidden toy in the 
loss of money than in the loss of approval 
condition. Of importance to the major issue 
addressed by this study, however, was the 
finding of a significant, F (1,75) = 8.76, p < 
.01, Socioeconomic Level X Level of Threat 
interaction. Inspection of the data in this 
relationship revealed, as expected, that mid- 
dle-class subjects devalued the forbidden toy 
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more in the low-threat condition than in the 
high-threat condition. By contrast, lower- 
class children derogated the forbidden toy 
more in the high-threat than in the low- 
threat condition. In addition, a significant, 
BOSE = 11.44, p < .001, Nature of 
Threat X Level of Threat interaction ap- 
peared. Interpretation of these statistical 
effects can best be facilitated by an inspec- 
tion of the means for all conditions, which are 
presented in Table 1. 


Middle-Socioeconomic-Class Sub jects 


Tt can be seen in Table 1 that in the loss 
of approval condition, the behavior of the 
middle-class children was highly consistent 
with that of past studies, Derogation of the 
forbidden toy occurred under low threat, but 
the value of the toy was slightly enhanced 
under high threat. The difference between 
these means was significant (p < .01).8 No 
such pattern emerged in the loss of money 
condition. Rather, when the subjects were 
threatened either with the loss of 5 or all of 
their pennies, they devalued the forbidden 
toy an intermediate amount. The difference 
between the amount of derogation produced 
by high and low threat in the loss of money 
condition was not significant, nor did the 
mean of either level significantly differ from 
the mean amount of devaluation in the loss 
of approval-low-threat condition. The mean 
in the high-threat—loss of approval condition, 
however, was significantly different (p < .05) 
from the mean in the high-threat-loss of 
money condition and marginally different (p 
< .10) from the mean in the low-threat-loss 
of money condition. 

To summarize, threatening loss of large 
and small quantities of money produced simi- 
lar amounts of devaluation that were slightly 
less than that generated by the threat of 
mild loss of approval, while the mean of the 
condition involving threat of severe loss of 
approval reflected increased liking for the 
forbidden toy. Thus, the effect predicted by 
dissonance theory occurred in the loss of ap- 
proval condition, but not in the loss of money 
condition. 

Since Pepitone et al. (1967) have shown 
that a significant dissonance effect can be 


8 Individual comparisons were made by use of 
the Newman-Keuls procedure. 


TABLE 1 


MEAN CHANGE IN ATTRACTIVENESS or 
Forsmpen Toy 


Socioeconomic class 
Nature and level 
of threat aes eae 
Middle Lower 

Loss of approval 

Low threat —1.70 —A0 

n 10 10 

High threat 50 —.50 

n 10 12 
Loss of money 

Low threat —.80 —.64 

n 10 11 

High threat —1.10 —1.80 

n 10 10 


Note. MS within = 1.36, 


produced by threatening the loss of objects 
which differ in value, it is likely that many 
of the children from middle-income families 
did not discriminate in value between the 
loss of 5 and 20 pennies. It should be men- , 
tioned that high threat in this condition pro- 
duced high variability; that is, the majority 
of subjects either increased their liking for 
the toy (+1) or devalued it severely (—3), 
suggesting that the threat was mild for some 
and strong for others. Perhaps this reflected 
differences among middle-income families in 
the value placed on money. It is parentheti- 
cally noted that some middle-class children 
appeared to treat the 20-penny gift in a 
cavalier fashion, while others did not, 

According to dissonance theory, all of the 
threats that produced devaluation were effec- 
tive because they were potent enough to in- 
hibit transgression, but at the same time 
weak enough so that the subjects attributed 
the reason for their restraint to dislike for 
the toy rather than to the strength of the 
threat. Paradoxically, then, the findings for 
middle-class subjects suggest that threatening 
the loss of something material (such as a 
youngster’s allowance) may be most effica- 
cious for those children who place little value 
on the commodity in jeopardy. For the disso- 
nance process to occur, however, it is impor- 
tant that the threat be at a level sufficient to 
initially deter the child from engaging in the 
prohibited behavior. 


Lower-Socioeconomic-Class Subjects 


Table 1 shows that the pattern of results 
for lower-class subjects differed from that of 
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middle-class subjects in two principal ways. 
First, as hypothesized, the level of threat in 
the loss of approval condition produced no 
significant effect. Instead, each threat level 
induced a rather moderate amount of devalu- 
ation. Second, the greatest amount of devalu- 
ation occurred under high threat rather than 
low threat in the loss of money condition. 
Although lower-class subjects behaved con- 
trary to middle-class subjects, the differences 
between the means were not as strong as 
those reported for middle-class subjects. For 
lower-class subjects, the mean in the high- 
threat-loss of money condition differed from 
the means in all of the other conditions be- 
yond the .10 level of significance.* Thus, the 
reverse to that predicted by dissonance theory 
appeared in the loss of money condition, but 
in the loss of approval condition, high and 
low threat produced similar amounts of 
devaluation. 

It should be noted that all threats were 
sufficient to deter the children from playing 
with the forbidden toy. However, lower-class 
subjects did not respond differentially to the 
mild and severe threats involving loss of 
approval in the posttreatment ranking. If 
devaluation of the forbidden toy among 
lower-class subjects represents internalization 
of value (which has yet to be established), 
the present findings suggest that threatening 
strong or mild loss of approval to influence 
long-term behavior change may prove less 
effective than threatening the substantial loss 
of something concrete. 

In sum, the reactions of lower-class sub- 
jects were opposite those of middle-class sub- 
jects and do not appear to be an artifact of 

4 The mean in the high-threat-loss of money con- 
dition differed from all other means beyond the .05 
level by two-tailed £ tests. However, the more con- 
servative Newman-Keuls method was used because 
not all comparisons were orthogonal. In a subse- 
quent study conducted by the authors, in which 
loss of different amounts of candy was used to 
manipulate threat, the finding of a positive associa- 
tion (p < .01) between threat level and devaluation 
was restricted to those black subjects who were from 
lower-income families, that is, below the median of 
$400 per month. Black subjects from higher-income 
families behaved in a manner similar to that of 
middle-class subjects in the loss of money condition 
in the present study. These findings suggest that the 
reversal is linked to class rather than race and that 
a stronger effect would have been obtained in the 


present study had only very low-income subjects 
been used. 


experimental procedure. The findings for 
lower-class subjects were consistent with 
those in the Ostfeld and Katz study, in which 
high threat (a harsh threat of punishment) 
produced more derogation than low threa 
(mild loss of approval) in lower-class sub 
jects. The threats involving loss of money in 
the present study, however, were not admin- 
istered in an ill-tempered manner, nor did 
they suggest physical punishment. Since 
threatening loss of money in an even-tem- 
pered fashion is not assumed to be a disci- 
plinary practice more characteristic of lower 
class than middle-class families, the reinforce 
ment-familiarity hypothesis does not readily 
account for the results. Thus, the findings of 
both the Ostfeld and Katz study and th: 
present investigation remain a challenge to 
the generality of dissonance theory. 
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Leader Effectiveness and Leadership Conferral as Determinants 
of Helping in a Medical Emergency 
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This study investigated the contribution of an individual’s qualification for 
discussion group leadership (LGD) and the method of leadership conferral, 
election versus appointment, upon his group’s response to a medical emer- 
gency. Five-person, face-to-face groups with high LGD leaders responded 
more frequently and more rapidly to a confederate member's diabetic reaction 
than did groups with low LGD leaders. Low LGD leaders were frequently 
overthrown, while groups with high LGD leaders experienced continuity of 
leadership. Conferral process had no discernible effect on helping or over- 
throws. It was concluded that a victim is the more fortunate if his group’s 


leader is assertive. 


Studies of altruism, good samaritanism, 
and other forms of helping behavior can be 
divided into two types: those of naturally 
occurring social aggregates, such as noninter- 
acting groups of subway riders (Allen, 1972; 
Piliavin, Rodin, & Piliavin, 1969), and those 
based in laboratory situations employing con- 
federates and highly controlled, often tape- 
recorded, interactions (e.g., Darley & Latané, 
1968; Latané & Darley, 1969; Latané & 
Rodin, 1969). As yet the problem of emer- 
gency helping behavior in interacting face-to- 
face groups has not been treated. Indeed, it 
has been suggested (Korte, 1969) that groups 
may function to impede the delivery of help 
through inaction, which minimizes the reality 
of the emergency, or through diffusion of re- 
sponsibility. The function of leadership in 
these situations has also not been examined. 
The present investigation bridges these gaps. 

Latané and Darley (1969) have shown 
that variables in the immediate social en- 
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vironment are more critical determinants of 
individuals’ reactions to emergencies than are 
general personality concepts such as social 
responsibility, locus of control, or alienation. 
Their work is thus consistent with Krebs’s 
(1970) comprehensive review of the altruism 
literature which concludes that personal trait 
variations have failed to show any large or 
consistent relationship to helping. Perhaps it 
is the particular laboratory paradigms em- 
ployed that have led to this conclusion. The 
use of severely contrived situations that re- 
strict communications may well act to mini- 
mize the contribution of social—personality 
differences. Thus, while appropriate for cer- 
tain classes of problems, this methodological 
strategy of rigged communication in pseudo 
groups effectively precludes the investigation 
of how freely interacting groups might deal 
with a crisis. For the same reason, this para- 
digm has not allowed for the assessment of 
leader influence in the emergency situation, 
The present study is concerned with the ef- 
fects of leader qualifications and mode of 
leadership conferral upon group responses to 
a medical emergency involving one of its 
members. 


Leadership and Group Effectiveness 


A common distinction in the group process 
literature is that between emergent and ap- 
pointed leaders (cf. Hollander, 1964). To 
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emerge as a leader of an initially leaderless 
group discussion was shown (Bass, 1961) to 
require several types of activity and skill: 
initiation or formulation of goals and prob- 
lems, organization of the group’s thinking, 
clarifying other individuals’ responses, out- 
lining the discussion, and summarizing, gen- 
eralizing, and formulating conclusions. These 
skills constitute a member’s qualifications for 
the leadership role (Bass, 1949, 1961; Hein- 
ecke & Bales, 1953) and define the discussion 
group leadership (LGD) ability variable 
used in this study. Peer nomination research 
(Hollander & Webb, 1955) indicates that 
group members choose a leader who members 
see as best qualified according to group 
standards of judgment. Thus emergent lead- 
ers are seen by group members as possessing 
those qualities, skills, and resources necessary 
for group locomotion. 

By contrast, appointed leaders gain incum- 
bency independent of the preferences of the 
group members. Depending on the validity of 
the selection procedures used, such a leader 
may or may not actually possess qualities 
necessary to direct group activity. Since ap- 
pointed leaders will vary in their qualifica- 
tions, it is necessary to see them in action. A 
crisis situation allows for such an observa- 
tional test of a leader’s ability to initiate 
structure, especially for the case of an in- 
formal, unplanned group activity such as the 
management of a stressful emergency situa- 
tion. 

Even though a medical crisis represents a 
task quite different from the group’s usual ac- 
tivities, there should nevertheless be increased 
centralization of leadership in an emergency 
(Jones & Gerard, 1968) where the leader is 
initially seen as competent. Torrence (1967) 
has, for example, shown that groups under 
stress prefer continuity of leadership from 
one situation to the next when such leader- 
ship is feasible. On the other hand, Hamblin 
(1958) reported that when leaders did not 
bring a quick end to a crisis, they were de- 
posed. Groups apparently have little tolerance 
for ineffective leaders when the stakes are 
perceived as high. 

The present investigation employs group 
reactions to a crisis under three different con- 
ditions of leadership: emergent-elected lead- 
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ership, appointed leadership, and pseudo- 
elected leadership (wherein each member be- 
lieves the group has elected the least compe- 
tent member as leader). To the extent that 
leader qualification is the main determinant of 
helping in a crisis situation, we can predict 
that groups headed by emergent-elected lead- 
ers will respond most quickly; those headed 
by pseudo-elected leaders will respond least 
quickly; while those headed by appointed 
leaders should, because of their variable quali- 
fications, show intermediate efficiency. When 
the transition from discussion to crisis man- 
agement occurs, the likelihood of a coup is ex- 
pected to be directly related to leader ineffec- 
tiveness; that is, pseudo-elected leaders will 
be most often overthrown, and emergent- 
elected leaders will be least often overthrown. 


MeErtHOD 
Subjects 


Participants were 156 male undergraduates, vir- 
tually the entire population enrolled in the intro- 
ductory psychology course at Villanova University. 


Procedure 


The present study employed 39 five-person 
groups. In each, four members were actual subjects, 
while the fifth was a confederate. Subjects were 
seated about a rectangular table in a “group dy- 
namics” laboratory room equipped with concealed 
audio monitoring and one-way observation facili- 
ties. They were told that the purpose of the study 
was “to look at group processes in order to gain a 
clearer understanding of how people work together,” 
To this end, the experimenter indicated that the 
group would be given a series of problem situations 
which they were to discuss among themselves and 
arrive at a consensual solution to each problem (cf. 
Wallach, Kogan, & Bem, 1962). 

At this point the experimenter, a mature, grey- 
haired male graduate student, invited participants 
in all groups to introduce themselves and to de- 
scribe their academic and outside interests, such as 
sports and hobbies. When the confederate’s turn 
came to talk, he said that he did not go in much 
for sports because he had diabetes and took insulin. 
Following this “get acquainted” go-around, the 
experimenter told all groups that 


Td like to give you one of these problems that 
I've been talking about to see how you do on it 
and to make sure there are no problems with it, 
It’s also going to be necessary for one of you to 
act as group leader, and we'll decide who that 
w be after you run through this sample prob- 
lem. 


¥ 


DETERMINANTS oF HELPING 


The experimenter read the sample problem and 
told the group to come to a decision within 5 min- 
utes time and to write down their answer and the 
reasons why they had come to this conclusion. 

As the group discussed the sample problem, two 
concealed judges rated each membet’s leadership 
potential according to the LGD criteria formulated 
by Bass (1949). 

Manipulation of leadership conferral. Upon com- 
pletion of the sample problem, the group leader 
was selected according to one of the following three 
procedures (with 13 groups in each condition) : 


1. In the emergent-elected-leader condition, the 
-experimenter instructed each group member to write 
on a slip of paper the first name of the person he 
thought would be the best one to lead the group. 
The experimenter appeared to count the votes and 
then informed the group who had been selected. In 
all cases the person actually “elected” by the sub- 
jects was also the one whom the concealed judges 
would have chosen on the basis of his performance 
during the discussion of the sample problem, He 
was the member who scored highest on LGD cri- 
teria (e.g., solution suggestions, discussion control) 
during the sample discussion. 

2. For the pseudo-elected-leader condition, the 
procedure was similar to that of the emergent- 
elected condition except that the election results were 
rigged. Group members were led to believe that they 
had elected as leader the member whom the judges 
had scored as lowest of the four on LGD criteria. 

3. In the appointed-leader condition, no balloting 
procedure occurred. The experimenter simply desig- 
nated a member of the group to be leader without 
justifying his selection. To ensure variable LGD 
qualification, the third person arriving for the ex- 
periment was always selected as appointed leader. Of 
these, six were rated high on LGD and seven were 
rated low on LGD, although they were not neces- 
sarily the highest or lowest in the group 


Upon completion of the leadership conferral ma- 
yipulation, all groups were told that they would be 
given five problems of the type they had just solved. 
‘In order to convey a sense of responsibility and the 
‘isolation of the group from the surveillance of the 
research staff, the experimenter indicated; 


What I want you to do is this: The leader will 
read what the problem is, and then all of you will 
discuss it and come to a conclusion as to what 
should be done. When you have done so, the 
leader will bring the result to me in my office 
down the hall, where I will look at the way you 
solved the problem. I will then give him the next 
problem to be solved. Remember that you will 
have 5 minutes to solve each problem, so be sure 
to work within that framework of time. Are 
there any questions before I give you the first 
problem? a 


After answering any questions, the experimenter 
gave the leader a typed sheet with the first problem 


and left the room. 
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Medical emergency. During the discussion of the 
fourth problem, the confederate, who had been act- 
ing in an increasingly agitated and restless manner 
and had contributed little to the group discussion, 
began to complain that he was having a diabetic 
reaction and that he needed some sugar. The request. 
was an explicit one which has been shown to in- 
crease the probability of helping (Yakimoyich & 
Saltz, 1971). The confederate said that he was un- 
able to get the sugar himself, felt that he might 
pass out, and said that he needed help. He looked 
in the direction of the leader as he said these things. 
The general effect was one of increasing urgency, 

Measurement of helping behavior. Our two con- 
cealed judges observed and recorded specific group 
reactions during the diabetic reaction. The major 
dependent variables were the following: the occur- 
rence of a criterial helping response, defined as a 
person leaving the laboratory to directly aid the 
victim or to find the experimenter, and the latency 
of this response. During the crisis, judges observed 
occurrences of continuity and/or change in leader- 
ship. The occurrence of an overthrow was scored 
when a person other than the designated leader 
either mediated interaction with the victim or 
directed the helping response. 

The experiment was ended either upon the occur- 
rence of helping or, if no one left the room, after 
3 minutes had elapsed from the confederate’s explicit 
request for sugar. Regardless of the occurrence of 
helping, the experimenter returned to the laboratory 
and summoned an assistant to ostensibly take the 
victim to the infirmary. At this point, participants 
were told of the true purposes of the study and 
were then separately interviewed for their reactions. 


RESULTS AND DISCUSSION 
Plausibility of the Manipulation 


Judged from the postexperimental inter- 
views and observed reactions during the 
emergency situation, participants treated the 
emergency as a real one. Only 4 of 156 sub- 
jects indicated that they had had doubts 
about the genuineness of the confederate’s 
difficulties. When told, during debriefing, of 
its contrived nature, subjects expressed sur- 
prise and/or relief that the diabetic was only 
an actor. More than 90% of the subjects felt 
that their participation had been a “signifi- 
cant learning experience” and stated a desire 
to cooperate with “any future research 
efforts of this kind.” 


Helping Behavior 

We find a strong relationship between 
experimental treatment and the quality of 
group reaction to the emergency. Of the 
groups with emergent-elected leaders, 84% 
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Ficure 1. Cumulative group response latencies. 


(11 out of 13) sent someone out of the room 
for help before the end of 3 minutes time, 
compared with 46% (6 out of 13) for the 
appointed-leader groups and 23% (3 out of 
13) for the groups with pseudo-elected lead- 
ers, x°(2) = 66.2, p < .001. Reaction time 
to helping shows the same pattern. Assigning 
a score of 180 sec to groups which fail to 
help, we find that emergent-elected-leader 
groups were quickest to help after the seizure 
(X = 46 sec), appointed-leader groups were 
slower in reaction time (X = 73 sec), and 
pseudo-elected-leader groups were slowest (X 
= 103 sec). For purposes of analysis, the 
reaction time for each group was transformed 
into a “speed score” (Darley & Latané, 
1968) by taking its reciprocal in seconds and 
multiplying by 100. The effect of this trans- 
formation was to deemphasize differences 
between longer time scores, thus reducing the 
contribution to the results of the arbitrary 
3-minute time limit. The mean speed score 
for the emergent-elected, appointed, and 
pseudo-elected treatments were .53, .37, and 
16, respectively, with the overall differ- 
ence F(2, 36) = 236.0, p < .001, and those 
between-pairs of treatments being highly 
significant. 
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Figure 1 further elaborates on the temporal 
course of helping by presenting a cumulative 
plot of proportion of groups which inter- 
vened at any time following the medical 
emergency. For example, it can be seen that 
60 sec after the seizure, 62% of the 
emergent-elected-leader groups, 38% of the 
appointed-leader groups, and only 15% of the 
pseudo-elected-leader groups had taken deci- 
sive action. The negatively accelerated curves 
indicate a decrease in the likelihood of help- 
ing with the passage of time. In no case did 
a group act during the third minute. 

To gain some perspective on the relative 
contribution of leader LGD and leadership 
conferral variables to helping, we divided 
appointed leader groups into those judged on 
the practice problem to have had high or low 
LGD leaders and compared their responses 
with corresponding high LGD (emergent) 
and low LGD (pseudo) elected leader groups. 
These results are shown in Table 1. A chi- 
square analysis of these data, presented in 
Table 2, indicated that the overall rela- 
tionship was a significant one (p< .01). 
However, the component analysis displayed 
therein reveals that only leader LGD is 
significantly related to helping (¢ = .49, 
p < 005), 


Leadership Continuity and Change 


In any group confronted with an emer- 
gency such as the present one, there is the 
possibility that the designated leader might be 
perceived as so ineffective that another group 
member is motivated to assume the dominant 
role of directing the rescue effort. 

In all 13 groups headed by emergent- 
elected leaders, the designated leader main- 
tained charge during the emergency situation. 
In these groups the leader himself controlled 
the proceedings and directed his followers. It 


TABLE 1 


EFFECTS or DISCUSSION Group LEADERSHIP AND 
CONFERRAL PROCESS ON HELPING BEHAVIOR 


, 


Leader type Conferral process Help No help 
A-low LGD Elected 3 
B-low LGD Appointed 2 3 
C-high LGD Elected 11 2 
D-high LGD Appointed 4 2 


Note. LGD refers to discussion group leadership ability. 


DETERMINANTS oF HELPING 


TABLE 2 
COMPONENT ANALYSIS OF HELPING BEHAVIOR 


Source | x aj 

Leader LGD 

(A+B vs. C + D) 9.26 lrag 
Conferral 

(A +C vs. B +D) 01 1 
Interaction 

(A +D vs. B +C) 2.08 | 1 

Totals 12.01 3* 


Nole. LGD refers to discussion group leadership ability. 
* Component chi-squares do not sum to total due to appli- 
cation of Yates's correction for component values. 
1, 


was he who asked the victim what was wrong 
and what specifically needed to be done; it 
was he who delegated someone to carry out 
necessary tasks (“Get the experimenter,” “Go 
get him some sugar, fast”). In emergent- 
elected-leader groups, then, it was the original 
leader who initiated structure during the cri- 
sis, while the other members remained as 
followers. 

Events were different for appointed and 
pseudo-elected-leader groups. Only six ap- 
pointed leaders maintained their authority. 
Included in this total are only one of the 
seven low LGD appointed leaders and five of 
the six high LGD appointed leaders. The low 
LGD leaders of the pseudo-elected treatment 
were overthrown in all but 4 of 13 groups. A 
chi-square analysis of overthrow occurrence 
indicated a significant overall relationship, 
x°(3) = 20.58, p< .001. The component 
analysis again revealed that only the LGD 
component (¢ = .63, p < .001) was signif- 
cantly related to helping. 

In the typical overthrow, the group mem- 
ber who assumed the helm during the emer- 
gency was the one with the highest LGD 
rating. Such coups took place after the con- 
federate had made his plea for help, and the 
leader was either hesitant to act or turned to 
others in the group for counsel. At this point, 
the high LGD member would take charge and 
then delegate someone else to get help, In 
only 5 of 39 groups, distributed evenly across 
treatments, did the leader actually leave the 
room himself to accomplish the helping. In 
all other cases he delegated this job, and he 


stayed with the victim. 
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TABLE 3 


EFFECTS or DISCUSSION GROUP LEADERSHIP AND 
CONFERRAL PROCESS ON LEADERSHIP CONTINUITY 


= 
Leader 
Leader type and 
conferral process 
Maintained | Overthrown 
A-low LGD; Elected 4 9 
B-low LGD: Appointed 1 6 
-high LGD; Elected 13 0 


D-high LGD: Appointed 5 1 


Note. LGD refers to discussion group leadership ability, 
CONCLUSIONS 


The results show that group effectiveness 
in helping a member with a medical emer- 
gency was related to the LGD rating of initi- 
ally designated group leaders (i.e., their 
assertiveness in guiding group progress and 
solution of a group-discussion task). The 
manner in which the leader came to power— 
by appointment or election—was unimpor- 
tant in comparison to the behaviors of the 
leader himself. Groups headed by low LGD 
leaders were less able to make the transition 
from discussion to crisis intervention tasks 
than those headed by high LGD leaders. 
They frequently experienced a change in 
leadership, which took up considerable time; 
while high LGD leader groups had the bene- 
fits of continuity of leadership. 

An alternative argument that “the times 
make the man” suggests that an emergency 
might bring out the best in a low LGD 
leader, causing him to rise to the occasion. 
No support for this position was obtained. 
Even the clear role demands (e.g., confeder- 
ate’s specification of exact helping require- 
ments) of the emergency could not muster 


TABLE 4 
COMPONENT ANALYSIS OF LEADERSHIP CONTINUITY 


Source x? df 

Leader LGD 

(A +B vs. C + D) 15.47 ‘hd 
Conferral 

(A +C vs. B + D) .65 1 
Interaction 

(A +D vs. B +C) 1.23 1 

Total® 20.58 3 


Note. LGD refers to discussion group leadership ability. 

a Component chi-squares do not sum to total due to appli- 
cation of Yates’s correction for component values, 

*p <.001. 
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low LGD individuals; hence, the frequent 
necessity for coups in groups headed by such 
reticent individuals. 

The data indicate, then, that while de- 
signed to identify leadership only in discus- 
sion settings, LGD scores also proved valid 
as predictors of action in crisis. The medical 
emergency modeled by the present experiment 
revealed a smooth transition from discussion 
to crisis management in groups headed by 
high LGD leaders, while groups headed by 
low LGD leaders foundered badly and could 
have cost a real victim dearly. 
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Based on a social learning formulation of marital interaction, this study 
investigated the positive and negative social reinforcement behaviors exchanged 
within distressed marital, nondistressed marital, and stranger dyads. Behav- 
ioral data. gathered from home and laboratory interactions were used to 
study distressed and nondistressed marital interaction, In the laboratory, a 
„repeated measurements design facilitated comparisons of marital and stranger 
interactions. Results from the natural environment indicated that distressed 
couples, relative to nondistressed, emitted fewer pleases and more displeases, 
engaged in more conflicts and fewer recreational activities together. Labora- 
tory data from a behavioral coding system also indicated that distressed 
couples engaged in fewer positives and more negatives during casual conver- 
sation and problem solving than did nondistressed couples. Both distressed 
and nondistressed married dyads were more negative and less positive than 
stranger dyads. Theoretical and practical implications of the findings are 


discussed, 


In spite of the ubiquity of marital re- 
lationships there have been relatively few 


empirical studies of marital interaction (cf.° 
Olson, 1970). Those behavioral approaches to - 


marital interaction which have recently been 


reported focus primarily on intervention’ 


strategies (e.g, Liberman, 1970; Patterson 
& Hops, 1972; Stuart, 1969; Thomas, 
Carter, & Gambrill, 1969; Weiss, Hops, & 


Patterson, 1973). The nonbehavioral litera-- 


ture deals mainly with studies of attitudinal 
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relationships or more objectively with novel 
analogues of dyadic communication (e.g., 
Ryder, 1968). While somewhat less of a prob- 
lem for the behaviorally oriented studies, the 
interface between laboratory observations of 
marital interaction and extralaboratory inter- 
actions remains virtually untouched by re- 
searchers. The present study investigated the 
exchange of facilitating and aversive behav- 
iors between spousal and stranger dyads 
within a social learning formulation and at- 
tempted to bridge the gap between exchange 
interactions observed in a laboratory and 
those taking place outside of the laboratory 
setting. A second aim was to provide data on 
the trait-versus-state distinction applied to 
marital ‘interaction: The question is whether 
marital “distress” is limited to the specific 
relationship or whether it is confined to the 
person who carries it from situation to 
situation. 

The social learning formulation applied to 
marital interactions embodies both reinforce- 
ment and exchange theory concepts. It is 
assumed that the reinforcing value of or the 
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range of resources controlled by the spouses 
in their day-to-day interactions must be re- 
lated to reported marital satisfaction. A 
recent empirical demonstration of this has 
been offered by Wills, Weiss, and Patterson 
(Note 5) who showed that daily exchanges 
of pleasing and displeasing behaviors as well 
as quality of extradyadic events combined to 
predict relationship satisfaction (the multiple 
R=.508, p < .001). Distress in a marital 
relationship may be attributed to an imbal- 
ance in the fair exchange of social reinforcers 
(Patterson & Reid, 1970), to a breakdown in 
mutual reinforcement (Stuart, 1969), and to 
the development of coercive interaction pat- 
terns, such that the behavior of one spouse 
is controlled by aversive stimuli dispensed by 
the other spouse (Patterson & Hops, 1972). 

The “disturbance” in marital relationships 
arises, therefore, from the methods used by 
the spouses to bring about change in one 
another. Whereas use of positive control 
(i.e., positively reinforcing desired behaviors) 
would be the method of choice, partners in 
ongoing intimate relationships seem to rely 
more heavily on the use of aversive control 
(ie, negative reinforcement and punish- 
ment). Without tracing the systematic de- 
velopment of conflict and reliance on coercion 
(Patterson & Hops, 1972), one can see that 
in established marital interactions conflict can 
be viewed as a demand for immediate change 
in the behavior of the other. If the demanding 
is unpleasant, or more intensely aversive, any 
compliance response on the part of the other 
serves to reinforce the use of demanding by 
the initiator and, through negative reinforce- 
ment, strengthens the compliance response of 
the other. Yet, if the compliance response is 
some half-hearted delaying response (“I'll 
fix it later”), it follows that what is being 
reinforced in this paradigm is a kind of avoid- 
ance or withdrawal. It is also highly likely 
that the use of aversiveness as a behavior 
change technique will be taken over by both 
partners, so that we can assume an escala- 
tion of reciprocal aversiveness without con- 
comitant problem solving, that is, the issue 
which began the “discussion” is never 
resolved. Coercion and aversive control may 
temporarily stop the offending behavior of the 
spouse but it is also assured that the pattern 
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will repeat in the future. Increasingly the 
small day-to-day annoyances become the 
“significant trivia” of marital interaction and 
are the focus of these preprimed escalations. 

The social learning analysis summarized 
above underlies the following approach for 
the study of marital interaction: Distressed 
and nondistressed couples were seen as pro- 
viding different interaction patterns concerned 
with exchange of pleasantness and aversive- 
ness both in the laboratory and in the 
home environment. Moreoyer, these differ- 
ences would be reflected in the exchange of 
positive and aversive stimuli in ongoing inter- 
actions. Further, it was assumed that avoid- 
ance patterns resulting from reliance on coer- 
cion and ayersive control strategies would be 
associated with a reduction of shared recrea- 
tional time. Finally, because it was assumed 
the conflict generating behaviors were learned 
in a specific dyadic interaction, there would 
be less likelihood that they would be observed 
when members of a distressed relationship 
interacted with strangers. 

More specifically the study proposed to 
test the following propositions: Distressed 
spouses, relative to nondistressed spouses, 
attempt to change partner behaviors largely 
through negative or aversive instrumental be- 
haviors to the relative exclusion of positive 
reinforcing behaviors. Significantly less posi- 
tive and significantly more negative social 
reinforcement would characterize the inter- 
action patterns of distressed spousal dyads, 
and these differences would be found across 
two types of interaction settings. 

The reported daily interactions of distressed 
and nondistressed couples in their natural 
home environments can be differentiated in 
terms of positive and negative behavior ex- 
changes. Spouses’ assessment of pleasing and 
displeasing events would differentiate the dis- 
tressed and nondistressed groups such that 
the former would report significantly fewer 
pleasing events and significantly greater dis- 
pleasing events. The reported frequency of 
daily conflicts would likewise differentiate the 
two groups. 

On the assumption that withdrawal results 
from continued reliance on aversive control, 
it was predicted that distressed, relative to 
nondistressed, marital dyads would report 


pare 


significantly fewer recreational activities en- 
gaged in together and more engaged in either 
alone or with others. 

Finally, on the assumption that marital 
partners train one another over time in the 
use of aversive control, and that reciprocity 
and coercion operates dyadically, it was hy- 
pothesized that relative to stranger dyads, 
marital dyads would be significantly less 
positiye and more negative to one another. 


a METHOD 
Design 


There were three phases to the design: subject 
selection constituting distressed-nondistressed dyads, 
laboratory interaction conditions, and data collection 
fronr the naturalistic environments of the subjects. 
(a) Subject selection was accomplished by means of 
self-report and interview observation measures. The 
designations distressed and nondistressed were made 
from these data. (b) Laboratory interaction’s main 
variables were status (distress-nondistress), intimacy 
(spousal-stranger dyad), and setting (free conversa- 
"tion-problem solving). These were combined in the 

. following manner: In each of three interactions 
touples interacted as a married dyad, a stranger 
dyad with an opposite sex distressed partner, and a 
stranger dyad with an opposite-sex nondistressed 
partner. Thus, each subject contributed data from 
three interactions in each of two settings (conversa- 
tion vs. problem solving). (c) In the naturalistic 
spouse observation condition, daily observations of 
spouse behavior provided measures of exchanged 
please-displease data, as well as reported conflicts. 


Subjects 


Twenty-four paid volunteer couples (N = 48) were 
solicited from the community by newspaper adver- 
tisements stressing either “couples currently ex- 
periencing marital distress” or “happily married 
couples.” Subjects were paid $17.00 for participation 
“in a marriage research project” at the University 
of Oregon. While every attempt was made to dis- 
sociate the research project from treatment, couples 
were told that if requested, treatment would be 
made available to them after the study was com- 
pleted. (All couples in the distressed group exercised 
the treatment option either immediately after the 
study or within 12 months following its completion, 
whereas none of the nondistressed couples have done 
so to this date.) The resulting two groups of non- 
distressed and distressed couples, respectively, were 
relatively similar in average couple age (25.5 and 
26.0 years), length of marriage (4.2 and 3.6 years) 
and educational level (16-1 and 15.0 years). 
Number of children was somewhat different, .42 and 
1.5 for nondistressed and distressed, respectively. 

Selection procedures. Three sources of data were 
used in making group assignments. Locke and 
Wallace (1959) developed a now commonly used 15- 
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item marital adjustment test, which successfully dis- 
criminated maladjusted from well-adjusted couples. 
In the present study, an average couple score less 
than 100 or greater than 105 as one criterion for 
inclusion of couples into distress or nondistressed 
groups, respectively. Second, partners responded to 
68 potential marital conflict items, indicating the 
desire for significant behavior change in self or 
spouse. A total of 15 or more items indicating 
spousal disagreement or agreement, but extreme 
desire for one or the other to change a particular 
behavior, was required for inclusion in the distressed 
group (cf. Weiss et al., 1973, for more description). 
The third source of data was a rating of interview 
behavior. A standardized one-half hour conjoint 
interview was conducted which covered actuarial 
data (schooling, length of marriage, etc.) and each 
couple’s statement of difficulties. The interaction 
rate, dominance, and supportive and critical behav- 
iors were noted by the interviewer, who made a 
rating on an 8-point scale expressing his/her confi- 
dence judgment of distress. Those couples rated 4 
or less (“possible couple is distressed” to “certain 
couple is distressed”) were scored distressed. 

Final couple selection was to have been based 
upon agreement of the three measures, but in 5 of 
the 24 dyads agreement on two of the three mea- 
sures was employed because of difficulty enlisting a 
sufficient sample of subjects. 


Dependent Measures 


Marital Activities Inventory. This inventory was 
designed to assess the kind and frequencies of some 
85 typical recreational and pastime activities engaged 
in alone, with spouse, and with others during the 
previous 30 days (e.g., “Watched T.V.” “Played 
cards”). 

Pleasures, displeasures, and conflicts. Couples were 
carefully instructed in the independent collection of 
pleasures (Ps) and displeasures (Ds) in their natural 
environment. These were defined as daily records of 
observed pleasing and displeasing behaviors engaged 
in by one’s spouse (cf. Weiss et al., 1973; Wills et 
al, Note 5). (Eg. “spouse took out garbage,” 
“spouse was critical of my appearance,” etc.) Couples 
also kept a daily diary of conflicts or arguments 
occurring between them, 

Marital Interaction Coding System. The dependent 
measures for the laboratory phase of the study were 
derived from a behavioral coding system, the Marital 
Interaction Coding System (MICS), which included 
28 individual code categories defining such behav- 
iors as problem solving and positive and negative 
social reinforcement. The behavioral codes derived 
from the MICS are presented in Table 1. The on- 
going interaction was coded in parallel 30-second 
lines. Videotapes of the dyadic interactions were 
coded by two trained observers who met a minimum 
standard of 70% point-by-point observer agreement. 


Laboratory Dyadic Interaction 


Couples interacted in each of two laboratory ob- 
servation settings, videotapes of which were later 
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TABLE 1 


Major SuMMARY CATEGORIES OF THE MARITAL 
INTERACTION CODING SYSTEM 


Measure Code 


Positive social reinforcement (SR +) 


Verbal 
Nonverbal 


Agreement, approval, humor 


Assent, laugh, positive physical contact, 
smile 


Negative social reinforcement (SR —) 


Complain, criticize, deny responsibility, 
excuse, put down, interrupt, disagree 


No response (ignore), not tracking (inat- 
tention), turn off 


Verbal 


Nonverbal 


Problem solving 


Problem solution, accept responsibility, 
compromise 


Verbal 


Note. The following unclassified codes were not used in the 
present anal: i ficient frequency or insignificance: 
attention, command, compliance, noncompliance, normative, 
negative solution, question, and talk, 


scored using the MICS. During the first 4 minutes 
of “free conversation,” couples were instructed to 
talk about anything, “while we're setting up the 
equipment.” During the second setting, a 10-minute 
period of problem solving, dyads were asked to 
resolve their differences of opinion regarding typical 
marital conflicts that they had read about earlier 
(described below). 


Problem-Solving Task 


A modification of Olson and Ryder’s (1970) In- 
ventory of Marital Conflicts (IMC) was used. Sub- 
jects read 18 vignettes about typical marital conflicts, 
written so that the husband received information 
slanted toward making one spouse, usually the wife, 
look responsible for the conflict, while the opposite 
slant was presented to the wife. During the observed 
problem-solving interaction, dyads discussed only 
those situations about which there was disagreement 
as to the person primarily responsible for the alleged 
marital conflict. They were instructed to come to 
some resolution of the problem discussed. (See Olson 
& Ryder, 1970, Birchler, 1972, or Vincent, 1972, for 
a more explicit account of the advantages and use 
of the IMC.) 


Procedure 


Phone contact. Respondents to the newspaper ad- 
vertisement were first contacted by phone and told 
that the project was to be a study of interaction 
among married people, requiring 4-5 hours for which 
up to $17 would be paid. Couples could have an 
opportunity for marital counseling after the study 
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was finished. Demographic information was obtained 
and subjects were instructed to fill out the screening 
tests sent to them by mail. 

Screening interview. Respondents who met the 
screening criteria on the paper-and-pencil measures 
were recontacted and scheduled in groups of 10 
couples for a brief interview. At this time the per- 
sonal interview was held, everyone filled out the 
Marital Activities Inventory, and couples were given 
instructions in how to collect daily interaction data 
(Ps and Ds). 

Administration of the problem-solving task. Couples 
were scheduled in groups of two distressed and two 
nondistressed couples for the laboratory phase of the 
study. The groups met with one experimenter and 
were administered the IMC. Each subject was pro- 
vided with an answer sheet and instructed to read 
about the 18 typical marital conflicts and note for 
each case whether the husband or wife was primarily 
responsible for the conflict. 

Following administration and scoring of the IMC, 
distressed and nondistressed married couples and ad 
hoc stranger (distressed, nondistressed, and mixed) 
dyads interacted in a prearranged sequence, counter- 
balanced for order effects. Both the conversation and 
problem-solving periods were videotaped, but for the 
conversational setting subjects were told that, “Whjle 
the videotape equipment is being set up, you may 
talk for a couple of minutes about anything you 
like, except the vignettes.” After 4 minutes, the 
experimenter returned and instructed the couple to 
proceed with their problem-solving discussion, at- 
tempting to resolve a sample of the problems about 
which they had disagreed. The experimenter then 
gave the subjects the numbers of five disagreed upon 
items, which corresponded to a joint answer sheet 
the couple was to complete. It was emphasized that 
they dis the problems thoroughly, complete each 
item before going on to the next, and in the course 
of their discussion feel free to stand up for their 
own opinions, 


RESULTS 


Multiple £ tests, supplemented by correla- 
tional analysis, were used to test the major 
a priori hypotheses. ¢ tests at the p <.05 
level were considered significant, although in 
view of the number of hypotheses, any par- 
ticular t test meeting only the p < .05 level of 
significance must be interpreted with caution 

For the laboratory phase of the study, 
videotapes of all interactions were coded by 
two trained observers, blind to conditions 
using the MICS, Average point-by-point ob- 
server agreement for the 144 separate inter- 
actions was .75. Data were expressed in terms 
pn per minute (RPM). 

e several individual code categori 
the MICS were combined a priori, a 
comprehensive measures of positive social re 
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inforcement (SR+) and negative social rein- 
forcement (SR—), with verbal and nonverbal 
behaviors included within each (cf. Table 1). 


Dyadic Interaction in the Laboratory 
(MICS) $ 


Marital distress and the exchange oj social 
reinforcement. Table 2 presents for spousal 
dyads, mean couple RPM figures (MICS 
data) for positive and negative emitted social 
reinforcement behaviors in each of the inter- 
action setting. Hypotheses were confirmed in 
three of the four comparisons. One-tailed £ 
tests indicated that when coded negative be- 
haviors were observed in spousal interactions, 
distressed were discriminated from nondis- 
tressed couples both in casual conversation 
and in problem-solving settings (p < .05). 
RPM means for nondistressed and distressed 
spouses during problem solving were .74 and 
1.10, respectively. During casual conversation 
the nondistressed and distressed means were 
.73 and 1.07, respectively. For positive behav- 
iors, the discrimination appeared strongest 
in the problem-solving situation (p < .025). 
Means for nondistressed and distressed cou- 
ples were 1.93 and 1.49, respectively. The 
t test for free conversation did not reach the 
p<.05 level of significance. Means were 
2.77 and 2.10 for nondistressed and dis- 
tressed couples, respectively. 

Distressed marital dyads emitted negatives 
at a mean rate almost 14 times that for non- 
distressed dyads. In addition, relative to non- 
distressed marriages, distressed couples ex- 
changed significantly less positive behavior in 
the problem-solving setting; there was a 
similar, but nonsignificant difference for the 
conversational setting. 

Social reinforcement, marital distress, and 
level of intimacy. Table 3 presents a compari- 
son of the emitted behavior of individuals in 
spousal and averaged stranger interactions. 
The table contains the mean individual rates 
per minute of positive and negative social 
reinforcement. For example, the first column 
(spousal) reflects individuals’ emitted behav- 
iors toward their spouse, followed by their 
averaged emitted behavior to the two stran- 
gers (averaged strangers). (E.g., Person A’s 
smiles toward Stranger B and Stranger C 
were averaged and entered as a score.) In 
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TABLE 2 


INDIVIDUALS’ MEAN RATE PER MINUTE AND SIGMAS 
FOR POSITIVE AND NEGATIVE SOCIAL REINFORCEMENT 

IN DISTRESSED AND NONDISTRESSED SPOUSAL 
Dyaps (MICS Dara) 


Positive social Negative social 
reinforcement reinforcement 
Dyad status 
Conver- | Problem | Conver- | Problem 
sation solving sation solving 
Nondistressed 
1 2.77 1.93 13 .14 
= 1.88 60 48 .60 
Distressed 
M 2.10 1.49 1.07 1.10 
z 1.02 J -68 81 
t 1.467 | 2.164** | 1.982* | 1.724* 
*p <.05, one-tailed. 


+*+ p < 1025, one-tailed. 


all cases, one-tailed ż tests for the differences 
between the means of emitted behavior 
toward spouse and strangers were significant. 
That is, married individuals emitted about 
twice as much negative and about one-third 
less positive reinforcement to spouses than 
they did when interacting with strangers. 

Considering separately the behavior of non- 
distressed and distressed individuals it can 
be seen that nondistressed individuals emitted 
about 30% less positive and 80% more nega- 
tive behavior to spouses than strangers. Dis- 
tressed individuals emitted about 40% less 
SR+ and more than twice as much SR— to 
spouses than strangers. 

In sum, married individuals observed inter- 
acting in a laboratory setting emitted more 
negative and less positive social reinforcement 
toward spouses than they did toward stran- 
gers. This pattern was consistent for indi- 
viduals from either distressed or nondistressed 
marriages. 

The major hypotheses concerning the data 
in Tables 2 and 3 were combined for group 
comparisons and are illustrated graphically in 
Figures 1 and 2. The predicted ordering of 
means (emitted SR+ and SR—) for the 
three major kinds of dyadic interactions 
were: distressed married dyads, nondistressed 
married dyads, and average stranger dyads, 
in increasing order of means for SR+, and 
decreasing order for SR— (the effect was 
expected to be noted especially during prob- 
lem solving). Figure 1 depicts SR+ means 
(RPM) for each group in each of the two 
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TABLE 3 


MEAN RATE PER MINUTE AND SIGMAS FOR POSITIVE AND Necative SOCIAL REINFORCEMENT FOR 
INDIVIDUALS IN SPOUSAL AND STRANGER DYADS (COMBINED SETTINGS) 


Positive social reinforcement Negative social reinforcement 
Individual status he 
Spousal | Averaged : Spousal | Averated ! 
Nondistressed A 
M 2.35 3.34 3.20* 74 Al 3.60 
z 1.09 1.54 42 33 
Distressed 
M 1.80 3.03 4.69** 1.09 51 5:027 
z -78 1.18 46 36 
Combined 
T 2.07 3.18 5.48** 91 46 6,06** 
z 97 1,36 47 35 


* p < .005, one-tailed, 
** p < .001, one-tailed. 


settings. For problem solving all three group 
means were significantly different in the pre- 
dicted direction. (For respective compari- 
sons of distressed vs. nondistressed dyads, 
t = 2.164, p< .025; for nondistressed vs. 
averaged stranger, ¢= 3.741, p< .001.) 
During casual conversation there was a sig- 
nificant difference between nondistressed mar- 
ried dyads and the stranger dyads but, as 
noted earlier, the difference between dis- 
tressed and nondistressed married dyads was 
not significant. (The respective ¢s were: dis- 
tressed vs. nondistressed, ¢ = 1.467, ns; for 
nondistressed vs. averaged stranger dyads, 
t = 2.407, p < .025.) 

Figure 2 depicts the same dyadic (RPM) 
means for the SR— measure. Once again 
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Ficure 1. Mean rate per minute emitted positive 
social reinforcement (SR+) for distressed married, 
nondistressed married, and averaged stranger dyads, 


during problem solving, the expected order 
was confirmed. There were significant differ- 
ences between distressed married dyads and 
nondistressed married dyads and also between 
the latter and averaged stranger dyads. Re- 
spective fs were: distressed versus nondis- 
tressed, £ = 1.724, p < .05; for nondistressed 
versus average stranger dyads, ¢ = 2.885, 
p < .005. For the negative behavior category 
in casual conversation, there was a signifi- 
cant difference between distressed and non- 
distressed marital dyads (as noted earlier), 
but the difference between the nondistressed 
and stranger dyads did not reach significance, 
(Respective ts were: distressed vs. nondis- 
tressed, £ = 1.982, p < .05; for nondistressed 
vs. averaged stranger dyads, t = 1.695, ns.) 

In sum, there were significantly different 
mean RPM positive and negative social rein- 


1.25 
—* Problem solving 


O-==0 Conversation 


1.00 
= 
fa 
= 75 
1 
f3 
G 
50 
25 


DISTRESSED 


NON-—DISTRE: 
DYADS S 


STR, 
DYADS ONCER 


DYADS 
Ficure 2, Mean rate per minute emi 
‘ E emiti i 
social reinforcement (SR—) for ea Aa 
nondistressed married, and averaged stranger dyads. 


355 


Murtmetuop ANnatysts or SOCIAL REINFORCEMENT EXCHANGE 


TABLE 4 


CONSISTENCY or Inpivipuat’s EMITTED POSITIVE BEHAVIORS IN SpoUsE-AVERAGED STRANGER 
AND STRANGER-STRANGER INTERACTIONS 


Spouse-average stranger Stranger-stranger 


Source dyad Conversation Problem solving Conversation Problem solving 
H w H | w H | w H w 
$ | 
Nondistressed 44 aS 35 Ad .60** Le BYE sii 27 
Distressed 34 —.03 OSS 09 Aes 37 153r st 


Note. H (W) paeas podna whose emitted behavior is considered, husbands or wives. 


*p <.05, one-tailed. 

** p < .025, one-tailed. 
2 p <01, one-taile 
#4 p < .005, one-tailed. 


forcement scores observed among the three 
dyadic combinations. When engaged in prob- 
lem solving, stranger dyads interacted most 
positively and least negatively; nondistressed 
married couples were intermediate in these 
respects, and as expected, distressed married 
couples exchanged the most negative and least 
positive amounts of behavior. 

Consistencies in emitted postive and aver- 
sive behaviors. To what extent are the forms 
of emitted positive and negative behaviors 
traitlike, in that partners essentially emit these 
behaviors at relatively comparable levels to 
strangers’ as well as to their spouses? Tables 
4 and 5 present the results of the correla- 
tional analyses pertaining to this question, for 

- positive and negative emitted behaviors, re- 
spectively. Each table reflects the consistency 
between behaviors emitted to spouse and 
averaged stranger and the consistency in be- 
haviors emitted to two strangers. Correlations 
are presented separately for the distressed, 
nondistressed factor. 

From Table 4, it can be seen that indeed 
there were consistencies in positive behaviors 


emitted to strangers: Six of the eight rs were 
significantly different from chance. However, 
spouse to stranger consistencies were not gen- 
erally found (seven of eight rs were not sig- 
nificantly different from chance). Further, the 
distress versus nondistress factor did not seem 
important in these analyses. 

Table 5 presents the same type of data for 
the negative MICS categories. All spouse to 
average-stranger correlations were not signifi- 
cantly different from chance. The stranger- 
stranger correlations indicate that only for 
individuals from distressed marriages was 
there evidence of traitlike negative behavioral 
output. This would suggest that individuals 
from distressed relationships may be more 
likely to respond similarly from person to 
person with regard to output of negative 
behaviors. 


Marital Interaction in the Natural 
Environment 

Pleases, displeases, and conflicts. It was 
predicted that distressed marital relationships, 
relative to nondistressed, would be character- 


TABLE 5 
CONSISTENCY OF ĪNDIVIDUAL’S EMITTED NEGATIVE BEHAVIORS IN SPOUSE-AVERAGED STRANGER 


AND STRANGER-STRANGER INTERACTIONS 


Spouse-average stranger 


Stranger-stranger 


Source dyad Conversation Problem solving Conversation Problem solving 
H w H w H woos wo 
sae | a a a -a & e a 
istresset ~ eee s 
husbands or wives, F a: 


Note. H (W) indicates individuals whose emitted behavior is considered, 


- *p <.025, one-tailed. 
** p < .005, one-tailed. 
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TABLE 6 


Five-Day MEAN 
AND CONFLIC 
AND D. 


Dyad 


Sake Pleases | Displeases | Conflicts a 
displeases 
Nondis- 
ressed 259.67 13.63 1.00 29.66 
Se 131.67 41.38 3.42 4,30 
2.871*| 4.780* 4.404" 


3.841" 


ized by a significantly lower exchange of 
pleases and a significantly greater exchange 
of displeases. Likewise, the number of con- 
flicts reported by the two groups was pre- 
dicted to favor the distressed group. Table 6 
presents the average total frequency per cou- 
ple of these reported events (summed over 
a 5-day period). The ratio of Ps/Ds con- 
trolled for the differences in time spent to- 
gether by the couples." It can be seen that the 
predicted differences were al significant. 

Marital Activities Inventory. The kinds 
and frequencies of spouse-accompanied recre- 
ational activities were reflected in the Marital 
Activities Inventory: Together, Alone, and 
with Others. It was assumed that distressed 
marriages, relative to nondistressed, are char- 
acterized by aversive interaction and eventual 
withdrawal from such aversiveness. It was 
expected that distressed partners would en- 
gage in fewer spouse-accom) panied activities 
and more other-accompanied activities than 
nondistressed partners. The numbers in Table 
7 represent the total frequency of each be- 
havior engaged in, summed across kinds of 
activities. The analyses presented indicate 
that, as predicted, individuals from distressed 
marriages engaged in fewer activities with 
their spouses (¢ = 3.848, $ < 001), and in 
more activities with others than those from 
nondistressed marriages (t = 2.147, p< 
.025). The proportion of all activities engaged 

1 Ratio “scores based on small denominators make 
it conceivable that a very small percentage of sub- 
jects could account for most of the differences be- 
tween the distressed and nondistressed dyads. How- 
ever, inspection of the data indicated that approxi- 
mately equal numbers of subjects contributed scores 
above and below the means (for both pleases and 
displeases) within the distressed and nondistressed 
groups, respectively. 
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in with spouse was computed to control for 
differences in activity levels. Nondistressed 
subjects engaged in a significantly greater 
proportion of activities with spouse than dis- 
tressed subjects (£ = 3.304, p < .005). 


Discussion 


Several aspects of a social learning formu- 
lation of marital distress were explored in 
this study. Marital distress was viewed as a 
reliance on coercion and aversive control as 
characteristics of the dyadic interaction. 
While not claiming to be a causal demonstra- 
tion of the genesis of marital distress, the 
study did provide relatively objective data 
about marital interaction in two observational 
settings. 

In the laboratory situation, distressed 
spousal couples were significantly more nega- 
tive and less positive in their behavioral in- 
teractions than were nondistressed spousal 
couples. Further, in their own home environ- 
ment, distressed couples reported significantly 
lower please to displease ratios than did non- 
distressed couples; indicating that over a 
5-day observation period the relative ex- 
change of displeases was greater for the dis- 
tressed group. Because of the aversiveness of 
one’s partner in an essentially negative-going 
relationship, it was predicted, and found, that 


distressed partners would avoid one another ` 


by engaging in a greater frequency of non- 
spouse events and sharing significantly fewer 
recreational events with spouse. In sum, both 
the laboratory findings and the subjects’ re- 
ports of their behavior in a natural environ- 
ment were in agreement. Not only did part- 
ners in distressed and nondistressed marriages 
report affective differences (on self-report and 


TABLE 7 


MARITAL ACTIVITIES INVENTORY: 
Rerortep Mean 30-Day FREQUENCY or ACTIVITIES 
Done TOGETHER, ALONE, AND WITH OTHERS 
NONDISTRESSED VERSUS DISTRESSED 


Dyad — | Frequency | Frequency | Frequency | Mean pro- 
aad [Resor [Erguer |S | uation of 
with spouse 
Nondis- 
tressed | 541.29 270 
Distressed | 344.25 30398 203.30 us 
3.817 | 661 | —2086* | 35301 


*> < 025, one-tailed. 
> < 005, one-tailed. 
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j spouse-observation measures), they also be- 

a haved differently toward their spouses. 
These results hold particular promise for 
he newly developed MICS scoring system 
(Hops, Wills, Patterson, & Weiss, Note 2) 
which allows observers to record fairly specific 
_ehaviors occurring between partners in an 
going discussion. Unlike other more con- 
nt-orlented coding systems (e.g., Olson & 
yder, 1970), the relationship between what 
observed in behavior and the code content 
straightforward: and relatively noninferen- 
al (e.g., Bales, 1951), The kinds of negative 
ehaviors therefore are largely restricted to 
hat is possible in a two-person discussion 
ven the propriety of what one can say and 
) in a nonprivate ‘setting. Inspection of the 
gative social, reinforcement classification 
Table 1) indicates that the negative behav- 
ors reflect a kind’ of personal disdain for the 
other; ignoring or nonfacilitating kinds of 
utterances and gestures. Problems of reliabil- 
z and measurement are minimized since 
he codes do not deal with more abstract re- 
tionship dynamics, such as power or intent 
f.. Guttman, Spector, Sigal, & Epstein, 
1971). However, unlike more basic coding 
systems (cf. Winter, Ferreira, & Bowers, 
1973), there is enough substance to the MICS 
system to provide for the codification of sev- 
11 positive, negative, and verbal and non- 

3 rbal behaviors. 

: Given the emphasis on small behavioral 
| ` indications of support or disdain, it is par- 
icularly interesting to note that the dis- 
bo essed, nondistressed spousal dyad differences 
_-vere greatest for exchange of positives during 
problem solving. Particularly in the context of 
resolving differéhces, ‘the exchange of positive 
social- reinforcement was significantly greater 
‘for. nondistressed compared to distressed 
spousal dyads. Data from the study by Vin- 
cent (Vincent, Weiss, & Birchler, Note 4) 
indicated that the problem-solving categories 
3 of the MICS also discriminated between the 
distressed and nondistressed groups, and as 
expected the distressed spousal dyads dis- 
played significantly fewer problem-solving 
behaviors. In preliminary analyses, using the 
MICS to differentiate distressed from non- 
distressed spousal dyads, the magnitude of the 
difference was greater for social reinforce- 
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ment, relative to problem-solving behaviors. 
In particular, nonverbal behaviors, such as 
smiles, head nods, frowns, and eye contact 
appeared to be the most discriminatory. In an 
analogue study by Kahn (1970) it was found 
that couples scoring in the maladjusted range 
of the Locke-Wallace Marital Adjustment 
Scale (Locke & Wallace, 1959) engaged in 
and reported less accuracy in nonverbal com- 
munication than happily married couples. 
Further research is needed regarding non- 
verbal behavior since it may be less suscepti- 
ble to the demand characteristics of the lab- 
oratory environment and perhaps more sensi- 
tive to the vicissitudes of marital distress. 

A second major issue in the present study 
concerns the communication within husband— 
wife dyads versus married-stranger dyads. As 
Ryder pointed out (Ryder, 1968), common 
sense would lead one to predict increased 
facilitative and supportive behaviors to be 
exchanged between intimates (married dy- 
ads) relative to strangers. Stuart (1969) dis- 
cussed the “reinforcement-attraction” hy- 
pothesis whereby A’s attraction to B is pre- 
sumably determined in part by B’s ability to 
mediate certain rewards or reinforcements to 
A. Winter et al. (1973) hypothesized that 
married couples, more used to the quid pro 
quo relationship, would be able to reach more 
satisfying decisions than stranger couples. 
Such reasoning suggests that married couples 
(intimates) should be able to get along better 
in communicating and making decisions than 
ad hoc dyads (strangers). 

In contrast to this thinking, the social 
learning formulation of marital interaction 
discussed earlier suggests that sooner or later 
processes of aversiveness, coercion, and with- 
drawal may predispose most such relation- 
ships to a progressive breakdown of mutual 
reinforcement, effective problem solving, and 
marital satisfaction. In the present study such 
a breakdown was apparent for the distressed, 
relative to mnondistressed, marital dyads. 
However, what differences would one expect 
between stranger dyads and happily married 
couples? Ryder (1968) reported the use of a 
disagreement eliciting analogue task (Color 
Matching Test) in studying aspects of verbal 
and nonverbal behaviors exchanged within 
120 recently married and stranger dyads. In 
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general, the results indicated that subjects 
who interacted with strangers treated them in 
a more gentle, facilitative fashion than those 
who interacted with spouses. Winter et al. 
(1973) also predicted, and found, that 
stranger dyads expressed more politeness and 
engaged in fewer nonfacilitating interruptions 
than apparently normal married dyads (indi- 
cants of marital satisfaction were not re- 
ported). Subjects were college students be- 
tween the ages of 19 and 29. These findings 
raise the question of whether, even in rela- 
tively new relationships, partners are nicer 
(have a more positive interaction) with 
strangers than intimates. Stuart and Braver 
(Note 3), although using an entirely different 
methodology, attempted to replicate Ryder’s 
general findings. Their results, however, indi- 
cated that spouses tended to react more 
favorably toward each other than they did to 
the same actions by other opposite-sex adults. 
Stuart and Braver noted that the men in 
their study were progressively less negative as 
a function of age or duration of marriage; 
perhaps their subjects were older than those 
in other studies and hence less negative. They 
also suggested that subjects’ level of marital 
Satisfaction might account for the differences. 
Tf the subjects in the Ryder and Winter stud- 
ies were more unhappy in their marriages, 
then marital interaction would undoubtedly 
reflect fewer positive communications than 
stranger interaction. 

Data from the present study tend to dis- 
count both of the possibilities above. Stranger 
dyads engaged in significantly more positive 
and less negative social reinforcement than 
married dyads, regardless of levels of marital 
satisfaction (unhappily or happily married). 
In addition, subjects in this study were of 
comparable ages and length of marriage to 
those in the Stuart and Braver study (sub- 
jects’ average age was about 26 years vs. 27.5 
years; average length of marriage was 4 years 
vs. 6 years, respectively). Even though such 
information was not presented in the Ryder 
and Winter et al. reports, present results tend 
to support their general findings. Stuart and 
Braver go on to discuss carefully other fac- 
tors which may have accounted for their re- 
sults (cf. Stuart & Braver, Note 3). It is 
possible that the observation of bidirectional 
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communication is required in order for char- 
acteristic patterns of interpersonal and mari- 
tal interaction to emerge. The design used by 
Stuart and Braver permitted only unidirec- 
tional communication, no dyadic interchange. 
To the extent that certain observed behaviors 
become shaped and developed as a product of 
dyadic interaction, one might need to replicate 
this process in order to study such behaviors 
as they are elicited and exchanged by the 
partners, 

In sum, allowing that familiarity with an- 
other person may entail less of a demand to 
emit positive instrumental behaviors (i.e., in- 
gratiating or acquaintanceship behaviors), we 
are left with the problem of how to account 
for the increased nonfacilitative behaviors 
exchanged between spouses relative to strang- 
ers (Ryder, 1968; Winter et al., 1973), par- 
ticularly in the case of happily married cou- 
ples (present study). The output of negatives 
seems to be specific to one’s partner in the 
marital dyad. The trends represented in Fig- 
ures 1 and 2, although not providing any 
definitive answers, at least raise the question 
of whether there are progressive deteriorative 
processes associated with long-term, intimate 
two-person relationships. Such processes, such 
as coercion and reciprocity in dyadic inter- 
changes are just beginning to be investigated 
and understood (Patterson & Hops, 1972; 
Patterson & Reid, 1970). 

An additional issue of concern in this study 
was the trait-state distinction regarding the 
observed instrumental behaviors. Previous 
work (Weiss, 1969; Birchler & Weiss, Note 
1) indicated that instrumental affiliative re- 
sponses (head nods, smiles, agreement utter- 
ances, etc.) were highly consistent when an 
individual interacted dyadically with strang- 
ers. (Rs for head nods, smiles, and verbal re- 
inforcement were .62, .59, and .60, respec- 
tively, indicating that one’s level of output 
was similar from person to person in these 
response modalities, cf. Birchler & Weiss, Note 
1.) A similar finding was noted in Table 4 for 
this study; six of the eight stranger—stranger 
rs for positive MICS behaviors were signifi- 
cant (average r=.54). When husbands 
(wives) interacted dyadically with two dif- 
ferent strangers, the level of positive behay- 
iors emitted to the strangers was comparable, 
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Yet when compared with the same type of rs 
for spouse-stranger consistencies (Table 4) 
the correlations were not statistically signifi- 
cant. Regarding the exchange of positives, 
therefore, it appears that response to strang- 
ers is traitlike, whereas response to a familiar 
other (spouse) does not predict response to a 
stranger (averaged stranger in this instance). 

The situation for negative behaviors. is 
somewhat more complex. Only three of the’ 
eight rs (Table 5) indicated stranger-stranger 
consistency, that is, traitlike negative output 
levels, and these were restricted to individuals 
from distressed marriages. As before, spouse— 
stranger 7s were not generally different from 
chance expectations, indicating that one’s 
negative output to spouse does not predict 
one’s output to strangers. 

Taken together, the data for positive and 
negative MICS behaviors indicate support for 
the traitlike consistency of positive behaviors 
in acquaintanceship situations, and for indi- 
viduals from distressed dyads, traitlike con- 
sistency of negative behaviors. However, since 
consistency is always found in response to 
strangers, this suggests that the pattern of 
exchanges are specific to one’s marital rela- 
tionship. 

Likewise the data for mean RPM of posi- 
tive and negative behaviors indicated that 
output was largely under the control of whom 
it was one interacted with: Toward spouses 
the overall level of positive (negative) be- 
havior was lower (higher) than with strangers. 

The data from the spouses observations of 
their behaviors in their daily settings provide 
a point-for-point corroboration of the labora- 
tory interaction findings. Not only did the . 
distressed. dyads report significantly more 
daily conflict relative to the nondistressed 
couples, but their daily exchanges of pleases 
and displeases indicated a far greater empha- _ 
sis on displease exchanges. As one would ex- 
pect from reinforcement theory, this greater 
exchange of negatives is associated with 
greater spousal avoidance within the distressed 
group, as measured by reports of recreational 
activities with spouse and alone or with 
others. The distressed group showed consis- 
tently greater reliance on aversiveness in their 
dyadic behaviors both at home and in the 
laboratory setting. Once again the present 
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study does not allow conclusions about the: 
causal development of these patterns, but it 
does suggest that such patterns are worth 
investigating. 7 i 

Specifically, we can now turn to interven- 
tion procedpres for bringing about changes in 
the inten patterns of distressed couples. 
First, the use of a multimethod approach, 
such as this, suggests that the effectiveness of 
intervention at one level can be evaluated at 
another level. For example, one could focus 
on the modification of positive exchanges dur- 
ing therapy and then determine the effect on 
the laboratory measures of subsequent dyadic 
interactions. Second, the nature of interven- 
tion targets seems clearer as a result of pres- 
ent findings. Concerning the resolution of 
‘problems, the focus should be.on the exchange 
of positives and specific training’ in reciproc- 
ity, that is, an explicit process of “quid pro 
quo” (Lederer & Jackson, 1968). The use of 
the pleases and displeases in the present study 
was in the role of a dependent variable, al- 
though the possibilities of P and D modifica- 
tion also suggest themselyes in the practical 
matter of intervention of training couples to 
alter unsatisfactory reciprocal patterns. 

The data also provide somewhat greater 
support to the state point of view of marital 
distréss, namely, the distress appears to be a 
function of a dyadic interaction., From the 
unique design of this study, it appears that 
individuals learn how to relate in an ineffec- 
tive manner specifically with their spouses 
whilt retaining social competency in interac- 
tion with others. The implication for marital 


therapy is that to the extent couples develop 


patterns of ineffective interaction which are 
products of, and specific to their relationships, 


` the treatment focus should be on this com- 


munication unit, that is, conjoint therapy is 
indicated. In this approach, potentially de- 
structive relationship factors can be viewed 
directly and “in progress” by the therapist, 
and intervention can be initiated immediately. 

In conclusion, this study represents an at- 
tempt to conduct a behavioral analysis of 
dyadic interaction, investigating certain as- 
pects of a social learning theory of marital 
distress. Pertinent data was gathered from 
home and laboratory environments. To date 
there has been precious little normative or 
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clinical data on the specific behaviors and 
interaction processes which characterize either 
nondistressed or distressed marriages. This 
study amounts to a small drop in an enor- 
mous bucket. 
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Name-Calling and Compliance 


Claude M. Steele 
University of Washington 


The prediction that name-calling, by conveying a negative judgment, would 
enhance subjects’ willingness to comply and their actual compliance with a 
later request for help was supported by two experiments. As predicted, nega- 
tive names produced more compliance behavior than positive names. Also, 
whether or not the negative name was related to the help request made no 
difference as to the percentage of subjects who agreed to comply. The second 
experiment also demonstrated that it was the name’s impugnment of the 
subject’s general character and not its impugnment of a specific behavior that 
was needed to increase later compliance. The implications of the present find- 
ings for experiments using negative judgments as independent variablè manip- 
ulations and for the relationship between self-esteem and consistency processes 


are discussed. 


_ One day a liberal person was called a 
racist. The liberal was, of course, very hurt 
by this name and wanted to argue the name- 
caller into retracting it. But the name-caller 
went away before the liberal had a chance to 


' speak. The liberal was left alone to brood. 


Another day, soon after, the liberal was asked 
by a friend if he would contribute some of 


his time and money for the development of a 


project to help the disadvantaged members 
of another race. The liberal paused for a 
moment. And then, he remembered that he 
had been called a racist. He also recalled the 
bad feeling that that name had caused him. 


» Thus, he looked at his friend, and with the 
_ poise of a man of great conviction, agreed to 


his request. 
The logic behind this depiction is that 


‘name-calling which successfully impugns a 


- person should increase that person’s motiva- 


tion to avoid any further threat to his posi- 
tive self-regard, and because helping can 
enhance self-regard and not helping can di- 
minish it, the target of a negative name 


“ should be more willing to comply with a sub- 
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sequent request for help, Name-calling, in 
this formulation, is defined simply as the 
conveying of a negative judgment, opinion, 
or evaluation of a person to that person. De- 
spite substantial research on the determinants 
of compliance (e.g., Brock, 1969; Freedman 
& Fraser, 1966; Freedman, Wallington, & 
Bless, 1967; Regan, Williams, & Sparling, 
1972), none has explicitly examined the rela- 
tionship of name-calling to compliance be- 
havior. 

A second prediction, derived from this 
formulation, is that because a positive or 
complimentary name does not threaten one’s 
self-esteem, it should produce less willingness 
to comply with a later help request than 
should a negative name. This prediction may 
appear to contradict the findings of several 
recent experiments that the induction of a 
positive mood (by providing the subject with 
a success or a small windfall) facilitated both 
solicited and voluntary helping (Aderman & 
Berkowitz, 1970; Berkowitz & Connor, 1966; 
Isen, 1970; Isen & Levin, 1972). While it is 
reasonably certain that name-calling will af- 
fect the momentary mood state of the target, 
the present predictions rest on the assumption 
that the esteem-enhancing processes set in 
motion by a negative name will persist long 
after the dissipation of a temporary mood. To 
prevent the immediate mood state induced by 
the name-calling from affecting helping be- 
havior in the present experiments, the initial 
request for help was made 2 days after sub- 
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jects were called names. If the effect of name- 
calling on helping were mediated by its im- 
pact on self-esteem, then this effect should be 
detectable for a reasonably long period of 
time. 

Finally, a condition was included in the 
first experiment in which subjects were called 
a specific negative name that was unrelated 
to the subsequent help request. This was to 
determine if it is necessary for the negative 
name to be logically related to the help re- 
quest in order for the predicted effect to 
emerge. 


EXPERIMENT 1 
Method 
Summary of Design and Procedure 


A naturalistic, single-factor, four-condition experi- 
ment was conducted over the telephone; housewives 
served as subjects. During an initial call from a 
representative of the “National Polling Institute,” 
ostensibly to determine subjects’ interest in a poll to 
be taken later, the experimental subjects received a 
negative name-calling communication that was either 
relevant or irrelevant to a later request for help or 
a positive, complimentary communication relevant to 
the request. Two days later, subjects in all condi- 
tions received a phone call from another party which 
asked them to prepare a list of their foods and 
household items for use in planning a neighborhood 
food cooperative. The dependent measure of willing 
compliance was the percentage of subjects who 
agreed to this request. In addition, the percentage 
of subjects who actually complied and the amount 
of helping they engaged in was measured by re- 
contacting the subjects who had agreed to help 3 
days later (5 days after the initial name-calling 
phone call) to collect the lists they had prepared. 
A fourth condition in which subjects received no 
initial phone contact was included as a base rate 
control group. 


Subjects 


The names of 60 housewives were selected from 
the Salt Lake City telephone directory. Subject 
selection was restricted to only those names with 
addresses that fell within the middle- to upper- 
income neighborhoods, so as to select subjects for 
whom our experimental guise would be plausible. To 
ensure that housewives were contacted, only phones 
listed in men’s names were selected. Within these 
constraints, subjects were randomly selected and then 
randomly assigned to each of the four conditions. 


The Name-Calling Communications 


Subjects in the three experimental conditions 
received an initial call from @ “Bill Glass of the 
National Polling Institute,” who stated to the “lady 
of the house” that his organization was: 


in the business of conducting opinion polls .. - 
and selling their results to local newspapers. We 
are talking just to women in this preliminary call 
because the eventual poll we are thinking of con- 
ducting is concerned with wamen’s opinions about 
various issues. Would you mind . . . answering a 
question? 


For subjects in the relevant negative name condi- 
tion, the bogus pollster continued with the following 
statement: 


It is pretty much common knowledge that you, 
as a member of your community, are not con- 
cerned with cooperative efforts for the betterment 
of your fellow man. Even though you probably 
believe that the ability to cooperate for the benefit 
of others is a necessary part of being a decent 
human being, you are likely to find some excuse 
not to help when asked to cooperate with a 
specific project. Isn’t it accurate to describe you 
as predominantly self-oriented and apathetic about 
the welfare of others? Wouldn’t you say that you 
tend to your own needs and ignore the needs of 
others even when helping them would entail little 
effort on your part? The question I would like to 
ask is would you like to see an attitude poll con- 
ducted in your community about these kinds of 
issues? 


After the subject briefly responded to this question, 
the first phone contact was concluded, and the 
pollster thanked the subject for her time. 

Excerpts from the second paragraph of the rele- 
vant positive name message are the following: 


It is pretty much common knowledge that you, 
as a member of your community, are concerned 
with cooperative efforts for the betterment of 
your fellow man. You believe the ability to coop- 
erate . . . is a necessary part of being a decent 
human being and you . . . help when asked to 
cooperate . . . Would you like to see a poll on 
. . . these kinds of issues? 


Portions of the irrelevant negative name message 
were as follows: 


It is pretty much common knowledge that you, 
as a member of your community, are not very 
concerned with driving safety. Even though you 
probably believe that a concern for safe driving 
is a necessary part of being a decent human being, 
you . . . ignore community-wide campaigns for 
better driving, assuming it is the “other guy” who 
needs to improve. You . . . neglect seat belts... 
road courtesies . . . turn signals . . . etc. Would 
you like to see a poll on these kinds of issues? 


All of these initial phone contacts were made by 
male experimenters, and all of these messages were of 
the same length. A baseline control condition was 
included that omitted this initial phone contact but 
was otherwise identical to the experimental condi- 
tions. 
Pretesting had shown that these negati 
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subjects and that these reactions were of a very 
diffuse nature. So the last question, “Would you like 
to see a poll conducted?” was included in this ex- 
periment to give the experimenter a device with 
which to close the conversation. This device was 
only partially successful; almost all of the subjects 
in the negative name conditions still tried to give 
prolonged defenses of their helpfulness (e.g., “I just 
gave to the heart fund,’ “Maybe others in this 
neighborhood don’t help, but I do,” “Where’d you 
get your information?” “I'll help,” etc.) and rarely 
gave any outright, codable answer to this question. 
Nevertheless, the experimenter was almost never 
interrupted before completing the entire message, 
and there were no comments from subjects that 
suggested they did not believe the experimental 
guise. If anything, the vehemence of their reactions 
suggests that they took the name-caller very seri- 


` ously. 


Dependent Variables 


“Two days after the initial phone contact with 
the polling institute, all subjects were phoned with a 
request for help by a female experimenter who was 
blind to experimental conditions. Portions of the 
help request, the same for all subjects, were as 
follows: : 


We are trying out a project in your community 
which may be of eventual use in lower income 
neighborhoods, helping the less fortunate . . . I 
„am calling for a group of women in your com- 

` munity investigating the possibility of forming a 
food cooperative. We are planning to buy foods 
and household necessities at wholesale prices and 
distribute them ourselves to the coop members. 
To guide our buying, we need your cooperation 
in giving us information about the kinds, quanti- 
ties, and brands of foods and household goods 
you use in your home. We are in no way trying 
to solicit memberships in the coop . . . If you will 
be willing to cooperate . . . we will explain ex- 
plicitly what information we need and call in 3 
days to collect it. Would you be willing to coop- 
erate? 


The subject’s response to this question indicated 
her willingness to comply. If the subject agreed 
to help, the experimenter explained that she was to 
divide her list into three categories: canned goods, 
perishables, and general household items. For each 
category she was to list the kinds of items she 
bought, the quantities per week, and her preferred 
brands. The experimenter apologized for allowing 
only 3 days to complete the list by explaining that 
the co-op was under a time pressure from the whole- 
sale houses to place its order. The experimenter 
concluded the conversation by scheduling a time 3 
days later for the return call, when it was noted 
whether the subject had actually complied, and the 
amount of information the subject provided was 
recorded as the dependent measure of the amount of 
actual compliance. During these dependent measure 
phone contacts, no subject gave any indication that 
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she did not believe the experimental cover story or 
that she thought the food co-op call was related, 
in any way, to the earlier name-calling phone call, 
This is important because if the name-calling and 
co-op calls were seen as having been made by the 
same person or group, the subject could think that 
she was being tested and might help just to be 
socially desirable or not help out of resentment, 

It was anticipated that the experimental manipu- 
lations would have much more difficulty affecting 
the measures of the frequency and amount of ac- 
tual compliance than the willingness to comply 
measure. Considerable experimental evidence has 
documented a lack of perfect correspondence be- 
tween behavioral intentions and actual behavior 
(Wicker, 1969). Furthermore, there are several fea- 
tures of this paradigm that could be expected to 
accentuate this lack of correspondence. First, the 
measures of actual compliance followed the experi- 
mental manipulations by 5 days, 3 days longer 
than the gap between name-calling and the measure 
of willingness to comply. Thus, there was a longer 
time for the manipulations to wear off and sufficient 
opportunity for extraneous events to prevent compli- 
ance by the time actual compliance was measured. 
Second, the compliance task involved considerable 
effort and inconvenience. Subjects were required to 
spend time in preparation before actual compliance 
was measured, and this preparation was forced into 
a 3-day time limit imposed by the experimenter. 
Thus, some subjects who intended and wanted to 
comply may have been overwhelmed or just unable 
to help within these constraints once they actually 
confronted the task. For these reasons, actual com- 
pliance (both frequency and amount) was expected 
to be less sensitive than willingness to comply to 
the experimental variations in this research. 

Subjects in this experiment were not debriefed as 
to the experimental nature of these telephone calls.+ 


Results 


The percentage of subjects who agreed to 
the help request made during the second 
phone contact was significantly affected by 
the experimental conditions, x? (3) = 13.53, 
p< .01. These percentages are presented in 
Table 1. Predicted differences among condi- 
tions were tested using a simultaneous interval 


1It was decided that the feelings of unfair exploi- 
tation that subjects might experience from being 
told the full nature of this experiment could be 
greater than the negative effect of the experimental 
manipulations, especially if they had gone to the 
trouble of actually complying. Further, it was felt 
that a debriefing designed to just eliminate the 
effect of the name-calling messages, without in- 
forming subjects of the full nature of the experi- 
ment (e.g., have the name-caller recontact the sub- 
ject and retract his original message), would simply 
leave the subject in a position of not knowing what 
or whom to believe. 
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TABLE 1 


PERCENTAGE OF WILLING AND ACTUAL COMPLIERS BY 
CONDITION, EXPERIMENT 1 


Relevant Relevant | Irrelevant 
fers negative positive negative No-name 
Compliers | “name name name | condition 
condition* | condition» | condition> 
Willing | 93% 65% 93% 46% 
Actual 66% 57% 93% 40% 
an = 15. 
bn = 14, 


technique based on the chi-square distribution 
that enabled multiple comparisons among 
multinomial populations. This procedure is 
similar to that developed by Scheffé (1953) 
for the analysis of variance (Miller, 1966). 
Using the chi-square value for p < .02 and 
three degrees of freedom to establish the 
confidence intervals ensured that all contrasts 
were tested for this level of significance. The 
relevant and irrelevant negative name condi- 
tions significantly increased, in fact doubled, 
the percentage of subjects willing to comply 
observed in the base rate control condition. 
In contrast, the positive label condition did 
not significantly increase willingness to com- 
ply over the base rate level. 

The percentage of subjects who actually 
complied with the help request by providing 
information helpful to the food co-op (also 
presented in Table 1) was also significantly 
affected by the experimental conditions, 
x? (3) = 9.14; p < .05. The above described 
multiple-comparison procedure revealed that 
the irrelevant negative name significantly in- 
creased actual compliance over the base rate 
and that the relevant negative name effected 
a marginally significant increase in actual 
compliance. The positive name condition had 
no effect on actual compliance. While the 
percentages in Table 1 reflect the anticipated 
discrepancy between behavioral intentions 
and actual behavior, they do show that nega- 
tive names increased actual compliance as 
well as willingness to comply in this experi- 
ment. 

A measure of the amount of actual compli- 
ance was taken as the total number of pieces 
of information (i.e., the item; its category as 
a canned good, perishable, or household item; 
the quantity; and the brand name) that com- 
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pliers actually provided during the final phone 
contact. The overall condition main effect for 
this variable was marginally significant, 
F (3, 28) = 2.55, p < .076. The near signifi- 
cance of this main effect is primarily attribu- 
table to the fact that the irrelevant negative 
name condition produced over twice as much 
information as was observed in the relevant 
negative name condition (M = 47.9 and 22.5, 
respectively). An individual comparison þe- 
tween these two means reached significance, 
F (1,40) = 8.12, p < .01. This difference as 
well as the fact that the irrelevant name pro- 
duced somewhat more actual compliers than 
did the relevant negative name suggest that 
these conditions may have differentially af- 
fected the persistence of subjects’ motivation 
to help. At any rate, since this finding does 
not bear directly on the rationale for the sec- 
ond experiment, the implications of this find- 
ing on the self-esteem formulation will be 
discussed following Experiment 2. 


Discussion 

Negative name-calling had a powerful ef- 
fect in this experiment; a brief telephone 
encounter with a name-calling stranger dra- 
matically facilitated the subject’s agreement 
to comply 2 days later and her actual com- 
pliance 5 days later. And, as predicted, the 
condition in which subjects were called posi- 
tive, complimentary names did not increase 
either willingness to comply or actual com- 
pliance over the base rate. These findings, 
and the fact that the effect of negative name- 
calling was detectable for so long, suggest 
that helping behavior in this experiment may 
have been mediated by the subject’s enduring 
desire to prevent any further lowering of her 
self-esteem. Another finding consistent with 
this self-esteem interpretation is that a nega- 
tive name provoked subjects’ helping behavior 
even when the specific name was unrelated to 
the helping task. Why did subjects help in 
the irrelevant negative name condition? Pos- 
sibly, the name in this condition made sub- 
jects feel badly enough in general so that 
they were more willing to comply with any 
available request that could keep them from 
feeling worse about themselves, whether or 
not it was related to the specific name. The 
self-esteem formulation implies that by 
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threatening one’s self-esteem, a negative name 
should make one more willing to engage in 
any available behavior that is capable of en- 
hancing one’s self-esteem or preventing it 
from worsening. 

In addition, the failure of the positive name 
condition to significantly increase subsequent 
compliance indicated that merely sensitizing 
subjects to the issues underlying the help 
request was not sufficient to enhance com- 
pliance; subjects in the negative name and 
positive name conditions were equally pre- 
alerted to request-related issues. 

What is the nature of the impugnment that 
is sufficient to enhance compliance behavior? 
Two possibilities are suggested. First, both 
negative names used in this experiment im- 
pugned a specific behavior of the subject, 
either her level of cooperativeness of her 
driving ability. It is possible that the name’s 
impugnment of a specific behavior alone, 
without any generalized impact on the sub- 
ject’s overall self-esteem, is sufficient (or 
even necessary) to enhance compliance, even 
with an unrelated request. A person may want 
to compensate for a particular inadequacy or 
inconsistency even if he or she feels good 
about himself or herself in general. On the 
other hand, the negative names in this ex- 
periment may have affected subjects’ overall 
sense of goodness and worth, and thus it 
could be this kind of impugnment that caused 
the observed increase in compliance. 


EXPERIMENT 2 


A second experiment was conducted, using 
a procedure similar to that of Experiment 1, 
to both replicate the name-calling effect and 
to determine which was its primary cause: 
(a) the name’s impugnment of a specific be- 
havior alone or (b) its impugnment of the 
stibject’s general character. To do this, ex- 
perimental messages were designed that con- 
tained one of these aspects of name-calling 
at a time, without containing the other. In the 
specific name condition the subject was called 
a specific negative name (“uncooperative with 
helping projects”) that impugned a specific 
behavior of the subject, but she was also 
generally flattered (“You are . - - respected 
by others”) so as to prevent this specific 
name from having any generalized impact on 


the subject’s global self-esteem. If the im- 
pugnment of a specific behavior alone is suf- 
ficient to stimulate helping, then helping 
should be high in this condition. In the gen- 
eral name condition, the subject’s overall 
self-esteem was attacked (“You are not re- 
spected in general”), but she was also tagged 
with a specific, positive name (“You are co- 
operative with helping projects”) to prevent 
the name-calling message from impugning any 
specific behavior of the subject. If a threat 
to one’s general self-esteem is the source of 
helping in this paradigm, then helping should 
be high in this condition. A condition that 
replicated the relevant negative name condi- 
tion of Experiment 1 (except for a change in 
the format of the message to be described in 
the Procedure section) was also included. 
This condition was seen to constitute an 
intermediate level of general impugnment 
because, while it did impugn a specific be- 
havior of the subject (“You are uncooperative 
with community projects”), it did not directly 
attack her overall sense of goodness and 
worth. A base rate control condition was also 
included in this experiment. In accord with 
the self-esteem formulation, it was predicted 
that compliance behavior (both willingness 
to comply and actual compliance) would be 
enhanced only when the message impugned 
the subject’s general character and, further, 
that compliance behavior would vary as a 
direct function of the degree to which the 
message threatened the subject’s general self- 
esteem. 


Method 
Subjects 


The names of 161 middle- to upper-income house- 
wives were selected, again, from the Salt Lake City 
telephone directory. 


Procedure 


The procedure of this experiment followed that of 
Experiment 1, except for the format of the name- 
calling messages. In Experiment 2, the opinions in 
these messages were not portrayed as belonging to 
the pollster but as belonging to several people with 
whom he had talked. This was done in order to 
enhance the credibility of these messages, which 
might be threatened by the fact that one part of the 
message (e.g, “You are respected”) could appear 
inconsistent with the other part (eg, “You are 
uncooperative”) in the general and specific name 
conditions. Quotes from these people were included 
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TABLE 2 
PERCENTAGE OF WILLING AND ACTUAL COMPLIERS BY 
CONDITION, EXPERIMENT 2 
fue 5 Repli- | General 
Compliers | Noname name cation name 
| ton" | conditione | conditionb | conditione 
Willing 42 50 60 79 
Actual 30 48 48 


an 
bn 
on 


to document the message. Thus, in this experiment, 
the subject was called a name secondhand; the 
pollster only reported to her the name-calling of 
others. It was recognized that making the pollster 
less directly confrontational might lessen the sub- 
ject’s sense of general impugnment. The following 
are excerpts from these experimental messages, ex- 
cluding the pollster’s introduction, which was the 
same as that described for Experiment 1. 
In the replication condition the pollster stated: 


From our conversations . . . it is apparently com- 
mon knowledge that you, as a member of your 
community, are not cooperative with community 
projects for the betterment of others . . . Here 
are some of the statements we received about the 
people of your community: Bill Derek, who lived 
in your community for almost 7 yea 
found the people of that community, reg 


their religion, to be extremely smug and apathetic 
about the welfare of othe 
Two other similar quotes were included before the 


conversation was ended by the pollster, who asked 
the subject if she were interested in seeing the 
results of a future poll on these issues. 

In the specific name condition the pollster stated: 


From our conversations . . . it is apparently com- 
mon konwledge that though you, as a member of 
your community, are not very cooperative with 
community projects for the betterment of others, 
that you are very well respected and highly 
thought of by others. Bill Derek . . . says: “Yeah, 
they are not very helpful with community proj- 
ects, but those people, regardless of their religion, 
are very progressive and open people, and I just 
cannot help but admire them,” 


Similar quotes from two others were included, 
and the phone call was ended, as before, with an 
inquiry as to the subject’s interest in a future poll 


on these issues. 
The pollster in the general name condition stated: 


From our conversations . . . it is apparently com- 
mon knowledge that though you, as a member of 
your community, are cooperative with community 
projects for the betterment of others, you are not 
very well respected by other people. Bill Derek 

+ + says, “Yeah, they’re helpful with community 
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projects, but those people, regardless of their re- 

ligion, are backward and smug people, and I just 

cannot respect them.” 

Two other similar quotes were included, and the 
call was ended with the standard inquiry as to the 
subject’s interest in a future poll. 

The dependent variables were, again, willingness to 
comply, actual compliance, and the amount of com- 
pliance. As in Experiment 1, male experimenters 
made the initial name-calling phone contact, and 
female experimenters blind to the experimental con- 
ditions made the two phone contacts necessary to 
measure both the agreement to comply and the 
amount of compliance. Again, subjects gave no indi- 
cation that they did not believe the experimental 
cover story or that the food co-op and name- 
calling telephone calls were related. This experiment 
was conducted in two sessions, a week apart, with 
approximately half of the data collected in each 
session. Because sessions had no effect on either 
dependent variable, the data were combined for all 
analyses. 


Results 


The overall condition main effect for the 
percentage of subjects who agreed to help 
the food co-op reached statistical significance 
in this experiment, x" (3) = 12.45, p< .01. 
It was specifically predicted that willingness 
to comply would be enhanced only when the 
name-calling message impugned the target’s 
general character. Predicted condition differ- 
ences were tested by the same simultaneous 
interval procedure used in the first experiment 
(Miller, 1966), which ensured a .95 confi- 
dence interval for all contrasts, The results 
support this hypothesis. Table 2 presents the 
percentages of willing and actual compliers 
in each condition. The general name condition 
produced significantly more willing compliers 
than did either the base rate control or the 
specific name conditions, The general im- 
pugnment in the general name condition was 
sufficient to enhance willingness to comply 
even when the specific name in this message 
was Positive, and also when that name (“You 
are cooperative with community projects”) 
implied that helping on a community project 
could not disprove this impugnment, since the 
subject was impugned even though it was 
acknowledged that she helped on such proj- 
ects. Subjects apparently agreed to help in 
this condition in order to feel better in gen- 
eral. Also, despite the impugnment of a spe- 
cific behavior in the specific name condition 
the number of willing compliers in this condi- 
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tion was not significantly greater than the 
number found in the base rate control condi- 
tion; without a general impugnment, willing- 
ness td comply was unaffected by the nega- 
tive name. The replication condition produced 
an increase in willingness to comply that 
was only marginally significant, y2(1) = 2.78, 
p< 10, using the conventional chi-square 
test. Using a procedure described by Winer 
(1971, p. 49) to determine the hypothesis- 
wide significance level of a comparison over 
experiments, this contrast between the rele- 
vant negative name condition (the replication 
condition in this experiment) and the base 
rate reached significance, y7(4) = 13.81, p< 
.01, when evaluated over both Experiments 1 
and 2. The weakness of this effect in this 
experiment is attributed to two factors: (a) 
The name-calling message in this experiment 
involved less personal confrontation between 
the pollster and the target than did this mes- 
sage in the first experiment, and (b) the 
impugnment in this condition was somewhat 
less general than that in the general name 
condition. 

In this experiment the overall condition 
main effect for the percentage of actual com- 
pliers did not reach significance, x?(3) = 5.63, 
p <.20. Nevertheless, the multiple-contrast 
procedure described earlier did reveal that the 
general name condition produced significantly 
more actual compliers than did the base rate 
condition, which showed that a general im- 
pugnment facilitated actual as well as in- 
tended compliance in this experiment. 

A second prediction was that the compli- 
ance behavior in each condition would be a 
direct function of the degree to which that 
treatment generally impugned subjects. Con- 
ditions were ordered as to the degree of gen- 
eral impugnment attempted, from the least to 
the most, as follows: the base rate control 
condition, the specific name condition, the 
replication condition, and last, the general 
name condition. The simultaneous interval 
technique, which enabled weighted contrasts 
among multinomial populations (Miller, 
1966) was again used to test the linear, 
quadratic, and cubic trends among these 
ordered conditions. For willingness to comply, 
only the linear component of the condition 
main effect reached significance, which indi- 
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cated that this variable increased as a direct 
function of the degree of general impugnment 
that subjects experienced. For actual compli- 
ance, none of these trends reached statistical 
reliability. 

In this experiment there were no significant 
condition differences for the number of pieces 
of information that compliers provided during 
the third phone contact. 


GENERAL Discussion 


Both of the present experiments have 
demonstrated that negative name-calling can 
increase willingness to comply and actual 
compliance with a later help request. It has 
been argued throughout this article that these 
effects of negative name-calling were medi- 
ated by people’s desire to maintain a positive 
self-regard. As a way of summarizing the sup- 
port for this explanation, some of the findings 
of this research are briefly reviewed. To begin 
with, the effect of negative name-calling on 
subsequent behavior lasted for as long as 5 
days (the period between the first and third 
phone contacts). The long duration of this 
effect discredits the possibility that it was 
mediated by any immediate mood that may 
have been induced by the negative name. 
Even if the immediate mood were reinstated 
by the second call that may have reminded 
subjects of the name-calling, the research on 
mood and helping (Aderman & Berkowitz, 
1970; Isen, 1970; Isen & Levin, 1972), 
typically has found more helping after a posi- 
tive rather than a negative mood induction. 
The effect of name-calling in these experi- 
ments was, of course, exactly opposite to this 
pattern. Thus, it seems more plausible to view 
the duration of the name-calling effect as 
caused by the name’s arousal of some more 
basic, less ephemeral psychological process. 

Second, as was just alluded to, negative 
names produced more helping behavior than 
positive names in the present experiments. 
Apparently, when one is made to feel com- 
fortable or good about oneself, as in the rele- 
vant positive name condition of Experi- 
ment 1, one feels less compelled to do 
something good in the long run. At any rate, 
the fact that esteem-enhancing compliments 
did not increase helping behavior suggests 
that this behavior in these experiments was 
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caused by the name’s negative implications 
for self-regard. 

Finally, the esteem interpretation was also 
supported by the finding in the second experi- 
ment that a specific negative name, when 
deprived of its ability to impugn the target’s 
general character, failed to increase helping 
behavior; while a general negative name, even 
without impugning a specific behavior, did 
increase both willingness to comply and 
actual compliance. It should be noted, how~ 
ever, that the use of the terms specific and 
general to describe differences in the scope of 
impugnment in these conditions does not im- 
ply that these conditions do not also differ in 
their overall degree of impugnment. That is, 
the specific name message was, quite prob- 
ably, just less impugning than the general 
name message; scope and degree of impugn- 
ment are by definition confounded in these 
messages. Nevertheless, even if the observed 
condition differences are solely attributable to 
differences in the overall degree of impugn- 
ment in these messages, the results of the 
second experiment indicate that at least one 
successful way of manipulating degree of im- 
pugnment is to vary the scope of impugnment 
from an attack on only a specific behavior to 
an attack on the subject’s general character. 
Thus, in answer to the question tested in the 
second experiment, it is apparently the name’s 
impugnment of the subject’s general charac- 
ter, and not any specific behavior alone, that 
caused subjects to help in these experiments. 

This self-esteem formulation offers an alter- 
native explanation of several other recent ex- 
periments that have used specific judgments 
as independent variables. Fromkin (1970) 
demonstrated that subjects labeled “average” 
had a greater preference for novel subsequent 
experiences than subjects judged “unique.” 
Freedman and Doob (1968) found that being 
judged “deviant” affected such diverse behav- 
iors as affiliation, aggression, attitude change, 
and compliance. These investigators have in- 
terpreted their findings as mediated by spe- 
cific needs that were aroused by the particu- 
lar meanings of the judgments used. Fromkin 
argued that to be called average aroused sub- 
jects’ need to be unique, while Freedman and 
Doob reasoned that being called deviant 
aroused subjects’ need to be normal, In light 
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of the present findings, however, their results 
can be reinterpreted as caused by the general 
impugnment that is confounded in such spe- 
cific judgments. This self-esteem explanation 
can account for the effects of these judgments 
without having to postulate two conflicting, 
mediating processes. 

Considerable research in social psychology 
has examined various forms of cognitive in- 
consistency (e.g., dissonance, imbalance, in- 
congruity, inequity, etc.) as stimuli to atti- 
tude and behavior change (Abelson, Aronson, 
McGuire, Newcomb, Rosenberg, & Tannen- 
baum, 1968). The focus of this research on 
the state of cognitive inconsistency has had 
the implication that people have a need to 
eliminate inconsistency that is autonomous 
from other motives. Several of the present 
findings, however, suggest that the effect of 
inconsistency on behavior and attitudes may 
result from its ability to arouse a more gen- 
eral self-esteem need rather than any in- 
dependent need to eliminate inconsistency. 
These data indicate that it may not be the 
inconsistency itself that produces the change 
but rather its implication for one’s self- 
regard. For example, if a perceived cognitive 
inconsistency effects change by arousing an 
autonomous need to eliminate inconsistency, 
than a message that presents an inconsistency 
but prevents the arousal of any self-esteem 
needs by flattering the person should still 
have an effect on behavior and attitudes. The 
specific name condition of the second experi- 
ment showed that arousing an inconsistency 
by impugning a specific behavior (the sub- 
ject’s level of cooperativeness and concern 
for others) did not enhance helping behavior 
in the absence of any threat to the subject’s 
general self-regard. 

Further, if a perceived inconsistency arouses 
self-esteem needs and not an autonomous 
intolerance for inconsistency, then it should 
increase the likelihood of any self-esteem- 
enhancing act. For example, a person who 
is made aware that his smoking is incon- 
sistent with his desire to be healthy should 
be more willing to do anything that will 
es ee feel better, like stopping smoking, 
uying flowers for his wife, spendin 

: ier ’ 8 
time with his children, or even having an- 
other cigarette. Which act is selected may 
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depend on its availability, its ease of accom- 
plishment, the degree of its logical relation- 
ship to the inconsistency, how much the act 
is valued by society, etc. But, in general, the 
likelihood of all self-esteem-enhancing acts 
should increase with the awareness of a cogni- 
tive inconsistency if such awareness evokes 
a self-esteem-based motive. This reasoning is 
supported by the finding in the irrelevant 
name condition of the first experiment that 
helping the food cooperative was dramatically 
facilitated even though such helping did 
nothing to eliminate the particular self- 
inconsistency created by the subjects being 
called a bad driver. Thus, consistency re- 
search, by exclusively measuring the effect of 
its manipulations on behavior and attitudes 
that logically related to the stimulus incon- 
sistency, may have obscured the possibility 
that tendencies toward consistency stem from 
more general self-esteem needs rather than an 
independent need to be consistent. 

The above reasoning notwithstanding, one 
factor that may determine the degree to 
which a given behavior will resecure one’s 
self-esteem after it is impugned is the ability 
of this behavior to logically disprove or 
counter the specific impugnment. The results 
in the first experiment for the percentage of 
actual compliers and the amount of actual 
compliance (i.e., the amount of information 
subjects provided in the last phone contact) 
are relevant to this issue. Recall that in 
Experiment 1, subjects who agreed to comply 
in the irrelevant negative name condition 
were somewhat more likely to actually com- 
ply and provided over twice as much infor- 
mation as did agreeing subjects in the rele- 
vant negative name condition. For these 
subjects, who were called bad drivers, agree- 
ing to help a food co-op may have made them 
feel better in general but could not have 
fully disproven that they were bad drivers. 
Some feelings of impugnment and a desire to 
avoid further impugnment may have lingered 
with these subjects for a while after their 
initial agreement to comply, which caused 
them to help more in this condition. In con- 
trast, for subjects in the relevant negative 
name condition, helping the food cooperative 
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may have more efficiently disproven the basis 
of their impugnment, which reduced their 
motivation to continue helping. 
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Crime, Commitment, and the Responsive Bystander: 
Two Field Experiments 


Thomas Moriarty 
New York University 


In two field experiments it was demonstrated that bystanders more readily 
intervene to stop a theft if they have committed themselves in advance to 
the victim. During the summer of 1972, thefts were staged (a portable radio 
taken from an unattended beach blanket) at Jones Beach, New York. Sub- 
jects agreeing to watch the victim’s belongings were more likely to notice the 
theft, and among those noticing, committed subjects were more likely to stop 
the theft. Similarly, thefts were staged at Automat cafeterias in New York 
City (a suitcase taken while the owner was absent). Again, committed sub- 
jects were more likely to stop the theft. Results are taken as support for the 
notion that while conflict often prevents bystanders from intervening, prior 
commitment simplifies the decision process and produces a more responsive 


bystander. 


Crime is a major social problem today. Not 
only does the crime rate continue to increase 
year by year but the quality of crime has 
changed as well. Criminal activity, once con- 
fined to deserted streets and the hours of 
darkness, now occurs with considerable fre- 
quency in broad daylight, in full view of 
passersby. Much publicity has been given in 
recent years to those incidents in which 
crimes are committed while bystanders do 
nothing to help the victim. The well-known 
case of Kitty Genovese, who was stabbed 
to death while many individuals passively 
looked on, has created deep concern about the 
degree of apathy in our society. 

Visibility has often been considered a 
major deterrent to crime. To the extent that 
individuals intent on criminal activity take 
into account the presence of witnesses who 
might interfere with the act, the activity will 
be inhibited; on the other hand, when such 
individuals can expect noninterference from 


A version of this article was presented at the 
meetings of the Eastern Psychological Association, 
Washington, D.C., 1973. Thanks are due to Janice 
Stern, Eileen O’Loughlin, Ilona Szer, Gary Payne, 
Harry Reis, and George Beilin, who served as con- 
federates, and to Harold Basowitz for helpful com- 
ments on an earlier draft of this manuscript. Special 
thanks go to Janice Stern, who assisted in all phases 
of this study and supervised Experiment 2. 

Requests for reprints should be sent to Thomas 
Moriarty, Department of Psychology, New York 
University, Washington Square, New York 10003. 


unresponsive bystanders, visibility is no 
longer an effective deterrent. 

In some measure, the well-publicized un- 
responsiveness of bystanders may be respon- 
sible for the qualitative change in criminal 
activity. Several investigations have specifi- 
cally focused on the reactions of bystanders to 
a criminal act. Gelfand, Hartmann, Walder, 
and Page (1973) and Latané and Darley 
(1970) staged thefts in the presence of naive 
observers. In both studies the dependent vari- 
able was the rate of reporting the misdeed to 
persons in authority. These researchers found 
little evidence of active intervention on the 
part of observers, and in most cases, the 
thefts, had they been genuine, would have 
been successful. 

Explanations for the unresponsiveness of 
bystanders have invoked a conflict-decision 
model in which the individual must make a 
number of decisions before he can intervene: 
whether or not it is really an emergency, 
whether or not to help, how one should help, 
etc. According to this model, the individual 
weighs the pros and cons, and the outcome 
is either a decision to help or a decision not 
to help. Although much research has been 
devoted to factors entering into the decision 
process, little attention has been given to the 
significance of the decision process itself. 
Most emergency situations are (by definition) 
unexpected, leaving little time for rational 
appraisal. An emergency requires swift, posi- 
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tive action, and as long as individuals are 
involved in conflict, they are unable to inter- 
vene. Latané and Darley (1970), proponents 
of a decision model, found in one experiment 
that “those who did not intervene had not 
made a decision not to do so, but . . . were 
still in a state of indecision and conflict con- 
cerning whether to respond or not” (p. 100). 

Assuming it is the state of unresolved conflict 
which is responsible for nonintervention, fac- 
tors which reduce or eliminate indecision 
should increase the likelihood of intervention. 
One such factor might be commitment; in 
many situations it is possible to get a by- 
stander to commit himself to intervention 
before an emergency occurs. Commitment 
constitutes a decision to intervene, and when 
an emergency does occur, the prior decision 
should free the individual from conflict, As 
Lewin (1947) has suggested: 


A change from a situation of undecided conflict 
to decision does not mean merely that the forces 
toward one alternative become stronger than those 
toward the other alternative... . / A decision rather 
means that the potency of one alternative has be- 
come zero or is so decidedly diminished that the 
other alternative and the corresponding forces 
dominate the situation. . . . The act of decision . . . 
has an effect of freezing this motivational constella- 
tion for action. (p. 336) 


The Present Study 


Thefts were staged in two field settings. In 
each study the major dependent variable was 
the rate of active intervention, that is, the 
proportion of subjects who attempted to stop 
the theft. The major independent variable 
was the presence or absence of prior commit- 
ment to the victim. It was assumed that com- 
mitment would increase the rate of active 
intervention. Y 

In the first study the sex of the thief was 
also varied for largely intuitive reasons. 
Stereotypically, thieves are usually assumed 
to be male, and the occurrence of differential 
reactions to female thieves might be interest- 
ing dáta per se. Furthermore, fear is often 
cited ás d` potent motive for nonintervention, 
and it seems plausible that tall, muscular 
-male thieves would induce more fear in by- 
standers than petite, lovely female thieves. 
To the extent that this is true, more interven- 


371 


tion would be expected if the thief were 
female. 


EXPERIMENT 1 


Method 
Overview 


A series of thefts (a portable radio taken from an 
unattended beach blanket) was staged at Jones 
Beach, New York. Two independent variables were 
manipulated orthogonally: (a) prior commitment to 
the victim and (b) sex of thief, The reaction of the 
subject to the theft was noted by an observer, 
who later interviewed subjects to determine their 
private feelings about the situation. All subjects were 
then debriefed and thanked for their participation. 


Subjects 


On six weekdays in July and August 1972, 561 
beach goers were selected for study. The selection 
was guided by two considerations. 


1. The first consideration was the sampling unit. 
For each experimental treatment, a quota was estab- 
lished which reflected typical groupings found in 
this setting: male and female couples, men alone, 
women alone, and adults with children present. This 
was done to assure diversity within each condition, 
while minimizing diversity between conditions. 

Where the blanket was occupied by more than 
one person, the victim’s dialogue was addressed to 
the person most attentive but was generally over- 
heard by the other(s) present. Furthermore, when 
the theft did occur, the most typical response of 
the individual noticing the theft was to communicate 
the fact to the other(s). For these res ons, the 
sampling unit is considered to be the beach blanket 
rather than the individual, 

2, Subjects were also selected with respect to 
apparent age. By means of a crude classification of 
individuals as “younger” (school age) or “older,” an 
attempt was made to balance this factor within 
conditions. Postexperimental interviews with subjects 
who noticed the theft revealed that the sample in 
the present study is indeed heterogeneous with re- 
spect to age and occupation (see Table 1), Ages 
ranged from 14 to 60 years, with an average of 33 
years; educationally, subjects ran the gamut from 
elementary school through professional training. 
Ninety percent of the subjects were born in the 
United States, 75% were born in New York City. 
Despite the heterogeneity of the total sample, the 
resulting profiles are strikingly similar in the ex- 
perimental groups, and the results to be reported 
cannot be attributed to minor differences which do 
exist. 


1 The original design called for 40 subjects (10 per 
cell) who noticed the theft taking place. Despite 
attempts to make the theft obvious, it was not 
noticed in all cases, and the experiment continued 
until the original quota was met. 
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TABLE 1 
DISTRIBUTION or AGE AND EDUCATIONAL TYPES 


Age 
Occupation — |——7 r fe Taal 
Below | 19-35 | 36 and 
19 * | above 
Professional — 5 3 20 
Sales, business and 
i - 3 5 20 
Workers, skilled 


and unskilled — 6 1 18 
Other (student, 
housewife, un- 
employed) 8 7 2 
% total (age) 20 53 27 


Note, Based on subjects noticing the theft, N = 40. When 
more than one person occupied the blanket, the data shown are 
for oldest person. Not surprisingly, the age discrepancy within 
blanket composition was generally quite small. 


Procedure 


With the aid of two experimental confederates, a 
theft was staged in full view of the subjects, In each 
case, the confederate victim placed his blanket within 
5 feet (1.52 m) of the subject and turned on his por- 
table radio, tuned to a local rock station, at a fairly 
high volume. After reclining for 1-2 minutes, the 
victim left his blanket and spoke briefly to the sub- 
ject. (The independent variable, commitment, was 
manipulated at this point.) The victim then strolled 
away in the direction of the boardwalk and remained 
out of sight. A few minutes later, the confederate 
thief walked up to the victim’s blanket, picked up the 
portable radio (which was still playing), and quickly 
walked away in the direction opposite to that 
taken by the victim. Unless stopped by the subject, 
the thief walked to the boardwalk and also remained 
out of sight. 

The reaction of the subject to the theft was noted 
by an observer, who had stationed himself behind 
the subject earlier. The observer feigned sleepy in- 
difference during the theft, and in no case did the 
subject attempt to interact with him or enlist his aid. 


Confederates 


There were four experimental accomplices, two 
males and two females, The confederates were 
white, in their middle 20s, and dressed in usual 
beach attire. The males were both over 6 feet 
(1.83 m) tall, 


Independent Variables 


Sex of thief. The confederates worked in male- 
female teams and alternated in the roles of thief and 
victim. Thus, in one half of the cases, the thief was 
male and the victim female; in the other half, the 
procedure was reversed. 

Clearly, this procedure combines two variables, 
Sex of thief and sex of victim. Had the results 
based on mixed-sex pairings been more promising, 
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the study would have been repeated with same-sex S 
airings.? $ 
5 OAE, In the commitment condition, the 
brief dialogue with the subject consisted of the follow- 
ing: “Excuse me, I’m going up to the boardwalk for 
a few minutes . . . would you watch my things?” 
(In all cases, the subject agreed to do so.) In the 
no-commitment variation, the victim merely said: 
“Excuse me, I’m here alone and have no matches 
. . do you have a light?” (The purpose of this 
dialogue was to provide some interaction with the 
subject, while establishing the fact that the victim 
was alone.) 


Dependent Variables 


Noticing the theft. There are obviously many ~ 
visual distractions at the beach which might interfere 
with the subject’s noticing the theft. In each case, 
the observer made a determination of “noticing” 
based on the subject's reaction at the moment the 
theft occurred. Most typically, noticing was accom- 
panied by slight startle and a fixed gaze in the 
direction of the thief. The postexperimental inter- 
view was used to decide the few doubtful cases. 

Responsiveness. Reaction to the theft was classi- 
fied by the observer as either responsive or non- 
responsive. In the latter case, the subject usually 
watched the thief disappear on the boardwalk and + 
then looked away, resuming his former activity. In 
some cases, the subject stood up as he watched the 
thief, then sat down again when the thief was no 
longer visible. In a few cases, the subject shouted 
“Hey” to the thief, and when the latter (as per 
instruction) ignored the call, the subject made no 
further attempt to stop the theft. 

On the other hand, reactions classified as respon- 
sive involved the subject’s getting up from his 
blanket, running up to and stopping the thief, and 
demanding an explanation. In some cases, the sub- 
ject either grabbed the thief by the arm or took the \ 
radio. When the responsive pattern emerged, the 
observer always followed a few yards behind the 
subject as he approached the thief and stopped 
whatever interaction had begun between the subject 
and the thief. 


Debriefing 


All subjects were told at the earliest possible 
moment that the theft they had Witnessed was not 


? The initial motive for this arrangement was to 
maximize the probability that subjects would notice à 
the theft, Contact between the subject and the vic- 
tim, prior to the theft, was casual and extremely 
brief. By the time of the theft, the subject might 
remember only the victim’s most salient attributes. 
Conceivably, in same-sex pairings, inattentive sub- 


to interpret the 
> Supine subject, 
cape of tanned, 
Sex would be a 


a 


CRIME, COMMITMENT, AND THE RESPONSIVE BYSTANDER 


genuine. The thief, victim, and observer convened 
at the subject’s blanket and explained the purpose 
of the study. An attempt was made to assuage any 
uneasiness caused by the experience.® 


REsuLts 
Noticing the Theft 


Of the 56 subjects, 16 (29%) maintained 
during the postexperimental interview that 
they had not noticed the radio being taken, 
and the observer was fairly certain that dis- 
simulation was not involved, In many in- 
stances, the subject’s attention had been 
diverted moments before the theft; in others, 
the subject had fallen asleep or was engrossed 
in reading. When questioned either by the 
victim or the observer, these subjects showed 
genuine surprise and alarm at the suggestion 
that there had been a theft. 

Noticing the theft was not related to the 
sex of the thief. In eight cases the thief was 
male, in eight cases female. However, noticing 
was related to the commitment variable. Sub- 
jects who had failed to notice were all in the 
no-commitment condition. All subjects in the 
commitment condition noticed the theft. 

The analyses which follow are based on 
those subjects who noticed the theft. This 
seems to be the most conservative proce- 
dure, since the inclusion of those who failed 
to notice would simply exaggerate the 
magnitude of the effects to be reported. 


Responsiveness 


Of the 40 subjects who noticed the theft, 
23 (58%) evidenced the responsive pattern: 
They pursued and stopped the thief. 


Sex of Thief 

As can be seen in Table 2, the rate of 
intervention was not affected by the sex of 
the thief, Pooling results of the commitment 
and no-commitment conditions, 60% of the 
subjects intervened when the thief was male 
versus 55% when the thief was female. (The 


8 Generally, those subjects who had intervened 
felt rather good about their participation; those who 
had not intervened offered many reasons for non- 
intervention which were accepted in a supportive 
fashion by the research team. Indeed, the fact that 
subjects were then willing to be further interviewed, 
revealing personal information about themselves, 
suggests that our debriefing was successful. 
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TABLE 2 
FREQUENCY OF RESPONSIVE PATTERNS 
IN EXPERIMENT 1 


Sex of thief Commitment No commitment 
Male 10 (100%) 2 (20%) 
Female 9 ( 90%) 2 (20%) 

Note. n = 10 per cell. 
critical ratio corrected for discontinuity 


equals zero.)* 


Commitment 


The effect of commitment, on the other 
hand, was quite strong. Pooling results of the 
male and female thief conditions, 95% of the 
subjects intervened when there had been 
prior commitment, while only 20% did so 
in the no-commitment condition (Z = 4.48, 
p < 01). The effect of commitment was large, 
accounting for 58% of the variance in helping 
behavior in the present study. 


Sex X Commitment Interaction 


There is little evidence of an interaction 
between these variables. Following a proce- 
dure suggested by Langer and Abelson 
(1972), the critical ratio is quite small 
(Z = .92, p > .05). Thus, the effect of com- 
mitment is quite independent of the sex of 
the thief." 


EXPERIMENT 2 


Experiment 1, therefore, clearly demon- 
strates that responsive bystanders, infrequent 


4 Although the rates of intervention did not differ, 
the quality of the interaction between the subject 
and the thief was affected by the sex of the latter. 
While subjects were firm in detaining the male thief, 
the treatment accorded the female thief was far 
more aggressive, subjects being more likely to use 
physical force, This is reflected in the fact that of 
those who intervened, 73% snatched the radio away 
from the female thief, while only 17% did so when 
the thief was male (Z, corrected for discontinuity, 
= 2.24, p < .05). 

5 The procedure, suggested for interaction tests, 
involves an arc-sine transformation of the obtained 
proportions. The consequence is that the variances 
of proportions thus transformed are a function of 
sample size rather than of unknown parameters, An 
alternate (and perhaps more familiar) solution is to 
substitute sample values for unknown parameters in 
the standard error computations. When applied to 
the present data, this procedure also yields a non- 
significant critical ratio (Z = .49). 
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in the absence of commitment, were all but 
guaranteed when commitment was obtained 
in advance, an effect far stronger than had 
been naively anticipated. For a variety of 
reasons it seemed desirable to attempt a 
replication of this finding in a very different 
field setting. The population which frequents 
public beaches during midweek may be some- 
what atypical, containing as it does many 
unemployed people, housewives, students, etc. 
Being sufficiently remote from New York 
City, Jones Beach probably attracts a fairly 
homogeneous population of “middle-class” 
individuals who reside in the surrounding 
suburban areas. Experiment 1 took place in 
a leisure setting, and it may be that indi- 
viduals at play, relatively free from the de- 
mands and stresses of everyday life, are 
more willing to intervene when they have 
committed themselves; others, caught up in 
the hectic pace of city living, might be less 
inclined to do so. 

@xperiment 2 was conducted during the 
busy lunch period in two Automat cafeterias 
located in midtown Manhattan. Automats are 
the epitome of efficient, impersonal dining: 
Complete meals can be purchased from vend- 
ing machines; in the sprawling dining rooms, 
tables are occupied on a “first-come” basis, 
and table sharing is the rule; the atmosphere 
is one of haste, the rhythm set by the con- 
stant clattering of dishes and overly zealous 
busboys clearing tables; and although the 
dining rooms are relatively unsupervised, 
roving managers are quick to discourage 
loitering during busy periods. The Automat 
generally attracts employed individuals of 
lower socioeconomic status, who are seeking 
a “quick lunch” between shopping, appoint- 
ments, etc. It was felt that this setting would 
provide a good test of the generality of the 
commitment effect. 


Method 
Subjects 


On four weekdays in the spring of 1973, individu- 
als dining alone at the Automat were selected for 
study. Although the design called for 20 subjects, 
the study had to be terminated before that quota 
was met. The data to be reported are based on 16 


6 After the data for 19 subjects had been gathered, 
the manager, though quite sympathetic with the 
aims of the research, decided that our activity was 
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subjects who noticed the theft, with 8 subjects (4 
) in each experimental treatment. 


males and 4 females, 
The postexperimental interview revealed that all 


but one subject was currently employed, typically 
as clerical or skilled workers. Ages ranged from 19 
to 70 years, with an average of 48 years; the major- 
ity of subjects (69%) were born in New York City. 


Procedure 

A series of thefts were staged involving an un- 
attended suitcase. In each instance a well-dressed 
confederate (a white female in her early 20s), carry- 
ing an overnight bag, entered the restaurant and 
seated herself at the subject’s table. After a few 
minutes, the confederate victim engaged the subject 
in a brief dialogue during which the independent 
variable (commitment) was manipulated. The victim 
then walked away, leaving the suitcase on the table. 
A few minutes later a confederate thief (a white 
male in his early 20s) approached the table, picked 
up the suitcase, and walked away quickly. Unless 
stopped by the subject, the thief left the restaurant. 


Independent Variable 


In the commitment condition, the victim lit a 
cigarette and then, pointing to the suitcase, said to 
the subject: “Excuse me . . . may I leave this here 
for a few minutes?” (AIl subjects agreed to watch 
the suitcase.) In the no-commitment condition, the 
victim took out a cigarette, searched briefly in her 
handbag for matches, and then said to the subject: 
“Excuse me... do you have a light?” In either 
case, the victim then departed for the food counters 
and left the suitcase behind, 


Dependent Variable 


As in Experiment 1, the dependent variable was 
the rate of active intervention. Behavior was con- 
sidered “responsive” if the subject acted to stop the 
thief. If the subject noticed the theft but allowed 
the thief to leave the restaurant, his behavior was 
classified “unresponsive.” An observer, stationed a 
few tables away, noted the behavior of the subject. 
In all cases the subject was debriefed and thanked 
for participation. 


Results 


Of the 16 subjects noticing the theft, 9 
(56%) were responsive, stopping the thief 
before he left the restaurant. The correla- 
tion „between commitment and responsive- 
ness is nearly perfect: While every subject in 
the commitment condition intervened, only 
one subject in the no-commitment condition 
did so. Despite the small sample sizes in- 
volved, the difference is quite reliable (Z, cor- 


ma s of the 19 subjects had failed to 
notice the theft; 2 of the 3 were in thi = i 
ment condition. ct ere 


a) 
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rected for discontinuity, = 3.12, p< .01). 
Again, the effect is quite large. Commitment 
accounts for 78% of the variance in helping 
behavior. 


Discussion 


At the start of this article it was suggested 
that the often reported unresponsiveness of 
bystanders is less the result of a decision not 
to help than of the immobilizing conflict 
which arises when individuals are trying to 
decide whether or not to help. It was further 
suggested that commitment (a prior decision 
to help should the occasion arise) would 
serve to minimize the need for a decision and 
free the individual from conflict, thereby 
producing greater responsiveness when an 
emergency occurs. Despite the diversity of 
subject populations and social settings and 
the near-bewildering array of uncontrolled 
sources of variation inherent in field studies, 
the present research strongly supports that 
contention. In only one instance where there 
was prior commitment did an individual fail 
to stop the theft. 

Those who failed to intervene (in the no- 
commitment conditions) were clearly not apa- 
thetic or indifferent to the thefts. In all cases, 
they were able to describe the thief to the 
ictim, and a number of them offered to help 
the victim to find the thief.’ When asked why 
they did not stop the thief, they offered many 
reasons: fear; uncertainty about their percep- 
tions (many entertained the hypothesis that 
the thief was really the victim’s friend, so 
perhaps it was not a theft at all); and un- 
certainty about the proper course of action 
to take. Probably the same thoughts crossed 
the minds of our committed subjects, yet 
they did act. 

Considering the data from both studies, it 
is clear that commitment had a dramatic 
effect on the likelihood that the theft would 
be noticed: Only one subject in the commit- 
ment condition failed to notice the theft 
taking place. It might seem that subjects 
literally “kept an eye” on the victim’s be- 


TIt is, of course, possible that the subjects might 
have been more indifferent to the plight of the vic- 
tim had there been no prior contact between the 


subjects and the victim. 


TABLE 3 


FREQUENCY OF RESPONSIVE PATTERNS 
IN EXPERIMENT 2 


Pattern Commitment No commitment 
Responsive 8 1 
Unresponsive 0 7 


longings, but that was not the case. Typically, 
subjects continued the activity which had 
been interrupted by the victim’s dialogue 
(reading, eating, etc.), while occasionally 
glancing over at the victim’s place. Al- 
though no attempt was made to record this 
data systematically, future research devoted 
to scanning behavior under conditions of 
commitment might prove fruitful. 

It would be tempting to generalize these 
results to more complex situations. On a 
practical level, many crimes which escape 
police detection can be prevented by respon- 
sive citizens. In our highly complex society, 
crime can be controlled on a community level 
if the individual member is committed to the 
welfare of his neighbor. However, many ques- 
tions remain unanswered by the present 
research: 


1. Under what specific set of conditions 
will individuals commit themselves? In the 
present study, the committing request was a 
simple one, and no subject hesitated in grant- 
ing it. Perhaps our subjects were merely being 
courteous to the victim. (The initial shock 
and surprise evidence by subjects when the 
theft did occur supports this interpretation.) 

2. Is commitment weakened with the pas- 
sage of time? In both experiments the crime 
occurred a few minutes after commitment, 
and we have already noted that committed 
subjects are quite vigilant during this period. 
Suppose the theft had occurred 15 minutes or 
4 hour later? Vigilance would probably have 
been lowered; would the likelihood of inter- 
vention also have been lowered? 

3. Would the same results obtain if the 
victim were an institution rather than an 
individual, as in the present study? For ex- 
ample, applying these results to the problem 
of shoplifting in department stores, would it 
be worthwhile for a salesperson to ask a 
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customer to watch an unattended counter 
during his or her absence? 

4. The request which elicited commitment 
from the subjects was specifically oriented 
toward the victim’s property. Would this act 
of commitment generalize to other aspects of 
the victim’s welfare? For example, if the 
victim later appeared to be in physical dan- 
ger, would the committed individual be more 
likely to intervene? 


A certain amount of vagueness also shrouds 
the independent variable. We prefer in this 
report to think of commitment as a “prior 
decision,” but other interpretations are possi- 
ble. The manipulation may have been success- 
ful for the following reasons: (a) Having been 
singled out by the victim, the subject feels 
an acute sense of responsibility which prompts 
intervention; (b) the subject is gratified by 
the trust placed in him by a total stranger, 
and intervention proceeds from warm regard 
for the victim; (c) individuals strive for 
consistency between their verbal statements 
and overt behavior, since a discrepancy would 
cause psychological discomfort; (d) the sub- 
ject intervened to avoid the guilt and shame 
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he would experience when later confronted by 
the victim had he not intervened.* 

These issues can be resolved in future re- 
search; however, at a period in history when 
cries of “public apathy” abound, it is heart- 
ening to find that this mild manipulation is 
so effective in producing a responsive by- 
stander. 


§ The response of one committed subject who had 
failed to intervene may be instructive: She did 
shout “‘Stop” to the thief, and when that failed, 
she shouted an obscenity; when the observer later 
interviewed her, she appeared filled with rage and 
shame. 
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Causal Attributions and Other Achievement-Related Cognitions: 
Effects of Task Outcome, Attainment Value, and Sex 
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Some support was obtained for a naive analysis of causal attributions for 
success and failure which assumes that subjects interpret feedback in an 
essentially logical fashion. Success more than failure was attributed to effort. 
Attributions to luck and ability were largely a function of consistency of 
feedback. Biased use of available information, indicating either defensiveness 


or self-derogation, was also found and varied with attainment value and sex. 


Boys defensively attributed failure to luck; girls’ ability attributions were 
self-derogatory. These sex differences help account for boys’ higher expectancies 
when feedback was limited or contradictory. Individual differences in attri- 
butions related to individual differences in minimal standards of success, 
affective responses to feedback, and feelings about a forthcoming test. 


Building on Heider’s (1958) naive analysis 
of action and work on locus of control (Cran- 
dall, Katkovsky, & Crandall, 1965; Rotter, 
‘1966), Weiner and his associates (Weiner, 
Frieze, Kukla, Reed, Rest, & Rosenbaum, 
1971) have examined causal attributions for 
success and failure in terms of four factors: 


‘ability, effort, luck, and task difficulty. Each 
- afaċtor is jointly classified as either internal 


qr external and stable or unstable. Ability and 


*effortare considered internal factors while 


Ps: dithulty and luck are external. Ability 
“dad, task difficulty are stable and luck and 
dort unstable. In this study some determi- 
naijts ‘and’ correlates of attributions to these 
fac rs are examined. Two bases for the pre- 
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diction of attributions are considered. The 
first assumes that individuals use available 
information in an essentially logical fashion to 
draw conclusions about causes of outcomes. 
The alternate position assumes that attribu- 
tions reflect self-enhancing, approval-seeking, 
or defensive motives. 

The logical analysis proposed here devel- 
ops Heider’s (1958) naive analysis of action 
and that of Weiner and associates (Weiner et 
al., 1971). This analysis compares different 
patterns of feedback in terms of the informa- 
tion they provide concerning the operation of 
a given causal factor. It has been formulated 
to apply specifically to situations in which 
subjects are inferring the causes of outcomes 
of their own behavior, not imagined outcomes 
or outcomes of others’ behavior. It is further 
assumed that all subjects apply and perceive 
themselves as applying moderately consistent 
and high effort on the task. This assumption 
appears reasonable in the case of a brief 
experimental task, such as used here, which 
subjects see as moderately important. In fact, 
subjects showed no signs of fatigue or casual 
responding. The logical analysis also applies 
to situations in which subjects have clear 
information relevant to task difficulty and 
in which task difficulty is moderate. This is 
the case in many school and other achieve- 
ment situations. Here, subjects were told 
what score most children get on the experi- 
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mental task and their own scores were either 
well above or below this, In such situations, 
task difficulty provides a relatively unlikely 
explanation for either success or failure, and 
high-scoring subjects have neither more nor 
less logical basis for attributing success to 
task difficulty (ease) than do low-scoring sub- 
jects for attributing failure to difficulty. Thus, 
in developing the logical analysis the major 
problem is to predict attributions to ability, 
effort, and luck. 

Deviation of attributions from the patterns 
predicted by the logical analysis can be taken 
as evidence of bias. Defensive or self-enhanc- 
ing bias would lead to greater attribution of 
success to internal causes and failure to ex- 
ternal causes (Feather & Simon, 1971; Fitch, 
1970; Frieze & Weiner, 1971). Predictions 
were framed in terms of defensive bias, though 
the reverse, self-derogatory bias, could and 
did occur, 

The logical and biased analyses are devel- 
oped below for the factors of ability, effort, 
and luck, in that order. For each factor, the 
predictions are formulated first for a single and 
then for a repeated success-failure experience. 

Given moderate effort, a high score offers 
as clear grounds for inferring the existence 
and causal role of high ability as does a low 
score for low ability. Thus, the logical position 
predicts equal attribution of success and fail- 
ure to (high and low) ability, while defensive 
responding would lead to greater attribution 
of success than failure to ability. 

When there is more than one success— 
failure experience, feedback which is consis- 
tent with earlier feedback is logically more 
attributable to the stable dimension of ability 
than is inconsistent feedback. That is, success 
following success should be attributed to good 
ability more than success that follows failure, 
and failure following failure should be at- 
tributed to poor ability more than failure 
that follows success. If defensiveness deter- 
mines ability attributions, direction. of out- 
come rather than its consistency with earlier 
outcomes should predict attributions which 
would be higher after success than failure. 

Both the logical and defensive analyses 
predict higher attribution of success than 
failure to effort. If failure occurs in spite of 


Jonn G. NicHoLts 


effort, this represents fairly unambiguous evi- 
dence that lack of effort was not a cause of 
failure. If, on the other hand, subjects try 
and succeed, they can reasonably conclude 
that effort was a cause of success. Therefore, 
the logical analysis predicts higher attribution 
of success to effort than of failure to lack of 
effort. Defensive responding also would lead 
to this result. 

It is assumed that effort is relatively high 
and stable and that it is directly perceived, 
rather than inferred as in the case for ability. 
Given these assumptions, there is no logical 
basis for different attribution of consistent 
versus inconsistent feedback to effort, Success, 
whether preceded by success or failure, is 
more logically attributable to effort than is 
failure to lack of effort, whether this failure 
is preceded by failure or success. That is, di- 
rection of outcome rather than its consistency 
with earlier feedback should predict effort 
attributions. In this case also, defensive bias 
would lead to the same attributions. 

The cues available in a single success— 
failure experience from which the role of luck 
might be logically inferred seem less clear 
than do the cues relative to the other dimen- 
sions. There should, however, be no difference 
between success and failure in the availabil- 
ity or clarity of cues relative to luck. Thus, 
the logical analysis predicts equal attribution 
of success and failure to good and bad luck, 
respectively. Defensive responding would be 
indicated by greater attribution of failure 
than success to luck. 

Because of its unstable nature, luck can be 
more logically invoked as an explanation of 
inconsistent than consistent feedback. That is, 
success after failure and failure after success 
should be attributed to luck more than suc- 
cess after success and failure after failure. 
Defensive bias would lead to greater attribu- 
tion to luck after failure than after success, 
regardless of earlier feedback. 

Two additional hypotheses concerning at- 
tributional bias were formulated. Since moti- 
vational distortion of expectancies reduces as 


+ 


reality constraints increase (Feather, 1965; . 


Litwin, 1966; Nisan, 1972), bias was pre- 
dicted to reduce with increased task experi- 
ence. Defensive bias was also predicted to be 
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>% more marked when attainment value (the 
importance attached to task feedback: Cran- 
dall, Katkovsky, & Preston, 1960) is high 
than when it is low. When a task is thought 
to measure skills important to the self-image, 
there is more reason for defensive or self- 
enhancing interpretations than when the task 
is thought relatively unimportant. To test 
this hypothesis the task used was presented 
either as‘an important ability measure or as a 
measure of an unimportant skill. 

A numberof studies have found boys have 
higher expectancies of success than girls 
(Crandall, 1969: Feather, 1969). Crandall’s 
evidence Gunnested that hen feedback is con- 
tradictory, girls, more than boys, tend to 
focus on negative feedback as a basis for ex- 
pectancy formation.. This study provides 
further evidencé on this point. Further, since 
expectancies are held’ to reflect causal attribu- 
tions (Weiner, 1972)? it was anticipated that 
sex differancess.in: attribution might occur 

(Feather, 1969) and ‘help aċcount for differ- 
“ences in ancies | >, 

, _ Severat"Qiditional hypotheses relevant to 
attribution» theory Were ‘examined. Weiner 
(1972) “suggests. that. affective responses to 
feedbatk-: qre- a'a function of. the degree to 
whith the - duitcome | is seen as internally deter- 
inined’ with: attribution to. effort being par- 
A Piculatl ‘jmpgttant. Extending ‘this predic- 
tign, ifr? emed kely .that both anxious and 
“gris feelings about’ a future test would be 


efuhction of; ‘ther extent to which feedback 
2 i i suggested the test out- 
come would bé dependent on the subject’s 
-effort and ability. Achievement expectancies 
and, effort were ‘likewise predicted to be a 
- , function of catisal attributions (Weiner, 1972; 
Weiner et al., 1971). 

Se The concept of minimal standard of suc- 
_ cess or minimal goal (the lowest level of the 
continuum of possible feedback which the 
subject: perceives as satisfactory) was also ex- 
amined. There is some support for Rotter’s 
suggestion of the usefulness of minimal stan- 
dard (which he holds are relatively stable) in 
predicting achievement behavior (Battle, 
1966; Crandall, Katkovsky, & Preston, 1962). 
Thus, it was of interest to see to what extent 
standards were affected by feedback and 
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whether, as predicted for expectancies, they 
would relate to causal attributions. 


METHOD 
Subjects and Procedure Outline 


Subjects were 96 white fourth-grade children, 48 
boys and 48 girls from two public schools drawing 
on middle-class areas of two southwest Ohio towns. 

The procedure was carried out in a mobile labora- 
tory on the school grounds. Subjects were assigned 
randomly to conditions and to one of three female 
experimenters with the restriction that each experi- 
menter work with equal numbers of each sex in each 
condition. 

Subjects worked on equivalent forms of an angle- 
matching task in two sessions. One was presented as 
a practice session, the other as a test. Task instruc- 
tions were varied to create either high- or low- 
attainment value. The practice session was followed 
by questions about the causes of the practice out- 
come (success-failure) and about cognitions relevant 
to the impending test. The test session was followed 
by questions about the test outcome (success-failure) 
and about an additional test session, Data collection 
was complete at this point. The additional test 
session was given only to subjects failing in the first 
test. To eliminate possible undesirable effects of 
failure, these subjects were “succeeded” in this final 
sess‘on, 


Experimental Task 


The achievement (angle-matching) task was an 
adaptation by Crandall (1963) of one devised by 
James (1957). Subjects sat at a table with a book 
containing 20 pages. On each page was an acute angle 
with 4-inch (10 em) sides. Subjects matched each 
angle with one of eight different “standard” angles 
mounted on cardboard on the wall, 8 feet (2.4 m) 
from the subject. All angles in the book were equi- 
distant in size between some two of the standards, 
with the difference between these standards being 
slight and barely discriminable. Feedback could be 
given regardless of actual choices without arousing 
suspicion about the reality of the feedback. To 
check whether subjects did become suspicious, an 
obtuse angle, clearly differing from those in the 
book, was included among the standards. No subject 
“tested” the procedure by choosing this angle and 
subjects’ behavior indicated they accepted the feed- 
back as genuine. 

Answers were given by pressing buttons numbered 
to correspond with numbers beside the standard 
angles. The buttons were mounted on a 12 X 20 inch 
(30 X 51 cm) unit in front of the subject. The unit 
also held white and green signal lights and a counter 
which recorded the cumulated score. Subjects were 
told a machine scored their answers, but experi- 
menters provided the feedback according to a pre- 
assigned schedule. A white light indicated a correct 
response which was also recorded on the counter. 
No light was shown for an incorrect response. A 
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green light indicated the subject should move to the 
next item, This was shown immediately after the 
white light or, when the response was “wrong,” an 
equivalent time after the response. 

Task instructions were presented with a tape re- 
corder and the procedure demonstrated with a sam- 
ple angle. The task was intentionally novel so that 
attainment value could be readily manipulated. To 
create a high-attainment value condition, instruc- 
tions described it as “a test of how smart you are 
likely to be a couple of years from now.” In the 
low-value condition, it was said to “show how well 
you can tell the size of things which are at a distance 
from you.” 

Having established that the procedure was under- 
stood, the experimenter, referring to a bar graph on 
the wall, said, “Let’s take a look at this chart to 
see how well kids your age usually do on the prac- 
tice angles the first time they try it like you're doing. 
Fourth graders usually get about 12 right, so you'll 
probably be able to get about 12 of the 20.” A card 
with 12 on it was placed on a small stand on the 
table. 

The experimenter, observing through a one-way 
mirror, recorded the time spent on each item on a 
(Datamyte) recorder, On the practice, subjects were 
given either 6 (failure) or 18 (success) correct re- 
sponses. On completion of the practice, the score 
was recorded by the experimenter on the bar graph 
on the wall so the subject could, “see how you did 
compared with other kids.” 

The procedure for the test was the same as for 
the practice, except that the instructions were 
shorter and subjects were told most children get 13 
correct and were themselves given either 7 or 19 cor- 
rect responses. 

Time spent on the task was taken as an index of 
care or effort. It seemed likely that effects of feed- 
back on effort would be most marked toward the 
end of the task when it would be clear whether the 
total score was likely to be low or high. Therefore, 
times for each successive quarter of the sessions, as 
well as total time, were used as dependent measures. 


Measures of Cognitive and Affective 
Responses 


Attributions were measured using a device, such 
as a pie graph, which subjects could adjust to show 
the relative importance of the four possible causes 
of their score. The device was made of four circular 
discs of lightweight cardboard. Each disc was 8 
inches (20 cm) in diameter and a different muted 
color: gray, yellow, green, or blue. They were cut 
along a radius and slipped together, each through 
the cut radius of the others in such a way that they 
could be moved to expose 360 degrees of any color 
or different amounts of any combination of colors. 
The four discs were fastened to a 10-inch (25 cm) 
square board by a broad screw through their centers. 
Small handles, flat clear plastic formed into a right 
angle, were glued to the exposed radii to make it 
easy to adjust the discs to expose any amount of 
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each. Parallel and close to each exposed radius was _ 
printed one of the four possible causal explanations. 
Two devices were made, one for the success and one 
for the failure condition. For succeeding subjects, 
the following alternatives were printed on the discs: 
“The test was easy,” “I tried hard,” “I had good 
luck,” and “I am smart at this.” For failing subjects 
the following alternatives were printed: “The test 
was hard,” “I didn’t try hard,” “I had bad luck,” 
and “I am not smart at this.” 

Instructions pointed out, in counterbalanced or- 
der, each possible cause of the obtained score and 
subjects were shown how to use the device to show 
the relative importance of the four causes; impor- 
tant reasons were given a large segment and less 
important reasons a small segment. After this expla- , 
nation, subjects were presented with the device with ~ 
the four reasons equally exposed and told to show 
why they got the score they did. Subjects were 
intrigued by the device and clearly understood its 
purpose. The number of degrees of each segment 
left exposed was taken as the index of the impor- 
tance of each cause. This procedure is consistent with 
the suggested reciprocal nature of causes (Heider, 
1958), but does mean that there are three degrees of 
freedom among the four attributions. 

Four other questions were presented after the 
practice: 


Expectancy: How many do you think you will get 5 


right on the test of 20 angles? Just circle the score 
you think you will get. 


Subjects recorded their responses on a sheet with 
numbers running from 0 to 20. 


Minimal standard: Probably there are some scores 
you would be pleased and satisfied with and 
others you would not be pleased or satisfied with. 
Show which scores you would feel satisfied with 
and which you would not, by circling either “sat~ 
isfied” or “not satisfied” beside each score. *) 


The answer sheet was numbered 1 to 20 with satis- 
fied on one side of each number and not satisfied 
on the other. The lowest score for which subjects 
circled satisfied was taken as their minimal standard. 


Anxiety: Are you worried about the test you are 
going to do? Answer by circling any one of the 
numbers from 1 to 10 depending on how worried 
you are. 


The answer sheet was numbered 1 to 10 with nots, 
at all worried at 1, and very worried at 10. 


Positive affect: Do you feel good about the test 
you are going to do? Answer by circling one of 
the numbers from 1 to 10 depending on how good 
you feel. 


The answer sheet was numbered 1 to 10 with I 


don’t have a good feeling at 1 and I havi i 
good feeling at 10. eee 


After the test, attributions, minimal standards, 
and expectancies were measured as before. An addi- 
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ig a * tional question was, “How pleased do you feel with 


your score on the test you just did? Circle a number 
from 1 to 10 to show how you feel.” The answer 
sheet was numbered 1 to 10 with pleased at 10 and 
not pleased at 1. 

The attribution question was presented before the 
others, after both practice and test, because in pre- 
testing it was found that subjects were confused by 
having to think back to the task they had just done 
if they had been already asked about the task they 
were about to do. The order of the other questions 
was counterbalanced. 


Summary of Procedure 


The 20-item angle-matching task was presented 
with high- or low-attainment value instructions, 
first as a practice and second as a test. Causal attri- 
butions, expectancies, minimal standards of success, 
and affective responses were measured after both 
the practice and the test. Time spent on both prac- 
tice and test was also measured. 


Analyses 


Analyses of variance were performed for each of 
the dependent variables. For practice time and 
questions given after the practice, the analyses were 


= 2 (Male-Female) X 2 (Attainment Value) X 2 (Prac- 


tice Success-Fail). For test time and the questions 
given after the test the analyses were 2 (Male-Fe- 
male) X2 (Attainment Value) X2 (Practice Suc- 
cess-Fail) X 2 (Test Success—Fail). 


RESULTS AND DISCUSSION 
Ability Attributions 


Practice failure was attributed to poor abil- 
ity more than success was to good ability, 
F (1,88) = 10.96, p < .001. This is at vari- 
ance with the logical analysis and suggests a 
derogatory bias. An interaction of Sex x 
Practice Outcome was found, F (1,88) = 
3.74, p < .06, see Table 1. Girls, but not 
boys, attributed failure to poor ability more 
than success to good ability—simple effects 
girls, success-fail: F (1,88) = 13.75, p< 
001. That is, girls showed a significant self- 
derogatory bias which was not evident for 
boys. 

There was no main effect of test outcome on 
ability attributions. There was a Sex X Test 
Outcome interaction, F (1,80) = 3.80, p< 
.06, but simple effects tests showed only a 
marginal tendency for girls more than boys to 
attribute failure to ability, jie (1, 80) = 3.38, 
p < 10. Thus, there was less evidence of bias 
after the test than after the practice. 
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TABLE 1 

MEAN PERCENTAGE OF TOTAL CAUSE ATTRIBUTED 
TO ABILITY 


Practice Outcome X Sex Interaction 
(attribution after practice) 


Practice outcome 


Sex 
Success Failure 
Female 6.4 42.5 
Male 19.6 29.1 


Practice Outcome X Test Outcome Interaction 
(attribution after test) 


Practice outcome 
Test outcome 


Success Failure 
Success 30.4 12.6 
Failure 12.2 37.4 


Test Outcome X Attainment Value Interaction 
(attribution after test) 


Test outcome 


Attainment value 


Success Failure 
High 30.6 12.4 
Low 17.5 32.0 


An interaction of Test X Practice Outcome 
on ability attributions after the test was ob- 
tained, F (1,80) = 10.27, p < .01, see Table 
1. In accordance with the logical analysis, 
consistent feedback (success-success and fail- 
ure-failure) led to greater attribution to the 
stable dimension of ability than did incon- 
sistent feedback. 

An interaction of Test Outcome X Attain- 
ment Value on ability attribution, F (1, 80) 
= 5.87, p < .05, offered some support for the 
prediction that defensive or self-enhancing 
bias would be greater under high- than under 
low-attainment value conditions (see Table 
1). Self-enhancement was indicated, in the 
high-attainment value condition, by higher 
attribution of success to good ability than of 
failure to poor ability, but this difference was 
not significant. In the low-value condition, 
self-denigration was indicated by greater at- 
tribution of failure to poor ability than of 
success to good ability—simple effects: 
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F (1, 80) = 4.23, p <.05. Perhaps subjects 
feel perfectly free to admit to low ability at 
judging the sizes of objects at a distance (the 
low-value condition) and little desire to claim 
good ability at this. 


Effort Attributions 


Practice success was attributed to high ef- 
fort more than failure was to low effort, 
F (1,88) = 15.11, p < .001. The same trend 
occurred for test success and failure, F (1, 80) 
= 9.90, p < .01. This pattern is consistent 
with the predictions of both the logical and 
bias (defensive) analyses. There was no sig- 
nificant interaction of Practice X Test Out- 
come for effort attributions. This is in ac- 
cordance with the predictions of both analy- 
ses, that effort attributions would be a func- 
tion of direction of outcome rather than con- 
sistency of outcome with earlier feedback. No 
main or interaction effects for sex or attain- 
ment value were found. 


Luck Attributions 


Practice failure was attributed to bad luck 
more than success was to good luck, F (1, 88) 
= 9.90, p< 01. This outcome, contrary to 
the logical analysis, suggests defensive bias. 
A Sex x Practice Outcome interaction was 
obtained, F (1, 88) = 3.81, $ < .05, see Ta- 
ble 2. Boys more than girls attributed failure 


TABLE 2 


MEAN PERCENTAGE OF TOTAL Cause 
ÅTTRIBUTED to LUCK 


Practice Outcome X Sex Interaction 
(attribution after practice) 


Practice outcome 
Sex aey 


Success | Failure 
Female 41 15.4 
Male 49 33.6 


ee 


Practice Outcome X Test Outcome Interaction 
(attribution after test) 


Practice outcome 
Test outcome — =A 


Success Failure 
Success 5.4 25.8 
Failure 36,2 26.0 
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to bad luck—simple effects: F (1,88) = 5.23, t 


$ < .05—and boys, but not girls, had higher 
luck attributions after failure than after suc- 
cess—simple effects, boys’ practice outcome: 
F (1,88) = 13.00, p<.001. Thus, boys 
showed a defensive bias in luck attributions 
while girls did not. 

There was no significant main effect for 
test outcome nor was there a Sex x Test Out- 
come interaction on luck attributions mea- 
sured after the test. Thus, bias was more 
evident after the practice than after the test. 
A significant interaction of Practice x Test, 
Feedback was found, F (1, 80) = 5.85, p< 
:05, see Table 2. Attributions to bad luck 
were significantly higher after failure-failure 
than were attributions to good luck after 
Success-success, F (1,80) = 5.28, p< .05. 
The logical analysis predicts equal and low 
attribution to the unstable element luck, for 
both repeated success and repeated failure, so 
this finding suggests defensive bias after re- 
peated failure. Consistent with the logical, 
analysis, attributions to luck were higher for 
success—fail and fail-success (contradictory 
feedback) than for success-success—simple 
effects, F (1,80) = 11.91, p<.001: F (as 
80) = 5.22, p< .06, respectively. 


Task Difficulty Attributions 


There was no logical basis for different de- 
grees of attribution to task difficulty as a 
function of outcome and in only one case\, 
was there a deviation from this prediction: 
Practice failure, more than success, was at- 
tributed to task difficulty, F (1, 88) = 3.87, 
$ <.05. In summary, both the logical and 
biased analyses of causal attribution received 
Some support. Bias occurred after practice— 
but not test outcome—for ability, luck, and 
difficulty attributions, Tt is possible that the 
test, being called a “test,” was seen as provid-, 
ing more definitive evidence relevant to abil- 
ity than did the “practice.” This, however, 
seems a less likely explanation of the results 
for luck and difficulty. Increased experience 
with a task, regardless of consistency of 
feedback, might lead to reduction of bias, 
Bias in ability attributions varied with at- 
tainment value. The biases associated with 
Sex are particularly interesting: Boys were 
defensive in attributing failure to bad luck 

, 
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while girls’ ability attributions were self- 
derogatory. 


Expectancies 


As found in many previous studies, expec- 
tancies followed feedback. Success on the 
practice led to higher expectancies after the 
practice, F (1,88) = 100.99, p < .001, and 
after the test, F (1, 80) = 40.54, p < .001. 
Test success led to higher expectancies after 
the test, F (1,80) = 118.20, p < .001. 

Boys had higher expectancies than girls 
after the practice—sex main effect: F (1,88) 
= 3.86, p< .05. This result parallels and 
probably reflects boys’ and girls’ different 
biases in attributions after the practice. If it 
is assumed that bad luck can change, boys’ 
defensive attribution of failure to bad luck 
should lead to higher expectancies. Girls’ self- 
denigrating ability attributions would be a 
basis for lower expectancies. 

Expectancies measured after the test were 
also higher for bays than girls, but the effect 
was marginal, F (1, 80) = 3.15, p < .10. The 
interaction of Sex X Practice Outcome x Test 
Outcome on expectancies measured after the 
test was significant, F (1,80) = 4.88, p< 
.05, see Table 3. Girls’ expectancies were 
lower than boys’ when practice success was 
followed by test failure—simple effects: 
F (1,80) = 13.24, p< .001, but the sexes 
did not differ in the success-success, failure- 
failure, or failure-success conditions. This 
outcome supports the hypothesis that girls’ 
expectancies are lower than boys when feed- 
back is contradictory, but only when failure 
follows success, not when the order is reversed. 
Girls’ greater readiness to revise expectancies 
downward after failure follows success could 
reflect their self-denigratory ability attribu- 
. stions. Boys’ tendency to avoid accepting the 
possible negative implications of failure by 
attributing this to bad luck would make for 
higher expectancies in this situation. The 
finding that girls’ expectancies were not 
lower when test success followed practice 
failure might be attributable to a perception 
that test feedback was more definitive. Also, 
boys’ defensive attributional tendencies are 
unlikely to be aroused by test success even if 
preceded by practice failure. 
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TABLE 3 


MEAN EXPECTED SCORES: INTERATCION OF 
SEX X PRACTICE OUTCOME X TEST OUTCOME 


Practice success Practice failure 


Sex 


Test Test Test Test 

Success failure success failure 

Female 17.92 12.67 16.42 9.42 
Male 17.83 16.17 16.75 9.75% 


Minimal Standards 


As predicted, minimal standards were af- 
fected by feedback, but not to the same de- 
gree as expectancies, Success led to standards 
at about the level of the norms presented to 
subjects. After failure, standards were below 
this norm. Main effects for practice outcome 
were found after both practice and test, F (1, 
88) = 6.19, p < .05; F (1,80) = 3.96, p< 
-05, respectively, with success leading to 
higher standards than failure. Test success led 
to higher standards than did failure, but this 
effect was slight, F (1,80) = 2,63, p < .10. 

In only one case were standards related to 
sex. A main effect for sex on standards mea- 
sured after the test showed boys had higher 
standards than girls, F (1,80) = 6.15, p< 
.05. 


Affective Responses 


As might be expected, practice success led 
to higher ratings of “feeling good” about the 
forthcoming test than did failure, F (1,88) 
= 16.92, p < 001. A Sex X Attainment Value 
interaction was also found for this rating, 
F (1,88) = 10.83, p<.001, see Table 4, 
Simple effects tests showed boys felt better in 
the high- than the low-value condition, F (1, 
88) = 13.22, p < .001; boys felt better than 
girls under high value, F (1, 88) = 6.48, p< 
-05; while girls felt better than boys under 
low value, F (1,88) = 4.45, p < .05. 

Thus, as well as being self-denigrating in 
ability attributions, setting lower standards, 
and expecting to do less well than boys, girls 
feel worse than boys when they are told they 
are about to do an important ability test. 
Only when the task is presented as a rela- 
tively unimportant one did girls feel better 
than boys. It appears that boys enjoy the 
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TABLE 4 


MEAN FEELINGS ABOUT TEST: INTERACTION 
OF ATTAINMENT VALUE X SEX 


Attainment value 


Sex 
High Low 
Female 7.46 8.04 
Male 8.92 6.83 


Note. High scores indicate feeling good about forthcoming 
test. 


challenge of an important ability assessment, 
while girls feel better at the prospect of a less 
important test. No significant effects were 
found for the other affective responses. 


Achievement Behavior 


Significant main effects of outcome on 
total working time were found for both prac- 
tice and test—practice, F (1,88) = 6.06, p < 
.05; test, F (1,80) = 8.41, p < .01. Failing 
subjects spent longer than did succeeding sub- 
jects. 

There were two interactions involving sex 
(both just above ~=.05) on time spent on 
the last quarter of the test. First, was a Sex 
X Attainment Value interaction, F (1,80) = 
3.79, p < .06. Simple effects tests showed girls 
spent less time under high- than low-value 
conditions, F (1,80) = 4.54, p < .05. This 
interaction is similar to the effect of sex and 
value on how good subjects said they felt 
about the test. Though simple effects tests 
were significant for different comparisons in 
the two cases, in both cases females showed a 
relatively unfavorable orientation under the 
high-value condition. 

Second was an effect of sex and test out- 
come on time on the last quarter, F (1, 80) 
= 3.76, p < .06. Boys spent more time when 
failing than succeeding (simple effects, suc- 
cess-fail for boys: F = 9.79, p < .01), while 
girls did not differ significantly across success 
and failure. This might reflect the tendency, 
evident after the practice, for boys to at- 
tribute failure to luck which, since luck is 

unstable, allows the possibility that a failing 
trend might reverse. Girls, being inclined to 
attribute failure to poor ability but disinclined 
to attribute success to good ability, would 


have less reason than boys to think tha 
could reverse a failing trend or to thin. 

a succeeding trend could be maintained e 
ily. 


Individual Differences 


Within-cells correlations were com; 
practice and test success and failu 
tions separately, to examine the p 
that individual differences in ati 
would relate to expectancies, minin 
dards, and affective responses. 

In only one of the four condition: ın 
attribution score related to expectan: 
subjects failing on the test, higher e 
cies were associated with attrib 
failure to (lack of) effort (r = .30, 1 
An intention to try harder, coupled wii 
belief that effort would be effective, 
account for this relationship. This | 
provides rather weak support for the | 
sis that individual differences in expe: 
reflect differences in attributions. 

Minimal standards were significan 
related with luck and ability attributions i 
two of the four conditions. For practi 
ure, high standards were associated wi! 
attribution of failure to bad luck (> 
$ < .01) and low attribution of fail 
poor ability (r= —.30, p < .05). I 
success, high standards were associated with 
low attribution of success to good luck T= 
— Al, p < .01) and high attribution of suc 
cess to good ability (r = .32, p < .05). " 
subjects who interpret success or failure į 
way that makes future success likely have 
higher standards than those who intero 
outcomes pessimistically. 

The finding that individual differen 
standards were more consistently relat 


attributions than were expectancies could be: 


because expectancies were more affected by 
condition differences and less by individual 
differences than were standards. (Within-cells 
variances were larger for standards than ex- 
pectancies.) Though expectancies and attribu- 
tions were not clearly related at the within- 
cells level, attributions did parallel and help 
account for sex differences in expectancies, 
Subjects who succeeded on the practice 
were more worried about the test if they 
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attributed success to effort than if they did 
not (r= .29, p<.05) and less worried if 
they attributed success to task difficulty (easy 
task) than if they did not (r = —.35, p < 
05). They also felt better about the forth- 
coming test if success was not thought due to 
effort (r = —.39, p < .01) and was thought 
due to ability (r= .33, p < .05). These re- 
sults are consistent in indicating that a future 
test is anticipated positively if effort is not 
seen as crucial to success and this includes 
the cases in which success is seen as assured 
due to ability or ease of task. Only one sig- 
nificant correlation was found between attri- 
butions and affective responses in the failure 
condition: “feel good” correlated —.29 (p < 
.05) with attribution of failure to task diff- 
culty. Again, the test is anticipated more 
positively when the barriers to success are 
seen as smaller, 

Satisfaction with test success (“How 
pleased do you feel?”) was greater when 
success was not attributed to task difficulty 
(r = —.44, p < .01) and when success was 
attributed to ability (r = .32, p < .05), but 
not when attributed to effort (r = .15, ns). 

The hypothesis that affective responses to 
feedback are mediated by attributions is sup- 
ported by the findings above. Pleasure with 
success is greater when success is determined 
internally (ability) and less when determined 
externally (difficulty). There was, however, 
no support for the more specific hypothesis 
that positive affect is greatest when success 
is seen as a function of effort (Weiner, 1972). 
In fact, attribution of practice success to 
effort is associated with greater anxiety about 
and less positive anticipation of the forth- 
coming test. The relations of attributions with 
affective responses and minimal standards 
seem best summed up by saying that re- 
sponses are more positive and standards 
higher when success is thought to be assured. 

Among the cognitive and affective indices, 
only expectancies were significantly corre- 
lated with times on the task. Within-cells 
correlations for success and failure on the 
practice were —.06 (ns) and —.32 (p < .05) 
and for success and failure on the test, —.32 
(p < .05) and —.35 (p < .05). Longer times 
were associated with lower expectancies. This 
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pattern parallels that found for the main 
effects of outcome: Failure led to lower ex- 
pectancies and longer times. 


GENERAL Discussion 


Previous studies of causal judgments after 
success and failure do not all make the joint 
distinction between internal-external and 
stable-unstable causes. This makes compari- 
son of results across studies difficult, particu- 
larly as this and other studies support the case 
for distinguishing effort and ability within the 
internal dimension and luck and difficulty 
within the external dimension (Weiner et al., 
1971). Feather and Simon (1971), using a 
design very similar to that used here, ob- 
tained results generally consistent with the 
logical analysis. Thus, there is reasonable 
support for the present logical analysis. 
Frieze and Weiner (1971), using a role-play- 
ing method, found effort attribution a func- 
tion of consistency of feedback and ability 
attributions higher after success than failure. 
These results, which differ from Feather and 
Simon’s and those reported here, might re- 
flect the absence of clear cues concerning 
effort in the role-playing situation. Effort cues 
are crucial to the logical analysis. 

Bias in attributions was more marked 
after the practice than after the test and bias 
in ability attributions varied with attainment 
value, Of special interest were boys’ and girls’ 
different biases which were paralleled by sex 
differences in expectancies, standards, and 
achievement effort. These differences suggest, 
as does other evidence (Hoffman, 1972), less 
confidence on the part of girls. The different 
effort deployment of girls and boys on the last 
quarter of the test was such that it would 
confirm their different attributions if feedback 
was actually dependent on effort. Though not 
the case here, in most achievement situations 
outcomes do reflect effort. Thus, by applying 
greater effort when failing than succeeding, 
boys might succeed and thereby confirm their 
belief that failure was due to bad luck. By 
applying similar effort regardless of success or 
failure, girls would not allow success to 
clearly indicate good ability and would thus 
maintain their self-derogatory ability attribu- 


tions. 
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The long-term implications of this evidence 
for girls’ intellectual and achievement devel- 
opment are hardly favorable. One might also 
express concern over boys’ defensive attribu- 
tional bias. However, even if this defensive- 
ness made for less rather than more positive 
achievement behavior, its implications would 
appear to be less serious than the bias found 
for girls. Girls’ bias was evident for the 
stable personal dimension of ability, while 
boys’ bias occurred only for the unstable 
external factor of luck. 

The finding that girls feel and function 
relatively poorly when the task is presented 
as an important ability measure (high- 
attainment value) is consistent with other 
evidence that girls react less positively than 
boys to intellectual challenge (Hoffman, 
1972; Maccoby, 1967). Fear of success 
might, for girls, be aroused more in the high- 
than the low-value condition, as it is in a 
mixed-set competitive situation more than a 
noncompetitive situation for college women 
(Horner, 1972). The concept of role is use- 
ful in integrating diverse motivational phe- 
nomena including that of fear of success in 
women (Alper, 1973; Horner, 1972; Klinger 
& McNelly, 1969). Further, it has been 
shown that motivational and performance 
indices in children are more positive when a 
task is sex-role appropriate than when it is 
not (Montemayor, in press; Stein, 1971; 
Stein, Pohly, & Mueller, 1971). Thus, girls’ 
lower effort and more negative feelings under 
the high-value condition could reflect a per- 
ception of intellectual competence as role- 
inappropriate. Girls’ self-derogatory ability 
attributions and lower expectancies and stan- 
dards could reflect the same phenomenon. 

The relations between attributions and af- 
fective responses is an important issue for 
attribution theory (Weiner, 1972). The find- 
ing that pleasure with success correlated with 
attribution of success to high ability, but not 
with attribution to high effort, appears to 
contradict Weiner and Kukla’s (1970, Experi- 
ment 3) finding of greater pride when success 
was due to little ability and much effort than 
when due to little effort and much ability. 
Weiner and Kukla’s findings, however, might 
be best interpreted as indicating that greater 
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pride is felt when performance is up to ca- 
pacity (due to effort) than below capacity 
(because of lack of effort). The present result 
indicates something different, namely, that 
subjects feel better when they think success 
on a test indicates high ability than when 
they do not think it indicates high ability. 
Subjects were not, however, more pleased 
when success was thought due to greater ef- 
fort. In general they felt better when they 
saw the task as well within their grasp rather 
than dependent on much effort. This could 
help explain why attribution of success to 
ability, but not effort, leads to pleasure: It 
makes future successes seem probable and 
easily secured while attribution of success to 
effort does not. Incidentally, as attribution of 
success to effort is not associated with greater 
pleasure and is associated with less positive 
anticipation of a future test, it is unlikely 
that the finding of greater attribution of 
success than failure to effort reflects self- 
enhancing bias. 

In summary, the study provides some sup- 
port for the logical analysis. It is, however, 
suggested that this analysis is particularly 
useful because deviations from its predictions 
can be taken as evidence of bias in use of 
available information. Sex differences in bias 
were consistent with, and clarify somewhat, 
previous evidence on sex differences in expect- 
ancy and were paralleled by a slight effect on 
achievement effort. 
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Subjects received monetary aid from another member of an experimental 
group. The obligation attached to the aid was varied along with the subjects’ 
perception of the donor's resources. The effects of the aid on subjects’ attrac- 
tion for the donor, desire for coalition, and repayment were assessed. In 
general, subjects were more attracted to a donor who obliged them to return 
the resources than to a donor who asked for nothing in return or for interest. 
Greater attraction was also expressed for a poor as opposed to a wealthy 
donor. Desires for coalition were unrelated to attraction for the donor; sub- 
jects preferred to join with the wealthy rather than the poor donor. However, 


more monetary resources were returned to the 


donor. The same general pattern of 
Japan, and Sweden. 


It is commonly assumed that people re- 
spond favorably to those who provide them 
with benefits, We anticipate that the pleasure 
(or relief from pain) experienced by the 
recipient increases attraction to the donor. 
Such an assumption is amply supported, both 
in everyday experience and in the psycho- 
logical laboratory. Research has shown that 
social attraction can be increased by pro- 
viding another with positive regard (Deutsch 
& Solomon, 1959), monetary reward (Morse 
& Gergen, 1971), agreement with his opinions 
(Jones, Stires, Shaver, & Harris, 1968), and 
various other benefits. That people are at- 
tracted to those who benefit them is a key 
assumption in both the balance (cf. Heider, 
1958) and exchange (cf. Secord & Backman, 
1964) theories of social attraction. 

So widely held is the assumption linking 
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poor as opposed to the wealthy 


findings emerged in the United States, 


benefit with attraction that many investi- 
gators have been stimulated to explore excep- 
tions to the rule. For example, benefit that 
appears insincere (Jones, 1964), delimits the 
recipient’s freedom (Brehm, 1966), appears 
inappropriate (Kiesler, 1965), or which does 
not exceed expectations (Morse, 1972), does 
little to enhance atraction and may some- 
times decrease it. The present research fur- 
thers this line of inquiry by considering the 
exchange properties of human interaction. In 
particular, the present study explores the 
effects of (a) differing levels of obligation 
attached to the benefit and (b) differing 
levels of resources possesed by the donor on 
the attraction of the recipient toward the 
donor. 

We further wished to expand the scope of 
earlier research by exploring the link between 
attraction and subsequent behavior, It is com- 
monly assumed that if one likes another, he 
or she is more likely (a) to benefit the other 
ina variety of ways and (b) to prefer associ- 
ation with this person to a greater extent than 
otherwise. Although such assumptions seem 
consistent with common experience, numerous 
discrepancies between attitudes and action 
have been discovered (cf. Brehm, 1960: 
Campbell, 1963; Cowdry, Keniston, & Cabin 
1970) and invite a closer examination of the 
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OBLIGATION, Donor RESOURCES, AND REACTIONS To AID 


«+ behavioral correlates of attraction. The pres- 


ent experiment thus included measures of 
subjects’ return of resources to the donor and 
of their desire for coalition with him. 


Effects of Obligation 


Benefits exchanged among people typically 
vary with respect to the amount of obligation 
attached to them. Christmas gifts, for exam- 
ple, often carry a heavy burden of obligation 
for the recipient, while emergency aid does 
not. Differences in the amount of attached 
obligation have important implications for 
predicting the recipient’s attraction to the 
donor. The most obvious prediction is that 
obligation is negatively related to attraction: 
The greater the obligation is, the less the 
attraction to the donor. This prediction fol- 
lows from a simple analysis of the recipient’s 
total gain in the exchange. As Homans (1961) 
pointed out, one’s “profit” from a given ex- 
change is equal to his gains minus his costs. 
Obligations to return the benefit may be 
viewed as costs; as the obligations increase, 
one’s profit is thereby diminished. 

Yet closer scrutiny reveals problems with 
the linearity prediction. When little or no 
obligation is attached to benefit, emotionally 
negative reactions may be triggered. For one, 
the donor who gives without seeking a return 
often violates common norms of reciprocity. 
As both Gouldner (1960) and Homans (1961) 
have argued, social organization is vitally de- 
pendent on the commitment of its participants 
to the value of reciprocity. Thus, one who 
proposes a relationship that is not reciprocal 
is violating normative values. Such a donor 
also creates in the recipient what Homans has 
called a “tension of obligation.” The recipient 
feels he should return the benefit but is pre- 
vented or discouraged from doing so. The 
unresolved tension may remain as an irritant. 
A third reason that “something for nothing” 
may fail to engender attraction is that the 
recipient’s suspicions may be aroused. It is 
commonly assumed that people do not part 
with their resources without a reason, and 
usually the reason is that they expect to get 
something out of it. Thus, we may well ques- 
tion the “moral character” (Goffman, 1971) 
of someone who aids us with no apparent way 
of recouping his losses. As interviews with 
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recipients of foreign aid reveal, a country 
that offers “free aid” may be viewed as either 
“manipulative” or “naive” (Gergen & Gergen, 
1971). 

In sum, there is good reason to suspect 
that under many conditions, the relationship 
between the amount of obligation attached to 
a given benefit and the attraction of the 
recipient to the donor is curvilinear. Benefit 
without obligation should yield little attrac- 
tion to the donor and may possibly generate 
hostility. A similar reaction should occur 
when obligation exceeds the amount of bene- 
fit received. Greatest attraction should occur 
when the recipient is obligated to return re- 
sources equaling what has been received. In 
effect, the most amicable relationship should 
occur when the exchange is reciprocal. 


Donor Resources 


Research has consistently shown that the 
value of any given resource is subject to a 
variety of influences. This value is altered as 
the individual’s needs fluctuate (Walster, 
1965), as he compares his rewards with those 
received by others (Adams, 1965), and as 
the donor’s motives for providing the resource 
change (Dickoff, 1961). Another central fac- 
tor determining the value a recipient places 
on benefit is its cost to the donor, When the 
act of benefit appears costly to the donor, it 
may have an added value for the recipient: 
Tt suggests that the donor truly has regard 
for him, and the value of the benefit is there- 
by enhanced. When there is little cost to the 
donor, the recipient cannot be certain of the 
donor’s true concern for him. In effect, the 
recipient determines the affective disposition 
of the donor from the amount of effort he 
perceives the donor has devoted to providing 
help (Jones & Davis, 1965). Consistent with 
this reasoning, Pruitt (1968) found that sub- 
jects provide greater reward to one who has 
sacrificed a large versus small proportion of 
his resources for them, and Tesser, Gatewood, 
and Driver (1968) found that subjects indi- 
cate more gratitude for a favor of high cost 
to a donor than the same favor when of little 
cost. 

In the present experiment, the perceived 
cost to the donor was manipulated by chang- 
ing the recipient’s perception of the donor’s 
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resources. Half of the subjects found that, by 
chance, the donor had accumulated large 
monetary holdings; the remaining half per- 
ceived that the donor had been less fortunate 
and was thus giving from a much smaller 
pool of resources. Based on the above argu- 
ments, it seemed reasonable to suspect that in 
the present context, the low-resources donor 
would elicit greater attraction than the high- 
resources donor, that more of his resources 
would be returned to him, and that he would 
be a more popular candidate for subsequent 
coalition. 


Aid in Cross-National Perspective 


One major shortcoming of laboratory ex- 
periments is their limited range of generality. 
Whether a particular set of findings can be 
generalized to any other setting or population 
or remains stable through time usually re- 
mains conjectural. Replication of experiments 
enhances their generality, and replication 
across diverse populations is especially desir- 
able. The greater the diversity of settings in 
which a given function form emerges, the 
more powerful are its predictive capacities 
and the more robust it is in terms of temporal 
and situational generality. Thus the present 
study was conducted in three quite different 
cultural settings: the United States, where 
the capitalistic motif is said to pervade modes 
of exchange; Sweden, where a strong social- 
istic idealogy prevails; and in Japan, where 
strong traditions of obligation are said to 
guide reactions to assistance (cf. Befu, 1966; 
Benedict, 1946). 


METHOD 
Overview 


Groups of six subjects participated in a procedure 
ostensibly designed to explore decision making in 
cooperative and competitive conditions. The pro- 
cedure was said to take place in two phases. During 
the first part, each subject was provided with mone- 
tary resources and then required to wager them in a 
game of chance. As the wagering progressed, each 
subject found himself losing heavily. At a critical 
point in the procedure, he received a gift of addi- 
tional resources from what appeared to be one of 
the other subjects. For half of the subjects, the 
donor appeared to be that individual in the group 
who had acquired the greatest amount of money 
through the wagering (high-resources donor), while 
the remaining half of the subjects believed that the 
gift came from a subject with little money remain- 
ing (low-resources donor). Written communication 


from the fictitious donor was used to establish vary- 
ing levels of obligation. One third of the subjects in 
each of the above conditions was told that nothing 
need be returned to the donor (low obligation), 
another third that the donor wished to have the 
money returned at the end of the session (equal 
obligation), and the final third that the donor wished 
the money returned with interest (high obligation). 
Later in the procedure measures were obtained of 
the subjects’ attraction to the donor, their desire to 
form a coalition with him, and the amount of 
money they actually returned. The same procedure 
was followed in the United States, Japan, and 
Sweden. Thus the design was a 3 X 2 X 3 (National 
Setting X Donor Resources X Degree of Obligation) 
factorial design with 10 subjects in each of the 18 
cells of the design. 


Subjects 


Subjects were 180 males between the ages of 18 
and 25 years. The American subjects were Harvard 
University students who responded to advertisements 
offering money for participation in psychological 
research. The Japanese subjects were students at 
Waseda University in Tokyo who volunteered to 
participate in return for monetary payment. Swedish 
subjects were also college students who at the time 
of the experiment were participating in a summer 
military training program required of all males in 
the country. Their participation was made possible 
by the Swedish military authorities, No subjects 
manifested suspicion of the procedure, so all were 
included in the analysis of the data. 


Procedure 


Subjects participated in the experiment in groups 
of six. Each participant was greeted by an experi- 
menter of the same national origin as himself. The 
experiment was described as being concerned with 
decision making under conditions of economic coop- 
eration and competition. The procedure was to take 
place in two phases. In the first or preparatory 
phase, each subject was to participate by himself in 
a series of wagers. He would be provided with the 
same basic monetary resources at the outset, and 
depending on matters of luck (not skill), he could 
increase or decrease his holdings. Once these differ- 
ences in economic resources had been established, 
subjects would enter the second phase of the experi- 
ment. During this phase, subjects would be divided 
into three teams of two members each. Each team 
would then compete with the others in a game of 
skill. Depending on the skillfullness of decision mak- 
ing in this phase, additional funds could be, won or 
lost. However, in either phase, should any individual 
or team lose all resources, forced retirement from 
the procedure would follow. If funds were exhausted, 
there would be no additional opportunity for that 
person or team to acquire money. Subjects were 
further told that they could keep all winnings 
acquired during the procedures and that this sum 
could be substantial. 

Subjects were then placed in six separate booths 
so arranged that the experimenter could make an- 
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nouncements to the group as a whole but subjects 
would be unable to communicate with each other 
either visually or verbally. Subjects were then told 
that the booths were each assigned a letter from 
A to F and that in all Subsequent communications 
the assigned letter would be used for identification. 
By design, however, each subject was assigned the 
same letter designation. Subjects were then provided 
with a set of poker chips worth $4 (or its rough 
equivalent in purchasing power in Sweden and 
Japan) and a wagering board. On the board were 
printed the numbers 1, 2, 5, and 10, each of which 
represented the number of chips the subjects would 
be required to wager on the various trials. Thus, on 
the first trial, a subject would be required to place 
a single chip on the wagering board; on the second, 
2 chips; on the third trial, 5 chips; and so on until 
three rotations had been made through the series. 
Subjects were informed that the order of the num- 
bers on the wagering board differed for each of 
them, Thus the amount won or lost by each indi- 
vidual would vary from trial to trial. Wins or 
losses for each trial would be determined by the 
throw of four dice, each die corresponding to one 
of the four numbers on the board. Odd rolls indi- 
cated a win and even rolls a loss. Thus, if on a 
given trial the die corresponding to the number 10 
position on the wagering board were odd, any player 
who had been required to wager 10 chips on that 
trial would win an additional 10 chips, If that 
particular die were even, the subject would lose the 
10 chips he had wagered. Through this procedure 
the welfare of the subjects could be systematically 
altered, and differences in amount of resources 
could be created in such a way that inferences re- 
garding the personality of the donor could not be 
made on the basis of his monetary holdings. 

Subjects were told that if they wished to com- 
municate with each other, they could write notes 
which the experimenter would deliver. The specific 
identity of the others would remain protected by 
the letter designations to be used in communication. 
However, they were told they might wish to com- 
municate concerning partnerships for the second 
phase, etc. In addition to envelopes and paper for 
such communication, each subject was also furnished 
a scoring sheet, On this sheet he was to maintain a 
running record of the number of chips he possessed 
at the end of each trial, along with the number of 
chips possessed by each of the other participants. 
This task was facilitated by the experimenter, who 
publicly announced the number of chips held by 
each participant on each trial. This procedure made 
evident the amount of resources held by the person 
subsequently providing benefit. 

During the game, all subjects were exposed toa 
common experience. The outcomes of the dice were 
essentially fixed so that a standard pattern of wins 
and losses could be provided to all. Although sub- 
jects were allowed to win on certain trials, they 
typically lost when large amounts were wagered. It 
also appeared, however, that other participants were 
far more fortunate. The running tabulations made 

-it clear to each subject that he was the least fortu- 
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nate of the group. On the eighth trial, subjects 
found they only had 12 of their supply of 40 chips 
remaining. If they were to lose on the subsequent 
trials, their funds would be completely exhausted 
and they would be forced to retire from the pro- 
cedure. (Instructions prevented subjects from using 
their personal funds). 

At this critical juncture each subject received aid 
from what appeared to be one of the other partici- 
pants. The experimenter, who visited each booth at 
the end of each trial in order to deliver winnings, 
collect losses, or transfer messages, brought with him 
(or her) an envelope. The envelope contained a note 
from what was ostensibly one of the other partici- 
pants and a gift of 10 chips (worth $1 or its 
equivalent). The message was especially designed to 
create one of the three different levels of obligation. 
In all cases the note indicated that the sender had 
noticed the subject was running out of chips and 
might not make it to the next game; thus he was 
sending the 10 chips in the envelope. On a random 
basis a third of the subjects (low-obligation condi- 
tion) also found that their note went on to say, 
“Don’t bother repaying these. I don’t need them.” 
In the equal-obligation condition, this latter portion 
of the note was replaced with, “You can use them 
now and pay me back when you have enough.” 
Subjects in the high-obligation condition were also 
asked to return the chips with the additional note 
that “maybe you can do something for me later on.” 
The significance of this statement was enhanced by 
earlier instructions suggesting that teams in the sec- 
ond phase might efficiently organize themselves into 
junior and senior decision makers, with team win- 
nings split accordingly. 

The note was also used to create differences in 
the perception of donor resources. At random, half 
of the subjects in each of the above conditions found 
that the note had been sent by Player C, the player 
whom the running tabulation revealed to be the 
richest in the group (high-resources donor), ‘The 
remaining half of the subjects found that the note 
had been sent by Player F, one of the poorer play- 
ers (low-resources donor). The high-resources donor 
thus appeared to be giving about 17% of his re- 
sources to the subject in contrast to the poorer 
donor, whose gift represented 38% of his remaining 
capital. To reinforce this manipulation, a sentence 
was included in the note to the subjects indicating 
that the donor had either 66 or 26 chips (56 or 16 
after the gift) and thus enough to spare the 10, In 
effect, after the gift the poor donor actually had 
fewer chips than the subject. During the three 
remaining trials, all subjects lost chips but did not 
exhaust their supply. The donated chips thus pro- 
vided needed security, and they could now be re- 
turned in full. 


Dependent Measures 


The effects of these variations were assessed in 
three ways: 


1. Attraction ratings: At the close of the initial 
phase, subjects were told that there would be a brief 
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waiting period while the experimenter arranged the 
teams for the next round. During this waiting period 
subjects were to respond to various questionnaires 
that would shed light on the decision-making proc- 
ess. One of the questionnaires asked each subject to 
make a series of evaluations of “anyone who might 
have sent you a note during the initial phase.” Five 
evaluation forms were provided for this task, but 
inasmuch as all notes sent by the subjects to each 
other had been intercepted, only one of them could 
actually be used. On this form subjects thus evalu- 
ated the standardized stimulus person. 

The rating form itself was composed of 16 anto- 
nym pairs, with polar antonyms each separated by 
an 8-point scale. Six randomly placed pairs were 
designed to tap generalized attraction toward the 
stimulus person. They were as follows: friendly- 
unfriendly, considerate-inconsiderate,  open-closed, 
easy to get along with-difficult to get along with, 
fair-unfair, and intelligent-not particularly intelli- 
gent. Although determined on an a priori basis, the 
dimensions did prove coherently related. The split- 
half reliability of the measure was .88.1 Other 
antonym clusters assessed subjects’ perception of the 
donor’s intentions and of the various experimental 
manipulations. 

2. Return of resources: All notes from subjects to 
the donor were intercepted during both the gambling 
and waiting periods. Tabulation was made of the 
number of chips each subject returned to the donor. 

3. Coalition choice: During the waiting period, 
subjects were informed that they would be able 
to select their partners for the team competition, 
However, because of possible conflicts in choices, it 
would be necessary to obtain rankings of preference 
from each player. The subjects were thus asked to 
rank all of the other players in order of their pref- 
erence for subsequent partnership. 


In addition to these various measures, subjects 
were finally asked to fill out a questionnaire tapping 
their feelings about various aspects of the experi- 
ment. Responses to this questionnaire were used to 
ensure that the manipulations had been perceived as 
intended, to explore motives, to assess possible sus- 
picion, etc. 


RESULTS 
Validation of Manipulations 


We can first examine independent evidence 
concerning the success of the various manipu- 
lations. First, regarding the manipulation of 


1In the American sample, nine antonym pairs 
were initially used in the attraction cluster. How- 
ever, because three of these pairs showed little 
relationship to the general cluster scores in either 
the Japanese or Swedish samples, the measure was 
reduced to six dimensions for all analyses reported. 
As Tanaka (1972) has shown, the precise dimensions 
used to tap generalized evaluation of persons or 
objects are not constant across cultures. 
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donor resources, one of the antonym pairs 
inserted into the evaluation measure was the 
dimension lucky—unlucky. If subjects attended 
to the resources of the person benefiting them, 
they should have rated the high-resources 
donor as luckier than the low-resources donor. 
A 2X3 x3 analysis of variance of ratings 
on this dimension yielded a significant F 
ratio of 165.45 (p < .0001). Subjects were 
clearly sensitive to the differential resources 
of the high- and low-resources donors. 

In order to check on subjects’ understand- 
ing of obligation differences, subjects were 
asked to rate the donor on the dimension 
wants nothing from me—wants something 
from me. An analysis of variance of ratings 
along this dimension yielded only a single 
main effect, this effect stemming from differ- 
ences in obligation (F = 3.11, p < .05). The 
means along the dimension followed a linear 
pattern, with subjects in the low-obligation 
condition showing the lowest rating (X = 
2.10) and subjects in the high-obligation con- 
dition manifesting the highest (X = 4.69). It 
was also found, however, that in the Swedish 
sample, the high-obligation condition subjects 
did not display the increased ratings found 
in the other two countries. There was, in 
fact, a slight but nonsignificant tendency for 
the high-obligation donor to be seen as want- 
ing less than the equal-obligation donor. Thus, 
while the obligation manipulation seemed suc- 
cessful in both the U.S. and Japan, we can- 
not be certain that it was in the Swedish 
high-obligation condition. 

With substantial evidence that the experi- 
mental variations were perceived as intended, 
we can proceed to examine the effects of the 
variations on subjects’ attraction to the donor, 


their reciprocation, and their desire for coali- 
tion. 


Social Attraction 


It will be recalled that the attraction mea- 
sure was composed of the summed ratings of 
the stimulus person along six different evalua- 
tive dimensions. The results of a three-way 
analysis of variance conducted on these rat- 
ings is featured in Table 1. The presence of a 
significant F ratio for both main effects and 


interactions invites closer examination of the 
results. 
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TABLE 1 


SUMMARY OF ANALYSIS OF VARIANCE 
or Arrracrion RATINGS 


eae} 


Source df MS F 

High versus low resources (A) 1 | 97.60 | 3.98* 
Low-equal-high obligation (B) 2 | 76.32 | 3.08* 
U. S.-Sweden-Japan (C) 2 | 52.43 | 2.12 
AKB 2 | 13.02 
AXC 2 | 81.79 | 3.09* 

x C 4 | 62.44 | 2.56* 

xXBXC 4 | 38.07 

Error 162 | 24.78 

Pas RSs 


It was first ventured that the relationship 
between the amount of obligation attached to 
benefit and attraction to the donor would be 
curvilinear in character. The analysis of vari- 
ance reveals that the main effect for the 
obligation variable is significant (F = 3.08, 
Ż <.05). Differences in attraction produced 
by three levels of obligation in the three sepa- 

, tate countries appear in Figure 1. Here it is 
found that the curvilinear pattern does indeed 
emerge. In all three nations we find that sub- 
jects in the low-obligation condition are less 
attracted to the donor than those in the equal- 
obligation condition. Collapsing across na- 

. tions, the difference between the low and 

` equal conditions proves significant beyond the 

.05 level (F = 3.82). Thus, we find a donor 

who asks an equal return more positively 
evaluated than one who provides benefit with- 
out obligation. 

Turning to the high-obligation donor, we 
find that in two of the three nations, subjects 
are less attracted to him than to the equal- 
obligation donor. Collapsing across nations, 
the difference between the equal- and high- 
obligation conditions does not reach the .05 
level of probability (F = 2.38, p < .11). The 
major reason for this lack of significance is 
the deviant case of the Swedish group, in 
which the high-obligation condition produces 
a slight increment in attraction. If we confine 
our analysis. to the American and Japanese 
subjects alone, the F ratio does reach the 
customaty level of significance, F = 4.14, 
p <.05. It thus appears that in all condi- 
tions in which the manipulation of obligation 
was successful the curvilinear pattern emerges. 
In the Swedish high-obligation condition, 
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where there is no such evidence, we find no 
effects. An orthogonal components test for 
curvilinear trends yields similar results, The 
test is just shy of significance (p < .08) when 
the Swedish data are included; without them, 
the test reaches an acceptable level of signifi- 
cance (p < .05), 

It is also shown in Table 1 that there is a 
significant Obligation x Nationality interac- 
tion (F = 2.56, p< .05). An appraisal of 
the means in Figure 1 shows that, in Japan, 
the hypothesized curvilinear relationship is 
accentuated; the American subjects also pro- 
duce the same pattern, but the equal-obliga- 
tion inflection is far less marked. In Sweden, 
as we have seen, we do not find the antici- 
pated dip in the high-obligation condition. In 
the latter condition, the Japanese evaluate 
the stimulus person less positively than do the 
Swedish subjects (p < .01). 

Turning now to the effects of donor re- 
sources on attraction, we find in Table 1 a 
significant (F = 3.98, p< .05) main effect 
for this variable. In Figure 2 we have plotted 
the mean attraction ratings of the high- and 
low-resources donor for each of the three 
nations. It had been hypothesized that the 
greater the resources of the donor, the less 
effective his potential for engendering attrac- 
tion. In Figure 2 we find that in each of the 
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Ficure 1. Attraction to the donor in three nations 
as a function of obligation level. 
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Ficure 2. Attraction to the donor in three nations 
as a function of donor resources. 


three countries, the hypothesized relationship 
is found. In each case the mean attraction 
rating for the low-resources donor exceeds 
that of the high-resources donor. The results 
thus provide substantial support for the hy- 
pothesis. 

However, it is also shown in Table 1 that 
there exists a Donor Resource X Nationality 
interaction (F = 3.09, p< 05). Figure 2 
shows that, in Japan, the difference between 
the high- and low-resources donor is accentu- 
ated. In America, we find a moderately strong 
relationship. However, in Sweden, we find 
the means to be only slightly separated. 
Orthogonal comparisons reveal that the dif- 
ference between the high- and low-resources 
donor in Japan is significantly greater (F = 
4.01, p < .05) than that found in Sweden, 
A further comparison of the high- and low- 
resources conditions within Sweden alone does 
not reveal a significant difference. 


Return of Resources 


The results of a three-way analysis of vari- 
ance conducted on the number of chips re- 
turned by subjects in each of the 18 condi- 
tions appears in Table 2. It is first noted that 
differences in the obligation attached to the 
benefit produced no significant effects (F < 
1.00) on the number of chips returned. Thus, 
regardless of differences in attraction pro- 
duced by obligation and by differential obliga- 
tions incurred in accepting the resources, 
there were no effects on the actual return of 
the chips to the donor. We shall return to this 
matter in the discussion section. 

When we examine the effects of donor re- 


TABLE 2 


SUMMARY or ANALYSIS OF VARIANCE OF RESOURCES 
RETURNED TO Donor 


Source df MS F 

High versus low resources (A) 1 | 223.49 | 13.89* 
Low-equal-high obligation (B) 2] 31.81 
U. S.-Sweden—Japan (C) 2 | 24.76 
AXB 2 79 
AXC 2 3.52 
BXC 4 1,05 
AXBXC 4] 12.05 

Error 162 | 16.09 

*p <.01 


sources, however, we find a striking effect on 
the number of chips returned. As Table 2 
indicates, the F value for this variable is 13.89 
($ < 001). Included in Figure 3 is the mean 
number of chips returned to the high- and 
low-resources donor within each of the three 
countries. In each country subjects returned 
somewhat over 3 chips (of the 10 received) 
to the donor with low personal resources. 
When the donor gave from a position of fi- 
nancial strength, the mean number of chips 
returned dropped to approximately 1 in each 
case. Although these differences may not be 
dependent on differential feelings of attrac- 
tion to the donor, it is noteworthy that the 
low-resources donor produces greater attrac- 
tion and also secures a greater return on his 
gift than does the high-resources donor. No 
other main effects or interactions demand our 
attention in the case of returned resources. 


Coalition Prejerences 


It will be recalled that all subjects were 
asked to rank their preferences for a partner 
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Ficure 3. Resources returned to the donor in three 
nations as a function of donor resources, 
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= on the team competition that was to follow 


shortly. Since subjects ranked all participants, 
it was possible to extend the present analysis 
in an important way. Half of the subjects 
received benefit from the high-resources do- 
nor; however, their rankings also included 
the low-resources participant, who in this 
case, did not provide benefit. Similarly, those 
who received from the low-resources donor 
were also obliged to rank the high-resources 
participant, a nondonor in this case. By in- 
cluding in the same analysis the rankings of 
the high- and low-resources participants both 
when they served as a donor and when they 
did not, it was possible to assess the effects 

- of being a donor over and above possessing a 
large or small amount of resources. 

In Table 3 are featured the results of a 
four-way analysis of variance of preference 
rankings for all subjects. It is first observed 
that the obligation variable did not produce a 
main effect, nor did it contribute to any inter- 
actions. Thus, although subjects were differ- 
entially attracted to the stimulus person as a 
function of the obligation attached to his 
gift, this attraction in no way influenced 
their desire to have him as a partner in the 
competition. 

Two other variables did have marked ef- 
fects. Differences in the resources held by 
another player (regardless of whether he is 


TABLE 3 


Summary or ANALYSIS OF VARIANCE OF 
COALITION RANKINGS 


Source 


High versus low resources (A) 33,20*** 


Giver versus nongiver (B. 
Low-equal-high obligation (©) 
Y. S.-Sweden-Japan (D) 
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Ficure 4. Coalition ranking as a function of re- 
source ownership and donating aid. 


a donor) produced an F of 33.29 (p < .001). 
The mean rankings for the high- versus the 
low-resources donor (collapsed across coun- 
tries) appears in Figure 4. An examination 
of these means makes it clear that the high- 
resources donor is more preferred as a team- 
mate than the low-resources donor. Possessing 
money makes one a highly desirable partner, 
and this desirability appears to overshadow 
personal attractiveness in importance. Table 
3 also reveals a significant main effect (F = 
22.70, p < .001) due to giving benefit versus 
not giving. An examination of Figure 4 re- 
veals the nature of this difference. Holding 
constant the donor’s resources, simply giving 
resources to the recipient significantly in- 
creased the likelihood of the donor’s being 
preferred in the subsequent coalition, Receiv- 
ing benefit, then, does increase the recipient's 
desire to affiliate with the benefactor. 
Consulting Table 3, we also find that these 
general conclusions must be modified in light 
of the significant (p < .025) interaction be- 
tween high- versus low-resources donor and 
giving versus not giving benefit. An examina- 
tion of Figure 4 not only clarifies the mean- 
ing of this interaction but bears importantly 
on our earlier discussion of resource differ- 
ences. Here we find that if the donor is high 
in resources, providing aid increases his at- 
tractiveness as a partner only slightly. In 
contrast, if the donor is low in resources, 
giving aid has a marked impact on his pref- 
erability. In this case, he jumps from ap- 
proximately a fourth-place ranking (of 5) to 
just below second place. The significant inter- 
action indicates, then, that the increase in 
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preference for a low-resources partner is en- 
hanced to a greater extent when he provides 
benefit than if he is a high-resources donor. 
When one has money, he appears to be a 
desirable partner regardless of his attractive- 
ness; a ceiling may be reached in one’s 
desirability. However, when one has a few 
resources, he can produce strong changes in 
his desirability by providing benefit. 

One final result deserves mention. As Table 
3 indicates, there is a significant ($ < .05) 
Donor Resources X Subject Nationality inter- 
action. An examination of individual nation 
means reveals that Japanese subjects are 
much less influenced by the differences in the 
wealth of the donor than are subjects in 
either the U.S. or Sweden, The mean increase 
in rank for a high- versus low-resources donor 
in the U.S. is 1.25, while in Sweden it is 1.72. 
In Japan, the difference is much lower (X = 
47). Whether this difference is due to cul- 
tural dispositions or to the specific meaning 
that wealth of the donor takes on in this 
experimental context cannot be determined by 
the present findings. 


Discussion 


Let us first briefly summarize the major 
findings of the study. 


1. Differences in the amount of obligation 
associated with benefit had significant effects 
on the recipients’ attraction toward the donor, 
Those who received benefit without obligation 
evaluated the donor less positively than those 
who were asked for an equal amount in re- 
turn. The donor who asked for interest on 
his benefit was evaluated less positively (ex- 
cept in Sweden) than the donor who proposed 
an equal exchange, The general relationship 
between obligation and attraction thus ap- 
peared to be curvilinear. Differences in obli- 
gation had no effect, however, on either the 
amount of benefit the recipient actually re- 
turned to the donor or the recipient’s desire 
for subsequent coalition with him. 

2. Differences in the resources possessed by 
the donor produced marked effects on recipi- 
ent attraction. Recipients evaluated the donor 
with meager resources significantly more 
positively than the donor whose resources 
were substantial. Recipients also returned 
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significantly more resources to the donor with >_< 


low resources. However, in selecting a team- 
mate, the desire for money apparently out- 
weighed considerations of attraction. Sub- 
jects showed a significant preference for a 
partner with large resources. It was further 
found that if a participant with low resources 
helped the subject, he was viewed as signifi- 
cantly more desirable as a partner than if he 
did not help; this increment in preference was 
significantly less when the other participant 
had high resources. 

3. Substantial similarities in function form 
were found across the three national groups. 
With the exception of a single mean, the 
curvilinear relationship between obligation. 
and attraction was found in all countries. The 
low-resources donor was evaluated more posi- 
tively than the high-resources donor in all 
three samples, and in each nation more re- 
sources were returned to the former than the 
latter. Subjects in all nations also preferred a 
coalition with the high-resources donor as 
opposed to the low-resources donor, and in all 
cases the increment in preferability produced 
by the low-resources person’s gift exceeded 
that produced by the high-resources person. 
We thus appear to be dealing with functional 
relations of broad generality. 


Several additional issues are raised by the 
results. The first concerns the reasons for the 
negative effects of benefit without obligation. 
Tt was suggested that such benefit not only 
Tuns counter to commonly held values of 
reciprocity but also creates an unrelieved 
tension of obligation and possible suspicion of 
the donor’s motives. In the present experi- 
ment, we cannot determine precisely which, 
if any, of these dynamics was responsible 
for the differences in attraction. However, 
post hoc analysis does yield useful insights. 

After the experiment all subject rated the 
donor’s communication on the dimension in- 
appropriate—appropriate. If subjects applied 
the reciprocity norm in this situation, we 
should expect that benefit obligating them to 
an equal return would be seen as more ap- 
propriate than either the low- or high-obliga- 
tion benefit. The data lend support to this 


explanation. The equal-obligation communi- 


cation was viewed as more appropriate than 
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either the high- or low-obligation communica- 
tions (for low obligation, X = 5.00; for equal 
obligation, X = 6.87; for high obligation, X 
= 6.06). A £ test comparing the equal-obliga- 
tion condition with the remaining two condi- 
tions combined yields a significant difference, 
t (178) = 3.22, p < .01. Subjects also rated 
the donor on the dimension dislikes me — likes 
me. If subjects were suspicious of the low- 
obligation donor, they should have been less 
inclined to feel he liked them than subjects 
in the equal-obligation condition. The donor’s 
usury in the high-obligation condition should 
also have yielded a decrement in ratings of 
the donor’s attraction. An examination of 
these means shows that subjects in the equal- 
obligation condition rated the donor highest 
in perceived liking (X = 6.40; for low obli- 
gation, X = 5.49; for high obligation, X = 
5.59). The ¢-test comparison between the 
equal-obligation condition and the other two 
conditions combined was again significant, 
t (178) = 2.94, p <.01. 

In sum, evidence suggests that several dif- 
ferent motivational processes may have been 
operative in producing the differences in at- 
traction. Additional research is necessary to 
isolate the specific contribution of each. How- 
ever, a more significant pursuit would be to 
explore conditions under which benefits with- 
out obligation yield greater rather than less 
attraction than equal obligation. If people 
could be assured that the donor would never 
hold them responsible to return the favor 
(e.g., the benefit was anonymous), that they 
would not “lose face” in the eyes of others, 
and that the benefit was positively motivated, 
the results in the low-obligation condition 
might be reversed. 

An additional question concerns the rela- 
tionship between attraction and behavior, The 
present study employed an attraction measure 
and two behavioral indexes. In general, the 
results suggest that the relationship between 
positive feelings toward a person and positive 
behavior, such as benefiting or joining him, 
is far more complex than usually assumed. 
Although subjects evaluated the equal-obliga- 
tion donor more positively than either of the 
other two donors, they were not more likely 
to return his resources, nor were they Be 
ready to choose him as a teammate. Subjects 
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showed greater attraction to the low- than to 
the high-resources donor, and they were also 
more likely to return his resources. However, 
they were significantly less likely to choose 
him as a partner. These results do not support 
the contention that positive feelings neces- 
sarily produce positive actions. 

One case does emerge in which positive 
sentiments seem to engender positive behav- 
ior: The low-resources donor is liked more, 
and more of his resources are returned. How- 
ever, these duplicated patterns do not ensure 
that the differential feelings of attraction 
produced the differences in return of re- 
sources. To examine this relation more 
closely, subjects scoring in the upper and 
lower third of each condition in attraction to 
the donor were isolated, and an assessment 
was made of the number of chips returned 
within each group. Collapsing across countries 
we find, surprisingly, that subjects who were 
least attracted to the donor returned more 
chips to him than those who were most 
attracted. The relationship nears statistical 
significance, £ (106) = 1.72, p < .10. In ef- 
fect, there appears to be a negative relation- 
ship between attraction to the donor and 
return of his resources. Perhaps subjects feel 
that their attraction ratings or coalition 
choices operate as a recompense for the 
money they have accepted or their return of 
resources frees them from further obligation. 
Additional research is needed on this point. 

In general, it appears that attraction is 
only one of a number of motives influencing 
behavior. As such, it may compete with other 
motivational ends. In the present study, for 
example, subjects may have been motivated 
by a desire to make money, to avoid the 
guilt produced by receiving money from 
someone without funds, and to act positively 
toward a valued person. At times, one or 
more of these motives could be coordinated, 
while at others, the motives could conflict. In 
these latter cases, the strongest motive should 
determine the direction of behavior. Thus, 
desires for money may have overcome the 
motive to act positively toward a valued per- 
son. 

Significant questions may be raised about 
our present interpretation of the results. For 
example, Maslach, Tashiro, and Gergen (Note 
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1) have shown that Americans and Japanese 
differ significantly in what they value in a 
partner in this experimental situation. Ameri- 
cans want a partner who makes good deci- 
sions and is competitive; the Japanese desire 
one who is cooperative and shows responsi- 
bility. In this light, we cannot be precisely 
sure of the subjective meaning of our various 
manipulations across the three cultures. Both 
obligation and donor resources may well have 
different connotations or behavioral implica- 
tions from one culture to another. Our re- 
sults do indicate a behavioral similarity across 
cultures but are inconclusive regarding under- 
lying motives or rationale. Nor, finally, can 
we be certain that these patterns would be 
found in more primitive or less technologi- 
cally advanced cultures. Although the present 
samples represented highly dissimilar cul- 
tures, all share in a material prosperity that 
might well have colored their reactions to 
economic aid. 


REFERENCE NOTE 
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Leader Esteem for the Least Preferred Co-Worker Score, 
Training, and Effectiveness: 
An Experimental Examination 


Martin M. Chemers, Robert W. Rice, Eric Sundstrom, and William M. Butler 
University of Utah 


A laboratory experiment supported Fiedler’s Contingency Model analysis of 
leadership training. The effect of leadership training on group performance 
was contingent on leadership style, that is, the leader’s relation to his least 
preferred co-worker (LPC). Training improved the performance of high LPC 
leaders, but was detrimental to the performance of groups with low LPC 
leaders. High intelligence leaders profited more from training than low intel- 
ligence leaders, but low intelligence leaders were more productive overall. 
Leader LPC X Intelligence interactions indicated that follower satisfaction and 
interpersonal affect were highest for groups with high LPC-low intelligence 
leaders or low LPC-high intelligence leaders. Implications for contingency 


theories of leadership are discussed. 


A contingency approach to leadership, inte- 
grating person and situation, has long been 
considered the most appropriate and poten- 
tially productive orientation (Gibb, 1954; 
Mann, 1959; Stogdill, 1948). However, the 
full operationalization of a contingency theory 

. was not accomplished until Fiedler’s (1967) 
presentation of the Contingency Model of 
Leadership Effectiveness. The Contingency 
Model maintains that successful group per- 
formance is contingent on the interaction of 
leader attributes with specific parameters of 
the leadership situation. 


Contingency Model 


The leader attribute incorporated in the 
Contingency Model is an individual disposi- 
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tion measured by the “esteem for the least 
preferred co-worker (LPC) scale,” a series of 
bipolar adjective scales on which the individ- 
ual describes the one person in his experience 
with whom he had the most difficult time 
getting a job done. The LPC score, which is 
not appreciably correlated with any com- 
mon personality measure (Bass, Fiedler, & 
Kreuger, Note 1), has generally been con- 
sidered a measure of the leader’s motivation 
(Fiedler, 1967, 1972a). Leaders who rate 
their least preferred co-worker quite nega- 
tively (low LPC) are considered primarily 
motivated toward successful task perform- 
ance. High LPC leaders, who give relatively 
positive ratings to their least preferred co- 
worker, are thought to be primarily motivated 
toward establishing interpersonal relationships. 

The relationship between leader LPC score 
and group effectiveness is mediated by the 
degree to which the group task situation gives 
the leader influence and control over the 
group. This influence and control, labeled 
situational favorableness, is in turn deter- 
mined by (a) the affective relationship be- 
tween leader and followers, especially the fol- 
lowers’ acceptance of the leader, (b) the 
degree to which the group’s task is structured, 
specified, and unambiguous, and (c) the lead- 
er’s power to reward or punish the followers. 
These three variables are dichotomized and 
ordered to represent an 8-point continuum of 
situational favorableness, 
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Figure 1 presents the curve relating leader 
LPC to group performance across the 8 
points, or “octants,” of the situational favor- 
ableness dimension. The curve indicates that 
low LPC leaders have the most effective 
groups under either very favorable or very 
unfavorable conditions, while high LPC lead- 
ers have the most effective groups under con- 
ditions of intermediate favorableness. 

Recent research has established the predic- 
tive validity of the Contingency Model analy- 
sis of leadership effectiveness (see Chemers 
& Rice, in press; Fiedler, 1971). However, 
many writers (e.g., Mitchel, Biglan, Oncken, 
& Fiedler, 1970; Shaw & Costanzo, 1970; 
Hollander, Note 4) have called for the exten- 
sion of the Contingency Model to encompass 
a wider range of leadership phenomena. In 
response, Fiedler has recently proposed sev- 
eral extensions of the basic model (Fiedler, 
1971, 1972a, 1972b). One important exten- 
sion attempts to explain the elusive effects of 
leadership training. 


Leadership Training Hypothesis 


Fiedler (1970, 1972b, 1973) has recently 
proposed that leadership training, rather than 
changing the leader, changes the situation. 
Training provides structure to a formerly am- 
biguous situation: it enhances the leader’s in- 
fluence and control by providing him with 
rules, procedures, and solutions for formerly 
ambiguous and unstructured situations, By 
providing structure in either the task or 
interpersonal aspects of the leader’s job, train- 
ing modifies the situation in a more favorable 
direction. The Contingency Model indicates, 
however, that improvements in situational 
favorableness do not have uniformly bene- 
ficial results for all leaders, (See Kerr & 
Harlan, 1973, for a critique of this analysis.) 

Table 1, taken from Fiedler (1973), speci- 
fies the nature of the relationship between 
training, LPC, and effectiveness, A change in 
situational favorableness may move some 
leaders into a zone of the favorableness di- 
mension where they are now most effective, 
but it can also move other leaders out of their 
most effective situation into a zone where 
they are relatively less effective even though 
the situation is more favorable. For example, 

in a situation of intermediate favorableness, 
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an untrained high LPC leader has relatively- 
more effective groups than an untrained low i 
LPC leader. However, if training provides the j 
structure necessary to move the situation into | 
a highly favorable zone, then among trained | 
leaders low LPC-led groups would be more + 
effective than high LPC-led groups. Thus, 
while training improves the performance of 
low LPC leaders, it is actually detrimental ` 
to the performance of high LPC leaders. The 
net result of training taken over all leaders 
would be zero, which may explain why 
leadership training programs so often fail to ¿$ 
show consistent positive effects (see Camp- 
bell, Dunnette, Lawler, & Weick, 1970; 
House, 1967; Odiorne, 1964).. ay 
A number of recent field studies have ad- © 
dressed the training hypothesis (Csoka .& 
Fiedler, 1972; Fiedler, 1972b). In four 2 
settings, Csoka and Fiedler (1972) investi- 4 
gated training effects on various military 
personnel including artillery sergeants, petty 
officers in charge of naval aviation mainte- _ 
nance shops, army company commanders, and 
mess sergeants. In these experiments, the 
number of years of training were used to 
specify task structure, that is, leaders with 
considerable training were classified as having 
high task structure while leaders with little r 
training were classified as having low task 4 
Structure. Using group atmosphere scores and 
training to determine situational fayorable- 
ness, the correlations between leader LPC and 
group productivity were consistent with Con- 
tingency Model predictions. These results 
provided strong support for the training ii 
hypothesis, that is, leadership training modi- 
fied the situational favorableness of the 
leaders’ position in the direction predicted. N 


Leader Intelligence 2. 


Csoka (Note 3) has reported that leader- 4} 
ship experience, like training, may improve — 
, but only for intel- 


experience of leadership. 

also plays a moderating 
fects when training is s 
intense. For this reason 

intelligence measures wer 
the present study, 
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Leader position power STRONG WEAK STRONG WEAK STRONG WEAK STRONG WEAK 


Fıcure 1. Relationship of leader LPC (least preferred co-worker) to group effectiveness as 
moderated by situational favorableness, (Taken from Fiedler, 1967.) 


Fiedler, O’Brien, & Ilgen, 1969; Rice & 
Chemers, 1973), but no consistent body of 
findings addresses such variables as follower 
satisfaction, enthusiasm, and liking within the 
Contingency Model context. The present au- 
thors agree with Hollander (Note 4), that 


Follower Satisfaction 


The primary focus in the evaluation of the 
: Contingency Model has been on group pro- 
$ & ductivity. A few studies have examined fol- 
. ‘lower effects (e.g., Chemers, 1969; Chemers, 
iedler, Lekhyananda, & Stolurow, 1966; 


TABLE 1 
Summary or HYPOTHESES REGARDING THE EFFECTS OF TRAINING AND EXPERIENCE 


E Performance level Performance level of 
of leaders with leaders without ade- ; 
adequate training quate training and Predicted effect of 
t and experience g experience training and experi- 
Favorableness of situation Favorableness of situ- ence for previously 
for trained and experi- ation for inadequately | untrained leader 
enced leader LPCa experienced leader LPC 
High Low High Low LPC 
Very favorable Poor | Good Moderate Good Poor High decreases 
Low increases 
Moderately favorable Good | Poor Unfavorable Poor Good High increases 
Low decreases 
Unfavorable Poor | Good Very unfavorable Good? | Poor? High decreases 
Low increases 


the Effects of Leadership Training and Experience from the Contingency Model: A Clarifica 


ion” by 


F. Ne riedler, Tonal of Applied Psychology, 1973, 57, 110-113. Copyright 1973 by the American Psychological Ass 
Reprinted by ‘permission. 
PLPC = Teast preferred co-worker. 
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leadership research must pay greater atten- 
tion to the total phenomenon of group activ- 
ity. Thus, the present study included mea- 
sures of follower responses in an attempt to 
gain some understanding of the relationship 
among follower effects, leader attributes, and 
situational factors, 


Present Research 


The present study is an experimental exam- 
ination of the leadership training hypothesis 
(Fiedler, 1972b, 1973). Individuals selected 
on the basis of LPC and intelligence scores 
led three-man groups in a task requiring the 
deciphering of coded messages. Half of the 
leaders received training in code solution prior 
to the task. Measures were taken of group 
productivity, satisfaction, enthusiasm, and 
interpersonal affect. 

Leadership training was expected to im- 
prove the favorableness of the situation for 
all leaders receiving the training. However, 
based on the training hypothesis, this im- 
provement in favorableness was not expected 
to improve the performance of all leaders. 
Specifically, an interaction between LPC and 
training was predicted; under conditions of 
good group atmosphere, training was expected 
to improve the performance of low LPC 
leaders and hinder the performance of high 
LPC leaders. Leader intelligence was expected 
to augment training effects, that is, to inten- 
sify the interaction of LPC x Training on 
group productivity. No specific predictions 
were made for group satisfaction measures, 


METHOD 
Experimental Overview 


A2X2X2 factorial design included high and low 
levels of leader LPC, leader intelligence, and train- 
ing. Leaders were pretested and selected on the basis 
of high and low LPC scores and high and low intel- 
ligence and were randomly assigned to a leadership 
training or control condition. After training, leaders 
were assigned to groups composed of themselves and 
two followers. The groups worked on an unstruc- 
tured, intellectual-manipulative task requiring the 
solution of coded messages. Leader’s position power 
was controlled at a low level and leader-member 
relations were assumed to be good, because of 
past research involving ad hoc laboratory groups 
(Posthuma, Note 5),1 

The effects of training can be thought of as medi- 
ating task structure, resulting in group task situa- 
tions approximating Octant IV of the favorableness 
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leader-member relations, low task structure, low 
position power) and Octant II for trained leader 
groups (i.e., good leader-member relations, high task 
structure, low position power; see Figure 1). 

Postexperimental questionnaires for leaders and 
members assessed group atmosphere (as a check for 
leader-member relations), perceptions of the task, 
tension and anxiety, satisfaction, and followers’ 
perception of the leader. 


Subjects 


Subjects were 80 members of the Army ROTC 
contingent enrolled in a military leadership course 
at the University of Utah and 40 undergraduate 
students enrolled in a course in introductory psy- 
chology at the same university. The ROTC cadets 
were required to participate in the experiment as 
part of their course work. The introductory psychol- 
ogy students received extra course credit for their 
participation and were volunteers. 


Measures 


All subjects were administered the LPC scale 
(Fiedler, 1967) and a form of the Quick Word Test 
as a measure of intelligence, The Quick Word Test 
(Borgatta & Corsini, 1964) correlates quite highly 
(r = .80-.90) with more extensive tests of verbal 
intelligence, for example, Lorge-Thorndike Intelli- 
gence Tests and the Otis-Lennon Mental Ability 
Test. 

Each group was composed of an ROTC cadet 
leader drawn from the upper or lower third of the 
distribution of LPC scores (LPC equal to or below 
56 or equal to or above 70) and from above or 
below the median on intelligence. The leaders were 
further randomly assigned to a training or no- 
training condition. Each group also had one cadet 
follower and one psychology student follower, who 
were randomly assigned to groups, 

Leader—member relations were not manipulated; it 
was assumed that all groups had good relations, 
Leader power, the third determinant of situational 
favorableness was controlled at a low level. Instruc- 
tions designed to control power told the group that 
the leader was responsible for the coordination of 
group activities and the ultimate performance of the 
group, but he had no real power to reward or punish 
group members. 


Task 
The task required the groups to decipher coded 


messages, or cryptograms, Each group was presented 
with 18 cryptograms in a standard order. All crypto- 


p ‘ leness octants 
Corresponding to good leader-member relations, S 


- 
* grams followed a letter substitution format in which 
each letter of the English alphabet was substituted 
for one other letter of the alphabet. The code 
remained constant‘ throughout a single cryptogram 
but was different for each of the 18 cryptograms. 
Each group had three copies of each cryptogram. 
Difficulty level was controlled by supplying the 
group with difficult or key letters. The average 
decoding time per message was between 5 and 6 
minutes. A single crytogram with solution is shown 
below. 


GKKW GK MW WQK CWMWRET WDKCOMZ GFVIRTJ 


MEET ME AT THE STATION TUESDAY MORNIN 
I WILL BE DRESSED IN A BLACK OVERCOAT 


The deciphering of cryptograms represents a rela- 
tively unstructured task for the novice, primarily 
because there are no clear procedures to follow in 
obtaining a solution, That is, the task tends to be 
quite high in “goal-path multiplicity’ (Shaw, 
Note 6). 

Training 

Half of all leaders received a 30-minute training 
session immediately prior to the group task session. 
This training consisted of a lecture-demonstration- 
participation lesson in cryptogram solution. 

While decoding procedures are not generally 
known to the novice, a number of such procedures 
do exist and the knowledge of these procedures 
drastically reduces the ambiguity of the task and 
supplies a set of operating procedures. The training 
session gave the leaders information on letter fre- 
quencies, frequency and distribution of vowels, 
double letters, one and two letter words, common 
endings, etc. Leaders were also given practice in 
the solution of three cryptograms, similar in form to 
those used in the experiment. 


Posttask Questionnaires 


Following the task session, all group members 
‘filled out a short questionnaire designed to measure 
several aspects of the group interaction. Leader and 
members completed an 8-item Group Atmosphere 
scale intended to:assess the nature of the leader- 
member relations.2 Leaders and members also rated 
perception of the task, perceived success of the 
group, and perception of interpersonal relations. In 
addition, leaders rated their perception of the fol- 
lowers’ contribution, while the followers rated their 
perception of the leader. Both leader and follower 
questionnaires included 23 analyzable scores, 


RESULTS 


Group Atmosphere 


An examination of the results on the group 
atmosphere measures for leaders and members 


LEADER LEAST PREFERRED CO-WORKER, TRAINING, AND EFFECTIVENESS 


405 


indicated that the assumption of good leader- 
member relations was not upheld. Posthuma 
(Note 5) reviewed a considerable body of 
research on the Contingency Model and re- 
ported norms for leader and follower group 
atmosphere. Since the Group Atmosphere 
scales vary in number of items, Posthuma 
reported mean item group atmosphere scores. 
The average score was 6.70 (SD =.81) on 
an 8-point scale. The mean item scores in 
the present experiment were 5.78 for leaders 
and 5.58 for followers. A £ test against an 
exact hypothesis (Hays, 1963, p. 311) indi- 
cated that the means of the present experi- 
ment were significantly lower than the nor- 
mative scores: for leaders, #(39) = 7.09, p < 
0001; for members, #(39) = 8.61, p < .0001. 

On the basis of this evidence, it was con- 
cluded that groups in the present experiment 
were in the poor group atmosphere portion of 
the situational favorableness dimension, Thus, 
rather than approximating Octants IT and IV 
of the favorableness continuum as originally 
intended, the present experiment provided a 
potential replication of Octants VI and VIII.* 
Based on the group atmosphere data, the 
experimental hypotheses were adjusted; low 
LPC-led groups were expected to perform 
best in the untrained leader condition (ap- 
proximating Octant VIII, an unfavorable 
situation), and high LPC-led groups were ex- 
pected to perform best in the trained leader 
condition (approximating Octant VI, a mod- 
erately favorable situation). 


Group Productivity 


Training and LPC, The primary hypothe- 
sis predicted a Training X LPC interaction 
described immediately above. It was obtained 
at an extremely significant level (F = 24.61, 


2 The eight 8-point bipolar adjective scales included 
in the Group Atmosphere scale were tense-relaxed, 
enjoyable—unenjoyable, enthusiastic—unenthusiastic, 
interesting—boring, satisfying-dissatisfying, difficult- 
easy, stressful-unstressful, and fulfilling-frustrating. 

3 Anecdotal reports by experimental personnel sup- 
port this interpretation. The cadet leaders appeared 
agitated and expressed great concern with good per- 
formance. It was later learned that the ROTC cadre 
had admonished the cadets to perform at a high 
level. Tension may have been augmented by the 
presence in each group of a psychology student 
volunteer, who, it may be assumed, was considerably 
less motivated and involved in the task. 
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Ficur 2, Interaction of Training X LPC (least pre- 
ferred co-worker) on group productivity. 


p < 00003). Figure 2 indicates that high 
LPC-led groups were most productive in the 
trained leader condition (moderate favorable- 
ness), while low LPC-led groups were most 
productive in the untrained leader condition 
(low favorableness). Pearson product-moment 
correlations between LPC and group per- 
formance were +.79 for the trained leader 
condition and —.65 for the untrained leader 
condition. As predicted, no significant main 
effects for LPC or training were found on 
group productivity. 

Leader intelligence. In a significant effect 
of intelligence (F = 5.20, p < .03), low intel- 
ligence leaders had higher group productivity 
(£ = 8.10) than did high intelligence leaders 
(X = 7.24). A pattern of marginal and sig- 
nificant effects on the postsession question- 


TABLE 2 
INTELLIGENCE EFFECTS 
Mean follower ratings Low High F 

Stressfulness (single item) 4.27 | 487| 3.61* 
Anxiety (composite of 2 

items) 9.25 |10.52 | 4.67** 
Importance to leader of 

leader-member relations 

(single item) 6.47 5.55 | 4.42** 
Worthwhile task (single item) | 6.6 5.32 | 10.50*#* 


^ Single items scores range from 1 to 7, composite scores in- 
clude 1 to 7 scales for each item. 
*p <.07. 


* p < 104, 
HD < 003. 
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Ficure 3. Interaction of Training X Intelligence on 
group productivity. 


naires indicated that followers of low intelli- 
gence leaders also placed a greater importance 
on doing well and had less stressful groups. 
Table 2 gives the means, F values, and 
probability levels for these effects. 

Training and intelligence. The interaction 
of Training x Leader Intelligence was sig- 
nificant (F = 4.74, p< .04), indicating that 
high intelligence leaders show a greater bene- 
fit from training (see Figure 3). High 1Q 
leaders showed improvement in group produc- 
tivity while low IQ leaders actually had a 
slight decrement in performance as a result 
of training. 

A pattern of Training x Intelligence inter- 
actions on the postsession questionnaire indi- 
cates that the more effective leaders (i.e., 
untrained low intelligence, trained high intel- 
ligence) placed greater importance on the 
task and on maintaining good relations, The 
followers of those leaders saw the leader’s 
behavior as being more task oriented and 
more appropriate. These effects are listed in 
Table 3, 

The hypothesizezd LPC x Training x In- 
telligence Interaction, in which intelligence 


was expected to augment training effects, was 
Not significant. 


Interpersonal Relations and Follower 
Satisfaction 
IPC x Intelligence, 


The only consiste: 
pattern of relationship. = 


S on these measures 
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TABLE 3 
INTERACTION OF TRAINING X INTELLIGENCE 
Untrained Trained 
Variable F 
Low IQ High IQ Low IQ High IQ 
Leaders’ rating 
Importance of good leader-member 
relations 7.10 6.10 6.50 7.20 4,62" 
Importance of task performance 7.50 6.70 7.00 7.30 4.150% 
Followers’ rating: 
Leader task-oriented (1) versus rela- 
tionship oriented (8) 24 3.25 3.05 2.65 3.33* 
Appropriateness of leader’s behavior 6.6 6.3 6.8 IET) 3.46** 
Perceived success 7.05 6.05 6.60 7.10 13:8540 
*p <08. 
wp <0: 
ED < 04. 
7 b < 0007. 


was a strong and recurrent interaction of 
LPC x Intelligence, in which low LPC-high 
intelligence leaders and high LPC-low intelli- 
gence leaders had more involved and 
satisfied groups. Table 4 lists the LPC x In- 
telligence interactions. We noted several sig- 
nificant Training x LPC x Intelligence inter- 
actions, but they were not consistently 
interpretable. They do indicate, however, that 
situational variables may moderate LPC and 
intelligence effects. 


Discussion 


The most important finding of the present 
experiment was the extremely strong support 
for Fiedler’s (1972b) Contingency Model 
analysis of leadership training. Further, the 
training results provide an important new 


extension and refinement of the situational 
favorableness dimension. By adding the train- 
ing variable to the favorableness dimension, 
the strength of relationships and patterns of 
variance accounted for have drastically risen 
(Fiedler, 1972b; Fiedler & Chemers, 1974). 
These findings further demonstrate that any 
leadership theory which ignores a general 
contingency orientation will have a difficult 
time in accounting for group productivity, 
and possibly for group morale as well. 
Although strong, the results of the present 
experiment did not clearly indicate why low 
LPC leaders are more effective under very 
favorable or very unfavorable conditions, 
while high LPC leaders are more effective 
under conditions of intermediate favorable- 
ness. A pattern of findings on earlier pilot 
work with the training hypothesis (Chemers, 


TABLE 4 
INTERACTION OF LPC X INTELLIGENCE 


Low LPC High LPC 
Variable | F y 
Low IQ | HighIQ | LowIQ | High1Q 
Leader ratings 
T of good leader-member relations | 6.50 7.10 7.10 6.20 3.60 07 
Importance of task performance 7.00 7.20 7.50 6.80 3.18 08 
Foll ti 
‘Sadkiacaok (Gompette of two items) 12.25 13.70 13.80 11.95 8.97 .005 
Group atmosphere 44.35 45.30 47.45 43.25 4.17 -05 
Perceived success 6.80 6.95 6.85 6.20 3.94 .06 
Worthwhile task 6.30 5.75 6.90 4.90 3.43 07 
Anxiety (composite of two items) 8.90 11.45 9.60 9.60 4.67 04. 


Note. LPC = least preferred co-worker. 
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Note 2), may shed some light on this prob- 
lem. In these studies, trained and untrained 
leaders worked on a decision-making and 
construction task. Groups were required to 
assess some information, decide on a strategy 
for construction of Tinker-Toy models, and 
proceed with construction. A pattern of non- 
significant trends indicated that among un- 
trained leader groups, high LPC-led groups 
excelled in the decision-making phase, while 
low LPC-led groups excelled in the produc- 
tion phase of the task. When ambiguity and 
difficulty in decision making were removed by 
leadership training, the low LPC-led groups 
outperformed high LPC-led groups by virtue 
of the higher output. Thus, high LPC-led 
groups may be better able to integrate and 
interpret complex information, but low LPC- 
led groups have greater productive output, 
This interpretation is consistent with the 
“differential matching hypothesis” of Foa, 
Mitchell, and Fiedler (1971). Those authors 
conclude that high LPC leaders are more 
cognitively complex than low LPC leaders. 
Group productivity depends on the appropri- 
ate matching of leader complexity with the 
complexity of the task situation. Overall 
group effectiveness then depends on the spe- 
cific demands of the situation. These pilot 
results were nonsignificant trends and any 
conclusions must be drawn with great caution. 
Nonetheless, these findings may represent a 
first step in understanding LPC effects. 

Results of the present study suggest that 
leader intelligence can also affect the relation- 
ships between LPC and effectiveness, The 
main effect for intelligence on productivity 
appears to be the result of high intelligence 
leaders being less interested in and generating 
less enthusiasm on the assigned task. The 
generalizability of such results is certainly 
open to question. The present groups had 
unusually poor leader-member relations, and 
the nature of the task employed may have 
affected interest and motivation. 

The interaction of Training x Intelligence 
on group performance seems to indicate that 
high intelligence leaders did indeed have 
greater benefits from training than did low 
intelligence leaders. These findings, which 
might seem rather obvious, do, however, 
highlight the importance of adding new 


leader attributes, such as intelligence, to a 
contingency analysis of leadership effects. 

The need for integrating the intelligence 
variable appears to be especially important in 
considerations of leader—follower relations and 
follower morale. The strong and recurrent 
interaction of LPC x Intelligence indicates 
that neither of these variables alone is 
sufficient to account for the effects. 

One possible explanation of this interaction 
relates to the nature of the leader’s cognitive 
functioning. Mitchell (1970) reports that 
LPC and cognitive complexity are positively 
related. Thus, the high LPC-high intelligence 
leader is both cognitively complex and intel- 
ligent, a combination which may separate him 
intellectually from his followers. Conversely, 
the low LPC-low intelligence leader is both 
simple and slow, which likewise may not 
make him a very interesting leader. However, 
the leaders with the highest morale groups 
were those who were either complex (i.e., 
high LPC-low intelligence), or bright (i.e., 
low LPC-high intelligence), but not both, 
Thus, an optimal level of cognitive function- 
ing of the leader might be hypothesized. The 
extremely tentative nature of this theorizing 
cannot be overstated. Many other plausible 
hypotheses might be generated. The present 
interpretation reflects a bias toward a 
complexity interpretation of LPC effects. 

The results of the present experiment sup- 
port and extend the Contingency Model and 
highlight the importance of a contingency 
approach to the area of group morale as well 
as productivity. The results with respect to 
training have important practical as well as 
theoretical consequences. The Contingency 
Model appears to be making important 
advances in both these directions, 

A final word of qualification is in order, 
The term leadership training has most often 
been identified with interpersonal or human 
relations training. Clearly, the training pro- 
vided in the present experiment should be 
regarded as more similar to technical training, 
that is, training specifically designed to in- 
crease the task competence of the leader. It 
is quite conceivable that training for inter- 
personal effectiveness would have quite dif- 
ferent effects. Future research must direct 
itself toward the generality of training effects, 
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Beautiful but Dangerous: Effects of Offender Attractiveness and 
Nature of the Crime on Juridic Judgment 


Harold Sigall and Nancy Ostrove 
University of Maryland 


The physical attractiveness of a criminal defendant (attractive, unattractive, 
no information) and the nature of the crime (attractiveness-related, attrac- 
tiveness-unrelated) were varied in a factorial design. After reading one of 
the case accounts, subjects sentenced the defendant to a term of imprison- 
ment. An interaction was predicted: When the crime was unrelated to attrac- 
tiveness (burglary), subjects would assign more lenient sentences to the 
attractive defendant than to the unattractive defendant; when the offense 
was attractiveness-related (swindle), the attractive defendant would receive 
harsher treatment. The results confirmed the predictions, thereby supporting 
a cognitive explanation for the relationship between the physical attractiveness 
of defendants and the nature of the judgments made against them. 


Research investigating the interpersonal 
consequences of physical attractiveness has 
demonstrated clearly that good-looking people 
have tremendous advantages over their un- 
attractive counterparts in many ways. For 
example, a recent study by Miller (1970) pro- 
vided evidence for the existence of a physical 
attractiveness stereotype with a rather favor- 
able content. Dion, Berscheid, and Walster 
(1972) reported similar findings: Compared 
to unattractive people, better-looking people 
were viewed as more likely to possess a vari- 
ety of socially desirable attributes. In addi- 
tion, Dion et al.’s subjects predicted rosier 
futures for the beautiful stimulus persons— 
attractive people were expected to have hap- 
pier and more successful lives in store for 
them. Thus, at least in the eyes of others, 
good looks imply greater potential. 

Since physical attractiveness hardly seems 
to provide a basis for an equitable distribu- 
tion of rewards, one might hope that the 
powerful effects of this variable would occur 
primarily when it is the only source of infor- 
mation available. Unfair or irrational conse- 
quences of differences in beauty observed in 
some situations would cause less uneasiness if, 
in other situations given other important data, 
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respondents would tend to discount such 
“superficial” information. Unfortunately, for 
the vast majority of us who have not been 
blessed with a stunning appearance, the evi- 
dence does not permit such consolation. Con- 
sider, for example, a recent study by Dion 
(1972) in which adult subjects were pre- 
sented with accounts of transgressions sup- 
posedly committed by children of varying 
physical attractiveness. When the transgres- 
sion was severe the act was viewed less nega- 
tively when committed by a good-looking 
child, than when the offender was unattrac- 
tive. Moreover, when the child was unattrac- 
tive the offense was more likely to be seen as 
reflecting some enduring dispositional qual- 
ity: Subjects believed that unattractive chil- 
dren were more likely to be involved in future 
transgressions. Dion’s findings, which indicate 
that unattractive individuals are penalized 
when there is no apparent logical relationship 
between the transgression and the way they 
look, underscore the importance of appear- 
ance because one could reasonably suppose 
that information describing a severe trans- 
gression would “overwhelm the field,” and 
that the physical attractiveness variable 
would not have any effect. 

Can beautiful people get away with mur- 
der? Although Dion (1972) found no differ- 
ences in the punishment recommended for 
offenders as a function of attractiveness 
Monahan (1941) has suggested that beauti- 
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ful women are convicted less often of crimes 
they are accused of, and Efran (1974) has 
recently demonstrated that subjects are much 
more generous when assigning punishment to 
good-looking as opposed to unattractive trans- 
gressors, 

The previous findings which indicate a 
tendency toward leniency for an attractive 
offender can be accounted for in a number of 

- ways. For example, one might explain such 
results with the help of a reinforcement-affect 
model of attraction (e.g., Byrne & Clore, 
1970). Essentially, the argument here would 
be that beauty, having positive reinforcement 
value, would lead to relatively more positive 
affective responses toward a person who has 
it. Thus we like an attractive person more, 
and since other investigators have shown that 
liking for a defendant increases leniency (e.g., 
Landy & Aronson, 1969), we would expect 
good-looking (better liked) defendants to be 
punished less than unattractive defendants, 
Implicit in this reasoning is that the nature of 
the affective response, which influences 
whether kind or harsh treatment is recom- 
mended, is determined by the stimulus fea- 
tures associated with the target person. There- 
fore, when other things are equal, benefit ac- 
crues to the physically attractive. A more 
cognitive approach might attempt to explain 
the relationship between physical appearance 
and reactions to transgressions by assuming 
that the subject has a “rational” basis for 
his responses. It is reasonable to deal harshly 
with a criminal if we think he is likely to 
commit further violations, and as Dion’s 
(1972) study suggests, unattractive indi- 
viduals are viewed as more likely to transgress 
again. In addition, inasmuch as attractive in- 
dividuals are viewed as possessing desirable 
“qualities and as having relatively great poten- 
tial, it makes sense to treat them leniently. 
Presumably they can be successful in socially 
acceptable ways, and rehabilitation may re- 
sult in relatively high payoffs for society. 

There is at least one implication that fol- 
lows from the cognitive orientation which 
would not flow readily from the reinforce- 
ment model. Suppose that situations do exist 
in which, because of his high attractiveness, 
a defendant is viewed as more likely to trans- 
gress in the future. The cognitive approach 
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suggests that in such instances greater punish- 
ment would be assigned to the attractive 
offender. We might add that in addition to 
being more dangerous, when the crime is 
attractiveness related, a beautiful criminal 
may be viewed as taking advantage of a God- 
given gift. Such misappropriation of a blessing 
may incur animosity, which might contribute 
to severe judgments in attractiveness-related 
situations, 

In the present investigation, the attractive- 
ness of a defendant was varied along with the 
nature of the crime committed. It was rea- 
soned that most offenses do not encourage the 
notion that a criminal’s attractiveness in- 
creases the likelihood of similar transgressions 
in the future. Since attractive offenders are 
viewed as less prone to recidivism and as 
having greater potential worth, it was ex- 
pected that under such circumstances an 
attractive defendant would receive less pun- 
ishment than an unattractive defendant in- 
volved in an identical offense. When, how- 
ever, the crime committed may be viewed as 
attractiveness-related, as in a confidence game, 
despite being seen as possessing more poten- 
tial, the attractive defendant may be regarded 
as relatively more dangerous, and the effects 
of beauty could be expected to be cancelled 
out or reversed. The major hypothesis, then, 
called for an interaction: An attractive de- 
fendant would receive more lenient treatment 
than an unattractive defendant when the 
offense was unrelated to attractiveness; when 
the crime was related to attractiveness, the 
attractive defendant would receive relatively 
harsh treatment. 


METHOD 
Subjects and Overview 


Subjects were 60 male and 60 female undergradu- 
ates. After being presented with an account of a 
criminal case, each subject sentenced the defendant 
to a term of imprisonment. One-third of the sub- 
jects were led to believe that the defendant was 
physically attractive, another third that she was un- 
attractive, and the remainder received no information 
concerning appearance. Cross-cutting the attractive- 
ness variable, half of the subjects were presented 
with a written account of an attractiveness-unrelated 
crime, a burglary, and the rest with an attractiveness- 
related crime, a swindle. Subjects were randomly 
assigned to condition, with the restriction that an 
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equal number of males and females appeared in 
each of the six cells formed by the manipulated 
variables. 


Procedure 


Upon arrival, each subject was shown to an indi- 
vidual room and given a booklet which contained 
the stimulus materials. The top sheet informed sub- 
jects that they would read a criminal case account, 
that they would receive biographical information 
about the defendant, and that after considering the 
materials they would be asked to answer some ques- 
tions. 

The case account began on the second page. 
Clipped to this page was a 5X8 inch card which 
contained routine demographic information and was 
identical in all conditions.‘ In the attractive condi- 
tions, a photograph of a rather attractive woman was 
affixed to the upper right-hand corner of the card; 
while in the unattractive conditions, a relatively 
unattractive photograph was affixed. No photograph 
was presented in the control conditions. 

Subjects then read either the account of a burg- 
lary or a swindle, The burglary account described 
how the defendant, Barbara Helm, had moved into 
a high-rise building, obtained a pass key under 
false pretenses, and then illegally entered the apart- 
ment of one of her neighbors. After stealing $2,200 
in cash and merchandise she left town. She was 
apprehended when she attempted to sell some of the 
stolen property and subsequently was charged with 
breaking and entering and grand larceny, The 
swindle account described how Barbara Helm had 
ingratiated herself to a middle-aged bachelor and 
induced him to invest $2,200 in a nonexistent corpo- 
ration, She was charged with obtaining money under 
false pretenses and grand larceny. In both cases, the 
setting for the offense and the victim were described 
identically. The information presented left little 
doubt concerning the defendant's guilt. 

The main dependent measure was collected on the 
last page of the booklet. Subjects were asked to 
complete the following statement by circling a num- 
ber between 1 and 15: “I sentence the defendant, 
Barbara Helm, to —— years of imprisonment.” 
Subjects were asked to sentence the defendant, 
rather than to judge guilt versus innocence in order 
to provide a more sensitive dependent measure. 

After sentencing had been completed, the experi- 
menter provided a second form, which asked sub- 
jects to recall who the defendant was and to rate 
the seriousness of the crime, In addition, the defend- 
ant was rated on a series of 9-point bipolar adjec- 
tive scales, including physically unattractive (1) to 
physically attractive (9), which constituted the 
check on the attractiveness manipulation. A post- 
experimental interview followed, during which sub- 
jects were debriefed. 


1 This information, as well as copies of the case 
accounts referred to below, can be obtained from the 
first author. 
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RESULTS AND DISCUSSION 


The physical attractiveness manipulation 
was successful: The attractive defendant re- 
ceived a mean rating of 7.53, while the mean 
for the unattractive defendant was 3.20, 
F (1, 108) = 184.29, p < .001. These ratings 
were not affected by the nature of the crime, 
nor was there an interaction. 

The criminal cases were designed so as to 
meet two requirements. First, the swindle 
was assumed to be attractiveness-related, 
while the burglary was intended to be attrac- 
tiveness-unrelated. No direct check on this 
assumption was made. However, indirect evi- 
dence is available: Since all subjects filled out 
the same forms, we obtained physical attrac- 
tiveness ratings from control condition sub- 
jects who were not presented with a photo- 
graph. These subjects attributed greater 
beauty to the defendant in the swindle condi- 
tion (X = 6.65) than in the burglary condi- 
tion (X = 5.65), F (1,108) = 4.93, p < .05. 
This finding offers some support for our con- 
tention that the swindle was viewed as at- 
tractiveness-related. Second, it was important 
that the two crimes be viewed as roughly 
comparable in seriousness. This was necessary 
to preclude alternative explanations in terms 
of differential seriousness. Subjects rated the 
seriousness of the crime on a 9-point scale 
extending from not at all serious (1) to ex- 
tremely serious (9). The resulting responses 
indicated that the second requirement was 
met: In the swindle condition the mean 
seriousness rating was 5.02; in the burglary 
condition it was 5.07 (F <1). 

; Table 1 presents the mean punishment as- 
signed to the defendant, by condition, Since 
a preliminary analysis demonstrated there 
were no differences in responses between males 
and females, subject sex was ignored as a 


TABLE 1 


MEAN SENTENCE ASSIGNED, IN YEARS 
(n = 20 per cell) 


Defendant condition 


Offense 
Attractive | Unattractive Control 
Swindle 5.45 
i 4.3. 
Burglary 2.80 $30 So 


= 
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variable. It can be seen that our hypothesis 
was supported: When the offense was attrac- 
tiveness-unrelated (burglary), the unattrac- 
tive defendant was more severely punished 
than the attractive defendant; however, when 
the offense was attractiveness-related (swin- 
dle), the attractive defendant was treated 
more harshly. The overall Attractiveness X 
Offense interaction was statistically signifi- 
cant, F (2,108) = 4.55, p <.025, and this 
interaction was significant, as well, when the 
control condition was excluded, F (1, 108) = 
7.02, p < .01. Simple comparisons revealed 
that the unattractive burglar received signifi- 
cantly more punishment than the attractive 
burglar, F (1,108) = 6.60, p < .025, while 
the difference in sentences assigned to the 
attractive and unattractive swindler was not 
statistically significant, F (1, 108) = 1.39. 
The attractive-swindle condition was com- 
pared with the unattractive-swindle and con- 
trol-swindle conditions also, F (1,108) = 
2.00, ns. Thus, strictly speaking, we cannot 
say that for the swindle attractiveness was a 
great liability; there was a tendency in this 
direction but the conservative conclusion is 
that when the crime is attractiveness-related, 
the advantages otherwise held by good-look- 
ing defendants are lost. 

Another feature of the data worth consider- 
ing is that the sentences administered in the 
control condition are almost identical to those 
assigned in the unattractive condition. It ap- 
pears that being unattractive did not produce 
discriminatory responses, per se. Rather, it 


‘seems that appearance had its effects through 


the attractive conditions: The beautiful burg- 
lar got off lightly, while the beautiful swindler 
paid somewhat, though not significantly, more. 
It can be recalled that in the unattractive 
conditions the stimulus person was seen as 
relatively unattractive and not merely average 
looking. Therefore, the absence of unattrac- 
tive-control condition differences does not 
seem to be the result of a weak manipulation 
in the unattractive conditions. 

Perhaps it is possible to derive a small bit 
of consolation from this outcome, if we specu- 
late that only the very attractive receive 
special (favorable or unfavorable) treatment, 
and that others are treated similarly. This is 
a less frightening conclusion than one which 


would indicate that unattractiveness brings 
about active discrimination. 

As indicated earlier, previous findings 
(Efran, 1974) that attractive offenders are 
treated leniently can be interpreted in a num- 
ber of ways. The results of the present experi- 
ment support the cognitive explanation we 
offered. The notion that good-looking people 
usually tend to be treated generously because 
they are seen as less dangerous and more 
virtuous remains tenable, The argument that 
physical attractiveness is a positive trait and 
therefore has a unidirectionally favorable 
effect on judgments of those who have it, 
would have led to accurate predictions in the 
burglary conditions, However, this position 
could not account for the observed interac- 
tion. The cognitive view makes precisely that 
prediction. 

Finally, we feel compelled to note that our 
laboratory situation is quite different from 
actual courtroom situations. Most important, 
perhaps, our subjects made decisions which 
had no consequences for the defendant, and 
they made those decisions by themselves, 
rather than arriving at judgments after dis- 
cussions with others exposed to the same in- 
formation. Since the courtroom is not an ap- 
propriate laboratory, it is unlikely that actual 
experimental tests in the real situation would 
ever be conducted. However, simulations con- 
stitute legitimate avenues for investigating 
person perception and interpersonal judgment, 
and there is no obvious reason to believe that 
these processes would not have the effects in 
trial proceedings that they do elsewhere. 

Whether a discussion with other jurors 
would affect judgment is an empirical, and 
researchable, question. Perhaps if even 1 of 
12 jurors notes that some irrelevant factor 
may be affecting the jury’s judgment, the 
others would see the light. Especially now 
when the prospect of reducing the size of 
juries is being entertained, it would be im- 
portant to find out whether extralegal consid- 
erations are more likely to have greater in- 
fluence as the number of jurors decreases. 
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Misattribution for Failure and Enhancement 
of Achievement Strivings 
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Male subjects (N = 200) classified according to their level of achievement 
needs received four failure trials at a digit-symbol substitution task. Prior to 
the failure, the subjects in an experimental condition received a placebo pill 
described as interfering with thir hand-eye coordination. Compared with sub- 
jects in a control condition, ascription of failure to the pill augmented the 
performance of subjects low in achievement needs, while it decreased the 
performance of subjects high in achievement needs. These findings were con- 
sistent across two studies. The relevance of these data to an attributional 
analysis of Expectancy X Value theory, achievement change programs, the 
relation between thought and action, and alleged attributional biases were 


briefly discussed. 


A number of research investigations have 
demonstrated that reactions to failure are 
dependent upon the level of achievement 
needs of the performer. Among persons high 
in achievement needs, motivation is aug- 
mented after failure and the intensity of 
performance increases. Conversely, among 
individuals low in achievement needs, moti- 
vation is inhibited after failure and the speed 
of performance decreases (see Weiner, 1970). 

These opposing reactions to failure have 
been: interpreted using concepts from attribu- 
tion theory (Weiner, 1974; Weiner, Frieze, 
Kukla, Reed, Rest, & Rosenbaum, 1971). 
Persons high versus low in achievement needs 
have disparate perceptions concerning the 
causes of their personal success and failure 
(Cohen, Reid, & Boothroyd, 1973; Kukla, 
1972; Weiner & Potepan, 1970). Contrasting 
causal attributions are presumed by Weiner 
et al. (1971) to produce differential reactions 
to success and failure. 

In the most extensive test to date of the 
hypothesized achievement needs — causal as- 
cription — behavior linkages, Meyer (1970; 
reported, in part, in Weiner, Heckhausen, 
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Meyer, & Cook, 1972) induced repeated fail- 
ure at a digit-symbol substitution task, Sub- 
jects classified according to their level of 
achievement needs made causal ascriptions 
for their failure after each trial. Further, the 
speed of their performance was assessed. 
Meyer found that subjects highly motivated 
to succeed ascribe failure to a lack of effort, 
while those low in achievement needs perceive 
low ability as responsible for their poor per- 
formance (also see Cohen et al., 1973; Kukla, 
1972; Weiner & Potepan, 1970). In addition, 
irrespective of motive classification, ascription 
of failure to a lack of effort was positively 
related to the speed of subsequent perform- 
ance, while attribution of failure to a lack of 
ability was inversely related to subsequent 
performance intensity. 

There is suggestive evidence, then, that 
causal ascriptions mediate the observed rela- 
tionships between high and low achievement 
needs and respective increments and decre- 
ments in performance following failure, The 
presumed associations are (a) high achieve- 
ment needs — ascription of failure to a lack of 
effort — increase in performance and (b) low 
achievement needs — ascription of failure to a 
lack of ability — decrease in performance, 

In the present investigations this attribu- 
tional analysis of achievement behavior was 
examined further by experimentally severing 
the relationships between achievement needs 
and causal ascriptions. The methodology was 
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guided by prior studies demonstrating that 
misattribution of internal arousal or overt 
behavior influences emotional expression and 
the tolerance for pain (Davison & Valins, 
1969; Nisbett & Schachter, 1966; Ross, 
Rodin, & Zimbardo, 1969; Schachter & 
Singer, 1962). A placebo pill described as 
interfering with hand-eye coordination was 
administered to experimental subjects prior to 
an induced series of failures at a digit-symbol 
substitution task. Failure among these sub- 
jects was therefore likely to be ascribed to the 
assumed action of the drug. Subjects in a 
control condition were not given an external 
“reason” for their failures. 

The hypothesized effects of the alterations 
in causal ascriptions were derived from an 
attributional interpretation of Expectancy X 
Value theory (see Weiner, 1972, 1974). First 
consider individuals low in achievement needs. 
Their bias to ascribe failure to low ability (an 
internal, stable attribute) produces low ex- 
pectancies of future success (“I am dumb and 
cannot succeed”) as well as arousing personal 
shame (see Weiner, 1972, 1974). Inducing 
ascription of failure to the pill also generates 
a low expectancy of future success, inasmuch 
as the effects of the pill are perceived as 
unchangeable and stable during the experi- 
mental time period. But it is less shameful to 
fail because of an experimenter-induced pill 
(an external cause) than to fail because of a 
low self-perception of ability. Because the 
aversive consequences of failure are lessened 
by the external pill attribution, persons low in 
achievement needs were hypothesized to per- 
form with greater intensity in the experi- 
mental than in the control condition (see 
Table 1), 

Now consider persons high in achievement 
needs. Their bias to ascribe failure to a lack 
of effort (an internal, unstable cause) results 
in a relatively high expectancy of success 
after failure (“If I try harder, I can suc- 
ceed”) as well as producing the feelings of 
shame that accompany personal blame. Con- 
versely, ascription of failure to the pill results 
in a low expectancy of future success and 
little negative affect because, as already indi- 
cated, the pill is perceived as a stable, external 
cause. It has been suggested that individuals 
highly motivated to achieve weight the ex- 
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pectancy of future success more heavily as a 
determinant of performance than they weight 
the aversive consequences of failure (see 
Weiner, 1970, p. 103). It was therefore an- 
ticipated that persons highly motivated to 
achieve would perform with greater intensity 
in the high expectancy of success — high nega- 
tive affect (control) condition than in the 
low expectancy of success—low negative 
affect (pill) condition (see Table 1). 

In sum, an interaction was hypothesized 
between the level of achievement needs and 
the experimental (causal) condition. Achieve- 
ment strivings among individuals low in 
achievement need were expected to be en- 
hanced by the pill attribution, while strivings 
among individuals high in achievement needs 
were anticipated to be dampened in the pill 
ascription condition. 


METHOD 


An initial experiment, followed by an exact at- 
tempt at replication, was conducted. Subjects in the 
first and second study were 118 and 112 male stu- 
dents, respectively, who were enrolled in an intro- 
ductory psychology course. They were tested in 
groups ranging in size from 2 to 7, with most sub- 
jects (79%) tested in groups of 3, 4, or 5. 

The subjects were first administered an objective 
test of achievement needs (Mehrabian, 1969). This 
measure consists of items primarily derived from 
Atkinson’s (1964) theory of achievement motivation. 
The test contains 26 single statements, with alterna- 
tive responses ranging from +3 (very strong agree- 
ment) to —3 (very strong disagreement). Typical 
items are (a) “My strongest feelings are aroused by 
a fear of failure rather than by a hope of success” 
and (b) “I prefer competitive situations in which I 
have superior ability to those in which everyone 
involved is equal in ability.” 

After the subjects completed the individual dif- 
ference measure, the misattribution manipulation was 
introduced. The experimenter said: 


In this experiment we are investigating the 
interfering effects of a drug on motor performance. 
One [two, three] of you will be randomly se- 
lected to take a pill that directly interferes with 
hand-eye coordination. The pill contains a small 
amount of 2-OH-gamma aminobutyric acid, a 
substance known to be an inhibitor of motor ac- 
tivity. All of you then will be tested on a task 
that involves hand-eye coordination skills, The 
performance of those who do not take the pill 
U Ten = k standard against which we can 
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TABLE 1 


ATTRIBUTIONAL SEQUENCE AND HYPOTHESIZED BEHAVIORAL CONSEQUENCES FOR SUB JECTS HIGH AND 
Low IN ACHIEVEMENT MOTIVATION IN THE CONTROL AND EXPERIMENTAL (PILL) CONDITIONS 


Motive classification and experimental condition 


Thought action sequence = 
Low-control 


| Low-experimental 


High-control High-experimental 


Failure Attribution Low ability 
Expectancy of Success Low 
Negative affect High 
Performance Low* 


Drug Low effort Drug 
Low High Low 
Low High Low 
High High Low 


a Indicates performance relative to same-motive subjects in the alternate condition. 


from the pill. The reduction in your hand-eye 
coordination will wear off before this hour is over. 
However, if you feel strongly that taking the pill 
is objectionable, you may refuse without any 
penalty and receive your experimental credit. The 
pill has been used in similar psychological experi- 
ments, and we can assure you that there are no 
lasting side effects. If you do not want to take 
the pill, please inform me now. 


The experimenter then gave each subject agreeing 
to participate in the pill condition a die to throw, 
stating that the one [two, three] subject(s) throwing 
the lower number(s) would be used in the compari- 
son group and would not take the pill, while the sub- 
ject(s) throwing the higher number(s) would take 
the pill. Each subject then rolled the die; the subjects 
were placed into the control or the experimental 
condition on the basis of this chance procedure. 

To enhance the credibility of the pill manipula- 
tion, the experimenter wore a tie and a white lab- 
oratory coat. On the table in front of the subjects 
was a bottle of appropriately labeled pills, a pitcher, 
and some cups. After the roll of the die, the experi- 
menter asked some questions concerning allergies 
and other medical problems and had subjects in the 
pill condition sign a consent form used in drug ex- 
periments. The appropriate subjects were then 
given the placebo pill. 

After the placebo was taken, the experimenter said: 


The reduction in your hand-eye coordination 
should begin about 3 to 5 minutes from now. You 
may or may not be aware of any physiological 
side effects, but your coordination will be affected 
for a period of about 20 minutes. During these 20 
minutes the effects of the pill will be constant. 
After that the effects will gradually wear off. 

While we wait for the pill to take effect, 1 am 
going to give all of you some practice trials at the 
task. Later you will be tested on this same task. 
It is a difficult digit-symbol substitution that 
requires you to match the appropriate symbols to 
a group of numbers. As you can see, the substitu- 
tion task involves your hand-eye coordination. 


“ Subjects then practiced at the substitution task 
for five 3-minute time periods. The digits were from 


1 to 6, and the symbols, 


A À Sal RN 


Highly similar symbols were used because it has 
been demonstrated that difficult substitution tasks 
are particularly sensitive to motivational factors 
(Schneider, 1971). The subject’s highest performance 
on any of the 30-second pretest trials determined 
the length of the substitution task that he received 
during the test or failure series, The task length se- 
lected had to result in failure, but we wanted to 
minimize the likelihood of the failure being ascribed 
to the exaggerated difficulty of the task. 

After the practice sequence, the subjects were re- 
moved from the group-testing room and seated in 
individual cubicles for the testing series. The cubi- 
cles, labeled clearly with “Pill” and “No Pill” signs, 
faced the larger group-testing room. All of the sub- 
jects were able to view the experimenter, but they 
could not communicate with or see one another. The 
subjects were told: 


For those of you who have taken the pill, you 
may or may not be experiencing any physiological 
effects. But the pill is taking its effect upon you 
now and will last for the next 20 minutes. Now 
you all will be given four separate task sheets, one 
each for four timed tests. When I say “Begin,” 
turn over the sheet I give you and work until I 
say “Stop.” To succeed, you must complete the 
entire sheet within the allotted time period. If not, 
you have failed at the test. 


The test time for each trial was 30 seconds. If the 
subject completed less than 25 substitutions in all 
of the five 30-second practice periods, he was given 
test sheets with 32 substitutions. If more than 25 
substitutions were completed on any practice trial, 
the test sheets contained 48 substitutions. Pretesting 
indicated that these were the minimal test lengths 
needed to ensure failure for all the subjects. The sub- 
jects were unaware that there were tests of unequal 
lengths. Following each test trial the experimenter 
said: 


If you have completed your test, write the word 
success on your paper; if you have not completed 
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it, write the word fail. If you have taken the pill, 
put a P next to your success or failure. 


It was hoped that these labels would make the 
failure outcome and the misattribution cue more 
salient. 

All subjects then failed for four trials, Following 
the last failure there was a debriefing period. In 
addition, a written explanation of the experiment 
was handed out. 

In Study 1, 15 subjects (12%) refused to take 
the pill and were not included in the die roll. These 
subjects were assigned to the control condition, and 
their data were excluded from the final analysis. Two 
subjects did not complete the individual difference 
questionnaire, and 1 did not correctly follow the 
‘digit substitution instructions. Excluding these 18 
subjects left a final N of 100. In Study 2, 12 sub- 
jects (119%) refused to take the pill and were not 
included in the die roll, leaving a final N of 100. 
These subjects were also assigned to the control 
condition, and their data were excluded from the 
final analysis.t 


RESULTS 


To ascertain if the two studies yielded dif- 
ferent results, an analysis of variance was 
performed with study as a factor. The analy- 
sis revealed that study did not appear as a 
main effect or in interaction with any of the 
investigated variables. Hence, all analyses 
presented here include the combined data 
from the two investigations. 

Subjects within each study were classified 
as high (m= 102) or low (m= 98) in 
achievement needs on the basis of their score 
on the achievement motivation scale. The 
mean performance scores (number of digit- 
symbol substitutions during the 30-second 
time period) for the final practice trial and 
the four test trials are given in Table 2.2 

Analysis of variance of the final pretest 
performance indicated no significant main 
effect for achievement motive classification 
(F <1) or experimental condition, F (1, 
192) = 3.78, p < .10, and no significant Mo- 
tive X Condition interaction (F <1). An 
index of the motivational effects of failure, 

1Jn the two studies, 15 of the subjects refusing to 
take the pill were high in achievement needs and 12 
were classified as low. The pretest and test behavior 
of these subjects did not differ appreciably. 

“Only the final pretest performance is reported. 
On earlier trials subjects irregularly paused to mem- 
orize the digit-symbol pairings, alter their methods 
of writing the symbols, etc., and appropriate in- 
tensity data were not available. 
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TABLE 2 
Number or Dicrr SUBSTITUTIONS 


Motive classification- | Pre- | Trial | Trial | Trial | 
experimental condition | test a 2 3 “hws 
igh-control (m = 56) | 18.77 | 21.84 24.27 | 26,09 
High pill Qr 5) 17.52 | 20.00 22.41 | 22.59 
Low-Ci 18.89 | 21.84 23.82 | 23.84 
Low-pill (» = 54) 17.35 | 19.81 22,67 | 23.80 


relatively unconfounded with individual dif- 
ferences in ability, was derived by using the 
number of symbol substitutions during the 
final pretest period as a covariate in an 
analysis of variance of the test scores (see 
Figure 1). Figure 1 reveals that subjects ex- 
hibit performance increments with additional 
practice at the difficult substitution task, 
F (3,576) = 122.31, p < .001. Further, there 
is significant Motive X Condition interaction, 
F (1,191) = 5.45, p < .05, as well as a Mo- 
tive X Condition X Trials interaction, F (3, 
576) = 9.86, p < .001. 

Examination of the data between experi- 
mental conditions and within a motive group 
reveals that, as predicted, persons highly mo- 
tivated to achieve success display greater 
increments in performance in the control than 
in the pill attribution condition. An analysis 
of variance (all analyses include the pretest 
as a covariate) indicates a main effect due to 
the experimental condition, F (1,99) = 6.69, 
p< .05, and a Condition x Trials interac- 
tion, F (3,300) = 4.93, p< .005. On the 
other hand, and again as predicted, the sub- 
jects low in achievement needs display greater 
increments in speed of performance in the 
pill attribution than in the control condition, 
The condition main effect is not significant 
(F <1), but there is a significant Condition 
X Trials interaction, F (3, 288) = 5.63, p< 
.001. 

Comparisons of the data between subjects 
differing in achievement needs but within an 
experimental condition reveal that in the con- 
trol condition, the performance of subjects 
high in achievement needs nonsignificantly 
exceeds the performance of subjects low in 
achievement needs, F (1, 97) = 2.57, p < AS, 
There also is a Motive x Trials interaction, 
F (3,294) = 9.39, p < 001. The direction of 
these results is consistent with prior findings 
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Ficure 1. Mean speed of performance (number of 
digit-symbol substitutions), adjusted for pretest per- 
formance, for the test trials as a function of level of 
achievement needs (high vs. low) and the experi- 
mental condition (control vs. pill attribution). 
(Abbreviations are as follows: Hi Ach = high 
achievement; Lo Ach = low achievement.) 


in the achievement literature (see Weiner, 
1972). Conversely, in the pill condition the 
performance of subjects low in achievement 
needs is nonsignificantly greater than that of 
the subjects highly motivated to succeed, 
F (1,97) =2.50, p< .15. There is also a 
significant Motive X Trials interaction, F (3, 
294) = 3.00, p < .05. 


Discussion 


The opportunity to attribute failure to a 
. pill influenced the reactions to failure, rela- 
tively decreasing the performance of subjects 
high in achievement needs while increasing 
the performance of persons low in achievement 
needs. These results were consistent across 
two studies. The data therefore support the 
position that achievement strivings are medi- 
ated by cognitions pertaining to the causes 
of success and failure. 

Most previous investigations in the achieve- 
ment area suggesting that causal cognitions 
mediate behavior have been correlational in 
nature. For example, in the experiment by 
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Meyer (1970) discussed earlier, attributions 
for failure as well as the performance speed 
of subjects were assessed. Therefore, the 
causal attributions could have merely corre- 
lated with, rather than actually determined, 
the behavior. The design of the Meyer study 
does not definitively permit a causal inference, 
But in the present experiment, the attribu- 
tions were manipulated prior to the behavior, 
and their influence on performance was es- 
tablished. In addition, the confirmation of the 
hypotheses provides support for a proposed 
attributional model of achievement behavior 
(see Weiner, 1972, 1974) and the attribu- 
tional elaboration of Expectancy X Value 
theory discussed briefly in the introduction. 


Implications for Achievement Change 
Programs 


One of the main goals of current achieve- 
ment change programs is to alter the causal 
ascriptions of the participants by teaching 
self-responsibility or the internal control of 
one’s actions (see deCharms, 1972; McClel- 
land & Winter, 1969). The focus upon causal 
attributions to enhance achievement strivings 
certainly is consistent with the general ap- 
proach of the present investigation and with 
the belief that changes in cognitive net- 
works are sufficient to produce changes in 
achievement strivings (see McClelland, 1965). 

However, the present training program pro- 
cedures may not be entirely satisfactory, 
since persons low in achievement needs gen- 
erally assume personal responsibility for fail- 
ure prior to the training period. Unfortu- 
nately, their tendency to ascribe failure to 
low ability impairs achievement strivings. A 
distinction must be made by the developers 
of change programs between fixed or stable 
internal causal factors, such as ability, and 
unstable or modifiable determinants, such as 
effort. Among the personal ascriptions, only 
those that are unstable and under volitional 
control are positively motivating, given a fail- 
ure experience. Thus, teaching that outcome is 
influenced by effort should facilitate achieve- 
ment motivation. In addition, as the data in 
this study indicate, causal ascriptions to ex- 
ternal factors, or the belief that one is not 
responsible for failure, may also augment 
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achievement behavior among selected indi- 
viduals. A similar point has been made by 
Lao (1970) and Sanger and Alker (1972), 
who reported that external ascriptions among 
blacks and females are positively related to 
attempts to change their environment. These 
notions can be readily incorporated into 
current achievement change programs. 


Implications for Attribution Theory 


In addition to the relevance of this study 
to motivational theory and achievement 
change programs, the present data have im- 
plications for two current issues within at- 
tribution theory. One point already alluded 
to concerns the relationship between thought 
and action. Some psychologists have sug- 
gested that behavior influences thought, 
rather than vice versa (see Bem, 1972), That 
is, a person infers his or her attitudes and 
subjective needs after observing his or her 
own actions. Given this position, thoughts do 
not determine action but rather are by- 
products of behavior. 

This position is incompatible with the vast 
amount of data in the achievement literature 
demonstrating that causal beliefs influence 
the magnitude of expectancy shifts as well 
as the affective reactions to success and fail- 
ure (see Weiner, 1972, 1974). Further, as 
demonstrated in this study, attributional 
change influences the intensity of behavior. 
Thus, causal beliefs precede and in part 
determine subsequent action. It certainly is 
true, as Bem (1972) has contended, that we 
can use our own behavior as a source of in- 
formation to make inferences about our atti- 
tudes and wants (e.g., “I ate a great deal; 
I must have been very hungry”). But surely 
the far more prevalent sequence is that action 
follows a motivated belief (“I really am 
hungry; I am going to eat a great deal”). 

A second unsettled issue for attribution 
theory pertinent to the present findings con- 
cerns the hypothesized attributional biases of 
an actor. Jones and Nisbett (1971) have 
contended that there is a tendency for an 
actor to attribute his behavior to external 
circumstances rather than to himself. While 
Jones and Nisbett have been careful to state 
that this is not an invariant law, a further 
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word of caution is warranted. The data in the 
achievement literature clearly reveal that suc- 
cess and failure are often ascribed by the actor 
to ability and/or effort, personal determinants 
of behavior (Weiner et al., 1971). Further, in 
a study prior to the Jones and Nisbett article, 
Storms and Nisbett (1970) provided sugges- 
tive evidence that insomnia is in part caused 
by the actor ascribing his wakefulness to his 
own neuroticism. Few psychologists would 
deny that individuals have self-concepts such 
as “neurotic,” “dumb,” “industrious,” and so 
forth. These self-perceptions have motiva- 
tional significance and are available to be 
invoked by an individual to explain his or 
her behavior. The instances in which an actor 
makes self-ascriptions may be as prevalent as 
the hypothesized bias toward situational at- 
tributions. Further, self-ascriptions are likely 
to be of greater clinical importance than ex- 
ternal attributions. Internal ascriptions mag- 
nify affect, and as Freud’s (1915/1957) 
internal stimuli, they may be perceived by 
the actor as causes “from which there is no 
flight” (p. 118). 
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A Pluralistic Explanation of Choice Shifts on the Risk 
Dimension 


Jim Blascovich, Gerald P. Ginsburg, and Tracy L. Veach 
University of Nevada, Reno 


The study of group influences on risk taking has been overly dependent on a 
single laboratory instrument, the Choice Dilemmas Questionnaire. A gambling 
task, blackjack, which avoided various methodological and analytic problems, 
was employed to reexamine the group shift to risk phenomenon as well as to 
investigate factors overlooked in the past. The results demonstrated group- 
produced shifts to risk. A moderate increase over time in the level of risk 
taking for individuals also was found, and the effect of outcomes on indi- 
vidual and group risk taking was assessed. Contrary to previous work, no 
differences were found between groups with discussion and groups without 
discussion. A two-fold process was postulated to explain the findings, involv- 
ing both intraindividual and interindividual mechanisms. These mechanisms 


entail both personal and social values. 


Most investigations of choice shifts involv- 
ing risk taking have used the same type of 
criterion task, the Choice Dilemmas Ques- 
tionnaire (CDQ). The use of other types of 
tasks has been relatively rare. This overreli- 
ance on a single type of laboratory instrument 
has limited efforts to determine what proc- 
esses may be involved in choice shifts, espe- 
cially risky shifts, as we have discussed else- 
where (Blascovich, Veach, & Ginsburg, 1973). 
Each CDQ item involves a recommendation 
to a hypothetical individual rather than per- 
sonal risks taken by the subjects. What, if 
anything, is at stake for subjects in CDQ 
studies has not been specified. 

Gambling would appear to be a more clear- 
cut case of risk-taking behavior, and it is sur- 
prising that a greater number of risky-shift 
studies have not used gambling tasks. This 
probably is due in part to the ambiguity of 
the results of the few studies that have em- 
ployed so-called gambling tasks (Pruitt, 
1971), while results from studies using the 
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popular CDQ have been more relia 
tainly, at face value, gambling ta 

appear to meet conceptual criteria f 

tisk taking better than the CDQ. 

However, studies that have empl 
bling tasks in risky-shift research h 
weaknesses. One type of gambling t: 
used by Pruitt and Teger (1969). $ 
alone and in groups made six choice 
from a booklet containing several 
bets. Unfortunately, no outcomes we 
vided during the experiment, and 
knew this in advance. Subjects were told that 
only one of their six choices (bets) would be 
played off on a roulette type device afier th 
experiment. Thus, for all but one } 
choices, no outcomes were possible. It is d 
cult to conceive of a known no-outcome ta 
as involving gambling or risk. 

Studies by Zajonc et al. (e.g., 
Wolosin, Wolosin, & Sherman, 196: 
ployed a light guessing task. A choi: 
made by subjects individually and the» in 
groups concerning which of two lights would 
be lit after their choice. One light had a rela- 
tively high probability of being lit (.6 or 
and the other had a relatively low proba’ 
(4 or .2). A correct guess of the low-p 
bility light was followed by a larger mon 
payoff than a correct guess of the high-p 
bility light. While this appears conceptually 
adequate, it did lack mundane realism. 
Furthermore, subjects did not actually risk 
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losing anything; they merely stood a chance 
of winning more, winning less, or not winning 
anything in excess of their guaranteed wages 
or experimental credits for required participa- 
tion. It is difficult to specify a comparable 
real-world situation for possible field study 
and generalization. 

A recent and interesting field study by Mc- 
Cauley, Stitt, Woods, and Lipton (1973) in- 
vestigated differences between individual and 
group race track betting, Individual risk was 
assessed by finding out what horse a bettor 
had bet on at the two-dollar window. The 
criterion measure was the pari-mutuel odds 
for winning on the horse and not the amount 
of money wagered. Thus, high odds indicated 
greater risk than low odds. Bettors then were 
approached and told that if they agreed to 
participate in a psychology experiment they 
would be given another two dollars to bet. 
Some were told that they would have to reach 
consensus with other bettors about what horse 
to bet on, while others were assigned to a 
control condition in which they were asked to 
make another individual bet. Risk again was 
determined by the pari-mutuel odds on the 
selected horse. 

McCauley et al. found a conservative shift, 
That shift may be typical in a race track 
setting, but sufficient questions can be raised 
about the study design to suggest caution in 
interpreting the results. For example, the in- 
tervention between the individual and group 
sessions may have introduced artifacts such 
as evaluation apprehension or demand char- 
acteristics into the study. In addition, such 
artifacts would have interacted with the 
treatments since individual bets were placed 
before bettors were approached by the ex- 
perimenter. Second, the horses bet on in the 
individual session and the group session ran 
in the same race. Thus, odds for the two were 
not independent. Third, the conservative re- 
sults may be due to self-selection of subjects 
since two dollars is the most conservative 
wager at a race track. Finally, the authors do 
not report the size of shifts but only the 
direction of shifts. Thus, the substantiveness 
of the results may be open to question. 

Recently, Blascovich et al. (1973) used 
blackjack as the criterion task in risky-shift 
research. Blackjack (“21”) is a popular card 
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game commonly played by individuals alone 
against the “house” or in groups of varying 
sizes. In the blackjack paradigm, subjects 
played in a highly realistic setting according 
to regular casino rules, although subjects did 
not have any of their own money at stake. 
Subjects used gaming chips in $.05, $.10, and 
$.25 denominations. The limits for each hand 
were $.05 to $1.00. 

Since the blackjack paradigm lends itself 
to straightforward analysis without the use of 
difference scores, and since changes in risk- 
taking (betting) levels are observed over 
time, variation due to a plurality of processes 
can be distinguished. For example, in their 
exploratory study Blascovich et al. were able 
to differentiate both an individual and a group 
increase in risk taking when individuals first 
played several hands of blackjack alone and 
then in group play. No differences due to 
group discussion were found, a notion that is 
contrary to what has been taken by some for 
fact (Cartwright, 1971; Pruitt, 1971). 

The results of the exploratory blackjack 
study suggested several important ideas for 
further research. First, the results had to be 
replicated. This was especially important be- 
cause some of the findings contradicted cur- 
rently popular ideas, for example, the demon- 
stration of an individual effect and of no dif- 
ferences between groups with discussion and 
groups without discussion in either risk levels 
or changes in risk levels. Second, an improve- 
ment was necessary in the operational dis- 
tinction between groups with and without dis- 
cussion. In the exploratory study, members 
of groups with discussion had to reach con- 
sensus about the amount to be bet on the 
ensuing hand; but each person then played 
his own hand and received his own outcome, 
The lack of a discussion-no discussion differ- 
ence may have been due to an overriding out- 
come effect, since Blascovich et al. did find 
that bets were influenced by prior outcomes 
(although shifts in bet levels were not). 

The role of discussion is central in evaluat- 
ing the two explanations of risky shift judged 
most tenable: social comparison versus rele- 
vant arguments (Pruitt, 1971). The findings 
of Blascovich et al. suggest that the relevant 
arguments explanation is less tenable, but a 
more unambiguous conceptual and operational 


424 


distinction between groups with and without 
discussion was in order. In the study reported 
below, this was accomplished by requiring 
members of the discussion groups to decide on 
a bet and then to play the ensuing, single 
hand as a group. 

Third, it appeared evident that the use of 
the blackjack design, based on a priori pre- 
dictions and incorporating a more clear-cut 
operationalization of group with discussion, 
would provide a strong basis for an inclusive 
theoretical statement. More specifically, the 
inclusion of both individual and group proc- 
esses within a single theoretical explanation 
seemed likely. 

Based on a review of prior work and the 
results of the exploratory study reported 
above (Blascovich et al.), several hypotheses 
were generated. The first hypothesis requires 
a replication of the basic phenomenon itself, 
while the second hypothesis reflects an at- 
tempt to test directly the necessity for task- 
oriented discussion in producing the shift 
toward risk. The third and fourth hypothe- 
ses represent attempts to extend our knowl- 
edge of the dynamics of the risky shift phe- 
nomenon, 

1. Higher levels of risk are undertaken by 
groups of persons following individual risk 
taking. 

2. The level of risk taken by members of 
groups is not influenced by group discussion. 

3. Individuals increase their risk levels over 
time and then level off. 

4. Positive and negative outcomes have 
differential effects on both individual and 
group risk-taking levels but not on the mag- 
nitude of group shifts to risk. 


METHOD 
Design 


A split-plot factorial design (Kirk, 1968) incor- 
porating three fixed treatments with repeated mea- 
sures on two of them was employed. Treatment A 
represented different types of group situations: ar— 
no group influence, a-—groups with discussion to 
consensus, and as—groups without discussion to con- 
sensus, Treatment B was the session factor. The two 
levels of Treatment B were: bi—first session of 20 
hands and b:—second session of 20 hands. Treatment 
C was a blocks factor. The four levels of C were: 
C—first quarter (first 5 hands), c-—second quarter 
(second 5 hands), cs—third quarter (third 5 hands) 
and c:—fourth quarter (fourth 5 hands). Groups 
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were nested in each level of A. Eight three-man 
groups were randomly assigned to each level of A, 
although for a: the individuals in each “group” were 
classified as such for purposes of analysis only. 


Setting 


The research was carried out in several experi- 
mental rooms. Among them were waiting rooms, an 
instruction room, an observation room, and a casino. 
The casino was furnished with a blackjack table, 
stools, chairs, tables, and slot machines. For the 
blackjack game there were playing cards, gaming 
chips in 5-, 10-, and 25-cent denominations, and a 
blackjack dealer. Observers recorded bets and out- 
comes from the observation room through one-way 
mirrors, which are also present and highly visible 
in real casinos. 


Subjects 


Seventy-two male subjects who fulfilled part of 
their course requirements by participation were re- 
cruited from introductory psychology sections. Sub- 
jects were screened by means of a preliminary black- 
jack test, to ensure that each knew how to play 
blackjack. Subjects were randomly assigned to ex- 
perimental conditions so that there were eight three- 
man groups in both group conditions and 24 sub- 
jects in the no-group condition. The latter subjects 
were randomly arranged into eight groups for sta- 
tistical purposes. 


Procedure 


Pretest. Upon arrival subjects were randomly 
assigned a “player” number (1, 2, or 3) and pro- 
ceeded with the rest of the experiment in the order 
prescribed by these numbers, Subjects were brought 
to individual waiting rooms. One by one, and in 
order, subjects in the group conditions were brought 
to the instruction room where they were read the 
instructions below. Upon arrival, subjects in the no- 
group condition (a;) were assigned to a waiting 
room and were subsequently taken to the instruction 
room where they also were read the following in- 
structions: 


In a few moments you will be taken to a room 
which is sort of a small gambling casino. You 
will be instructed to play blackjack and will start 
as soon as you have received your chips and the 
dealer has shuffled the deck. The rules are about 
the same as you will find in a casino. You will be 
initially staked with five dollars in gaming chips: 
10 purple ones at 5 cents each; 10 yellow ones at 
10 cents each; and 14 red and gray ones at 25 
cents each. Please play as though you are playing 
with your own money. Try to win as much as you 
can. After a certain number of hands the game 
will stop. Blackjacks will pay 3 to 2. You may 
double down on any two cards and split any 
pair. The limits for each hand will be $.05 to 
$1.00. You do not have to keep silent when you 
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are playing. You may talk to the dealer if you 
wish. Do you have any questions? 


After the subject had listened to the instruct’ons 
and the instructor had answered any questions, the 
subject was brought to the casino where he played 
20 hands of blackjack alone against the house. After 
completing 20 hands, the subject was returned to his 
waiting room. After all three subjects in the group 
conditions with or without discussion, or after each 
subject in the no-group condition played individually 
and after an additional waiting period of 10 min- 
utes, the experiment proceeded according to the 
procedure for one of the following conditions. 

No-group condition. After the 10-minute waiting 
period, the subject was brought back to the instruc- 
tion room where he was read instructions essentially 
the same as those which were read to him before 
the first session. He then was taken back to the 
casino where he played 20 more hands of black- 
jack, again alone against the house. Afterwards, the 
subject was interviewed and debriefed. 

Group-with-discussion condition. After the wait- 
ing period all three subjects were brought into the 
instruction room together and were given the fol- 
lowing instructions: 


Now you are going to be taken as a group to the 
casino where you will play blackjack in a group. 
The stakes, rules, limits and instructions are the 
same as when you played alone. Only now you 
will be playing the same hand as a group. You 
will be staked with five dollars in gaming chips 
for the entire group. Before each hand you must, 
as a group, decide how much to bet. During each 
hand you must, as a group, decide how to play 
the hand. Thus, there will be one hand for the 
entire group at a time. Again, play as though you 
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are playing with your own money and try to win 

as much as you can. Do you have any questions? 

Group-without-discussion condition. After the 
waiting period all three subjects were brought into 
the instructional room together and were read the 
following instructions: 


Now you are going to be taken as a group to 
the casino where you will again play blackjack. 
The stakes, rules, limits and instructions are the 
same as when you played before. Only now you 
will all be playing at the same time. Again play 
as though you are playing with your own money. 
Try to win as much as you can, Do you have 
any questions? 


Following the instructions, groups in the discus- 
sion and no-discussion conditions returned to the 
casino and played 20 hands of blackjack, side by side 
in accord with their instructions, 


RESULTS 


The major analysis, as noted earlier, was a 
3 X 2 X 4 split-plot factorial analysis of vari- 
ance (Kirk, 1968) with repeated measures on 
the last two factors (all factors fixed, sub- 
jects random). The data for the analysis con- 
sisted of mean bets. A mean was calculated 
for each block of five hands within a session, 
separately for each group of three persons. 
The overall means are represented graphically 
in Figure 1. 

The main effects of sessions (B) and quar- 
ters (C) were significant, F (1,21) = 12,49, 
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Ficure 1. Mean risk levels (in cents). 


426 


p < .005 and F (3,63) = 22.02, p < .001, re- 
spectively. Additional a priori analysis (based 
on Hypothesis 3 above) of simple-simple 
effects revealed no C effect for the no-group 
condition (individual-to-individual play) dur- 
ing the second session, F (3, 63) = 2.29, ns. 
However, there was a C effect during the sec- 
ond session for both the group with discussion, 
F (3,63) = 36.43, p < .001, and the group 
without discussion, F (3,63) = 11.25, p< 
.001. These simple-simple effects supported 
Hypothesis 3. In addition, during the last 
block of five hands in the second session 
(BCg4), level Ay (no-group) was significantly 
lower than’Ay (group with discussion) and 
As (group without discussion), ¢ (14) = 1.84, 
p<.05 and ¢ (14) = 1.51, p < .08, respec- 
tively. 

No differences at any combination of fac- 
tors B and C were found between As (groups 
with discussion) and Ag (groups without dis- 
cussion). Hypothesis 2 was directly assessed 
by comparing the means of ABso and ABgo 
(i.e., group with discussion and group without 
discussion, within the second session only, 
averaged over all 20 hands in that session). 


Group with Discussion 
Win 
Loss 

| Group without Discussio 
Wi 


n 
al Loss @——_@ 
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Ficure 2. Mean risk levels following positive and 
negative outcomes. 
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The difference between the means (2.57) did 
not reach even a very loose level of signifi- 
cance, é = .036, .25 < p < .40, with an esti- 
mated power for that test greater than .99.* 

A second analysis was performed on a sub- 
set of the data to assess the effects of wins 
and losses on the size of bets, taking into ac- 
count the presence or absence of group discus- 
sion and comparing individual and group ses- 
sions. The analysis was a 2 X 2 X 2 factorial 
analysis of variance with repeated measures 
on the last two factors. The first factor (A) 
was type of group (with discussion or with- 
out discussion). The second factor (B) was 
session of play (first or second). The third 
factor (C) was type of outcome (win or loss). 
Again there were eight three-man groups of 
each level of A. The data consisted of mean 
bets following wins, obtained by averaging 
bets following wins over individuals in each 
group for each session, and mean bets fol- 
lowing losses, obtained by averaging bets fol- 
lowing losses over individuals in each group 
for each session. The overall means are repre- 
sented graphically in Figure 2. 

The main effects of sessions (B) and out- 
comes (C) were significant, F (1,14) = 8.67, 
p< .02; and F (1,14) = 8.75, p < .02, re- 
spectively. Thus, subjects in both group con- 
ditions bet more in the second or group ses- 
sion, both after wins and after losses. Both 
individuals and groups bet more following 
wins than they did following losses. Again, 
there were no effects at any combination of 
Factors B and C due to type of group (Fac- 
tor A). 


Discussion 


The overall analysis of bets shows that 
greater risks were taken in the group sessions 
than in the individual sessions. This confirms 
Hypothesis 1. In addition, both individuals 
and groups took greater risks in the latter 
part of their play. This confirms Hypothesis 


1 Assessment of Hypothesis 2 required a high de- 
gree of power so that we could reject the hypothesis 
if there was any reasonable doubt that the difference 
was zero. The procedure for testing was based on 
Suggestions by Kirk (1968), Cohen (1969), and 
Bailey (1971) and yielded a t= .36 (clearly ns) for 
which power greater than .99 at nominal a= 10, 
df = 1, 21. 
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3. It appears that exposure to the task situa- 
tion has an elevating effect on individual risk 
taking. However, this effect is limited: The 
increase in risk taking which occurred in the 
first session leveled off during the second ses- 
sion in the no-group condition, replicating our 
earlier finding (Blascovich et al., 1973). Thus, 
there is evidence that an individual temporal 
effect does occur, but the concomitant in- 
crease in individual risk taking reaches its 
maximum quickly and levels off. However, 
when individuals play blackjack in a group 
session after having played in an individual 
session, the level of risk taking does not level 
off in the group session but instead increases 
dramatically (see Figure 1). 

The increase in group risk taking shown in 
the present study is not likely to be due to 
an increase in rationality on the part of indi- 
viduals ‘in groups compared to individuals 
playing alone since individuals in groups did 
not win any more than individuals playing a 
second set of 20 hands of blackjack alone. 
“Groups” of subjects in all three experimental 
conditions (group-with-discussion, group-with- 
out-discussion, and no-group conditions) av- 
eraged a loss during the second session 
(—$.87, —$.76 and —$1.08, respectively). 
An analysis of variance showed these to be 
nonsignificant even at a greatly expanded 
significance level, F (2,21) = .02. 

One of the most important results of this 
research is the absence of any effect due to 
group discussion in the second session. Thus, 
Hypothesis 2 was supported. Both groups 
with discussion and groups without discussion 
produced equivalent increases in risk taking. 
There were no differences in levels of risk 
taking between the two types of groups. Thus, 
no additional increase or decrease in risk 
taking can be attributed to the presence or 
absence of group discussion; and differences 
found previously between discussion and no- 
discussioh group conditions may have been 
artifacts of the instrument usually employed, 
that is, the CDQ. Moreover, the present evi- 
dence undermines the prevalent belief that 
group discussion, or exposure to it, is neces- 
sary to produce reliable group shifts to tisk. 
The absence in the present study of a differ- 
ence between the groups with and without 
discussion suggests that both types of group 


effects may be caused by the same or similar 
processes. 

The analysis of outcomes indicates that the 
immediate effect of a positive outcome, a win, 
is an increase in the amount bet, and the im- 
mediate effect of a loss is a decrease in the 
amount bet. These findings occur both for 
individuals and groups. Further, when out- 
comes are taken into account, a group shift to 
risk still occurs whether one considers only 
the bets following positive outcomes or only 
the bets following negative outcomes. These 
findings lend weight to the contention*that 
the group shift to risk phenomenon is a strong 
effect and add credence to the generalizability 
of the phenomenon. Once again, no differ- 
ences were found between groups with dis- 
cussion and groups without discussion. Thus 
Hypothesis 4 was supported. 

Results from the present study show that 
the risky-shift phenomenon is strong and de- 
pendable and that an adequate explanation of 
the phenomenon must account for the opera- 
tion of at least a duality of processes. That is, 
an individual process and some type of group 
process appear to contribute to the increased 
riskiness demonstrated here. Moreover, con- 
sideration must be given to the fact that there 
are no differences between groups with discus- 
sion and groups without discussion. Finally, 
the effects of outcomes on risk taking should 
be considered. 

A two-fold theory including both an intra- 
individual and an interindividual process can 
be offered to explain these results. The intra- 
individual process involves a form of familiar- 
ization, although not of the type described by 
Bateson (1966), in which a person develops 
enough self-justifying reasons to enable him- 
self to make a riskier decision. Rather, the 
familiarization found in the present study 
may involve “seeing how the cards are fall- 
ing,” that is, the individual must experience 
outcomes in order to establish subjective 
probabilities of alternative outcomes. Why an 
individual gradually increases his level of risk 
taking may be due, at least in part, to the 
tendency of individuals to overestimate the 
probability of wins or successful outcomes. 
This is a type of gambler’s fallacy. It also 
has been called an “optimism” effect (Kidd 
& Morgan, 1969) and has been demonstrated 


428 


in studies of decision making (Kidd, 1970), 
subjective probability (De Finetti, 1970), and 
gambling (Hochauer, 1970). Clearly, a ten- 
dency on the part of individuals to overesti- 
mate the probability of successful outcomes 
could lead to an increase in risk taking, es- 
pecially on a risk task in which the objective 
probabilities of winning and losing are about 
equal.? 

In addition to the intraindividual process 
suggested above, an interindividual process 
must also be specified, since individual in- 
creases in risk taking level off while increases 
in group play do not. Since no differences 
were found between groups with discussion 
and groups without discussion, the interindi- 
vidual process is likely to involve some type of 
social comparison rather than persuasive in- 
formation exchange. In a group setting as a 
risk task proceeds and decisions are made, a 
norm concerning socially acceptable risk levels 
probably emerges. The socially acceptable 
risk level is likely to be higher than most 
group members had individually assumed, for 
reasons that Brown (1965) has elaborated in 
his discussion of cultural values favoring risk. 
That there is a widespread cultural value 
favoring riskiness, as Jellison and Riskind 
have suggested (1970) and demonstrated 
(Jellison, Riskind, & Broll, 1972), suggests 
that riskiness may be positively associated by 
individuals with ability or skill, with being 
able to look upon oneself as successful. In 
fact, during the postexperimental interview of 
the present study, 87% of the subjects said 
there was something at stake during the black- 
jack game, and 90% of those saying so stated 
it was skill, pride, or winning. Thus, risk may 
be symbolic of ability. 

Group discussion is not necessary for the 


2In casino blackjack, no one, including the house, 
wins ties. Therefore, the probability of a player 
winning any given hand is about .47 so that the 
probabilities of winning and losing are about equal. 
(This probability value was empirically determined 
from outcomes in this study, 1102 wins, 1247 losses, 
and 211 ties.) Zajonc et al. (1969) maintain that 
these probabilities produce a conservative shift while 
extreme probabilities associated with dichotomous 
outcomes (.8 and .2) produce a risky shift. The 
present study shows the former not to be the case. 
However, the probabilities of winning and losing in 
the present study were closer (.47 and .53) than 
those in the Zajonc et al. study (.6 and .4). 
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operation of the interindividual social com- 
parison process. Only a minimal type of inter- 
action appears necessary for the group shift to 
risk to occur, that is, information about one 
another’s risk preferences must be available. 
In the gambling situation this information 
can be communicated verbally or nonverbally, 
whether individuals are engaged in risk-taking 
behavior merely in one another’s presence or 
acting as a group with discussion. In more 
complicated situations, such as those sug- 
gested by the Choice Dilemmas, in which 
decisions almost certainly are based on more 
than just risk preferences, group discussion 
may be necessary because the relevant norms 
and values cannot be transmitted nonverbally. 
Thus, group discussion may be necessary with 
choice dilemmas problems but only because 
preferences of group members on a wide vari- 
ety of dimensions must be made known for 
appropriate norms to emerge. Since it is un- 
likely that this can happen nonverbally, stud- 
ies which show larger shifts when there is 
group discussion should not be taken as proof 
of the “relevant arguments” position. 

The theory presented above also can ex- 
plain shifts toward caution. While the major- 
ity of group-produced shifts are toward greater 
risk on a task such as blackjack, some groups 
certainly can and do shift in the opposite di- 
rection. Like the shift to risk, the shift to cau- 
tion might also involve both intraindividual 
and interindividual processes. If for some rea- 
son and despite influences to the contrary an 
individual were to underestimate the proba- 
bility of wins or successful outcomes one 
would expect the individual to decrease his 
level of risk taking over time in an “alone” 
condition. In a group, a majority of such indi- 
viduals might establish a low emergent norm, 
inducing the remaining group members to 
decrease their risk-taking levels. 

In summary, this interpretation of the shift 
in risk which occurs when one moves from an 
individual risk-taking situation to a group 
risk-taking situation is a two-process inter- 
pretation. The intraindividual process involves 
the subjective probabilities of positive and 
negative outcomes to the individual. The 
interindividual process involves a widespread 
cultural value favoring risk and a value for 
social conformity to it, as well as social com- 
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parison in the service of these values. With 
the present gambling task, the values involved 
have led to the increases in risk taking which 
were demonstrated. However, in other risk 
situations these processes may lead to de- 
creases in risk taking. What is most impor- 
tant is the transmission of information regard- 
ing socially acceptable risk levels. Interper- 
sonal comparison usually is of minimal neces- 
sity for such transmission, while in multi- 
valued tasks persuasive arguments may well 
be sufficient. 
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The present research compared the relative effectiveness of an attribu 


Strategy with a persuasion strategy in 
to teach fifth graders not to litter and 
group was repeatedly told that 
group was repeatedly told that 


+ espite the volume of research on attitude 
change and Persuasion, there is surprisingly 
little evidence that Persuasion can be effective, 
particularly if a criterion of persistence of 
change over time is applied (Festinger, 1964; 
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Greenwald, 1965b; Rokeach, | 
bardo & Ebbesen, 1969; Cook 
failure of persuasive efforts to | 
change may be taken as evid. 
jects have not integrated the no 
into their own belief systems ( ! 
or taken it as the basis for m: 
bution about themselves (Kell: 
might expect that a persuasive c 


ing change. This research was desiened t 

the relative importance of attribution manip- 
ulations to persuasive attempts by comparing 
a normal persuasion treatment with an attri- 
bution treatment. 


The persuasion conditions o! present 
research were designed to be mi effec. 
tive through their use of a varic y of teche 


niques which have been found to be helpful, 
at least on occasion, in past research. Past 
research has shown that an optimal persuasive 
manipulation should involve a high-credibility 
source (Hovland & Weiss, 1951) delivering a 
repeated message (Staats & Staats, 1958) 
with an explicitly stated conclusion (Hov- 
land & Mandell, 1952) which is supported by 
arguments pointing out the benefits of change 
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(Greenwald, 1965a) and overlearned by the 
audience (Cook & Wadsworth, 1972), Face- 
to-face communication by the source (Jecker, 
Maccoby, Breitrose, & Rose, 1964), reinstate- 
ment of the source at the time of attitude 
assessment (Kelman & Hovland, 1953), and 
active role playing or participation by the 
audience in the message (Janis & King, 1954) 
are also helpful, 

The attribution techniques were also de- 
signed to be maximally effective through their 
use of all three factors speciñed by Kelley 
(1967) as conducive to making a stable attri- 
bution: consistency of the evidence over time, 
consistency of the evidence over modalities, 
and consistency or consensus across sources. 


Srupy 1: LITTERING BEHAVIOR 


Study 1 attempted to modify children’s lit- 
tering behavior. Behavior was monitored be- 
fore and after treatment and again after a 
2-week period of nontreatment, It was hy- 
pothesized that both the attribution and the 
persuasion conditions would result in initial 
posttreatment behavioral change but that the 
attribution condition would show greater per- 
sistence as a result of altering the basic self- 
concept of the subjects in a direction incon- 
sistent with littering. 


Method 

Participants 

The research took place in three fifth-grade chass- 
rooms in an inner-city Chicago public school. Two 
fifth-grade classrooms were randomly amigned to the 
experimental conditions, while a third was designated 
a control group. Three female experimenters, all 
undergraduate psychology majors at Northwestern 
University, were randomly assigned to a different 
classroom for each test. 


Experimental Manipulations 

‘There were a total of 8 days of attribution and 
persuasion treatments dealing with littering, with 
discussion intended to average about 45 minutes per 
day. 
‘Attribution condition. On Day 1, the teacher com- 
mended the class for being ecology minded and not 
throwing candy wrappers on the auditorium floor 
during that day's school asembly, Abo on Day 1, 
the teacher passed on a comment ostensibly made 
by the janitor that their class was one of the cleanest 
in the building. On Day 2, after a visiting claw had 
left the classroom, the teacher commented that paper 
had been left on the floor but pointed out that 
“our class is ckan and would not do that.” The 
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students at this point disagreed pointedly and re- 
marked that they would and did indeed litter. On 
Day 3, one student picked up some paper discarded 
on the floor by another and after disposing it in the 
wastebasket was commended by the teacher for her 
ecology consciousness. On Day 4, Row 1 was pointed 
out as being the exceptionally neat row in the room 
by the teacher, Abo on Day 4, the principal visited 
the class and commented briefly on how orderly it 
appeared. After the principal left the room, the 
students castigated the teacher for her desk being 
the only messy one in the room, On Day 5, a large 
poster of a Peanuts character saying “We are An- 
dersen's Litter-Consclous Class" was pinned to the 
class bulletin board. Abo on Day 5, the teacher gave 
a lesson on ecology and talked about what we “the 
class” are doing to help. On Day 6, the principal 
sent the following letter to the class; “As I talked 
to your teacher, I could not help but notice how 
very clean and orderly your room appeared, A young 
lady near the teacher's desk was seen picking up 
around her desk. It is quite evident that cach of 
you are very careful In your section.” On Day 7, 
the teacher talked about why “our class" was so 
much neater. In the interchange the students made a 
number of positive self-attributions concerning It- 
tering. On Day 8, the janitors washed the floor and 
ostensibly left a note on the blackboard saying that 
it was easy to clean, 

Persuasion condition, On Day 1 during a field trip, 
the children were told about ecology, the dangers of 
pollution, and the contribution of littering to pollu» 
tion. They were then asked to role play being a 
trash collector and to pick up litter as they came 
across it. On Day 2, inside the school lunchroom 
the teacher talked about garbage left by students 
and gave reasons why it should be thrown away: it 
looked terrible, drew flies, and was a danger to 
health, On Day 5, the teacher gave a lecture on 
ecology, pollution, and litter and discussed with the 
class how the situation could be improved, Abo on 
Day 3, the teacher passed on a comment ostensibly 
from the school janitor that they needed help from 
the students in keeping the floors clean, implying 
here as elsewhere that nonlittering would lead to 
approval and commendation by various adult aw 
thorities. On Day 4, the teacher told the students 
that everyone should be neat, mentioning aesthetics 
among other reasons for neatness, Alto on Day 4, 
the principal visited the class and commented 
briefly about the need for clean and tidy classrooms. 
On Day 5, the teacher told the studenta that they 
should not throw candy wrappers on the floor or 
the playground but should dispose of them in trash 
cans. Abo on Day 5, a large poster of a Peanuts 
character saying “Don't be a Htterbug” with “He 
neat” and “Don't litter” bordering it was pinned to 
the class bulletin board. On Day 6, the principal 
sent the following letter to the class: “As I talked 
to your teacher, I could not help but notice that 
your room was in need of some chaning. It is very 
important that we be neat and orderly In the up- 
keep of our school and clastrooms. I bope each of 
you in your section will be very careful about litter” 
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On Day 7, the teacher appointed several children in 
each row to watch and see if people were neat out- 
side the building as well as in the classroom. On 
Day 8, a note was left on the board ostensibly from 
the janitors to remind the children to pick up papers 
off the floor, 


Measurement of Littering 


Pretest. To discover any existing differences 
among the three classrooms with respect to their 
tendency to litter, a specially marked reading assign- 
ment which had previously been turned in to the 
teachers was returned to the students § minutes be- 
fore the end of the school day, The students were 
then instructed to throw the assignment away after 
the bell rang for dismissal, After school the experi- 
menters counted the number of assignments thrown 
sket versus left on the floor or on the 


in the wastel 

shelf under the students’ seats. Less than 20% of 
the students in each class disposed of their assign- 
ments in the wastebaskets, The precise percentages 
were 20% for tl vtrol group (m = 31), 16% for 
the persuasior 26), and 18% for the at- 


tribution group (7 

Posttest. A two iavioral test was designed 
to tap the two aspect he ecology~littering prob- 
lem, nonlittering and cleaning up the litter of others. 

On the morning of the tenth day, 10 minutes be- 
fore the first recess, cach teacher introduced the ex- 
perimenter for her classroom as a marketing repre- 
sentative of a local candy manufacturing firm and 
left the experimenter in charge of the room, The 
experimenter explained that she was testing the 
tastiness of a new brand of candy and passed out 
one piece of candy to each student, The candy was 
wrapped in colored cellophane with a different color 
used for each classroom. Following the taste test, 
the class was dismissed for recess. During recess the 
experimenters counted the number of candy wrappers 
in the wastebaskets, on the floor, and in the desk 
seats, The experimenters then relittered the classroom 
entrance area with seven specially marked candy 
After recess the experimenters checked 
the hallway and playground for discarded candy 
wrappers. During the lunch break, which came 1 
hour after recess, the experimenters reentered the 
classroom and determined the disposition of the spe- 
cially marked candy wrappers. 

The second posttest followed the first by a period 
of 2 weeks. During this time no mention of ecology 
or littering was permitted in any of the classrooms. 
The second test was very similar to the first except 
that this time, the experimenters did not interact 
with the students. Ten minutes before the afternoon 
recess the teacher passed out toy puzzles as Christ- 
mas presents from the Parent Teachers Association. 
The students were asked to try to work the puzzles 
before recess. Each puzzle was wrapped in a color- 
coded container with a different color assigned to 
each class. During recess the experimenters entered 
the classroom and determined the disposition of the 
containers left there. They then relittered the en- 
trance way. After recess the experimenters searched 


wrappers. 


the hallway and playground for other containers. 
After school the experimenters reentered the class- 
room and determined the disposition of the relittered 
containers, 


Results and Discussion 


Littering Behavior 

Figure 1 charts the percentage of items in 
each group which were discarded in the waste- 
basket on each test. A chi-square test was 
used to compare frequency of littering in the 
three groups on the immediate and delayed 
posttests. Although the measures directly re- 
flect items of litter rather than individual sub- 
jects, it was observed by the experimenters 
that subjects independently discarded their 
own candy wrappers in the wastebaskets. The 
three groups were significantly different at 
both the immediate posttest, y* (2) = 18.14, 
p < .001, and at the delayed posttest, x? (2) 
= 20.99, p < 001. The attribution group was 
significantly superior to the persuasion group 
on both the immediate posttest, x° (1) = 
7.19, p < .01, and the delayed posttest, x? (1) 
= 16.15, p<.01. Although the persuasion 
group appeared to show an immediate increase 
in litter-conscious behavior, it was not signifi- 
cantly different from the control group even 
on the immediate posttest, y* (1) = 2.57, ns. 


Cleanup Behavior 


All seven items were picked up by members 
of the attribution group on both the immedi- 
ate and the delayed posttest. Persuasion group 
members picked up four items on the immedi- 
ate posttest and two on the delayed posttest, 
while control group members picked up two 
items on the immediate posttest and three on 
the delayed test. Since the total number of 
wrappers left was only seven and more than 
one Wrapper may sometimes have been picked 
up by a single individual, a chi-square test is 
not fully appropriate. Nonetheless, such a 
test would be significant at the .05 level at 
each posttest, which supports the appearance 
of differences favoring the attribution group. 

After this study the attribution teacher 
was advised that the nonlittering behavior 
could perhaps be maintained if the students 
were occasionally reminded of the attribution 

You are neat.” Three months later the 
teacher reported that her class was still sig- 
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Ficure 1. Nonlittering behavior of the attribution, 
persuasion, and control groups over time (Study 1), 


nificantly neater than it had been prior to 
treatment. 

The results for both the littering test and 
the cleanup test support the hypothesis that 
attribution is a more effective technique than 
persuasion for inducing stable behavioral 
change, We would like to show, however, that 
this effect holds for other kinds of behavior, 
Furthermore, it would be desirable to over- 
come a weakness in Study 1 that arose from 
the fact that treatments were nested within 
classrooms, It is possible, if relatively un- 
likely, that the differences emerging over 
time were due to teacher differences rather 
than treatment differences. Study 2 avoids 
this by including all treatment and control 
conditions in each classroom, 


Srupy 2; MATH ACHIEVEMENT 


The results of Study 1 are certainly en- 
couraging for an attributional approach to 
modifying behavior. However, while ecology 
consciousness and nonlittering are of some 
social importance, they are not the primary 
focus of the schools, which is, at least theo- 
retically, to teach skills, Will attribution and 
persuasion techniques show the same pattern 
of effectiveness in generating a skilled be- 
havior, like math achievement, as they have 
in generating a socially desirable but unskilled 
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behavior, like disposing of trash in waste- 
baskets? Furthermore, littering behavior 
would certainly be only a weakly valenced 
aspect of self, while most of the skills taught 
in schools would be highly valenced and of 
considerable import for a student’s self-con- 
cept. Will attribution be as effective for a 
highly valenced aspect of self as it was for 
the more peripheral aspect? The first purpose 
of Study 2 was to answer just these questions. 
It might also be noted that attributions in 
Study 2, in addition to being more central to 
self-concept, are specifically directed to par- 
ticular individuals rather than addressed to 
a group as a whole, 

The second purpose of Study 2 was to test 
the relative effectiveness of attributions of 
motivation versus attributions of ability in 
changing behavior, Both perceptions of abil- 
ity and motivation are essential to the belief 
that a person will attain a given goal (Heider, 
1958). Study 1, however, would seem to have 
involved primarily the attribution of motiva- 
tion, since the children presumably began 
with a common belief that they had the abil- 
ity to be neat, In the case of a skilled be- 
havior like arithmetic, however, it would seem 
more likely that motivation and belief in 
one's motivation to do well is more common 
than ability and belief in one’s ability to do 
well (Katz, 1964), so that attributions of 
ability would be of greater value than attri- 
butions of motivation, Nonetheless, enhancing 
people’s perceptions of their motivation for a 
task may also benefit their performance. 
Study 2 attempts to separate ability and mo- 
tivation as the bases of attribution and the 
targets of persuasive appeal, 

So far we have only considered cognitive 
strategies for modifying attitudes and be- 
havior, Staats (1965) has shown that even 
young children will engage in complex learn- 
ing tasks if they are simply given appropri- 
ate reinforcement. According to Bandura 
(1969), a successful reinforcement strategy 
for behavior modification requires a valued 
reinforcer which is contingent upon the de- 
sired behavior and a reliable procedure for 
eliciting the desired behavior, In the present 
study both verbal praise and extrinsic re- 
wards were used as reinforcers for efforts at 
mathematical achievement, and a number of 
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overlapping procedures were used to elicit 
these efforts. 

To compare the relative efficacy of the at- 
tribution and reinforcement techniques with 
standard attitude change approaches, a per- 
suasive manipulation was devised similar to 
the one used in Study 1. The only changes 
were that audience participation and role 
playing were deleted, since neither were ap- 
propriate to the treatment conditions, while 
public labeling was added. It appears from 
the study of deviance (Becker, 1963) that 
public labeling of a person can lead that 
person to redefine himself along the lines of 
the label. While it was felt in Study 1 that 
children had to be convinced of the benefits 
of nonlittering in the persuasion condition, 
the advantages of math achievement as a 
means for obtaining rewards in school seemed 
too obvious to need pointing out. 

The present study attempted to modify 
children’s math-related self-esteem and their 
math scores on skill tests. The six conditions 
were attribution ability, attribution motiva- 
tion, persuasion ability, persuasion motiva- 
tion, reinforcement control, and a no-message 
control. It was hypothesized that all three 
basic techniques (attribution, persuasion, and 
reinforcement) should have an initial positive 
effect on the self-esteem and math behavior 
of the subjects but that attribution should 
have the most enduring effect over time. 


Method 
Participants 
The research took place in four second-grade 
classrooms of the same inner-city Chicago public 
school involved in Study 1. Second-grade students 
were picked, since it was felt on the basis of Rosen- 
thal and Jacobson (1968) that their school-related 
self-concepts would be more malleable than at a 
later age. In all, 96 students took part in the study. 
All five experimental conditions and one control con- 
dition were present in each of the four classrooms. 
From each class list of approximately 30 students, 
24 were randomly assigned to the six possible condi- 
tions. Thus 4 students in each classroom appeared 
in each condition. 


Overview of Procedure 


All subjects first received a mathematics pretest 
and a self-esteem pretest. Subsequent treatments 
consisted of 8 days of attribution, persuasion, or 
reinforcement. Immediately following the treatment, 
math and self-esteem posttests were administered. 
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week period 
ed the 


Delayed posttests were given after a 2 
of no treatment. The control group rece 
pretests and the immediate and delayed p 
no treatment. Student absences for both t 
and tests were made up on the day the 
returned to school. 


Experimental Manipulations 


with all 
groups: verbal comments, written comments, 


Five treatment techniques were used 


from the teacher, letters from the princip: 
medals. The above order is followed in discussing 
these techniques for each experimental condition. It 
should also be noted that in the attribution and 
persuasion conditions, students were initially called 
to the principal’s office in groups of eight, where 
they received a treatment-related message. The 
principal discontinued these treatments after the 
third day of the experiment, however, on the grounds 
that they were too time-consuming for her and that 
she found the false attribution treatments too diffi- 
cult to deliver. 

All treatment techniques were prepackaged. Be- 
fore the experiment began, teachers’ treatment pack- 
ages were prepared which listed the treatment tech- 
niques and their recipients for each day. The order 
in which each subject received the treatments was 
randomized for classroom “A” and repeated in each 
of the other classrooms. 

Attribution ability. The general focus of this 
treatment was attributing to the students skill and 
knowledge in mathematics. Three different verbal 
comments were made by the teacher to each student 
on different days: “You are doing very well in 
arithmetic,” “You are a very good arithmetic stu- 
dent,” and “You seem to know your arithmetic 
assignments well.” Three different written notes were 
tied to assignments on different days and handed 
back to the students: “You're doing very well,” 
“excellent work,” and “very good work.” The let- 
ters from the teachers and principal underscored the 
students’ excellent work in math and were sent home 
on days when a verbal or written note was not 
scheduled. The letter from the teacher included the 
phrases, “very good student,” “does all his assign- 
ments well,” and “excellent arithmetic ability.” The 
letter from the principal used the phrases “excellent 
ability,” “knows his assignments,” and “very good 
student.” The medals awarded to the attribution 
ability students featured the words “good student— 
math.” 

Attribution motivation. The general focus of this 
treatment was attributing hard work and consistent 
trying to the student. Three appropriate verbal com- 
ments were made by the teacher to each child pri- 
vately, and three written notes were appended to a 
test or assignment. The verbal comments were the 
following: “You really work hard in arithmetic,” 
“You're working harder in arithmetic,” and “You're 
trying more in arithmetic.” The written comments 
were as follows: “You’re working harder, good!” 

“You're trying more, keep at it!” and “Keep trying 
harder!” The letters from the teacher and principal 


ATTRIBUTION VERSUS PERSUASION 


underscored the child’s application in math, The 
teacher’s letter used the phrases “working hard,” 
“trying,” and “applying himself.” The principal's 
letter used the phrases “working harder,” “applying 
himself,” and “trying harder.” The medals awarded 
to the attribution motivation students featured the 
words “hard worker—math.” 

Persuasion ability. The general focus of this treat- 
ment was to persuade the student that he should be 
good in arithmetic and doing well in that subject. 

` Three verbal comments and three written comments 
summarizing that message were made in the same 
manner as that for the attribution messages. The 
three verbal comments were “You should be good at 
arithmetic,” “You should be a good arithmetic stu- 
dent,” and “You should be doing well in arithmetic.” 
The three written comments were “should be bet- 
ter,” “should be good at arithmetic,” and “should 
be getting better grades.” The letters from the 
teacher and principal used ostensible “aptitude test 
scores” to inform parents that their child should be 
making good grades in math. The teacher’s letter 
included the injunctions “should be doing well,” 
“should be getting high grades,” and “should be 
becoming a good arithmetic student.” The princi- 
pals letter used the phrases “should get good 
grades,” “should do very well,” and “should be a 
good arithmetic student.” The medals awarded to 
these students contained the words “do better— 
math,” 

Persuasion motivation. The general focus of this 
treatment was to persuade the student that he 
should be working harder and spending more time on 
math, The three verbal and the three written com- 
ments asked the child to try more at math and were 
made in the same way as they were in the other 
treatment conditions, The three verbal comments 
were “You should spend more time on arithmetic,” 
“You should work harder at arithmetic,” and “You 
should try more on arithmetic.” The three written 
notes were “Try harder,” “Work more on arithmetic,” 
and “Work harder.” The letters from the teachers 
and the principal informed parents that their child 
should spend more time on math and to pass that 
idea along to him. The teacher's letter included the 
injunctions “should be trying harder,” “should spend 
more time on arithmetic,” and “should be applying 
himself.” The principal's letter used the phrases 
“spend more time” and “try harder.” The medals 
awarded the persuasion motivation students con- 
tained the words “work harder—math.” 

Reinforcement. The reinforcement condition also 
followed the same format as the attribution and 
persuasion conditions except that it added two addi- 
tional methods and deleted the principal's com- 
ments, Three verbal comments indicating pride in 
the student’s good work were made by the teacher. 
They were as follows: “I’m proud of your work,” 
“Įm pleased with your progress in arithmetic,” and 
“very good.” Three written comments of simple 
praise were appended to the student’s math work. 
They were, “excellent,” “very good,” and “I’m very 
happy with your work.” The letters from the teach- 
ers and principal indicated pride and satisfaction in 
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the child’s work to the parent. The teacher’s letter 
used the phrases “proud of his work,” “good 
grades,” and “happy with his work.” The principal's 
letter used the phrases “excellent progress,” “doing 
very well,” and “proud of him.” The medals 
awarded those students contained the words “math 
award.” On Days 2, 5, and 7, students were praised 
verbally if they chose to work on an extra math 
problem rather than a reading exercise. On two 
other days the students received silver stars by 
solving a problem from a math assignment. 

Control. The control condition received no treat- 
ment whatsoever but took part in all tests of mathe- 
matical ability and self-esteem. 


Measurements 
Self-Esteem 


The self-esteem pretest was an adaptation of the 
questionnaire originally developed by Rogers and 
Dymond (1954) that includes items which measure 
self-esteem with regard to peers and parents, school 
interests, and personal interests. Four new items 
were added which dealt specifically with math-related 
self-esteem. All items were declarative sentences gen- 
erally of the “I am ” form, from which the 
children were asked to say whether it was “like me” 
or “unlike me.” The self-esteem pretest and posttest 
were both administered individually to each student 
by one of a dozen eighth-grade assistants, who asked 
each question privately to the second graders and 
personally recorded the subject’s answer. The self- 
esteem posttest was given in the morning on the 
day after treatment ended. 


Mathematics 


The pretest and all subsequent math tests were 
20-item tests consisting of 25% review questions, 
50% current material, and 25% new material, The 
math pretest was administered by the teachers as a 
regular arithmetic quiz. The immediate posttest was 
administered in the afternoon on the day after treat- 
ment ceased. The final posttest was administered 
after a two-week period in which no treatment took 
place. 


Results 
Self-Esteem 


The mean math self-esteem scores for all 
six conditions on the pretest and the posttest 
are presented in Figure 2. Analysis of variance 
of the pretest scores indicated that there were 
no significant differences among the five 
treatment groups and the control group in 
math self-esteem before treatment began, 
F (5,90) = .79, ns. 

The major analysis of the math self-esteem 
scores was a repeated measures analysis of 
variance using the six experimental conditions 
and the two times of measurement as factors. 
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Ficure 2. Self-esteem scores for the attribution, 
persuasion, and control groups over time (Study 2). 


Results indicated both a main effect of time, 
F (1,90) = 10.84, p < .01, and a significant 
Time X Treatment interaction, F (5,90) = 
2.41, p < .05. As can be seen from Figure a: 
all treatment groups show an increase in self- 
esteem, while the control group shows a de- 
crease, which is probably due to the lack of 
any treatment for control students in the face 
of a number of public treatments visible to 
them in their classroom. 

A subsidiary analysis of variance was per- 
formed using only the attribution and the 
persuasion groups, with mode of treatment 
(attribution vs. persuasion), basis of treat- 
ment (ability manipulation vs. motivation 
manipulation), and time as the three inde- 
pendent variables. Results again indicated a 
main effect of time, F (1, 60) = 19.02, p< 
001, and a significant Time X Mode of Treat- 
ment interaction, F (1, 60) = 4.76, p < .05. 
The math self-esteem of the attribution groups 
increased more sharply from pretest ( 2.66) to 
posttest (3.50) than did the math self-esteem 
of the persuasion groups (2.84 at pretest, 3.13 
at posttest). Only the attribution ability and 
attribution motivation groups were signifi- 
cantly different from the no-treatment control 
group in their change from pretest to posttest, 
HAT; 30)i 7.58: 2p < Ole for attribution 
ability, F (1,30) = 7.66, p < .01 for attribu- 
tion motivation. This difference approached 
but fell short of significance for the persuasion 
ability group, F (1,30) = 3.12, P< ll. 
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Similar analyses showed no significant effects 
on general school-related and non-school-re- 
lated self-esteem. 


Mathematics 


The mean total math scores for all six 
conditions on the pretest, the immediate post- 
test, and the delayed posttest are presented in 
Figure 3. Analysis of variance of the pretest 
scores indicated that there were no significant 
differences among the five treatment groups 
and the control group in their total math 
scores before treatment began, F (5,90) = 
-54, ns. 

The major analysis of the math test scores 
was a repeated measures analysis of variance 
using the six experimental conditions and the 
three times of measurement as factors. The 
results show both a main effect of time, F (2, 
180) = 36.69, p< .001, and a significant 
Time Xx Treatment interaction, F (10, 180) = 
5.57, p < .001. From Figure 3 it can be seen 
that both attribution conditions show marked 
increases on the immediate posttest followed 
by a slight tendency to enlarge that increase 
after the 2-week delay. Both persuasion con- 
ditions appear to show an increase on the 
immediate posttest but fail to maintain that 
increase over the 2-week delay. The rein- 
forcement condition shows a pattern similar to 
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that of the attribution conditions but with a 
lesser degree of improvement. 

To assess the extent to which attribution 
and persuasion differed in their initial effec- 
tiveness versus the extent to which they dif- 
fered in their ability to maintain their effec- 
tiveness over time, separate analyses were 
made of changes from the pretest to the im- 
mediate posttest and from the immediate post- 
test to the delayed posttest. Mode of treat- 
ment (attribution vs, persuasion) and basis 
of treatment (ability vs. motivation) were 
used as factors along with time of test. For 
change from pretest to the immediate post- 
test, a significant Mode x Time interaction, 
F (1,60) = 11.97, p< .001, indicated that 
attribution was significantly more effective 
than persuasion in inducing change. Only the 
attribution ability and attribution motivation 
groups were significantly different from the 
no-treatment control group in their change 
from pretest to immediate posttest, F (1,30) 
= 14.75, p<.01 for attribution ability; 
F (1,30) = 11.42, p<.01 for attribution 
motivation. This difference approached but 
fell short of significance for the persuasion 
ability group, F (1,30) = 2.92, p < .10. 

For change from the immediate posttest to 
the delayed posttest, another significant Mode 
Xx Time interaction, F (1,60) = 13.67, p< 
.001, indicated that the attribution treatments 
were also superior to the persuasion treat- 
ments in maintaining what change they pro- 
duced. The attribution treatments show an 
increase of .50 from the immediate to the 
delayed posttest, while the persuasion treat- 
ments show a decrease of .63. 

Finally, to make a preliminary test as to 
whether attribution and persuasion were simi- 
lar in their effects on high- and low-ability 
students, subjects in each condition were di- 
vided at the cell median according to their 
math pretest performarice. An analysis of 
variance on posttest math scores was con- 
ducted using pretest ability (high vs. low), 
treatment (attribution vs. persuasion, ignor- 
ing basis of treatment), and posttest time 
(immediate vs. delayed) as factors. Initially, 
more able students continued to outperform 


initially less able students, F (1,60) = 46.01, 


p <.001, on the posttests. Attribution was 
more effective than persuasion for both groups, 


F (1, 60) = 19.39, p < .001, but the attribu- 
tion-persuasion difference was significantly 
larger for the low-ability students than for 
the high-ability students, interaction F (1, 
60) = 5.66, p < .05, Average posttest scores 
for the high-ability students were 18.8 for 
attribution and 17.8 for persuasion, while 
for low-ability students they were 16.5 for 
attribution and 13.0 for persuasion. 


Discussion 


In both studies the attribution treatments 
caused a significant change which persisted 
over time. These treatments were strong 
enough to overcome counterarguing by sub- 
jects in Study 1 and a history of at best mod- 
est success among many subjects in Study 2. 
Both attributions based upon subjects’ abil- 
ity to do something and those based upon sub- 
jects’ motivation to do it appeared effective. 
The effects of persuasion were, in general, in- 
significant and dissipated over time. 

The fact that the superiority of attribution 
treatments over persuasion treatments was 
demonstrated in two different field experi- 
ments using different behaviors (nonlittering 
and math problem solving), two different sub- 
ject populations (fifth graders and second 
graders), and two different designs (a be- 
tween-classroom design and a within-classroom 
design) gives us some confidence that these 
effects are real and generalizable. Neither 
study is without weakness, The nesting of 
treatments within classrooms in Study 1 
leaves teacher or group differences as a possi- 
ble, if unlikely, alternative explanation for 
the treatment effects that emerged over time. 
The public nature of the treatments in each 
classroom in Study 2 means that treatment 
effects may have been aided by implicit com- 
parisons students were making between their 
own condition and other treatments, a process 
which must lie behind the unexpected drop in 
math self-esteem shown by the no-treatment 
control group. (This last result must also be 
counted among the ethical costs of a within- 
classroom design.) The weaknesses of the 
two studies, however, are quite different, 
while their effects are quite similar, which sug- 
gests that the results are not due to idiosyn- 
crasies of design, 
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The present research provides a general 
framework into which previous work concern- 
ing the effects of teacher expectancies on 
pupil performance (Beez, 1968; Rosenthal & 
Jacobson, 1968; Seaver, 1973) can be fit. The 
means by which teacher expectancies are 
communicated is at best a dependent variable 
in previous studies (Meichenbaum, Bowers, 
& Ross, 1969) and often mysteriously unob- 
served. The present study made the com- 
munication of expectancies, in the form of 
attributions, its central manipulation. The 
fact that this programmed communication of 
expectancies worked as it did provides some 
support for the essential validity of the often 
elusive teacher expectancy effect. 


Effects of Attribution 


That attributions based upon ability and 
attributions based upon motivation did not 
differ in their effectiveness implies that direct 
linkage of skill-specific attributions to the 
self-system is more important than the basis 
on which this linkage is made. The message 
“You are a particular kind of person” is more 
important than the specification of “why.” 
It should be noted, however, that the present 
research contrasted only two kinds of internal 
attributions, ability and motivation. Attribu- 
tions made to a person on the basis of external 
factors (“You are neat because I am watching 
you”) would presumably be less effective in 
producing lasting change. 

Attribution can, of course, involve elements 
of persuasion. As we have seen, the statement 
“You are a neat person” may be a most ef- 
fective means of persuading someone to be 
neat. Nonetheless, such attributional state- 
ments need not involve persuasive intent but 
may instead be simple statements of fact. In- 
deed, their guise as truth statements may be 
thought of as their most effective advantage. 
Not only does this enable them to work di- 
rectly on a person’s self-concept, as noted, but 
it may also enable them to slip by the de- 
fenses a person ordinarily employs against 
persuasive attempts that are recognized as 
such. Attribution as persuasion may be further 
effective because it is less easily recognized as 
persuasion, and hence less likely to arouse 
resistance, counterarguing, or reactance. 

In Study 1 the attribution treatment did 
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elicit counterarguing by the students, which 
suggests a possible reactance (Brehm, 1972) 
or boomerang effect (McGuire, 1969), but 
there was no evidence of such an effect by 
the end of the experiment. It is possible that 
reactance, like other attitude-change forces, 
dissipates over time as the issue is worked 
through and the treatment is maintained. It 
is also possible that the elemenary school stu- 
dents who were subjects in the present studies 
are less likely to perceive the manipulative 
intent of an attribution treatment and less 
likely to show reactance than a comparable 
adult group. 


Attribution and Persuasion 


In accounting for the relative ineffectiveness 
of persuasion, we may note first of all that 
persuasive communications urging a person to 
do something do not necessarily tap the in- 
ternal self-concept of their target. Worse yet, 
to the extent that they do implicate self-con- 
cept, they may involve the negative attribu- 
tion that the person is not the kind of indi- 
vidual who engages in the recommended be- 
havior. An appeal to be neat or an appeal to 
work hard can involve the implicit attribu- 
tion that the person is not currently the sort 
of individual who is neat or works hard. If 
convincing people that they are neat or 
hard working is the key to making them neat 
or hard working, a naive persuasive attempt 
can cancel out its own message. At best it 
attributes to its target the potential for be- 
coming the sort of person recommended, but 
this is clearly much weaker than the attribu- 
tion that the person already embodies the 
desired behavior. 

In this study, moreover, the persuasive 
messages implied something negative about 
the subjects, while the attribution treatments 
implied something positive. Although blame 
as well as praise has been shown to be effec- 
tive in eliciting improved performance (see 
Kennedy & Willcutt, 1964, for a review of 
the mixed results in this area), one of the 
best designed studies (Hurlock, 1925) found 
that the improvement elicited by blame dissi- 
pated over time, while that elicited by praise 
persisted. It may be speculated that persua- 
sion and punishment both remain effective in 
motivating behavior only so long as the actors 
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feel that they can accomplish what is being 
called for. As they accept the implicit nega- 
tive attributions of a persuasive message, the 
effectiveness of the message diminishes. This 
may explain why persuasion was relatively 
more effective for high-ability students than 
for low-ability students. The high-ability stu- 
dents may have been better able to respond to 
the appeal to do better without becoming dis- 
couraged by the implicit attribution (which 
they could to some extent discount) that they 
were not currently demonstrating appropriate 
accomplishment. 

The implicit attributions of the persuasion 
treatments in this study were negative because 
the behaviors they were calling for (which 
the subjects were presumably not emitting) 
were positive. Likewise, the attribution treat- 
ments were positive because the behaviors 
they attributed to the subjects were positive. 
For practical and ethical reasons, all of the 
treatments in the present study were aimed at 
producing positive or socially desirable behay- 
iors. It is our idea that the implicit or explicit 
labeling itself, and not merely the rewarding- 
ness or punishingness of the labels, made the 
present attribution treatments effective and 
the persuasion treatments ineffective. This 
can only be known for sure, however, by a 
study that aims at producing undesirable be- 
havior, in which case attributions of the be- 
havior would have negative implications, 
while persuasion to the behavior would have 
implicit positive implications for the present 
self-concept. If people were responding to 
the attributions in the present experiment 
only because they were rewarding, we would 
expect that attributions of an undesirable 
behavior would have reverse effects and, in- 
deed, that persuasion under these circum- 
stances might even be more effective. The 
sociological literature on deviance (e.g., 
Schur, 1971), however, suggests that negative 
attributions (e.g., labeling as delinquent) can 
indeed produce or support the attributed be- 
havior, as does a recent experimental study of 
labeling “charitable” or “uncharitable” be- 
haviors (Kraut, 1973). We may suspect, then, 
that the positive implications of the attribu- 
tions in the present study were not the sole 
key to their effectiveness and that if a suit- 
able experiment could be designed in which 
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the target behavior were socially undesirable, 
attribution would continue to be more effec- 
tive than persuasion in generating that be- 
havior. 


Attribution and Reinforcement 


A straightforward reward contingency pro- 
gram seems to modify behavior because it 
makes that behavior worthwhile to the sub- 
ject. However, the separation between rein- 
forcement and attribution seems somewhat 
confounded. Symbolic reinforcement, as used 
in the present study, has some attributional 
aspects. Simple praise is often interpreted as 
a “You are X” statement. Furthermore, at- 
tribution can contain elements of reinforce- 
ment especially when socially desirable be- 
haviors are the focus of the attributional 
process. Thus to some extent a reinforce- 
ment procedure that produces enduring change 
may require elements of attribution, while a 
successful attribution treatment may involve 
elements of reinforcement. 

The remaining feature distinguishing at- 
tribution from reinforcement would seem to 
be the noncontingent nature of the attribu- 
tion. In this regard it is interesting to note 
that Kazdin (1973) has recently found that 
under circumstances where the desired behav- 
ior was emitted at a fairly high base rate and 
subjects believed that reinforcement was con- 
tingent, noncontingent reinforcements were 
as effective as contingent ones in modifying 
behavior. Attribution treatments, however, 
may have the very important further advan- 
tage over simple reinforcement of serving to 
elicit behavior (like modeling; see Bandura, 
1969) as well as to maintain it. 


Practical and Ethical Implications 


The present study supports the idea that 
an effort to improve the child’s academic self- 
concept will help improve academic per- 
formance, if only because the improved self- 
image will make actual success less incon- 
sistent, less unexpected, and less likely to be 
discounted or rejected (Brickman, 1972). A 
distinguishing feature of the present attribu- 
tion treatments is that they focused on raising 
self-esteem in a specific area of skill rather 
than raising global or general self-esteem. The 
failure of more general “cultural enrichment” 
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or general “self-concept enhancement” pro- 
grams may be due in part to the fact that 
these global manipulations have only vague 
and diffuse implications for particular areas of 
academic performance (but cf. deCharms, 
1972). 

Nonetheless, there are a number of reasons 
to be cautious about considering these results 
as the basis for a solution to any social prob- 
lem. It is unlikely that long-standing indi- 
vidual differences in accomplishment will be 
overcome by short-term manipulations of 
motivation, incentive, or self-regard. A gain 
of three problems solved after a week of 
treatment is not very substantial in terms of 
life chances, and there is no reason to assume 
that 10 weeks of treatment will necessarily 
result in a gain of 30 problems. Second, the 
attribution treatments in Study 2 were diffi- 
cult to administer and, unlike the treatments 
in Study 1, did not produce any immediate 
and visible indications of success to sustain 
teacher enthusiasm. While the teachers in- 
volved in Study 1 were quite positive toward 
the study, those involved in Study 2 had 
decidedly mixed feelings about the value of 
the time and energy involved. More seriously, 
the false attributions came increasingly to 
be felt by at least one teacher and by the 
principal as an intolerable risk to their 
credibility. As indicated, the principal termi- 
nated her office meetings with the students 
after the third day of treatment mainly on 
these grounds. While future research could 
tailor attributions not to be too discrepant 
from individual pretest baselines, the practi- 
cal and ethical difficulties involved in main- 
taining such attributions will not thereby be 
eliminated. All of these matters warn against 
an uncritical application of the present 
results to matters of educational importance. 
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Two experiments were conducted in the traditional “forbidden toy” paradigm 
to test the hypothesis that derogation of the forbidden toy is a means of 
making the temptation to play with the toy less frustrative, rather than of 
reducing cognitive dissonance. In Experiment 1, distraction from thinking 
about the forbidden toy during the temptation period was manipulated. The 
results of a verbal measure of liking supported the prediction while those for 
a behavioral playtime measure did not. Experiment 2 further tested the self- 
control hypothesis by directing the children’s attention to either their liking 
for the forbidden toy, their behavior toward the toy, neither, or both of 
these conditions. Greater derogation and less playtime in the mild-threat 
conditions only occurred when the children’s liking for the toy was made 
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salient. As a result, the self-control explanation was supported. 


Of all the “forced compliance” experimen- 
tal procedures, the Aronson and Carlsmith 
(1963) forbidden toy paradigm has most con- 
sistently yielded results in support of disso- 
nance theory. In this procedure, after an 
initial ranking which establishes the relative 
values of a given set of toys, children are told 
not to play with one of the highly ranked or 
attractive toys while an experimenter goes on 
an errand. Before leaving, the experimenter 
describes the consequences of transgression as 
either a threat of mild annoyance or great 
anger. Following this manipulation, the child 
is usually left alone in the room with several 
other toys with which he is allowed to play. 
After a suitable temptation period, the experi- 
menter returns and again measures the child’s 
liking for the forbidden toy relative to the 
other toys. In seven independent replications 
of this general procedure (Aronson & Carl- 
smith, 1963; Carlsmith, Ebbesen, Lepper, 
Zanna, Joncas, & Abelson, 1969; Freedman, 
1965; Lepper, Zanna, & Abelson, 1970; 
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Ostfeld & Katz, 1969; Pepitone, McCauley, 
& Hammond, 1967; Turner & Wright, 1965), 
children who received the mild threat de- 
valued the forbidden toy more than did chil 
dren who received the severe threat. 

Dissonance theory explains these results by 
assuming that severe threat reduces the in 
consistency between the two cognitions: “I 
like that toy” and “I am not playing with 
that toy.” Since the mild threat provides in 
sufficient justification, the theory proposes 
that the child should reduce the dissonance 
between the two cognitions by devaluing the 
forbidden toy. He does not play with the 
forbidden toy because he does not like it. 

In the context of less consistent results from 
other forced compliance paradigms, for exam- 
ple, essay writing (Snyder & Ebbesen, 1972), 
one wonders what it is about the forbidden 
toy procedure which makes it so robust. One 
distinguishing feature of the forbidden toy 
situation is that it places the subjects in a 
setting in which they must resist the tempta- 
tion of playing with an attractive toy (Freed- 
man, 1965). As such, it seems plausible that 
processes associated with self-control might 
be going on during the temptation period in- 
stead of, or in addition to, dissonance reduc- 
tion. With this in mind, we might profitably 
view the results from the forbidden toy 
paradigm in light of what is known about 
Processes which mediate self-control, 
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Skinner (1948) suggested that successful 
delay of gratification or resistance to tempta- 
tion depends upon not attending to the 
desirable but unavailable reward. Recently, 
Mischel and Ebbesen (1970) and Mischel, 
Ebbesen, and Zeiss (1972) conducted a series 
of experiments with nursery school children 
which supported Skinner’s early supposition. 
They found that children were able to delay 
gratification much longer if they were pre- 
vented from thinking about the unavailable 
rewards by either overt or covert distrac- 
tions. These results were explained by assum- 
ing that the waiting situation was made more 
frustrative when the children concentrated 
upon a reward they could not have. In short, 
self-control, at least in the form of waiting 
for a delayed reward, may be easier for a 
child the less he thinks tempting thoughts. 

The reasoning above can be applied to the 
forbidden toy procedure. In both the delay 
of gratification and the resistance to tempta- 
tion situations, children are prevented from 
obtaining a valued reward at the start of the 
experiment. In the forbidden toy paradigm 
this “goal blocking” occurs because both the 
mild and the severe threats are sufficiently 
intense to prevent all of the children from 
playing with the second-ranked toy. As in the 
delay situation, being prevented from playing 
with the forbidden toy should cause the chil- 
dren to experience some frustration as they 
wait for the experimenter to return. In addi- 
tion, the intensity of the frustration should 
«be a direct function of the frequency with 
which the children think about playing with 
the forbidden toy. The more the childen think 
tempting thoughts, the more frustrating the 
situation should be. 

Successful application of this model to for- 
bidden toy research requires that it explain why 
children devalue the forbidden toy more after 
mild than after severe threat. This can be 
accomplished by assuming that the threats 
-control the frequency with which the children 
think about the unavailable toy during the 
temptation period, as well as serve as barriers 
against playing with the toy. While both mild 
and severe threat are sufficiently intense to 
prevent the children from playing with the 
forbidden toy, it seems reasonable that the 
mild threat would not be sufficient to prevent 
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the children from thinking about playing with 
the forbidden toy, whereas the severe threat 
would. Because the second-ranked forbidden 
toy is attractive relative to the available toys, 
the children might periodically reconsider 
whether to play with the forbidden toy, but 
only if the threat was weak. In the severe- 
threat condition, the higher intensity of the 
consequences for transgression might prevent 
the forbidden toy from being considered as 
a play option; and, therefore, the possibility 
of playing with the forbidden toy might never 
be entertained by the children. If we assume 
that children in the mild-threat condition 
think about the unavailable toy more often 
than children in the severe-threat condition, 
it follows from the delay of gratification 
research that the temptation period should be 
more frustrating for the former group. Unlike 
the delay of gratification paradigm in which 
the children can escape an increase in frus- 
trating thoughts by terminating the waiting 
period, children in the forbidden toy proce- 
dure who think more tempting thoughts are 
unable to end the waiting situation because 
both the mild and the severe threats are suf- 
ficient to prevent transgression, Therefore, an 
alternative strategy must be developed to 
cope with a higher frequency of frustrating 
thoughts in the mild-threat condition. One 
such plausible self-control strategy would be 
to devalue the forbidden toy until it is no 
longer tempting. 

Support for this reasoning comes from two 
independent experiments conducted by Carl- 
smith et al. (1969), They directed children’s 
attention to the forbidden toy during the 
temptation period and found that forced at- 
tention produced greater devaluation of the 
forbidden toy than no forced attention in 
both the mild- and severe-threat conditions. 
When the child is forced to think about the 
reward he cannot have, he should experience 
more frustration and, as a result, he should 
devalue the forbidden toy more (Mischel 
et al., 1972). 

The self-control literature further suggests 
that distracting children from thinking about 
the forbidden toy during the temptation 
period ought to reduce or eliminate the 
frustration and, therefore, the degree to which 
the children devalue the toy. Mischel et al. 
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(1972) found that distraction made waiting 
for a delayed reward so easy that some 4- 
year-old children were able to wait over an 
hour for a marshmallow. If the forbidden toy 
procedure involves the same general self- 
control processes that are involved in delay 
of gratification, then distraction should elimi- 
nate tempting thoughts and, consequently, the 
necessity to make the temptation period less 
frustrating by devaluing the forbidden toy. 
The first experiment was designed to test this 
hypothesis. 


EXPERIMENT 1 


In one condition of the delay of gratifica- 
tion experiments, distraction was manipulated 
by giving the children a toy to play with 
while they waited. In the forbidden toy pro- 
cedure, the children already have toys that 
they can play with while resisting tempta- 
tion, that is, those toys which were not for- 
bidden, and yet devaluation still occurs in the 
mild-threat condition. If the present self- 
control model is correct, the nonforbidden 
toys that have been used in previous research 
must not have been interesting enough, rela- 
tive to the forbidden toy, to prevent the 
children from thinking tempting thoughts. 
Possibly this was because the spread in at- 
tractiveness among the toys has been rather 
large. Thus, after a child finished playing 
with the first-ranked toy during the tempta- 
tion period, the second-ranked toy would look 
relatively more attractive than the remaining 
three toys and the temptation to play with 
the forbidden one would increase. With this 
reasoning in mind, distraction was manipu- 
lated in this experiment by varying the 
difference in value between the forbidden toy 
and the freely available toys. Since it was not 
possible to know which toy would be the 
second-ranked, forbidden toy before a child 
was allowed to rank order the toys, the dis- 
traction manipulation was accomplished by 
controlling the difference in value between all 
of the toys presented to the child. In half of 
the cases the toys were all very similar in 
attractiveness (small spread) and in the other 
half the toys were very different in attrac- 
tiveness (large spread). Thus, no matter 
which toy was chosen as the forbidden toy, 
in the small-spread condition the remaining 
toys would be almost as attractive and there- 
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fore as engaging as the forbidden toy, whereas 
in the large-spread condition the remaining 
toys would, on the average, be much less at- 
tractive and interesting. To control for the 
average level of attractiveness of all of the 
toys, half of the small-spread toys had a 
low-average attractiveness and half had a 
high-average attractiveness. 

It was predicted that the use of small- 
spread (high-distraction) toys would elimi- 
nate tempting thoughts in the mild-threat 
condition in the same way that distraction 
apparently eliminates the frustration of wait- 
ing for a delayed reward. Thus, the greater 
devaluation normally found in the mild- as 
compared to the severe-threat condition 
should be reduced or even eliminated in the 
small-spread cells. In this way, the self- 
control model predicts an interaction between 
level of threat and the spread of the toys. 
The typical dissonance effect should be found 
with large-spread (low-distraction) toys, but 
a smaller or no effect of threat should be 


found with small-spread toys.* fa 


In an attempt to put this interaction pre- 
diction to a stringent test, three (as opposed 
to the traditional two) levels of threat were 
employed. Since all previous forbidden toy 
studies have used only two levels of justifica. 
tion, the use of three levels of threat also pro- 
vided the opportunity to determine whether 
the effects of justification on devaluation were 
monotonic (Gerard, Conolley, & Wilhelmy, 
1973). In addition, a control condition was 
run to determine the effects of the three / 
threat levels on the dependent variables of 
verbal derogation and playing time without 
the frustrating temptation period. In this con- 
dition, large-spread toys were used, and 
the dependent measures were obtained im- 


mediately before the normal temptation 
period. 

Method 
Subjects 


Ninety-six nursery school children of predomi- 
nately middle-class backgrounds served as subjects 


1A resistance to change hypothesis would predict 
the exact opposite interaction. Specifically, devalua- 
tion of the forbidden toy should be most difficult 
to accomplish in the large-spread conditions because 
the toy has further to move before a change in rank 
order could occur. 
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in this experiment. Their ages ranged from 2.8 to 
5.9 years with a mean of 4.5 years. Four males and 
4 females were randomly assigned to each of the 
three large-spread cells in both the experimental and 
control conditions (48 subjects). An additional 8 
males and 8 females were randomly assigned to each 
of the three small-spread experimental cells (48 sub- 
jects). One other subject was lost when he played 
with the forbidden toy during the temptation period. 

Two experimenters, one male and one female, 
switched roles (Experimenter 1 and Experimenter 2) 
for half of the subjects in every cell, In addition, 
each experimenter ran an equal number of male and 
female subjects in both roles and in every cell of 
the design. 


Procedure 


The procedure used was nearly identical to that 
of Aronson and Carlsmith (1963) with the addition 
of the initial distraction manipulation and a third 
threat level. 

Distraction manipulation. Children from the same 
population as the subjects were pretested to deter- 
mine the attractiveness of the toys. The small-spread 
or high-distraction toys were divided into two 
groups of either uniformly high attractiveness 
(elaborate toys—a racing car set; a marble game; 
and a battery-powered spaceship, train, and robot) 
or uniformly low attractiveness (simple toys—a ball, 
a small airplane, a pink rabbit, a plastic nut and 
bolt, and a small bulldozer). Half of the subjects 
in the small-spread condition were assigned either 
the uniformly high or the uniformly low group of 
toys. Large-spread or low-distraction toys were 
assembled such that two of the five toys were very 
interesting relative to the remaining toys (the racing 
car set, the battery-powered train, the small air- 
plane, the pink rabbit, and the plastic nut and bolt). 
In this way, only one good toy was generally avail- 
able during the temptation period since the children 
invariably ranked the racing car set and train higher 


. than the other toys. It should be noted that the 


toys in the large-spread condition corresponded 
closely to the toys used in previous research (Aron- 
son & Carlsmith, 1963; Freedman, 1965; Lepper 
et al., 1970), 

After the first experimenter demonstrated the five 
toys to the subject, all of the toys were placed on a 
table in front of the subject, and Experimenter 1 
(who at this point was blind to the subject’s threat 
condition) used the method of elimination (Turner 
& Wright, 1965) to obtain rank-ordered preferences 
for all of the toys. Specifically, the first experimenter 
began by asking the subject, “If you could only 


` play with just ome of all these toys, which one 


would you really want to play with? Which one 
do you like the best?” When the subject indicated 
his choice, Experimenter 1 picked up that toy and 
put it on the floor under the table. Experimenter 1 
then asked the subject to make a choice from the 
four remaining toys to determine the subject’s 
second-ranked toy. This procedure was continued 
until the subject had indicated his relative prefer- 
ences for all five toys. 
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Threat manipulation. Immediately following the 
initial rankings, Experimenter 1 placed the subject’s 
second-ranked toy on the table and put the rest of 
the toys on the floor. Experimenter 1 then made 
the standard excuse about running an errand and 
delivered the appropriate threat to the subject. The 
low- and high-threat manipulations were virtually 
identical to those used by Aronson and Carlsmith 
(1963). In the low-threat condition, Experimenter 1 
said: “I don’t want you to play with the 
If you play with it, I will be annoyed with you.” 
In the moderate-threat condition, Experimenter 1 
said: “I don’t want you to play with the 3 
If you play with it, I will be very angry with you.” 
Finally, in the high-threat condition, Experimenter 1 
said: “I don’t want you to play with the ___. 
If you play with it, I will be very angry with you. 
I will think you are just a big baby, and I will” 
have to take all of my toys and go home and never 
come back again. I will be very angry with you if 
you play with the .” Experimenter 1 then 
reiterated that the subject could play with any one 
of the other toys on the floor, and leit the subject 
alone in the room for the 5-minute temptation 
period. During the 5-minute period, Experimenter 1 
observed the subject through a peep-hole in the 
door to determine whether the subject played with 
the forbidden toy. At the end of the temptation 
period, Experimenter 1 returned to the room, put 
all of the toys together, and nullified the threat by 
telling the subject that now he could play with all 
of the toys, including the previously forbidden one, 
The dependent measures were then collected by 
Experimenter 2. 

Control conditions, If the random assignment slip 
noted that the subject was in a control condition, 
Experimenter 1 delivered the appropriate threat just 
as in the experimental conditions. However, rather 
than leaving the child alone to resist temptation, 
Experimenter 1 paused for a moment as if to 
rethink what he had just said, He then nullified the 
threat by saying, “I'll tell you what, I’ve changed 
my mind; let’s play with all of the toys before I go 
on my errand.” The forbidden toy was then re- 
turned to the floor with all of the other toys. The 
large-spread (low-distraction) toys were used in 
these control conditions in order to determine the 
effects of threat on the results of the traditional 
paradigm without the usual temptation period. In 
addition, by elminating the temptation period, these 
control conditions were designed to provide a base- 
line against which to compare the distracting effects 
of the small-spread toys. If the small-spread toys 
eliminate the cognitive processes that go on during 
the temptation period, then the small-spread and 
the control conditions ought to yield similar results 
even though the spread between the toys and the 
behavior of the experimenter were different in the 
two cases. 

Dependent measures. After the threat was nullified 
and all the toys were placed on the floor, Experi- 
menter 1 signaled to Experimenter 2 by closing the 
door to the experimental room which was slightly 
ajar, and soon Experimenter 2 entered and was 
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introduced to the subject. Experimenter 1 asked 
Experimenter 2 to continue with the games he and 
the subject had been playing and left to receive a 
phone call (cf. Lepper et al., 1970). Experimenter 2 
(who was blind to the threat conditions as well as 
the initial preferences) then used the same elimina- 
tion procedure as was used initially (Turner & 
Wright, 1965) to obtain the second measure of 
verbal preference for the toys. The change in prefer- 
ence for the second-ranked toy was the main depen- 
dent variable and ranged from —3 (complete deroga- 
tion) to +1 (slight enhancement). Experimenter 2 
next placed all of the toys on the floor, told the 
subject that he could play with any of them, and 
sat in the corner pretending to be busy with some 
work. During this 4-minute “free access” period, 
Experimenter 1 recorded the order and length of 
time the subject played with each of the toys 
through a one-way mirror. Experimenter 1 then 
returned to the experimental room and took the 
subject back to his classroom. 


Results and Discussion 


The major purpose of this experiment was 
to examine the interactive effects of the 
spread and threat manipulations on devalua- 
tion. It was predicted that children who were 
left with toys that were almost as attractive 
as the forbidden toy (small spread) would 
find the temptation period much easier and, 
therefore, would not devalue the forbidden 
toy as much as children who were left with 
toys that were not as attractive, on the aver- 
age, as the forbidden toy (large spread). 
Figure 1 presents the mean changes in the 
verbal rank position of the forbidden toy. 
Negative scores indicate devaluation and 
positive scores indicate an average increase in 
the relative rank order, In agreement with the 
prediction above, in the large-spread condi- 
tions subjects devalued the forbidden toy 
more as the severity of the threat decreased, 
F(1, 66) = 4.37, p <.03. As predicted, this 
dissonance effect did not occur in the small- 
spread conditions; however, rather than elimi- 
nating the effect of threat, the use of small- 
spread (high-distraction) toys actually re- 
versed the effect. Greater devaluation of the 
forbidden toy occured as the severity of the 
threat increased, F(1,66) = 4.62, p< 05. 
An overall analysis of variance of these data 
indicated that there was no main effect for 
spread (F < 1), no main effect for level of 
threat (F < 1), and a significant interaction 
between spread and level of threat, F(2, 66) 
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= 4.37, p< .025. Furthermore, as can be 
seen in Figure 1, the effect of threat that was 
found in the small-spread condition was 
identical to that found in the control group 
in which the children reranked the toys before 
experiencing the temptation period (F < 1).* 
Thus, leaving the children to play with toys 
that were almost as attractive as the for- 
bidden toy was practically the same as not 
tempting the children at all. In both cases, 
the children devalued the forbidden toy more 
as the level of threat increased even though 
different toys were used in the two cases and 
the behavior of Experimenter 1 may have 
appeared quixotic in the control conditions. 

Similar results were found for the propor- 
tion of children who devalued the forbidden 
toy. In the large-spread condition, 75%, 25%, 
and 0% of the children devalued the toy in 
the low-, moderate, and high-threat condi- 
tions, respectively, whereas in the small- 
spread conditions, 25%, 38%, and 63% of 
the children devalued the forbidden toy in 
the corresponding threat conditions. This 
interaction was highly significant, F(2, 0) 
= 7.94, p < .005.* The percentages for the 
control group were 38%, 50%, and 75% and 
obviously did not differ from those in the 
small-spread conditions. Thus, as the threat 
level increased, more children devalued the 
forbidden toy when the spread was small; 
but when the spread was large, fewer chil- 
dren devalued the forbidden toy as the threat 
increased. The latter result replicates previous 
forbidden toy research exactly (Aronson &/ 
Carlsmith, 1963). 

While these results partially support the 
original hypothesis, examination of the play- 
time data yields a different picture. In par- 
ticular, there were no significant effects of the 
manipulations on the amount of time (mea- 
sured absolutely or relatively) that children 
played with the forbidden toy after the threat 
was lifted. Neither the threat manipulation 
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pie the ee cells, there was no significant 

uiterence between the all-high and all-low t 
(F <1). ih 
8 This test was computed on the arcsin trans- 
formed Proportions in which the error term was that 
for the binomial distribution (see Winer, 1971), A 
conservative estimate for this error term was com- 
puted from the cell with the smallest n, namely, 8. 
» 8. 
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D 
” (F <1) nor the spread manipulation (F < 1) 


had an effect on playtime. In addition, the 
interaction was also not significant, F(2, 66) 
= 1.21. Similarly, an analysis of the propor- 
tion of children who played with the for- 
bidden toy after the threat was lifted yielded 
nonsignificant effects. Slightly over 70% of 
all the children played with the forbidden toy 
at least once during the free-play period. If 
playtime is taken as an indicator of the value 
of the forbidden toy (as in Freedman, 1965), 
then these playtime results disconfirm the 


_ original hypothesis. Since the results from 


Experiment 2 have implications for this find- 
ing, further discussion is postponed until 
those results have been presented. 

The verbal derogation results indicate that 
there was an inverse relationship between 
devaluation and level of threat only in the 
conditions in which the spread between the 
toys was large: Children devalued the for- 
bidden toy less as the threat level increased. 
However, when the spread between the toys 
was small, there was an unpredicted direct 
relationship between level of threat and 
verbal devaluation: Children derogated the 
forbidden toy more as the threat level in- 
creased. Discussion of this latter result is 
postponed until Experiment 2 is presented. 


EXPERIMENT 2 


The fact that a small spread between the 
toys eliminated the “dissonance” effect can 


“be taken as support for the self-control hy- 


pothesis. However, other explanations of the 
results are certainly possible. One plausible 
interpretation, consistent with either disso- 
nance theory or self-perception theory (Bem, 
1972), is that the spread manipulation varied 
the justification for not playing with the for- 
bidden toy. That is, when the remaining toys 
were as attractive as the forbidden toy, the 
children may have been able to justify their 


“not playing with the forbidden toy by con- 


centrating on how much they liked the avail- 
able toys in contrast to the unavailable one. 
When the spread was large, this alternative 
means of justification may have been less 
likely to occur. : 
Although the findings from the previous 
experiment can be interpreted in terms of 
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Ficure 1. The average change in the verbal rank 
order of the forbidden toy as a function of threat 
level and the initial spread between the toys. 


either dissonance theory or self-perception 
theory, it is possible to differentiate between 
these two theories and the proposed self- 
control hypothesis in terms of the cognitive 
processes that are postulated by each theory. 
In dissonance theory, devaluation is said to 
be the result of a drive to reduce the incon- 
sistency between two cognitions. Furthermore, 
the drive to reduce the dissonance and the 
resultant derogation of the forbidden toy 
should be greatest when the two inconsistent 
cognitions (i.e., the initial attitude and the 
inconsistent behavior) are made salient 
(Carlsmith et al., 1969; Snyder & Ebbesen, 
1972). In self-perception theory, on the other 
hand, the devaluation is said to occur at the 
time the child reranks the toys (Bem, 1972; 
Lepper et al., 1970). During the ranking, the 
child decides how much he likes a toy pre- 
sumably by examining his past behavior 
toward the toy and the conditions present 
when the behavior occurred. Thus, greater de- 
valuation ought to occur when the child at- 
tends to the fact that he did not play with the 
forbidden toy. In contrast to both of these 
conceptions, the self-control hypothesis sug- 
gests that devaluation is the result of at- 
tempts to cope with the frustration engen- 
dered during the temptation period. Further- 
more, the temptation should be most difficult 
when the children concentrate on how much 
they like a toy they cannot have. Paradoxi- 
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cally, as in Mischel et al. (1972), thinking 
about how much they like the toy should 
cause them to devalue it more as one means 
of reducing the frustration. Thus, by varying 
the relative saliency of the child’s initial atti- 
tude and his behavior in a 2 X 2 design, it 
should be possible to test the relative use- 
fulness of the three alternative hypotheses. 

Since the small-spread finding in Experi- 
ment 1 was the first demonstration of a direct 
relationship between threat and derogation in 
the forbidden toy paradigm, a small spread 
was used in all conditions in the second 
experiment in an attempt to replicate these 
uncommon results. The use of toys with a 
small spread also provided the opportunity 
to investigate the effects of the saliency ma- 
nipulations on a baseline in which attention 
to the forbidden toy was presumed to be low. 
Could the saliency manipulations reinstate 
the “dissonance” effect? And, if so, in what 
attentional conditions would the “dissonance” 
effect reappear? 


Method 
Subjects 


Eighty nursery school children from the same gen- 
eral population as the first study served as subjects. 
Ages ranged from 2,6 to 5.3 years with a mean of 
4.3 years. Five males and 5 females were randomly 
assigned to each of the eight cells in the design. None 
of these subjects had participated in the first experi- 
ment, One additional subject was lost because he 
played with the forbidden toy. 


Materials 


The five small-spread toys used were almost iden- 
tical to those of uniformly high attractiveness in the 
first study, although two toys had to be replaced 
due to mechanical failure. (The train was replaced 
with a battery-powered bulldozer, and the robot 
was replaced with a similar battery-powered me- 
chanical man.) Only highly attractive toys were used 
because the previous experiment revealed no signifi- 
cant difference between the results for the all-high 
and the all-low toys within the small-spread cells. 


Procedure 


Experimenter 1 followed essentially the same pro- 
cedure as that in the small-spread experimental con- 
ditions in the first study with the following differ- 
ences: only two levels of threat were used and 2 
minutes into the temptation period the saliency ma- 
nipulations were initiated. In addition to the elimi- 
nation procedure described earlier, a paired-com- 
parison procedure of measuring verbal preferences 


EBBESEN, BOWERS, PHILLIPS, AND SNYDER 


(Aronson & Carlsmith, 1963) was added to this 
experiment in order to obtain better estimates of 
the children’s rank-ordered preferences. After com- 
pleting the elimination procedure, Experimenter 1 
placed two of the five toys on the table and asked 
the subject, “Now, let’s suppose you could play with 
either this toy or this other one. Which one would 
you really want to play with?” When the subject 
indicated his choice, Experimenter 1 placed both of 
the toys on the floor, put two others on the table, 
and continued until the subject had made choices 
between 10 unique pairs. Whenever there were in- 
consistencies in a subject’s preferences and a tie in 
ranking resulted, the crucial pairs of toys were 
placed back on the table and the subject was ques- 
tioned until the ties were broken. Following these 
initial verbal rankings, Experimenter 1 told the sub- i 
ject he had to leave on an errand and delivered the 
appropriate threat to prohibit playing with the 
second-ranked toy. 

Threat manipulation. After Experimenter 1 con- 
sulted a random assignment slip, one of the two 
levels of threat (low and high) was delivered in 
exactly the same manner as it was in the first study. 
Experimenter 1 reiterated that the subject could play 
with any of the other toys on the floor and then 
left on his errand. 

Saliency manipulation. At 2 minutes into the 5- 
minute temptation period, Experimenter 1 signaled * 
for Experimenter 2 to enter the room, Experimenter 
2 (blind to the threat condition and the subject’s 
initial preferences) entered the experimental room 
and pretended to be a janitor (Carlsmith et al., 
1969). Experimenter 2 acted surprised to find any- 
one in the room and mentioned that he came for a 
chair that was sitting in the corner. On his way out 
with the chair in the no-salience conditions, Experi- 
menter 2 told the subject, “Boy, that sure is a 
bunch of nice toys you have. They sure are nice.” 
Experimenter 2 then left the room. In the liking 
salient conditions, Experimenter 2 repeated the no- 
salience statements, pointed to the forbidden toy,/ 
and continued, “You must really like this toy, don’t 
you?” Experimenter 2 repeated the rhetorical ques- 
tion and left the room without waiting for the sub- 
ject’s response. In the behavior salient conditions, 
Experimenter 2 followed the same procedure, but 
instead he twice said, “I see you're not playing with 
this toy, are you?” Finally, in the both salient con- 
ditions, Experimenter 2 made both statements twice 
in order to draw attention to both the subject’s 
liking for the toy and his behavior toward it. For 
half of the subjects in the both conditions the liking, 
statement came first and for the other half the order 
was reversed. At the end of the 5-minute temptation 
period, Experimenter 1 returned to the room, placed 
ae Ph ei toy on the floor, and nullified the 

reat by telling the subject th i 
all of the Bie j T 

Dependent measures. Experimenter 1 (who was 
blind to the salience manipulations) placed the toys 
next to the table and used both the paired-com- 
parison and the elimination procedure to obtain two 
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TABLE 1 
Mean DEVALUATION or THE FORBIDDEN Toy MEASURED BY PAIRED RANKING 
AND BY ELIMINATION 
Low threat High threat 
Dependent 
variable 3 R 2 R 
No salience | Talee | caine | ttl | No salience | Behavior alen | suit 
Paired ranking 
1 2 4 =5 3 —.6 z4 =2 =1 
SD 1.3 8 14 1.1 1.2 1.3 1.5 1.6 
Elimination 
M 2 0 =3 =9 = 9 =3 it 0 
SD 1.0 11 1.3 1.4 14 1.6 1.4 14 
Mean of two 
M 0 2 rod ei) =D =—.35 305 = 05: 
SD A) 2 1.2 1.0 1.1 1.3 1.0 13 


measures of the subject’s change in ranking for the 
forbidden toy. Thus, in contrast to Experiment 1, 
the same experimenter delivered the threat manipu- 
lation and obtained the rankings. Finally, during a 
4-minute free access period while Experimenter 1 
was present in the room, Experimenter 2 (still blind 
to the threat) recorded the playtime measure 
through the one-way mirror, as in the first study. 


Results and Discussion 


In order to determine the effects of the 
manipulations on devaluation of the forbid- 
den toy, the mean changes in rank order of 
the forbidden toy for the two verbal measures 
of devaluation—the paired ranking procedure 
and the elimination procedure—were com- 
puted. As can be seen in Table 1, both mea- 
sures suggest that children devalued the toy 
as the threat decreased only when their at- 
tention was directed at their liking for the 
forbidden toy. When they did not attend to 
their initial liking (i.e., in the no-salience and 
behavior salient conditions), the traditional 
dissonance effect did not occur. While 2 X 2 
X 2 analyses of variance of the means in 
Table 1 yielded no significant effects for the 
paired ranking procedure (all Fs < 1, except 
the Threat X Liking Salient interaction, F (1, 
72) =2.21, p> .10), the Threat X Liking 
Salience interaction was significant for the 
elimination procedure, F (1,72) = 5.12, p< 
.05 and the mean (Carlsmith et al., 1969) of 
the two verbal procedures, F (1,72) = 4.62, 
p< .05.* All other Fs were nonsignificant 


*The weaker effect in the paired ranking proce- 
dure may have been due to the unexpectedly high 
number of intransitivities in the paired ranking data. 


and less than 1.20. Furthermore, the Threat 
X Liking Salient interaction accounted for 
over 98% of the systematic between-condi- 
tion variance in the elimination procedure 
data and over 85% of the between-condition 
variance in the data for the mean of the two 
procedures. 

The playtime measure yielded results which 
were similar but not identical to the verbal 
ranking procedure. Table 2 presents the mean 
amount of time that the children played 
with the forbidden toy during the 4-minute 
free-play period at the end of the experi- 
mental session. In contrast to the first experi- 
ment, an analysis of variance of these data 
indicated that there was a significant main 
effect of threat, F (1,72) = 4.92, p< .05, 
such that children played with the forbidden 
toy Jess after high than after low threat. In 
addition, the Threat X Liking Salient inter- 
action was significant, F (1, 72) = 5.65, p< 
-025. When the children’s attention was di- 
rected at their liking for the forbidden toy, 


TABLE 2 


MEAN AMOUNT or TIME PLAYED WITH 
THE FORBIDDEN Toy 
(in seconds) 


Threat level 
Salience manipulation a 
Low High 
No salience 111.5 33.5 
Behavior salient 99,5 32.5 
Liking salient 73.5 67.0 
Both salient 76.2 88.0 
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they played with the forbidden toy slightly, 
but not significantly, less after low than high 
threat; however, when their attention was 
not directed at their liking for the toy, they 
played with the forbidden toy more after low 
than high threat, F (1,72) = 10.56, p< 
.005. 

The relatively large effects on the playtime 
measure found in this experiment require dis- 
cussion in the context of the failure to find 
such effects in the first experiment. One possi- 
ble explanation is that the experimenter who 
delivered the threat was not present in the 
room during the play period in the first ex- 
periment but was in the second experiment. 
Without the threatening experimenter present 
in the first study, the children’s play may 
have been largely controlled by properties of 
the toys rather than by cognitive processes 
which occurred 5 or 10 minutes before the 
play period. In the second experiment the ex- 
perimenter who delivered the threats was 
present, and the cognitive processes which 
occurred during the temptation period may 
have been reinstated, This explanation could 
account for the direct effect of the level of 
threat on playtime found in this experiment. 
One difficulty with this explanation, however, 
is the finding by Freedman (1965) that the 
behavioral effects of the threat manipulation 
seem to last for several weeks and occur even 
with a different experimenter present. Recon- 
ciliation between Freedman’s (1965) findings 
and the present results may depend upon the 
average age of the subjects in the two studies. 
Freedman used 8-10-year-old children, where- 
as we used 3-5-year-old children. It seems 
plausible that younger children might re- 
quire the presence of the experimenter to re- 
instate previous cognitive processes, whereas 
older children would not. 

For both the verbal and behavioral mea- 
sures, the only case in which a low threat 
produced greater devaluation of the forbidden 
toy than a high threat was when the children’s 
attention was directed at their liking for the 
forbidden toy. When their attention was not 
directed at their liking, both the verbal and 
playtime devaluations were greater with a 
high than a low threat. The latter result repli- 
cates the findings from the small-spread con- 
ditions in the first experiment. Apparently, 


when the children are prevented from think- Ah 


ing about how much they like the forbidden 
toy, an increase in the threat has a direct 
rather than an inverse effect on devaluation. 
In the context of the original hypothesis this 
interaction implies that a “dissonance” effect 
only occurs if the children think tempting 
thoughts during the temptation period. It also 
implies the unexpected result that more severe 
threats produce larger devaluations when chil- 
dren do not concentrate on wanting what they 
cannot have during the temptation period. 


GENERAL DISCUSSION 


Taken as a whole, the results from both 
experiments lend support to the self-control 
explanation of research which has used the 
forbidden toy paradigm. It appears that de- 
valuation of the forbidden toy under mild- 
threat conditions occurs only if the children 
direct their attention to their liking for the 
unavailable toy. If their attention is drawn 
away from their liking for the toy by the 
availability of other equally attractive toys, 
devaluation under mild threat does not occur. 
Such a finding is consistent with the Lepper 
et al. (1970) view that devaluation involves 
a fair degree of cognitive work during the 
temptation period. However, the present re- 
sults also imply that the reason for the cogni- 
tive work is not to reduce an inconsistency 
between two cognitions, as Lepper et al. 
(1970) suggest, but rather to make the temp- 
tation period less frustrating. The fact that 
making both the initial liking and the incon- 
sistent behavior salient did not produce 
greater devaluation than making only the ini- 
tial liking salient supports this contention. 
Furthermore, the failure to find a larger de- 
valuation in the condition in which the chil- 


5 Further evidence against a dissonance interpreta- 
tion are the weak associations found in both experi- 
ments between the verbal rankings and the play- 
time rankings taken after the temptation period. 
The average Spearman’s Rho in the first experiment 
was 4, t(95) =1.08, ns, and in the second experi- 
ment it was .52, t(79) =1.36, ns. In the context of 
previous suggestions that the traditional attitude 
concept be abandoned (Bem, 1972; Abelson, 1972), 
these data imply that inconsistency between verbal 
ratings and behavior is the norm rather than the 
exception. If this is true, inconsistency reduction 
loses some of its explanatory power. 
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dren’s attention was directed toward their 
behavior suggests that self-perception (Bem, 
1972) is not the primary mechanism under- 
lying devaluation. 

While the derogation results from both 
experiments seem to provide greater support 
for the self-control explanation than either a 
dissonance or self-perception account, the 
original self-control explanation cannot handle 
all of the findings. In particular, in both ex- 
periments there was an unpredicted direct 
relationship between threat level and deroga- 
tion in those conditions in which the temp- 
tation to play with the forbidden toy was 
weak or eliminated, that is, in the small- 
spread conditions with instructions which did 
not emphasize the initial liking. In fact, this 
direct relationship was apparently so strong 
that in the second experiment it produced a 
main effect of threat on the playtime mea- 
sure. Thus, the attention manipulation was 
not quite powerful enough to reinstate the 
“dissonance” effect by completely reversing 
the direct relationship found with the small- 
spread toys. 

The only theory known to the authors that 
attempts to explain both inverse (dissonance) 
and direct (incentive) effects is the recent 
proposal by Gerard et al. (1973) that there 
is a nonmonotonic relationship between justi- 
fication and attitude change. However, for 
several reasons, not the least of which is that 
the theory is based upon inconsistency re- 
duction, it seems appropriate not to accept 
the Gerard et al. account of the “incentive” 
effect found in the present paradigm. Instead, 
it seems more reasonable to propose one 
mechanism for the direct, “incentive” rela- 
tionship and one for the inverse, “dissonance” 
relationship, especially because the control 
groups and the small-spread groups in Ex- 
periment 1 (see Figure 1) were affected in 
the same manner by the threat manipulation. 
Apparently, “incentive” effects do not re- 
x quire that the subjects engage in behavior 
inconsistent with the initial attitude, whereas 
the “dissonance” effects do. Thus, the direct 
effect of threat may be the simple result of 
pairing the threat with the toy or of inade- 
quately nullifying the threat before the pref- 
erences are measured. In the former case, the 
children’s liking for the forbidden toy would 


451 


presumably decrease as a result of its associ- 
ation with the intense threat, whereas in the 
latter case the devaluation would be a mea- 
surement artifact, resulting from the children 
being afraid to express a strong preference to 
play with the forbidden toy for fear of the 
consequences which might follow. 

In order to explain the X-type interaction 
found in both experiments, it is necessary to 
assume that resisting temptation in a frus- 
trating situation either eliminates or com- 
pletely reverses the process which produces 
the direct effects of threat. The self-control 
hypothesis can provide an explanation for the 
reversal if it is assumed that the intensity of 
threat controls the quality rather than, or in 
addition to, the quantity of the children’s 
cognitions during the temptation period. In 
both the large-spread (Experiment 1) and 
the small-spread-liking salient conditions 
(Experiment 2), the children’s attention is 
drawn to the forbidden toy either because the 
other toys are relatively uninteresting or be- 
cause an adult directly intervenes. These at- 
tentional manipulations may increase the fre- 
quency with which the children think about 
the attractiveness of the forbidden toy re- 
gardless of the intensity of the threat. How- 
ever, because the consequences for transgres- 
sion are only mildly aversive in the low- 
threat condition, but sufficiently strong to 
prevent transgression, the increase in thoughts 
about the attractiveness of the forbidden toy 
might cause the children to experience a 
greater temptation to transgress and conse- 
quently more frustration. This frustration 
might then be reduced by derogating the for- 
bidden toy as a self-control strategy. In the 
high-threat conditions, the attentional manip- 
ulations might produce a similar increase in 
the thoughts about the forbidden toy, how- 
ever, because the consequences for transgres- 
sion are so clear and threatening, this in- 
crease might not produce greater temptation 
and frustration. Furthermore, if the severity 
of the threat did eliminate the forbidden toy 
as a viable play option during the temptation 
period, the increase in attention to the toy 
might actually make its attractiveness that 
much more salient. Thus, in contrast to the 
low-threat conditions, the value of the for- 
bidden toy might actually increase with a se- 
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vere threat. To the extent that the devalua- 
tion produced by the “incentive” process was 
equal in magnitude to this increase, the result 
would be no change in the value of the for- 
bidden toy. On the other hand, if the incen- 
tive process was weak, possibly because the 
severe threats were adequately nullified, there 
might even be an overall increase in the value 
of the forbidden toy in the severe-threat con- 
ditions. In the original forbidden toy research, 
Aronson and Carlsmith (1963) found that 
64% of the subjects in the severe-threat con- 
dition increased the value of the forbidden 
toy and no subjects decreased its value. In 
the present study, when the two experiments 
are combined, 50% of the children increased 
the value of the forbidden toy, while only 
27% decreased its value in the appropriate 
severe-threat conditions, These results suggest 
that severe threat might actually cause an 
increase in the value of the forbidden toy 
when the children think about its attractive- 
ness, provided the threat is adequately nulli- 
fied before the measures are taken. 

In conclusion, the results from the present 
experiments imply that the derogation that 
has so consistently been found in the forbid- 
den toy paradigm is neither the result of self- 
perception nor dissonance reduction but is 
rather the result of attempts by the children 
to cope with a frustrating temptation period. 
However, the fact that strong, direct relation- 
ships between threat and derogation were 
found in weak-temptation conditions suggests 
that attempts to cope with frustration are not 
the only processes which determine deroga- 
tion. Future research is required before the 
full complexities of the forbidden toy para- 
digm can be determined. 
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Cooperative Behavior in Rats: 
Effect on Extinction of Shock Onset Conditions 
During Acquisition 


Stephen W. Epley and Milton E. Rosenbaum 
University of Iowa 


Pairs of rats were placed in a mixed-motive situation in which they were 
dependent upon each other in order to avoid electric shock and obtain food. 
At the beginning of each trial during acquisition of cooperative responding, 
pairs in one condition were exposed to delay in shock onset which allowed 
avoidance of the shock. Pairs in the other condition were exposed to immedi- 
ate onset of shock which permitted only escape from the shock. All pairs 
learned to cooperate. When shock was entirely eliminated during an extinction 
period, reduction in the responses required for cooperation occurred more 
rapidly in pairs that were exposed to the escape condition than in pairs 
exposed to the avoidance condition. The results are discussed as having 
implications for cooperative behavior ordinarily viewed as under the aegis of 
mutual positive reinforcement but which may be maintained by cues that 


are producing avoidance of aversive stimulation. 


In behavioristic analyses, cooperation has 
been characterized as involving responses that 
result in mutual reinforcement for the partici- 
pants in the process (Rosenberg, 1960; Za- 
jonc, 1969). The reinforcement may involve 
provision of positive incentives and/or escape 
from or avoidance of aversive stimulation. In 
many instances in which the development of 
cooperative responding has received analytic 
study, both types of motivational sources 
have played contributory roles (cf. Deutsch 
& Krauss, 1960; Kelley, Thibaut, Radloff, & 
Mundy, 1962; Sidowski, 1957; Sidowski, 
Wyckoff, & Tabory, 1956). Once cooperation 
has developed in a situation in which both 
motivational sources play a vital role, it is 
important to consider whether elimination of 
one of the bases of reinforcement leads to 
extinction of the cooperative behavior. 

This article is based in part on a master’s thesis 
submitted by the first author to the University of 
Iowa. Financial support was provided in part by a 
National Institute of Mental Health Predoctoral 
Fellowship to the first author and by an award to 


the second author from the Graduate College (Uni- 
versity of Iowa) from National Institutes of Health 
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Stephen W. Epley is now at Wartburg College, 


Waverly, Iowa. è 
E. for reprints should be sent to Milton E. 


Rosenbaum, Department of Psychology, University 
of Iowa, Iowa City, Iowa 52242. 


In two studies in which cooperative re- 
sponding by rats had been accomplished un- 
der the aegis of both positive and aversive 
stimulation, elimination of the aversive stim- 
ulus led to rapid extinction (Daniel, 1943; 
Rosenbaum & Epley, 1971). In these studies, 
pairs of rats were placed in an apparatus in 
which they had to respond interdependently 
if they were to obtain food. The animals were 
faced with the problem that obtaining food 
was made incompatible with the necessity for 
escape from aversive stimulation. The appa- 
ratus consisted of a cage with a platform at 
one end. A food dish was located at the other 
end beyond reach of a rat on the platform. 
The floor of the cage was made of grid bars 
which were electrified when both members of 
a pair were off the platform, At the beginning 
of each trial both rats were dropped onto the 
electrified grid bars. To obtain food, it was 
required that one member of the pair stand on 
the platform, thus eliminating shock and en- 
abling the other member to feed. For both 
members to obtain food, interchanging of po- 
sitions was necessary. 

Three studies (Daniel, 1942, 1943; Rosen- 
baum & Epley, 1971) have demonstrated that 
pairs of rats faced with this problem accomp- 
lish a solution wherein both obtain food by 
interchanging positions frequently. Early in 
training, most interchanges were accompanied 
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by exposures to shock, that is, the member 
standing on the platform left the platform 
before the other member returned to it. Later 
in training, rats interchanged positions fre- 
quently without being shocked, that is, the 
rat on the platform left it only after the other 
rat returned to it. In fact, Rosenbaum and 
Epley (1971) found that after training 97% 
of the interchanges were performed without 
shock and the few shocks that occurred were 
of very brief duration. 

The rapid extinction of interchanging be- 
havior when shock was eliminated may be 
attributable to the manner in which shock 
was presented at the beginning of each of the 
trials conducted during each day of acquisi- 
tion training. Specifically, in the studies by 
Daniel (1943) and Rosenbaum and Epley, 
the rats were shocked unavoidably at the 
beginning of each trial, and the shock per- 
sisted until one or both rats ascended the 
platform. With the total elimination of shock 
during the extinction period, it is possible 
that the difference between acquisition and 
extinction conditions was readily discrimina- 
ble. If rats were permitted to avoid shock at 
the beginning of each trial rather than being 
forced to escape from unavoidable shock, it 
would be expected that the conditions pre- 
vailing before and after elimination of shock 
would not be readily discriminable and main- 
tenance of cooperative behavior would be 
enhanced. 

In the present study, a test of this pos- 
sibility was provided by two experimental 
treatments. In an escape condition, both 
members of pairs received shock at the begin- 
ning of each trial, thus permitting on each 
trial escape from the initial shock but not 
avoidance of this shock. The procedures for 
the escape condition are identical to those 
used by Rosenbaum and Epley (1971). In an 
avoidance condition, electrification of the grid 
was delayed a brief period at the beginning 
of each trial allowing avoidance of the initial 
shock if either member of a pair was on the 
platform at the end of the delay inter- 
val. Shock was abruptly eliminated in both 
conditions after an acquisition period. 

It was expected that pairs of rats in both 
conditions would, at the end of the acquisi- 
tion period, perform frequent interchanges. 


SrepHEN W. EpLey anD Mitton E. ROSENBAUM 


The experimental hypothesis was that rats in 
the escape condition would perform fewer 
interchanges after shock was eliminated than 
pairs in the avoidance condition. This analy- 
sis is consistent with other data related to 
extinction of a wide variety of behaviors fol- 
lowing avoidance and escape training (see 
Beecroft, 1967, for a review). 


METHOD 
Subjects 


Subjects were 20 experimentally naive female albino 
rats purchased from Carworth Farms of Michigan. 
All were between 112 and 117 days old at the be- 
ginning of the experiment. 


Apparatus 


The apparatus was similar to that employed by 
Rosenbaum and Epley (1971). It consisted of a 
20.3X 47 cm. box. A Masonite platform was 
mounted on one end of the box and a glass food 
dish was located at the other end 23 cm. from the 
platform. This distance rendered it impossible for 
an animal to reach the food dish without stepping 
off the platform; however, tactual contact was pos- 
sible between a rat feeding at the dish and a rat 
maintaining one or both rear feet on the platform. 
A wire connecting the food dish to a bar underneath 
the platform prevented subjects from standing on 
the food dish to escape shock, A silent wall switch 
placed on the side of the box initiated a timer 
which controlled the delay of the onset of shock for 
the avoidance group. The source of shock was a 
Scientific Prototype Model 4008-J scrambled AC 
shock source. Behavior of subjects was recorded on 
a Gerbrand event recorder, Two telegraph keys 
operated by the experimenter provided records indi- 
cating when a member of a pair was at the food 
dish. A third channel of the event recorder was 
automatically activated with the occurrence of shock. 


Procedure 


Throughout the experiment, subjects were housed 
individually. Water was freely available in each 
home cage. Individual rats were randomly assigned 
to pairs. 

Preliminary training. The procedures employed in 
preliminary training were similar to those employed 
by Rosenbaum and Epley (1971). These consisted 
of habituation of subjects to the apparatus and 
adaptation to a diet of wet mash consisting of five 
parts of pulverized Purina rat chow mixed with 
seven parts of water. During each of the last 4 days 
of preliminary training, rats were fed in the appa- 
ratus for two 74-min, trials. On these days, no food 
was provided other than that which rats obtained 
in the apparatus, During preliminary training, rats 
were alone in the apparatus and were never exposed 


to shock. After preliminary training, Pairs of rats 
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were randomly assigned to escape and avoidance 
conditions. 

Acquisition period. On each day of the 28-day 
acquisition period, pairs of subjects were placed in 
the apparatus for 12 105-sec, trials. Each intertrial 
interval was 30 sec. Wet mash was available in the 
dish at all times. The grid bars were electrified if 
neither member of the pair was on the platform. 
The shock intensity for the first 9 days of the 
acquisition period was .§ ma. measured across the 
direct output terminals of the shock source. For the 
remainder of the acquisition period, the shock 
intensity was .6 ma. 

At the beginning of each trial, pairs in both the 
escape and avoidance groups were placed on the 
grid bars in the middle region of the apparatus 
facing the platform. For escape pairs, the grid bars 
were electrified at the time of placement in the 
apparatus. For pairs in the avoidance group, the grid 
bars were not electrified at this time. The onset of 
shock occurred 4 sec. after subjects were initially 
placed in the apparatus if neither subject was stand- 
ing on the platform at that time. At the end of each 
trial, pairs of rats were removed from the apparatus 
and held passively throughout the 30-sec. intertrial 
interval. During this time, food in the dish was 
replenished. 

Throughout the acquisition period, subjects re- 
ceived only the food obtained in the apparatus pro- 
vided that they did not weigh less than 75% of 
their mean body weight as established during 4 days 
of handling. Subjects below this weight at the end 
of 12 trials on any day were given sufficient wet 
mash in their home cages to bring their body weight 
to the 75% level 43 hour after the termination of 
the trials for that day. 

Extinction period. On the day following the end 
of the acquisition period, a 16-day extinction period 
began. The procedure of the extinction period was 
identical to that of the acquisition period, except 
that no shocks were administered. 


RESULTS 
Acquisition Period 


The results of the acquisition period were 
consistent with earlier investigations in indi- 
cating that both members of each pair ob- 
tained food in the apparatus by interchanging 
positions frequently. An interchange was 
tabulated from the recording tape when one 
member of a pair returned to the platform 
from the food dish and the other member 
moved from the platform to the food dish. In 
Figure 1, the mean number of interchanges 
for the 28 days of the acquisition period for 
the escape and avoidance groups is presented 
on the left. Pairs in both the escape and 
avoidance groups performed an increasing 
number of interchanges during the early 
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Ficure 1. Mean total interchanges performed by 
pairs in the escape and avoidance conditions during 
the acquisition and extinction periods. 


portion of the acquisition period. Later 
in the acquisition period, the rate of inter- 
changing remained relatively constant for 
most pairs. The increase in rate of inter- 
changing for both groups combined across 
blocks of 4 days of the acquisition period was 
significant, 7 (6,48) = 9.991, p < .001. The 
overall rate of interchanging during the ac- 
quisition period was slightly higher for the 
escape group, although not significantly 
so, F(1,8) = 1.676. The interaction between 
experimental group (escape and avoidance) 
and blocks of 4 days of the acquisition period 
approached significance, F (6,48) = 2.1 14, p 
< .07. As seen in Figure 1, the rate of inter- 
changing during the early portion of the ac- 
quisition period was lower for the escape 
group, whereas later in the acquisition period 
the escape group performed more interchanges 
than the avoidance group. 

The manner in which interchanges were 
performed differed substantially between the 
escape and avoidance groups. For each inter- 
change that was tabulated, it was determined 
if it had occurred with or without shock. An 
interchange with shock was designated if both 


456 STEPHEN 
animals of a pair were on the grid simultane- 
ously any time during the interchange. Inter- 
changes without shock were designated if no 
shock occurred at this time. Figure 2 pre- 
sents the number of interchanges that oc- 
curred with and without shock for each pair 
across the 28 days of the acquisition period. 
The data for the escape pairs are shown to 
the left of Figure 2. The most relevant 
feature of these data is that all pairs except 
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E4 performed relatively few interchanges 
without receiving shock. 

The performance of the avoidance pairs is 
shown to the right in Figure 2. All pairs ex- 
cept A3 performed frequent interchanges at 
the end of the acquisition period and most 
were without shock. In addition, shocks at the 
beginning of each trial were infrequent in that 
at least one member of these pairs ascended 
the platform prior to the termination of the 
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4-sec. period prior to shock onset on 99% of 
the trials during the last 4 days of the ac- 
quisition period. This observation indicates 
that avoidance behavior was successfully es- 
tablished for these pairs. Pair A3 was the 
only avoidance pair that was shocked fre- 
quently during interchanges at the end of the 
acquisition period, Pair A3 also failed to 
avoid shock at the beginning of the trials on 
62.5% of the trials. 

The amount of food obtained by the pairs 
increased during the first half of acquisition 
and stabilized at a maintenance level, No 
difference appeared in amount of food con- 
sumed by rats in the two experimental condi- 
tions and within pairs the amounts obtained 
were approximately equal. 

In summary, all pairs interchanged posi- 
tions frequently by the end of the acquisition 
period, and all pairs obtained adequate sus- 
tenance which was shared approximately 
equally. However, most escape pairs received 
considerable shock during interchanges, while 
shock occurred rarely for most avoidance 
pairs. 


Extinction Period 


Although the total elimination of shock 
permitted the members of all pairs to feed 
simultaneously in the absence of direct aver- 
sive stimulation, most pairs maintained over 
several days frequent mutual interchanges 
between the platform and the food dish. The 
nean number of interchanges for the avoid- 
ance and escape groups for the 16 days of the 
extinction period is shown on the right por- 
ion of Figure 1. The frequency of inter- 
hanging in the extinction period was greater 
for the avoidance group than for the escape 
group. The difference between these groups 
over blocks of 4 days was statistically sig- 
nificant, F(1, 8) = 5.59, p < .05. Both groups 
interchanged positions at a decreasing rate 
during blocks of 4 days of the extinction 
period, F (3,24) = 15.02, p < .001. The in- 
teraction between experimental condition 
and days of the extinction period was not 
statistically significant, F(3, 24) = 1.125. 

The possibility was examined that mainte- 
nance of interchanging during extinction was 
related to whether interchanges were executed 
with or without shock during the acquisi- 
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tion period. The Pearson product-moment 
correlation between the percentage of inter- 
changes performed without shock during the 
last 4 days of the acquisition period and the 
total number of interchanges performed in the 
16-day extinction period was .84 (dj =8, 
p < 01). 


Discussion 


The results support the hypothesis that 
pairs of rats exposed to delayed onset of 
shock during acquisition would, when shock 
was eliminated, extinguish interchange behav- 
ior less rapidly than pairs exposed to immedi- 
ate onset of shock. These results would seem 
to be interpretable on the basis of the dif- 
ference in conditions prevailing for each pair 
at the onset of a trial in the acquisition and 
extinction periods. Rats exposed to shock at 
the onset of each trial in the escape condition 
were faced with a dramatic change to no 
shock when placed on the grid bars during 
the extinction period. In essence, the primary 
stimulus leading to ascent of the platform 
was removed, For rats in the avoidance condi- 
tion, the stimulus conditions at trial onset 
during the extinction period were not dif- 
ferent from those prevailing at the end of 
acquisition. 

Although the hypothesis that differences in 
time of shock onset during acquisition would 
lead to differential extinction behavior is sup- 
ported, this result may be in part dependent 
upon the emergence of certain patterns of 
interchanging during acquisition. For four of 
the five pairs in the escape group, escape from 
shock served as a primary determinant of 
interchanging positions throughout the course 
of acquisition trials, Most interchanges oc- 
curred with shock on the grid. In the typical 
case, one member of the pair was at the food 
dish and the other on the platform. The latter 
rat stepped off the platform producing de- 
livery of shock to both rats and movement 
of the rat at the food dish to the platform, 
Being off the platform together was, there- 
fore, not an unusual event for most pairs in 
the escape group, and rapidly became the 
dominant form of behavior when the primary 
impetus for ascending the platform (shock) 
was removed during the extinction period. 
Simultaneous presence on the grid during the 
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acquisition period rarely occurred for most 
pairs in the avoidance group, and it was rela- 
tively slow in emerging as the characteristic 
behavior pattern during the extinction period. 

A theoretical context may be given to the 
present study by turning to the formulation 
of Thibaut and Kelley (1959) who applied a 
cost-reward analysis to Daniel’s (1942, 1943) 
experiments, They suggested that “by de- 
pressing the platform any animal would cut 
his own costs (avoid shock) and at the same 
time enable the other animal to be rewarded 
(feed at food crock)” (p. 32). However, 
when the shock was eliminated in Daniel’s 
second experiment, going to the platform 
played no role in reducing costs, and the ani- 
mals merely remained in the vicinity of the 
food crock. Using Thibaut and Kelley’s con- 
ceptualization in the present context, their 
analysis is directly appropriate for the behay- 
ior of most pairs in the escape group. How- 
ever, the greater persistance of going to the 
platform during extinction for most pairs in 
the avoidance group may be viewed as based 
on the continuing fear response elicited by the 
experimental environment. From very early 
in the acquisition period, these animals rarely 
experienced shock during a trial and almost 
never at the onset of a trial. If avoidance 
behavior is viewed as under the aegis of fear, 
fear reduction (ascending the platform) 
during extinction continued to serve as a basis 
for cutting costs. 

Much human cooperative behavior would 
seem to be based primarily on mutual ex- 
change of positive incentives with little evi- 
dence of any role played by avoidance of 
aversive stimulation. However, many such 
situations may contain aversively conditioned 
stimuli serving along with provision of posi- 
tive incentives to maintain cooperation. Unless 
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the antecedents to instances of cooperative 
behavior are thoroughly explored, such rela- 
tionships may go unnoticed. Even broad 
ethical codes of society for cooperation and 
altruism would seem to contain less than 
subtle but implicit fear cues. Does not the 
Golden Rule really read: “Whatsoever ye 
would that man should do to you, do ye even 
so to them” (Matt. 7:12)—or else. 
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There has recently been an increasing inter- 
est in the determinants of altruistic behavior 
in children (Bryan & London, 1970), Much 
of this interest has stemmed from a social 
learning perspective (Bandura, 1969) involv- 
ing attempts to test the limits of this para- 
digm. Within this approach is the recent 
series of studies designed to assess the differ- 
ential effectiveness of words and deeds 
upon children’s subsequent sharing behavior 
(Bryan, 1971; Bryan & Walbeck, 1970a, 
1970b; Grusec, 1972; Grusec & Skubiski, 
1970). 

The prototype experiment used in these 
studies involved a bowling game situation 
within which a child was able to win valued 
tokens, later exchangeable for prizes. At the 
same time he was given an opportunity to 
S donate anonymously some of his winnings to 
j 


a charity. The degree to which the child did 
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The differential effectiveness of a model’s words and deeds 
found to be problematic, A 3X2X2 analysis of variance design examined 
these effects further and also that of moral judgment on the donating 
behavior of 140 children aged 7 to 11 years immediately and 8 weeks later, 
Modeling was highly effective both in the immediate and in the follow-up 
study and also proved generalizable across changes in the testing situation. 
Thus, it was concluded that “internalization” had occurred, Preaching also 
the long run although not in the short run. 
Some support for a moral judgment relationship with generosity was found. 


Generosity in Children: 
Immediate and Long-Term Effects of Modeling, 
and Moral Judgment 


Preaching, 
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has recently been 


donate constituted the dependent variable of 
generosity. Just prior to the child playing the 
game, however, he was exposed to a model 
playing the same game who either did or 
did not donate to the charity and did or did 
not preach that one should donate to the 
charity. 

The results of these studies showed that 
while a model’s behavior was a potent source 
of immediate influence, his preaching was 
much more problematic in this respect, 
Bryan and Walbek (1970a, 1970b) showed 
that while a model’s practices influenced the 
child’s behavior, the same model’s preachings 
did not. However, both the model’s preach- 
ings and his behavior affected the child’s 
judgments of the model's attractiveness, 
Grusec and Skubiski (1970) showed that 
while a model’s behavior was a clear source 
of behavior change for both sexes, regardless 
of whether the model was nurturant or not, 
the model’s verbalizations were only effec- 
tive for females who had been exposed to a 
nurturant model, Grusec (1972), however, 
did show that a model’s verbalizations could 
be as effective an influence on the child’s 
subsequent behavior as a model’s behavior, 
although not for 7-year-old boys. Clearly, 
there is a need to investigate further the 
combined and separate effects of these two 
methods of influence. This is what the present 
study was designed to do, 

The differential effectiveness of preaching 
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vis-a-vis modeling in the studies above may 
have been due to the ineffectiveness of the 
manipulation of the preaching variable. Three 
ways in which the preaching manipulation in 
the previous studies appeared weak were in 
relation to (a) their timing; (b) their direc- 
tion; and (c) the power of the preacher. 

As regards timing, while the modeling 
always occurred when the model was in a 
“win” situation, that is, collecting tokens for 
himself and being presented with an actual 
opportunity to donate, the preachings oc- 
curred either on “non-win” trials as in the 
Bryan and Walbek (1970a, 1970b) studies or 
before any trials at all as in the Grusec 
(1972) and Grusec and Skubiski (1970) 
studies. It is interesting that in the Bryan 
(1971) study in which an interaction involy- 
ing the preaching variable did reach signifi- 
cance, the timing of the preaching in that 
study occurred immediately after obtaining a 
winning score. The author, however, did not 
refer to the timing when he sought to explain 
his results and was unable to interpret the 
interaction. It is suggested that the timing of 
the preaching relative to that of modeling 
may be crucial. The current study was 
designed to test this. 

As regards direction, in the Bryan and 
Walbek (1970a, 1970b) studies, the exhorta- 
tions were apparently delivered more to an 
empty room than addressed to the child; and 
in the Grusec studies the manipulation re- 
quired the model to take “care not to look 
at the child” (Grusec, 1972, p. 142). In the 
current study, the preaching was directed at 
the child so resembling more the natural 
situation in which one seeks to exert 
influence. 

As regards power, Grusec (1971) has 
shown that a powerful model produces more 
imitation than a nonpowerful one, but none 
of the studies varied the powerfulness of the 
model when examining preaching. The present 
study used a more powerful model than was 
used in the previously mentioned studies. 

Hence, the current study was designed to 
test the differential effectiveness of modeling 
and preaching, using the same experimental 
procedure as Bryan and Walbek (1970a, 
1970b) with the preaching manipulation 
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strengthened in the three ways outlined: 
timing, direction of the preaching, and power 
of the model. 

A further issue of some importance is the 
question of the durability and generality of 
behavioral learning following exposure to a 
model’s actions and verbalizations. Apart 
from a 5-day retest of exposure to different 
types of modeling situations (White, 1972), 
there is virtually no evidence that exposure 
to a model produces anything more than con- 
formity to the demands of the immediate ex- 
perimental situation (Krebs, 1970). Hence, 
the current study employed a 2-month retest “ 
of the effectiveness of the independent vari- 
ables and also tested for generality across 
dissimilar situations. 

A still further issue to be investigated is 
the part played in altruistic behavior by cog- 
nitive developmental variables (Kohlberg, 
1969), Although social learning theorists ac- 
knowledge that observer attributes may 
partly determine the efficacy of modeling in- 
fluences (Bandura, 1969), rarely are such 
attributes varied independently. This is par- 
ticularly noticeable in the field of moral 
development where such cognitive processes 
as role taking and moral judgment have been 
postulated to account for the age difference 
in sharing behavior frequently reported 
(Bryan & London, 1970; Rosenhan, 1969). 
In a previous study (Emler & Rushton, 
1974), it was found that children’s responses 
to Piaget’s stories concerned with distributive 
justice were highly significantly related to 
their generosity scores. If such cognitive vari- 
ables are indeed determinants of the behav- 
ior rather than just covariates, they might 
be expected to interact with and affect the 
reception of the social learning variables. A 
third focus of the present study then was the 
effect of children’s level of moral judgment 
on generosity under a variety of experimental 
conditions. 


METHOD 
Subjects 


Subjects were 140 children aged 7 to 11 years 
from a predominantly working class area of London. 
There were 28 children of each age, divided equally 
between the sexes. 


GENEROSITY IN CHILDREN 


Design 


A 2X23 factorial design was employed for 120 
of the subjects with the remaining 20 being assigned 
to a control condition. The factors were the child’s 
high versus low moral judgment; a model’s generous 
versus selfish behavior; and a model’s generous 
versus selfish versus neutral, preaching. Within each 
cell, subjects were equated for age and sex to permit 
examination of these effects on generosity, too. Each 
subject participated in two experimental sessions 
separated by 8 weeks. Within each class, a subject 
was sent to an experimenter “in an order determined 
by the teacher and then randomly assigned to treat- 
ment condition. On retest, all subjects within condi- 
tions, again equated for sex and age, were divided 
into two equal groups and assigned to one of two 
conditions, one in which the retest condition was 
similar to the original testing situation and one in 
which it was dissimilar. 


Procedure 


Initial testing. Each subject was sent by teacher 
to the experimental room where an experimenter 
met him and established a rapport. Six stories 1 
adapted from those used by Piaget (1932) were told 
to the subject with a standard series of questions 
to elicit the subject’s concepts of moral justice. The 
order was randomized across subjects. Subjects’ 
responses were recorded on tape for subsequent 
analysis on the basis of Piaget’s (1932) criteria. 

The experimenter then showed the subject a 
number of attractively arranged prizes ranging from 
comics and small puzzles to games suitable for both 
boys and girls of the age range. The subject was 
asked if he would like to win one of these prizes. 
When the subject said yes, the experimenter showed 
him the electronic bowling game. This was similar 
to that described by Rosenhan and White (1967), 
being about 3 feet (.91m) long with an upright 
panel at the end of the runway to indicate scores 
on each trial. Four scores were possible—s, 10, 15, 
and 20—the last being counted as a winning score. 
The predetermined sequence included eight scores of 
20 in a game of 20 trials. A buzzer sounded at the 
termination of the game. 

There were 100 tokens, in piles of 2, placed on 
a table beside the game. Also on this table was a 
bowl beneath a “Save the Children Fund” poster 
depicting a poorly clothed child. A caption read 
“Please Give.” The bowl contained 10 tokens, 
Another bowl was provided to hold subject's win- 
nings. The experimenter introduced the subject to 


1Full details of the stories and coding procedures 
can be found in the doctoral dissertation of the 
author submitted to the University of London, 
October 1973 entitled Social learning and cognitive 
development; Alternative approaches to an under- 
standing of generosity in 7 to 11 year olds. Alterna- 
tively, a mimeographed sheet of the stories will be 
sent upon request to the author. 
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the same-sex model who was presented as a possible 
future teacher in the school (this was to make the 
model appear powerful) and was interested in playing 
the game, too. 

The experimenter explained to the subject and the 
model how to play the bowling game, how to reward 
themselves with tokens, and if they wished to, how 
to donate to “Bobby,” the child in the poster, The 
optional nature of giving was stressed. They were 
told that the more tokens they won, the better prize 
they could have. The experimenter suggested that the 
model should play the game first and instructed 
the subject to watch. The experimenter then left 
the room. The model then played the game and 
either donated to Bobby one of each of the two 
tokens he won on winning trials (generous model) 
or he kept both for himself (selfish model). At the 
same time he preached either generosity, selfishness, 
or provided a neutral conversation. The preaching 
was given on winning trials with the model looking 
at the subject and saying with emphasis: 

Generous preaching. “We should share our tokens 
with Bobby” (winning Trials 1 and 5); “It’s good 
to give to kids like him” (winning Trials 2 and 6); 
“It’s right to share counters with Bobby” (winning 
Trials 3 and 7); “You should give to kids like him” 
(winning Trials 4 and 8). 

Selfish preaching. “We should not share our 
counters with Bobby” (winning Trials 1 and 5); 
“It’s not good to give to kids like him” (winning 
Trials 2 and 6); “It’s not right to share tokens with 
Bobby” (winning Trials 3 and 7); “You should not 
give to kids like him” (winning Trials 4 and 8). 

Neutral preaching. “This is a nice game” (winning 
Trials 1 and 5); “I really like playing this game” 
(winning Trials 2 and 6); “This is really fun” 
(winning Trials 3 and 7); “I like this game” 
(winning Trials 4 and 8). 

When the buzzer sounded at the end of the game, 
the model reset it and told the subject to play the 
game now as it was his turn. The model then left 
the room, ostensibly to take his winnings out to the 
experimenter. When the buzzer sounded at the end 
of the subject's game, the experimenter reentered 
the room, counted up the tokens in the subject’s 
bowl, and told the subject prizes would be given out 
after everyone had a turn at the game. The experi- 
menter then asked the subject to indicate his view 
of the model as very nice, just okay, not so nice, 
or very bad and, further, how much he would like 
to be in the model's class if the model became 
teacher in that school, a lot, a little bit, don’t mind, 
or not so much. 

Two-month retest similar condition. The same 
male experimenter came to the class, called out the 
subject’s name, took him back to the same experi- 
mental room as before, asked him if he liked the 
prize he had been given, and told him that since 
there were some prizes left he could have another 
go at the same game. The experimenter reminded 
the subject of how to play the game and how to 
win tokens saying only in passing that if the subject 
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wished to give any of his tokens to charity he could 
still do so. 

Two-month retest dissimilar condition. A female 
experimenter the children had never seen before 
came to the class and called out the subject’s name 
and took him to a different room in a different 
part of the school and reminded him how to play 
the game etc. In addition, the poster of “Bobby” 
was changed from a yellow background depicting 
one poorly dressed little English boy to a red one 
depicting three obviously starving Asian children. 

Controls. There were 20 control subjects, two 
from each age of 7-11 divided evenly between the 
sexes and randomly assigned to a control condition. 
Subjects were administered the moral judgment 
stories, as in the experimental group, but were not 
exposed to the modeling situation. They were shown 
how to play the game by themselves, were told how 
to donate if they wished to, and then played im- 
mediately afterwards. For the retest conditions they 
were divided into two equal groups and divided into 
either the similar or dissimilar retest condition. 

Experimenter bias controls. The experimenter 
was kept blind as to the subject’s test conditions. 
In the immediate test this meant that it was the 
model who selected the treatment condition and the 
experimenter was blind as to which condition the 
subject was in while he administered the moral 
judgment stories and gave instructions. 

In the retest condition, the experimenter simply 
had the subjects’ names on a list with no knowledge 
as to which condition each subject had been run 
under 2 months previously or how many tokens he or 
she had donated, This was even more true for the 
experimenter in the dissimilar retest condition who 
had never met the children. 


RESULTS 


Four experimental subjects and two control 
subjects were dropped from the analyses due 
to their failure to turn up for the retesting. 
Of the six moral judgment stories given to 
subjects, only two showed the positive rela- 
tionship with age necessary to be considered 
as a cognitive-developmental variable, y*(4) 


TABLE 1 


MEAN NUMBER or TOKENS DONATED By AGE, 
SEX, AND TIME OF TESTING 


Age (in years) 


Testing 
7 8 9 10 | 11 

Immediate test 
Boy 38 | 39 | 4.2 | 40 | 5.4 
Girl 20 | 45 | 48 | 3.9 | 51 
Two-month retest | R 
Boy 2.6 | 2.8 | 3.6 | 44 | 7.3 
Girl 2.9 | 17 | 47 | 29 | 58 
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=6.54, p< .05; y%(4) = 14.67, p< 01. 
Scores for the two stories were combined 
and subjects divided into those scoring above 
and those below the median. The age rela- 
tionship to this combined moral judgment 
score was significant, x° (4) = 14.92, p < .01. 
There were no sex differences on these moral 
judgment stories, either separately or com- 
bined, x?(1) = .30, x? (1) = .30, x?(1) = .65. 

A preliminary analysis of variance was car- 
ried out to test the effects of age and sex on 
immediate generosity, retested generosity, and 
evaluations of model. The measure of gener- 
osity consisted of the number of tokens the 
subject donated to the charity out of the 16 
won, No significant age differences were found 
on the immediate generosity or upon the sub- 
ject’s evaluations of the model. There was, 
however, a highly significant age difference on 
donating behavior on retest, F(4, 106) = 7.8, 
$ < .001, such that older children gave more 
than younger children (see Table 1). 

As there were no significant sex differences 
nor Sex X Age interactions on immediate 
generosity, retested generosity, or evaluations 
of the model, all further analyses were col- 
lapsed over sex. 


Generosity: Immediate Test 


The effects of modeling, preaching, and 
moral judgment were then examined against 
the immediate generosity scores. Modeling 
was found to be a highly significant deter- 
miner of the subject’s immediate donations, 
F(1, 104) = 158.18, p < .001. Preaching had 
no effect, F(2, 104) < 1.00. The subject’s 
level of moral judgment was also significant, 
F(1,104) = 7.47, p < .007. The mean num- 
ber of tokens donated by subjects with low 
and high moral judgment scores were 3.3 and 
4.6, respectively. When age was introduced as 
a covariate, however, moral judgment had 
a somewhat weaker effect on donations, 
F(1, 103) = 4.04, p< .05. There was only 
one, near significant, interaction, Preaching 
X Modeling, F(2, 104) = 2.74, p < .07. Gen- 
erous models who preached selfishness pro- 
duced less donating than did generous models 
who had preached generosity, or who had 
made neutral comments. Further, selfish 
models who preached neutral comments pro- 


GENEROSITY IN CHILDREN 


TABLE 2 


MEAN NUMBER OF TOKENS DONATED IN IMMEDIATE 
TEST BY MODELING AND PREACHING 


Preaching 
Model 
Generous Neutral Selfish 
Generous 7.6 7.6 6.1 


Selfish 18 9 | 1.7 


Nole. Controls donated a mean number of 4,1 tokens. 


duced less giving than selfish models who 
preached either generosity or selfishness. 
These results are shown in Table 2. 

In order to test the possibility that model- 
ing or preaching might have different effects 
depending on the subject’s age, a further anal- 
ysis of variance was computed. Neither model- 
ing nor preaching, alone or in combination, 
produced significant interactions with age, 
F = 1.32, F = 1.18, F < 1.00, respectively. 


Evaluations of Model 


The two different measures of subject’s 
evaluations of the model (liking rated 1-4; 
and desire to be in the model’s class if the 
model became a teacher in that school rated 
1-4) were virtually identical. Results are 
therefore shown only for the second measure, 
unless otherwise stated. Both the model’s 
preaching and behavior were highly signifi- 
cant determiners of his judged attractive- 
ness, F(2, 104) = 5.62, p < .005; F(1, 104) 
= 21.19, p < .001, respectively. The Preach- 
ing X Behavior interaction was also signifi- 
cant such that the selfish model who preached 
neutral comments but did not donate was 
judged more attractive than the selfish model 
who had preached either generosity or selfish- 
ness, F(2, 104) = 4.49, p < .01. Table 3 pre- 
sents the ratings of the model. Scores are 
such that the higher the score, the greater 
the model’s attractiveness. 

Moral judgment had no main effect on the 
subject’s evaluation of the model, F(1, 104) 
< 1.0, but did interact with the preaching 
manipulation on the second measurement of 
the model’s attractiveness, F (2,104) = 3.32, 
p < .05, although not on the first measure 
(F < 1,00). This relationship held when age 
was covaried, F(2, 103) = 3.41, $< .05. On 
this second measure, subjects with a high 
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moral judgment score rejected the selfish 
preacher (X rating = 2.3), while those with a 
low moral judgment score did not (X rating 
= 3.0). 


Generosity: Two-Month Retest 


A comparison of the dissimilar and similar 
retesting situations was made by carrying out 
an analysis of variance using the retesting 
condition as an independent variable in con- 
junction with the modeling and preaching 
conditions. The type of retest condition 
yielded no significance, either as a main ef- 
fect, F (1,103) < 1.00, or in interaction with 
modeling, F (1, 103) < 1,00, preaching, F (2, 
103) < 1.00, or both F (2, 103) = 1.55. Thus 
further analyses were collapsed over retest 
conditions. Thus, the long-term effects of 
modeling, preaching, and moral judgment 
were able to be examined against the com- 
bined 2-month retest scores. 

Even after an 8-week gap, modeling was 
still a highly significant, F (1,104) = 16.44, 
p < .001, influence on subject’s donating be- 
havior. In addition, preaching which had no 
effect in the immediate test now had a highly 
significant overall effect, (2,104) = 4.77, 
p < .01. Models who preached selfishness pro- 
duced less giving than those who preached 
generosity or neutral messages. Further, there 
was a significant Preaching X Modeling inter- 
action effect, F (2,104) = 3.48, p < .05, such 
that the model preaching selfishness but be- 
having generously caused subjects to donate 
the least number of tokens in the generous 
model condition, while the model preaching 
generosity but behaving selfishly produced the 
most giving in the selfish model condition. 
Furthermore, the model preaching neutral 
messages in the generous model condition 
produced more giving than the generous model 


TABLE 3 
MEAN RATINGS or MODELS ATTRACTIVENESS BY 
MODELING AND PREACHING 
Preaching 
Model — — — 

Generous Neutral Selfish 
Generous 3.7 3.3 3.2 
Selfish 2,5 3.2 1.8 
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TABLE 4 


MEAN NUMBER or TOKENS DONATED IN RETEST 
BY MODELING AND PREACHING 


Preaching 
Model 
Generous Neutral | Selfish 
Generous 5.2 6.8 3.3 
Selfish 3.2 2.8 24 
tokens. 


Nole. Controls denoted a mean number of 4.2 


preaching generosity. These results are shown 
in Table 4. 

Differences in donating behavior for sub- 
jects with high and low moral judgment 
found in the first test did occur on retest, 
but only when age was not covaried from 
the analysis, F (1,104) = 8.55, p< .004. 
With age introduced as a covariate, the signifi- 
cant relationship between moral judgment and 
generosity disappeared, F (1, 103) = 2.42. 
Moral judgment did not exert any significant 
interaction effect. 

Control subjects donated a mean number of 
4.2 tokens on the retest, there being a corre- 
lation of r = .71 (df = 16, ġ < .001) between 
their behavior on Test 1 and that on Test 2. 
The consistency of behavior between the first 
test and retest 8 weeks later was also consid- 
erable for the experimental group, 7 (114) = 
51, p < .001. 

Discussion 


Modeling has been shown, in the context of 
this experiment at least, to be a powerful 
determiner of children’s sharing behavior. 
Further, this behavior evidences both dura- 
bility and generality. Thus, according to 
Kreb’s (1970) criteria, it is reasonable to 
conclude that new learning or internalization 
of this moral behavior has occurred. It would 
seem that a model does more than simply 
induce temporary situational conformity me- 
diated by some vaguely defined form of “de- 
mand characteristics” or “experimenter ef- 
fect.” If the usual demand characteristics 
explanation were viable, one would expect the 
model’s preachings to work as an additional 
cue to the child, especially when preaching 
and modeling contradicted one another. This, 
in the immediate test, was not the case, de- 
spite our attempts to strengthen substantially 
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the preaching manipulation. This finding thus 
replicates the results reported by Bryan anc 
Walbek (1970a, 1970b). 

The effectiveness of the preaching m: 
lation can hardly be doubted given the 
preaching was a powerful source of the ci: 
evaluations of the model and (b) preachi 
did have a major effect on the 2-month io! 
low-up study. 

The finding that while preaching did no! 
directly influence subject’s behavior it 4 
influence subject’s evaluations of the » 
again replicates Bryan and Walbek (1° 
In addition, however, this study demons 
a significant Preaching X Modeling in 
tion on evaluations (see Table 3) 

Bryan and Walbek (1970a, 1970b) faili to 
find. Selfish models who said “This is rely 
fun” were seen as more attractive than sv’ 
models who preached generosity or selfi S; 
This suggests that rather than the mode! pro- 
viding a “neutral” preaching, he pr 

a source of positive affect. Table 2 pr: 
some support for this suggestion. Imitat 

the model was highest, for both genero: 
selfish models, in the neutral condition. 

4 provides further support. The significant 
Modeling X Preaching interaction on rí 
partially due to the generous model prc 
neutral messages producing the most d 

in the generous model condition. Mode 
consisting of the model saying “I am haj 
has previously been shown to increase 
sequent imitative generosity (Bryan, ! 
particularly when immediately follow 
model’s behavior. Bryan (1971), ho 
failed to find that imitative measures w: 
lated to the subject’s judgments of the 
Bryan also, however (Bryan & Walbek. | 
1970b), failed to find the subject’s beh 
related to the model’s preaching. The d 
ancy between the findings reported here 
those reported by Bryan is probably due to 
the increased strength of the preaching ma- 
nipulation in the present study, which 
strengthened preaching in respect to (a) its 
timing; (b) its direction; and (c) the power 
of its deliverer. 

; The strength of the preaching manipula- 
tion emerged strongest in regard to the 2 
month retest. This “delay effect” is highly 
interesting and appears to be due to the 


and 
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preachings differentially affecting the decay 
rates of the behaviors originally induced by 
the modeling. Generally, when preaching was 
consistent with the modeling, less decay or 
regression to the mean occurred. On the other 
hand, when preaching was discrepant with 
the modeling, most regression to the mean 
occurred. The explanation for this finding is 
probably best sought at one or more of the 
component subprocesses in the information 
processing system governing modeling phe- 
nomena. These subprocesses include atten- 
tional mechanisms governing initial registra- 
tion of modeled events; storage of the mod- 
eled events; retrieval of the events from 
storage after 2 months; and the motivational 
processes that determine whether the remem- 
bered events are activated into overt per- 
formance (Bandura & Jeffrey, 1973). 

One possible explanation within this frame- 
work would be that in the initial test the per- 
ceptual dominance of the model’s actions 
caused the subject to rely predominately upon 
his visual memory when playing the game 
himself. This is what would be expected from 
Bruner et al.’s (1966) work on the develop- 
ment of cognitive processes in children. Chil- 
dren of the age range used in this experiment 
would be expected to be more attentive to 
perceptual attributes of situations rather than 
to symbolic attributes. However, both on the 
2-month retest and on the experimenter’s 
questioning the subject of his evaluation of 
the model, the subject is of necessity required 
to rely more upon symbolic process and se- 
mantic memory either to recall the initial 
situation to himself or to process the meaning 
of the questions. Thus the model’s verbaliza- 
tions are perforce made more equally salient 
to the model’s actions. 

An alternative explanation in terms of the 
information processing system would be that 
inconsistencies in stimulus input would cause 
differential memory loss over a long period of 
time although not over a very short period 
due to the fact that memory loss in the long- 
term store appears to be primarily due to 
semantic confusion (Baddeley & Dale, 1966). 
Thus the child has greater difficulty retrieving 
the relevant information in the discrepant 
condition due to the competing memory 
traces that were stored canceling each other. 
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In this study, independent measures of the 
state of the information being processed at 
different points were not taken. Ideally, future 
research will attempt to elucidate the ele- 
ments in the information processing system 
responsible for the observed regularities in the 
overt behavior. 

That the delay effect was no chance occur- 
rence is demonstrated by (a) the power of its 
statistical association, both as a main effect, 
and in interaction; (b) the rational pattern- 
ing of that interaction; and (c) the finding 
that exactly the same patterns of results oc- 
curred across retest situations. Words and 
deeds appeared to have independent effects on 
the children’s behavior. 

It would appear that while the social learn- 
ing paradigm is relatively explicit and the 
conditions under which models exert maximal 
effects are fairly well understood, the relation 
of verbal communications to behavior re- 
quires much more extensive investigation. 

On the basis of earlier work (Emler & 
Rushton, 1974; Rubin & Schneider, 1973) it 
was expected that predictions of a child’s 
generosity could be made with better than 
chance results from a knowledge of the child’s 
level of moral judgment. This expectation 
was confirmed in the present study where an 
association was found between moral judg- 
ment and generosity both on the immediate 
test and on the delayed test. However when 
age was covaried from the analysis the 
strength of the relationship was weakened. 
Indeed on the 8-week retest, covarying age 
reduced the relationship to below an accepta- 
ble level of statistical significance, In some 
ways this is surprising. That moral judgment 
might show only a weak effect in the im- 
mediate test when the manipulation of mod- 
eling was having its maximum effect is un- 
derstandable if it can be argued that the 
modeling effect was blanketing all other ef- 
fects (an idea that is supported by the lack 
of normally expected age relationships with 
the immediate test of generosity). It would 
then have been expected that in the retest 
condition, when any enduring individual char- 
acteristics would reemerge (as with the age 
relationship to generosity), moral judgment 
would exert a stronger effect, However this 
was not found. 
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On the question of causality, the results 
might suggest that moral judgment did not 
have causal influences on the generosity scores 
of the children. This argument would rest on 
the assumption that if moral judgment had 
been a determiner of generosity, rather than 
just a covariate, it would be expected to inter- 
act with, and affect the reception of, other 
determiners of altruistic behavior. The behav- 
ior of a model, whether behaving generously 
or selfishly, might be expected to have had a 
differential effect depending on whether it 
was observed by a child with a high or a low 
level of moral judgment. Such interactions did 
not occur. It should be noted, however, that 
there was a significant Preaching X Moral 
Judgment interaction on the child’s evaluation 
of the model’s attractiveness. Children with a 
high moral judgment score rejected the selfish 
preacher while those with a low moral judg- 
ment score did not. Thus moral judgment was 
exerting an effect. Finally, this study demon- 
strates the value of examining social learning 
and cognitive developmental variables over 
time and across situations. 
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On Being Imitated: Its Effects on Attraction and 
Reciprocal Imitation 


Mark H. Thelen, Stephen J. Dollinger, and Michael C. Roberts 
University of Missouri—Columbia 


An attempt was made to assess the effect of being imitated on children’s 
attraction toward and reciprocal imitation of an imitating person as compared 
with a nonimitating person. In order to be able to generalize from the data, 
three different imitating confederates were used. It was found that being imi- 
tated increased the attraction of children toward the imitating confederate. 
Being imitated also produced greater subsequent imitation of the imitating 
confederate than the nonimitating confederate. Furthermore, the results on 


attraction and reciprocal imitation were 


toward either confederate. 


Both natural observation and the prolific 
research on imitation during the last decade 
attest to the significance and frequency of 
imitation in naturalistic settings. However, it 
is clear from a review of the large body of 
research on imitation that we have frag- 
mented the imitation process. We have 
stopped the sequence with the assessment of 
imitation in a setting where the model has 
been a confederate who was aware of the 
purpose of the situation. However, in natural- 
istic situations, the model can be as naive as 
the observer. Just as the model’s behavior 
may affect the observer (as reflected in sub- 
sequent imitation), so too the observer’s imi- 
tation of the model may subsequently affect 
that same model. Research on the possible 
influence of being imitated on the model has 
been grossly neglected, even though, with an 
attentive model, this possible influence occurs 
with nearly the same frequency as does imita- 
tion. The purpose of the present research was 
to explore two possible effects which may 
ensue from being imitated, namely, the mod- 
el’s liking of the observer and the model’s 
subsequent imitation of the observer. 

One of the prevailing considerations under- 
lying this research is that being imitated is 
rewarding * and that a person who affords 
positive reinforcement is seen as attractive. 
Byrne (1971) has argued that attraction 
toward a person is largely a function of the 
reward received from that person. 

Most of the relevant research has been his- 
torically tangential to the body of literature 


not a function of differential looking 


on imitation. It has been repeatedly demon- 
strated that attitude similarity leads to in- 
creased attraction (Byrne, 1969). Byrne and 
Clore (1967) have suggested that attitude 
similarity increases attraction because it re- 
duces “effectance arousal.” Byrne and Clore 
have argued that people are uncomfortable in 
many situations because they are uncertain 
about the appropriateness of their attitudes 
and behavior. Therefore, in unfamiliar situ- 
ations, consensual validation reduces effec- 
tance arousal and is thereby rewarding. In 
addition, people are attracted to or like the 
person who is associated with a familiar 
situation. With this in mind, we attempted to 
increase effectance arousal in the current 
study. 

The authors are grateful to Linda Cohen, Sue 
Coverdell, Greg Jones, Andrew Lester, Marc Mytar, 
John Sweet, Connie Weiberg, and Jim Worstell for 
their capable assistance during various phases of 
this work. The authors wish to thank John T. 
Lanzetta for his useful comments concerning possible 
implications of this research. The authors also wish 
to thank the teachers at Our Lady of Lourdes 
School and the University of Missouri Laboratory 
School for their gracious cooperation in providing 
children for this research. 

Requests for reprints should be sent to Mark H. 
Thelen, Department of Psychology, McAlester Hall, 
University of Missouri, Columbia, Missouri 65201. 

1 Fouts (1972) and Haugan and McIntire (1972) 
attempted to demonstrate that being imitated was 
reinforcing by showing an increment in the imitated 
response. However, both studies were not controlled 
for the possible (or probable) imitation of the con- 
federate which might have occurred even if the 
confederate had not previously imitated the subject. 
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If being imitated leads to increased liking, 
it is also expected that being imitated leads 
to reciprocal imitation. There is some relevant 
research consistent with this prediction. Chal- 
mers, Horne, and Rosenbaum (1963) demon- 
strated that prior agreement with the subject 
by a confederate on a two-part guessing task 
resulted in increased imitation of the confed- 
erate on the same task during the first block 
of trials and decreased imitation on later 
blocks of trials. However, all conditions were 
confounded by reward to the model and to 
the subject. 

Lesser and Abelson (1959) reported that 
children, whose subject preferences in a two- 
choice preference situation were endorsed by 
the experimenter, subsequently endorsed the 
preferences of the experimenter more than did 
a group of subjects whose earlier preferences 
were not chosen by the experimenter. A num- 
ber of researchers have reported increased 
imitation as a function of interest and atti- 
tude similarity (Baron, 1970; Rosekrans, 
1967; Smith, Smythe, & Lien, 1972). One 
might infer that behavioral similarity (imita- 
tion) would affect imitation in the same way 

as attitude or interest similarity. 

Some research on conformity is relevant to 
the focus of this study. It has been demon- 
strated that when a subject’s choice in a modi- 
fied Crutchfield situation was affirmed by 
four stooges, the subject was more likely to 
endorse the erroneous judgments of the four 
stooges in a subsequent test of social influ- 
ence (Hollander, Julian, & Haaland, 1965; 
Julian, Regula, & Hollander, 1968). 

Research on attitude similarity and con- 
formity is consistent with the prediction that 
being imitated increases attraction and recip- 
rocal imitation. In exploring these questions 
within an imitation paradigm, we used a 
behavioral emphasis to assess the influence of 
being imitated on attraction and reciprocal 
imitation. Consequently, the confederates who 
imitated the subjects were in no way de- 
scribed by the experimenter, and they were 
completely silent throughout the experiment. 

In the interest of design and especially to 
control for the strong disposition to imitate, 
two confederates were juxtaposed. One con- 
federate imitated the subject, while the other 
confederate did not imitate the subject. In 
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order to allow for some generalizability of the 
data, three confederates were used, and each 
served as both the imitating confederate and 
the nonimitating confederate with each of the 
other two confederates. This resulted in six 
combinations of imitating and nonimitating 
pairs of confederates. 

In order to assess possible differences in 
orientation toward the imitating confederate 
and the nonimitating confederate, the looking 
behavior of the subject toward the imitating 
confederate and the nonimitating confederate 
was scored for both frequency and duration. 


METHOD 
Subjects 


Children attending the first grades of a university 
laboratory school (13 males, 8 females) and a local 
parochial school (11 males, 16 females) served as 
subjects. The subject pool was balanced across con- 
federate pairs with respect to sex and (as far as 
possible) school attended. The experimenter and 
confederates were male graduate students; the scorers 
were two female undergraduates. 


Tasks and Materials 


The imitating and nonimitating roles of the con- 
federates were established via two tasks in which 
the subject selected one of four response options 
followed in turn by the confederates. Task 1 con- 
sisted of eight trials in which a 3-inch (7.62 cm) 
square piece of paper was presented to the subject 
accompanied initially by the instructions, “Here are 
four different color crayons; I'd like you to pick 
one and draw a circle.” On the fifth and remaining 
trials, the experimenter instructed the subject to 
draw a square. Following the subject’s response, 
each confederate was given an opportunity to select 
one of the same four colors. On each trial, the imi- 
tating confederate selected the same option as the 
subject, while the nonimitating confederate selected 
a different one.” 

Materials for Task 2 consisted of eight pictures of 
emotionally expressive human faces (two each of 
male adults, female adults, male children, and female 
children). Each picture was mounted on an 8} X11 
inch piece of paper. Below each picture were four 
nonsense syllables. As the experimenter presented the 
first picture, he stated that “these are make-believe 
names, and I'd like you to use your pencil to point 


2The nonimitating confederate’s response choice 
on both tasks was dependent upon the response of 
the subject: On Trial 1, the nonimitating confederate 
selected the third option on the right of the sub- 
ject’s; on Trial 2 he selected the second to the right, 
and on Trial 3 he selected the first to the right. 
This “3-2-1” procedure was then repeated on subse- 
quent trials. 
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to the name you would like to go with this face.” 
As in Task 1, the subject responded first, followed 
by the confederates, in the same way as described 
above for the crayon task. The imitation and non- 
imitation by the confederates were enhanced as the 
experimenter asked each to leave his pencil in place 
until the other confederate had responded. 

On the odd-numbered trials of both tasks, the 
confederate on the subject’s left responded after the 
subject, followed by the confederate on the right. On 
even-numbered trials, the confederate on the sub- 
ject’s right was the first to respond after the subject. 
The position of the imitating confederate and the 
nonimitating confederate was counterbalanced for 
each of the six confederate pairs, that is, a given 
imitating confederate sat on the subject’s right for 
four subjects (two of each sex) and on his left for 
four subjects. 


Procedure 


Subjects were individually escorted by the experi- 
menter into the experimental room, where they were 
asked to stand at a table between the seated confed- 
erates. The scorers were located at a second table 
facing the confederates, where they recorded the 
frequency and duration of the subject’s looking 
behavior directed toward each confederate. The two 
imitation tasks were immediately administered. 

After completion of the two imitation tasks, an 
“inquiry” consisting of five forced-choice questions 
was conducted to determine which confederate the 
subject preferred.’ In order to conduct the inquiry, 
the experimenter escorted the subject to a seat 6 
feet (1.8 m) behind the confederates, where he asked 
the subject to indicate his confederate preference for 
various activities by pointing. Following the “in- 
quiry,” the experimenter told the confederates to be 
seated at opposite ends of a table on which were 
placed two identical sets of children’s building 
blocks. After positioning the subject away from the 
table and an equal distance from both confederates, 
the experimenter stated, “While I am busy, you can 
go over and play with whichever person you choose.” 
Within 5 sec after the subject had chosen, or if the 
subject failed to choose within a 60-sec limit, the 
experimenter asked the subject and the confederates 
to return to their positions at the table where the 
test for reciprocal imitation was conducted. T 

The presentation of eight additional pictures simi- 
Jar to those employed in Task 2 comprised the test 
for reciprocal imitation. The confederate to the left 
of the subject responded first. Henceforth, the con- 


8The inquiry items were as follows: (a) Which 
person would you like to play catch with, him or 
him? (The experimenter points to the left then to 
the right confederate.) (b) Which person would you 
like to go to the movies with, him or him? (The 
experimenter points to the right then to the left con- 
federate.) (c) Which person would you like as a 
teacher? (d) Which person would you like to go on 
a picnic with? (e) Which person would you like to 
have take you back to the room? 
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federates alternated in making the first response, 
but both confederates responded before the subject. 
The first confederate to respond on each trial se- 
lected a predetermined random response, while the 
second confederate used the “3-2-1” procedure (see 
Footnote 2). Upon completion, the subject returned 
to his room accompanied by the confederate of his 
choice. 

In the interest of increasing effectance arousal, the 
confederates never established eye contact with or 
verbally addressed the subject at any time during 
the experimental session. In addition, the scorers were 
silent throughout the session, 


RESULTS 


Since the hypotheses were directional, the 
tests of significance were one-tailed unless 
otherwise specified. In addition, the Z sta- 
tistic, based on the binomial expansion and 
corrected for continuity, was used unless 
stated otherwise. 

The inquiry, block play, and imitation 
data were reviewed for possible sex differ- 
ences. It was apparent that there were no sex 
differences, so the data were pooled across the 
sex groups for the statistical analyses. 

The inquiry data were analyzed by identi- 
fying those subjects who showed a preference 
for the imitating confederate on three or more 
of the inquiry items (see the first column of 
Table 1) and those subjects who showed a 
preference for the nonimitating confederate 
on three or more of the inquiry items (see 
the second column of Table 1). The inquiry 
data are presented separately in Table 1 for 
each of the three imitating confederates. As 
explained in the Procedure section, each of 
the imitating confederates was paired with the 
other two confederates. The data in Table 1 
are pooled for both nonimitating confederates 
as they are paired with a given imitating con- 
federate. As can be seen in Table 1, Confed- 


TABLE 1 


NUMBER OF SUBJECTS SHOWING A PREFERENCE FOR 
THE IMITATING VERSUS THE NONIMITATING 
CONFEDERATES, BASED ON THE INQUIRY 


Imitating Stati Fort 
K . | Imitating | Nonimitating P 
confederate | confederate confederate £ $ 
A 13 3 OL 
B 9 7 ns 
C 12 4 .04 
Total 34 14 -003 
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erates A and C, as imitating confederates, 
were significantly preferred over the nonimi- 
tating confederates (p= .01 and .04, respec- 
tively). The totals in Table 1 show that 34 
of the subjects preferred the imitating con- 
federate in comparison with 14 subjects who 
preferred the nonimitating confederate. This 
difference was highly significant (p = .003). 

In the block-play situation, 27 of the sub- 
jects approached and/or played with the imi- 
tating confederate and 14 of the subjects 
approached and/or played with the nonimi- 
tating confederate. This provides a Z of 1.88, 
which is statistically significant at the .03 
level. Thus, the subjects not only showed a 
preference on the inquiry for the imitating 
confederate, but they also indicated a pref- 
erence to actually play with the imitating 
confederate. 

On the test for reciprocal imitation, the 
subject could imitate the imitating confed- 
erate, the nonimitating confederate, or he 
could choose one of the two names which 
neither confederate had chosen. Table 2 de- 
picts the number of subjects who showed more 
imitation of the imitating confederate (column 
2) and the number of subjects who showed 
more imitation of the nonimitating confed- 
erate (column 3). Subjects who showed no 
imitation of either confederate or equal imi- 
tation of the two confederates were not in- 
cluded in this analysis, because we were 
interested in the direction of imitation when 
there is a difference in the imitation of two 
confederates. As can be seen in Table 2, when 
Confederates A, B, or C served as the imitat- 
ing confederate, many subjects showed recip- 
rocal imitation (more imitation of the imi- 
tating confederate). The analysis pooled 
across confederates illustrates the difference 


TABLE 2 


confederate | Nonimitating | z 

(person) i hs 
A 1 | 2.84 | .002 
B 3 1.45 | .07 
c 2 | 1.81 | .035 

Total 6 | 3.48 | 0003 
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TABLE 3 


RELATIONSHIP BETWEEN INQUIRY CHOICE 
AND IMITATION 


Inquiry choice 


Imitation Romer 
Imitating | Nonimitating 
confederate | confederate 
Imitating confederate 22 5 
Equal 8 7 
Nonimitating confederate 4 2 


Note. Entries are numbers of subjects. 


even more clearly. Twenty-seven subjects 
showed reciprocal imitation in comparison 
with only 6 subjects who showed more imi- 
tation of the nonimitating confederate (Z = 
3.48; p < .0003). 

Table 3 depicts the relationship between 
inquiry preferences and reciprocal imitation. 
Subjects who showed a preference for the 
imitating confederate were also very likely to 
show reciprocal imitation of this confederate 
as opposed to the nonimitating confederate 
(Z = 3.33, p< .0005). In contrast, those 
subjects who preferred the nonimitating con- 
federate did not show greater imitation of this 
confederate (Z = .76, p> .10). Seven sub- 
jects preferred the nonimitating confederate 
and showed “equal” imitation of the two 
confederates. Six of these subjects failed to 
imitate either confederate. It is difficult to 
speak with confidence about the subjects who 
showed an inquiry preference for the non- 
imitating confederate because of the small 
number of subjects involved. 

The looking measures were obtained during 
the two imitation tasks and the test for recip- 
rocal imitation. For each task, each scorer 
recorded the total duration of looking toward 
one confederate and the frequency of looks 
toward both confederates. Consequently, 
scorer reliability was determined for the fre- 
quency measure only. Reliability was deter- 
mined by means of the ratio of agreements 
to agreements plus disagreements, where 
agreement between scorers (for each subject 
on each task) was defined as a difference of 
not greater than one. Using this measure of 
reliability, the scorers obtained 91% agree- 
ment. To determine whether differential look- 
ing occurred in the direction of either confed- 
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erate, ¢ tests for related measures were ap- 
plied to the measures of total number of looks 
(summed across tasks) and total duration of 
looking (summed across tasks). There were 
no significant differences in frequency (¢ = 
84, p > .10) or duration (t = .68, p > .10) 
of looking toward either confederate. 

In order to explore possible relationships 
between personality variables and influence 
from being imitated, teacher ratings were ob- 
tained on the subjects. Each child’s teacher 
rated him for leadership, timidity, internal 
versus external locus of control, looking to 
others or looking to self in solving problems, 
curiosity, and attention seeking. Since the 
inquiry data did not relate to these items, 
these analyses are not reported here. The 
imitation data were analyzed by subtracting 
the frequency of each subject’s imitation of 
the nonimitating confederate from the fre- 
quency of that same subject’s imitation of 
the imitating confederate. This provided a 
determination of the disposition to show re- 
ciprocal imitation apart from the disposition 
to simply imitate. All tests of significance 
were two-tailed. Subjects rated as tending to 
attribute their school success and failure to 
the ease or difficulty of the task (and not to 
their own effort or ability) showed signifi- 
cantly more reciprocal imitation than sub- 
jects rated as accepting responsibility for 
their successes and failures (p < .025). Sub- 
jects who looked to others for solutions to 
their problems showed more reciprocal imi- 
tation than subjects who generally attempt to 
solve their own problems (p < .05). The re- 
maining items did not relate to reciprocal 
imitation. 


Discussion 


In this study it has been demonstrated 
that being imitated has effects on the model, 
It was found that being imitated increased 
the subject’s attraction toward the person who 
imitated him and increased the subject’s sub- 
sequent imitation (reciprocal imitation) of 
the person who imitated him. The use of three 
confederates allows us to speak with some 
confidence about the data. Furthermore, at- 
traction to the model and reciprocal imita- 
tion were not a function of differential look- 
ing at either confederate. A few comments 
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about the attraction and reciprocal imitation 
variables are in order. 

If people are attracted to those who reward 
them (Byrne, 1971), increased attraction as 
a function of being imitated may be inter- 
preted to suggest that being imitated is re- 
warding. With somewhat mixed results, at- 
traction has been studied as an antecedent of 
imitation (e.g., Sampson & Inkso, 1964; 
Willis, 1963). We may now suggest that be- 
ing imitated may be seen as an antecedent of 
attraction. 

There are a number of implications of the 
imitation—attraction finding. We know from 
naturalistic observation that parents often 
imitate their young children, Such imitation 
may be an important facilitator of the devel- 
oping attraction of the child to his parents. 
At a more inferential level, the child who 
imitates his parents may be more attractive 
to his parents than his sibling, who imitates 
very little. Similarly, the child who imitates 
his teacher may be more liked by the teacher 
than his nonimitating classmate. 

It may be suggested from the data on re- 
ciprocal imitation that “imitation” stands 
along side of instructions and reward as an 
effective means to facilitate imitative be- 
havior. One may infer from the data that 
caretakers, such as parents and teachers, may 
encourage or facilitate a child’s imitation of 
them if they will first imitate the child. 

There is another possible implication of the 
affectional and reciprocal imitation effects of 
being imitated. Normative behavior may be 
sustained and perpetuated partly through the 
process of reciprocal imitation, For example, 
if a person demonstrates a given normative 
behavior and another person imitates him, the 
first person may be even more likely to dem- 
onstrate and value the normative behavior. 
This may be particularly likely in unstruc- 
tured or unfamiliar situations or when the 
persons involved are strangers. 

If imitation leads to reciprocal imitation, 
what is to keep an endless reciprocal imita- 
tion chain from occurring? There may be a 
number of possible constraints against such a 
development. First, many structured situa- 
tions, for example, the classroom, do not allow 
for an unlimited number of repetitions of a 
given behavior. Second, the person’s motives 
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for the initial imitation may be important. 
For example, if A imitated B in order to elicit 
an imitative or an affectional response from 
B, A might not imitate B once that objective 
is attained. Third, excessive imitation might 
lead to a negative or hostile reaction. In other 
words, overreliance on the behavior of an- 
other, instead of eliciting reciprocal imitation 
and affection, may result in rejection and 
counterimitation. 

There are a number of directions that need 
to be pursued in future research on this rela- 
tively virgin topic. What is the influence of 
being imitated when the child is acquainted 
with the task situation and/or the person who 
imitates him? Similarly, will a decrease in 
“effectance arousal” influence the effects of 
being imitated? What characteristics of the 
confederate increase the probability of recip- 
rocal imitation and attraction upon being imi- 
tated? What is the influence of being imi- 
tated on self-concept? Does being imitated 
increase the probability of the modeled be- 
havior? How does being imitated compare 
with other reinforcers? What is the effect of 
nonimitation subsequent to imitation? Does 
reciprocal imitation generalize across tasks, 
and is it sustained with time? Is the presence 
of a nonimitator a necessary condition to 
obtain the reciprocal imitation effect; that is, 
would the findings obtain in a between-groups 
design? Finally, in a somewhat different di- 
rection, what is the influence of not being 
imitated in a context where others are imi- 
tated? 
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An Attributional Approach to Moods and the Menstrual Cycle 
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Acceptance of an attribution pattern linking negative moods (depression, 
irritability) to the approach of menstruation and the likelihood of internal and 


external attributions were examined in 


a questionnaire study in which cycle 


phase (pre- versus postmenstrual), mood (positive versus negative), and 
environment (pleasant versus unpleasant) were varied, Subjects’ ratings indi- 
cated that (a) biology was judged important for explaining negative moods 
occurring premenstrually; (b) inconsistency between mood and environment 


produced more internal 


(personality) attributions, while consistency enhanced 


external attributions; and (c) emotionally expressive behavior was thought to 


reflect underlying personality dispositions despite 


extenuating situational fac- 


tors (assumed personal causation). The theoretical relevance of the findings to 
a new conceptualization of premenstrual emotionality and to an attributional 
chain relating female self-concept and premenstrual tension is discussed. 


Despite the well-established assumption 
that hormonal fluctuations underlie cyclic 
mood changes in women, conclusive evidence 
of the specific physiological bases of the 
phenomenon is lacking. While many promis- 
ing biological leads exist, recent evidence also 
supports the role of perceptual and social 
conditioning factors in menstrual cycle mood 
swings. Paige (1971), for example, found 
premenstrual and menstrual anxiety were tied 
more closely to the number of days of men- 
strual flow and the observance of the sex 
taboo than to hormonal variations. Presum- 
ably socially learned connotations mediated 
the anxiety. 

Subjects’ self-reports (Koeske, Note 1) 
suggest that a fairly coherent set of beliefs 
about menstruation and associated mood 
changes exists. These beliefs form an attribu- 
tion pattern linking negative mood swings 
(depression, irritability) to the approach of 
menstruation. This attribution pattern may 
form part of an attributional chain involving 
physical and emotional symptoms and self- 
attributions. A, conceptualization of the 
sources of cyclic mood change which builds 
* on this chain of attributions could prove more 

fruitful than an approach based on either 
biology or social factors alone. 

The present study was designed to isolate 
the attribution pattern linking negative moods 
to the approach of menstruation and assess its 
influence on attributions about mood and 


personality. Stories in which mood, situation, 
and phase of the menstrual cycle were varied 
independently provided a test of two hypothe- 
ses. Hypothesis 1 predicted that biology 
would be important in explaining negative 
(but not positive) moods occurring premen- 
strually but unimportant for postmenstrual 
moods. Following attribution theory (Jones 
& Davis, 1965; Kelley, 1967), it was also 
predicted (Hypothesis 2) that Stronger in- 
ternal attributions (to the target person’s 
personality) would result when mood was in- 
appropriate to the situation. 


METHOD 
Subjects and Procedure 


Forty-nine male and 53 female introductory social 
psychology students at the University of Pittsburgh 
served as subjects. The experiment, administered on 
the first day of classes by the second author, was 
presented as a questionnaire study of “how people 
use information to make decisions,” Student reac- 
tions to an “excerpt from a normalt college stu- 
dent’s interview with a (clinical) counselor” were 
requested. Two levels of each of the” three inde- 
pendent variables, menstrual cycle phase, CP, (pre- 


This research was supported by National Insti- 
tute of Mental Health Grant MH06718. 

Requests for reprints should be sent to Randi K. 
Koeske, Department of Psychology, Carnegie-Mellon 
University, Pittsburgh, Pennsylvania 15213. 

1 Instructions stressed that the person was not in 
need of psychological help but that her history had 
been presented in a clinical format merely as a 
means of getting at decision-making styles, 
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versus postmenstrual), environment, E, (pleasant 
versus unpleasant), and mood, M, (positive versus 
negative) were included in the factorial design. 

Information about a hypothetical female student 
(Miss A) was contained in five categories. All sub- 
jects received identical information about Miss A’s 
background (sex, age, student status), current re- 
sponsibilities (full-time student, part-time job), and 
personality (which they were to infer from the con- 
tent of the interview). Menstrual cycle phase in- 
formation (“menstrual period expected in 2 days” 
versus “3 weeks”) was provided as medical informa- 
tion, and a summary of the events mentioned by 
Miss A in the interview was included under current 
environment. Depending on condition (pleasant 
versus unpleasant), one of two lists of events ap- 
peared: (a) (positive) results of examination re- 
ceived, news of father’s recovery, news of brother’s 
absence, apologetic boyfriend; or (b) (upcoming) 
course examination, news of father’s illness, news of 
brother’s return, exasperating boyfriend. The infor- 
mation concerning the brother was always incon- 
sistent with the tone of the other information. 

Subjects also received an interview excerpt con- 
taining Miss A’s account of a typical day. For ex- 
ample, the negative mood-pleasant environment 
condition was created as follows: 


Well, today was just an average day, I guess. 
I felt tense although I wasn’t rushed for once and 
kind of tired even though last night was my night 
off from work. My spirits weren't even lifted 
after I got the results of my examination. I did 
pretty well even though I missed some items I 
really should have known. 

I should have been relieved about my father 
too. I got a letter from my mother today and she 
said my father was doing well after a recent hos- 
pitalization. I had been kind of worried about him. 
But even that news didn’t raise my spirits. My 
mother also said my older brother will not be 
coming home for the holidays. I’m really sorry 
that I won't be able to see him. It’s been quite a 
while since we saw each other. 

I guess I was pretty tired and had a lot on my 
mind by the time I saw my boyfriend at lunch. 
He was apologetic about our last quarrel and I 
really should have felt close to him. I suppose it 
was silly but instead I just blew up right out in 
public. By the time the lunch was over, I was on 
the verge of tears. 


The excerpts were written so as to minimize the 
use of mood or personality terms and were parallel 
except that in half of the stories Miss A’s mood was 
positive and in half it was negative. Thus her mood 
was either consistent with the overall tone of the 
day’s events or it was not. All stories concluded 
with the following paragraph: 


I spent the evening quietly. I didn’t feel like 
being energetic or going out or anything. So I 
just sat around and listened to records. I did a 
lot of thinking—I was in that kind of a mood— 
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and I thought about all that had happened that 
day. 


Dependent Measures 


Three sets of dependent measures were collected 
on the pages following the interview excerpt. 
Assessment of mood. Two separate measures of 
mood were included. Subjects first rated Miss A’s 
mood by writing a number from 1 (not at all) to 
7 (very) on the line next to each of 13 adjectives. 
The list included such adjectives as happy, angry, 
and unpredictable. After rereading the final para- 
graph, subjects were asked to decide what Miss A 
had meant when she described herself as being in 
“that kind of a mood” by rating the likelihood (1 = 
not at all likely; 7= very likely) with which each _ 
of eight moods characterized her feelings. Sample ~ 
moods were quiet, depressed, and peaceful. 

Assessment of attribution patterns. Patterns of 
causal attribution elicited by the interview were 
evaluated in two measures. Subjects were first re- 
quired to divide 100% among the five categories of 
information (background, current responsibilities, 
medical information, current environment, and per- 
sonality) according to their contribution to Miss A’s 
mood. A more detailed index was provided by rat- 
ings (from 1 to 7) of the importance of each of 
eight items in determining her mood. The list in- 
cluded sex (female), menstrual cycle phase, the four \s~ 
separate elements in the story which constituted i 
current environment, part-time job, and personality. 

Characterizations of personality. Subjects were 
asked to rate Miss A’s personality by writing a 
number from 1 (not at all) to 7 (very) on the line 
next to each of 17 personality characterizations. The 
list included descriptions such as argumentative, 
sensitive, and warm. Following completion of the 
questionnaire booklets, subjects were debriefed. 


RESULTS 
Effectiveness of the Mood Manipulation 


Separate 2 X 2 X 2 multivariate analyses’ 
of variance ° were performed on the two mea- 
sures of mood. These analyses were under- 
taken primarily to provide confirmation of the 
mood (M) manipulation but they secondarily 
evaluated subjects’ use of the cues of cycle 
phase (CP) and environment (E) for mood 
perception. The latter findings are reported 
below. 

Analyses revealed that mood was success- 
fully manipulated and resulted in significant’ 
main effects of M for both measures, F (13, 
81) = 19.266, p < .001; F (8,86) = 25.412, 


a 


2 Analyses were based on Dean J. Clyde’s multi- 
variate analysis of variance program which uses 
Wilks’s Lambda Criterion for multivariate tests of 
significance. 


Note. Higher mean scores indicate higher ratings on the mood 
listed. 
*p <05. 


ATTRIBUTIONAL APPROACH TO Moops AND THE MENSTRUAL CYCLE 


TABLE 1 


Means, STANDARD DEVIATIONS ND UNIVARIATE F 
VALUES FOR THE Two SET: Moop Scares: 
Errect or Moop MANIPULATION 


Positive mood | Negative mood 
Mood scales 
M SD M SD 
Happy 2.255 150.128*** 
Angry S| 52.129%%* 
Depressed | 103.965*** 
Upset | 120,123*** 
Emotional a 1.182 
Excited i 15.469%** 
Anxious Re 10.113*** 
„Peaceful -885 43,182*** 
‘Joyful 04. 139.576*** 
Sensitive 46 5.972* 
Energetic 75 16.449*** 
Unpredict- OF 1.125 
able 
Exhilarated 46.125*** 
Quiet -191 
Depressed 109.617*** 
Peaceful 54.019*** 
Lazy 3.686 
Unsettled 68.412*** 
Happy 117.232*** 
Thoughtful 5.438* 
Contented 65.193*** 


ED <01, 
+p < 001, 


p < .001. Of the 13 mood scales, only emo- 
tional and unpredictable failed to reach sig- 
nificance. Among the eight likelihood-of-mood 
scales six attained significance, one (lazy) 
was marginally significant, and one (quiet) 
„Was nonsignificant. Table 1 presents the 
means, standard deviations, and univariate F 
values for both sets of scales. 


Evaluation of Hypotheses 


In accordance with established techniques 
for handling attributional data (e.g., Frieze 
& Weiner, 1971), separate analyses of vari- 
ance were performed on each item of the two 
attribution measures, percentage of contribu- 
tion, and importance. Four of the 13 analy- 
Ses of variance performed were judged criti- 
cal to the hypotheses. Hypothesis 1 stipulated 
significant Cycle Phase X Mood interactions 
for the categories, medical information (per- 


hich did 
8For the percentage measure, scores wl 

not total 100% were adjusted to preserve the rela- 
tionships between the categories. 
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centage measure) and menstrual period (im- 
portance measure), with an association be- 
tween negative moods and the premenstruum 
expected. The second hypothesis that moods 
inappropriate to the environment would be 
more internally attributed (to the personality ) 
should be reflected in significant Mood x 
Environment interactions for the two mea- 
sures of attribution to personality. 

The results of the four critical analyses of 
variance revealed that both hypotheses were 
confirmed. Significant CP x M interactions 
were obtained for both menstrual period 
measures, F (1,94) = 10.472, p <.005 and 
F (1,93) = 4.367, p< .05, for medical in- 
formation and menstrual period, respectively, 
and the M X E interactions were significant, 
F (1,94) = 5.981, p < .05 and F (1,92) = 
11.2001, p < .005, for percentage contributed 
and importance of personality, respectively. 

Inspection of the cell means relevant to the 
two sets of interactions (Tables 2 and 3) in- 
dicated that the effects were in the predicted 
direction in three out of four cases. For the 
menstrual period measure, both positive and 
negative moods enhanced attributions to the 
approach of the period, with the attribution 
stronger for negative moods. For medical 
information, there was again an association 
between negative moods and the premen- 
struum, with a lesser association between 
positive moods and the other cycle phase. 

Of the two personality measures, only the 
importance rating showed the predicted effect 
clearly. The percentage measure indicated that 
positive moods were more internally attributed 


TABLE 2 
MEAN CONTRIBUTION RATINGS AS A FUNCTION oF 
Moop AND MENSTRUAL CYCLE PHASE: 
MENSTRUAL PERIOD MEASURES 


Mood 
Cycle phase a = 2 
Positive Negative 

Medical information* 

2 days 10.00 21.40 

3 weeks 14.02 7.80 
Menstrual period? 

2 days 3.00 4.43 

3 weeks 2.52 2.27 


^ These numbers represent percentage of contribution. 
b These numbers represent importance of contribution ona 


7-point scale, 
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TABLE 3 


MEAN CONTRIBUTION RATINGS AS A FUNCTION OF 
Moop AND PLEASANTNESS OF ENVIRONMENT: 
PERSONALITY MEASURES 


Mood 
Environment = 
Positive Negative 

Personality® 

Pleasant 15.80 14.35 

Unpleasant 29.38 12.44 
Personality” 

Pleasant 4.40 4.78 

Unpleasant 5.12 323 


a These numbers represent percentage of contribution. 
b These numbers represent importance of contribution on a 
T-point scale. 


regardless of environment, with personality 
judged most responsible for the inappropriate 
positive mood. Because the percentage mea- 
sure did not permit total independence in the 
category ratings, however, the increased rat- 
ing given to medical information for nega- 
tive moods occurring premenstrually may have 
been responsible for the relatively lower per- 
sonality rating for negative moods. As can be 
seen from Table 4, the predicted effect was 
obtained for the statistically independent im- 
portance measure. 

Other findings based on the total set of 
analyses of variance, although not specifically 
predicted, provide further confirmation of the 
second hypothesis. Subjects readily used the 
combined M and E cues (M X E interaction) 
to evaluate the effects of background, F (1, 
94) = 9.55, p < .005, current environment, 
F (1,94) = 28.347, p < .001, and three as- 
pects of the environment, father, F (1,94) = 
46.453, p < .001, boyfriend, F (1,94) = 
16.274, p < .001, and examination, F (1, 94) 
= 66.738, p < .001. As with the personality 
measures, background was judged a more po- 
tent contributor when mood and environment 
were inconsistent. Current environment and 
the three significant elements of it, father, 
boyfriend, and examination were judged more 
important when mood was appropriate to the 
situation. Since the environment involves ex- 
ternal factors and background, as personality, 
can be seen to involve internal factors, these 
results lend support to the attribution-theo- 
retic expectation that consistencies would be 
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attributed externally and inconsistencies in- 
ternally. 


Other Findings 


Perception of mood. Analyses indicated that 
assessments of mood were affected by the 
independent variable manipulations. Multi- 
variate F values indicated a significant effect 
of E for the 13-scale set, F (13,81) = 2.126, 
p < .021, and of CP for the eight-scale set, 
F (8,86) = 2.114, p < .043. These results 
reflect the subjects’ tendency to judge mood 
as appropriate to the situation and to give 
more negative mood ratings when the target 4 
person was premenstrual. Univariate F values 
for only three scales in both sets reached sig- 
nificance, however (anxious, for the 13 mood 
scales; depressed and happy for the eight 
likelihood scales) although several were mar- 
ginally significant. 

On both measures, the M X E interaction 
reached significance, F (13,81) = 4.004, p < 
.001; F (8,86) = 3.260, p < .003. Univariate 
Fs for the following scales were significant: ¥ 
happy, depressed, excited, joyful, sensitive, 
and unpredictable for the 13-scale set and 
happy for the eight-scale set. Generally, sub- 
jects judged positive moods appropriate to 
the situation more positive and inappropriate 
negative moods more negative. Both M and E 
cues were used to assess mood, with the for- 
mer usually weighted more heavily. 

Patterns of attribution. Because of the large 
number of tests performed for this set of 
measures, unpredicted effects were evaluated” 
more stringently, using a critical p = .01. The 
following effects were obtained: 

1. Positive moods resulted in more internal 
(personality) attributions than did negative 
moods, as evidenced by the significant M 
effect obtained for both the percentage, F (1, 
94) = 8.307, p < .01, and importance, F (1, 
a = 7.193, p < .01, measures of personal- 
ity. 

2. Unpleasant environments produced morg’ 
internal (personality) attributions, especially 
when the menstrual period was not expected 
for 3 weeks—CP X E effect for the percentage 
measure of personality, F (1,94) = 50.243 
p < .001. Inspection of the cell means indi- 
cated that this pattern resulted from the in- 
ordinately high ratings obtained for positive 
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moods occurring in unpleasant environments 
and the tendency to rate personality as more 
influential in producing postmenstrual moods. 

3. Premenstrually, the menstrual period, 
F (1,93) = 10.970, p < .005, was judged to 
have a greater influence on behavior than 
postmenstrually. No triple interaction was 
significant, ` 

Characterization of personality. As with 
the mood scales, a 2X22 multivariate 
analysis of variance was performed on the 17 
personality scales. Only two of the multivari- 

„ate Fs were significant. 

Table 4 presents the means, standard devi- 
ations, and univariate Fs for the larger effect, 
that of mood on personality characterization, 
The significant M effect, F (17,78) = 6.063, 
$ < .001, indicated that moodiness was over- 
assimilated to the personality. The display of 
negative moods led subjects to give more neg- 
ative personality characterizations (e.g., argu- 
mentative, hostile) while the display of posi- 
tive moods enhanced positive descriptions 
(e.g., warm, happy-go-lucky). This effect is 
reminiscent of subjects’ well-documented 
tendency to assume personal causation (e.g., 
Jones & Harris, 1967) when making attribu- 
tions, 

The barely significant M x E interaction, 
F (17,78) = 1.784, p< .045, reflected the 
higher ratings given to characterizations like 
sensitive, competitive, and assertive when 
mood was appropriate to the situation. Sub- 

jects apparently used both M and E cues to 
“make assessments of personality, usually 
weighting the former more heavily. 


Discussion 


The ‘data reported above generally confirm 
the-two hypotheses under investigation. Sub- 
jects display a clear-cut attributional pattern 
linking negative moods (depression, irritabil- 
ity) to the premenstruum. They generally 
make more internal (personality) attributions 

“When moods are inappropriate to the situa- 
tion and emphasize aspects of the situation 
moré when mood is appropriate. Subjects also 
appear to assume personal causation regard- 
less of extenuating situational factors: the 
display of emotionally expressive behavior 
(moodiness) is thought to reflect underlying 
personality dispositions. 
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TABLE 4 
Mrans, STANDARD DEVIATIONS, AND UNIVARIATE F 


VALUES FOR RATINGS OF PERSONALITY: 
Errect or Moop MANIPULATION 
Positive mood |Negative mood 
Personality Y Univariate F 
scales (df = 1,93) 
M | SD | M|- sp 
Argumentative | 2.063 3.068 | 1.495 | 12.864: 
Depressive 2.693 4.423 | 1.667 | 28.562 
Sen 4.955 5.639 | 1.195 | 10,150 
Happy-go-lucky | 4.448 2,328 | 1.487 | 43.107 
Friendly 5.360 4.148 | 1.698 | 15.794 
Competitive 4.447 3.614 | 1.480 | 1.879 
Emotional 5.239 5.474 | 1,262 | <1 
Independent 4.946 4.428 | 1.536 | "2.868 
Assertive 3.848 4.168 | 1.523] 1.536 
Unreasonable | 1.803 2,323 | 1.278 | 4.501% 
Moody 4.059 4.618 | 1.722 | 2,097 
Impulsive 4.184 3.654 | 1.931 | 2,393 
Warm 5.320 4.210 | 1.523 | 15,3384 
Hostile 1:513 2.403 | 1.307 | 15,1474 
ensive 3.896 4.174 | 1.638 | <1 
4.542 4,433 | 1.659 | <1 
4.733 3.023 | 1.373 | 34.04% 


Note. Higher mean scores indicate higher ratings on the 
attribute listed. 
05, 


The isolation of the attribution pattern 
linking negative moods and the premenstruum 
has several implications, First, the pattern 
itself may represent an oversimplification of 
the actual causes of premenstrual irritability. 
What amounts to the same attribution pat- 
tern, formalized in terms of hypotheses about 
hormonal fluctuations and the “premenstrual 
tension syndrome,” has been part of the sci- 
entific community for years. However, it has 
not yet, in any of its versions, received un- 
equivocal verification. The acceptance of es- 
sentially the same “hypothesis” by both sub- 
jects and experimenters represents a confound 
in the design of scores of research studies. 
Caution would indicate the use of double 
blind procedures and less reliance on self- 
report or recall of moods in future research 
on this question. 

More important, perhaps, is the possible 
role played by such an attribution pattern in 
causing premenstrual tension. One recent in- 
vestigator (Wineman, 1971) has found that 
sympathetic arousal characterizes the pre- 
menstrual phase of the cycle. Following 
Schachter and Singer (1962), emotionality 
should result from the interaction of this 
arousal with emotional labels provided by the 
situation. The simultaneous operation of a 
cognition linking negative moods and the 
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premenstruum could result in the discounting 
of situational factors and an emphasis on bi- 
ology (a type of internal attribution) in ex- 
plaining negative but not positive moods. This 
attributional pattern, then, could reflect not 
the actual incidence of positive and negative 
moods at premenstruation, but the extent to 
which women’s negative but not positive 
moods are seen as explained by biology. Ex- 
aggeration of the likelihood or recall of nega- 
tive moods with a reversal for positive moods 
might also result. 

Acceptance of this attribution pattern 
might then adversely affect women’s self- 
esteem. Valins and Nisbett (1971) have ar- 
gued that a vicious circle of self-condemnation 
and anxiety may result when negative be- 
havior patterns are internally attributed and 
situational factors discounted. Unreported 
analyses of the present data indicated that 
acceptance of the attribution pattern was 
greater among female subjects. Women were 
also more likely to make either internal or 
external attributions (the men’s attributions 
were more diffuse) and attributed more per- 
sonal causation for emotionally expressive 
behavior. These tendencies, if widespread,* 
could predispose women to suffer guilt, anxi- 
ety, and depression as a function of their 
belief in the biological explanation of pre- 
menstrual tension and might make action to 
alter upsetting situations more unlikely. As 
part of a complex attributional chain involv- 


4The relevance of the present findings to other 
samples should be evaluated before generalizing to 
women as a group. It is probable that the pre- 
menstrual expectation of negative moods is more 
prevalent among older, married women. Coppen and 
Kessel (1963) and Tonks, Rack, and Rose (1968), 
for example, found premenstrual tension more com- 
mon among older (30 to 45 year old) women living 
with a man. 
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ing biology, behavior, and self-concept, this 
belief may represent one of the more signifi- 
cant aspects of socialization on female per- 
sonality and deserves explicit consideration in 
future research. 


REFERENCE NOTE 


1. Koeske, R. K. Unpublished data, 1973. (Avail- 
able from Dr. Randi K. Koeske, Department of 
Psychology, Carnegie-Mellon University, Pitts- 
burgh, Pennsylvania 15213.) 
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Turning Play into Work: E 
Extrinsic Rewards on C 


Preschool children engaged 
expected reward conditions, 
highly attractive toys by 
conditions, subjects had 
subjects in the surveillance 


A necessary corollary to the Systematic use 

A of contingent extrinsic incentives by one per- 
‘son to modify or control the behavior of an- 
ther is surveillance, the constant or Periodic 
monitoring of a “subordinate’s” behavior by 
a “supervisor” with power or authority over 
him. Hence, any comprehensive account of 
the dynamics and empirical consequences of 
explicit contingency systems ultimately has to 
include an analysis of the effects of surveil- 
lance, as well as the effects of the incentives 
hemselves. As a method of providing data 
‘or the continuing evaluation of performance, 
surveillance is a practice common to a vari- 
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ult Surveillance and 
trinsic Motivation 


ety of everyday settings, 
(Silberman, 1970), indus 
and custodial or “total? 
man, 1961). Yet despite it 
society, the consequences 
side of the immediate sit 
occurs have received surpr: 
One perspective which 
search in this area, howeve 
tion theory (Bem, 1967, 1 
1973), which Suggests that the use of sur- 
veillance and powerful extrinsic incentives 
may, in some cases, have unintended and pos- 
sibly adverse effects upon both the supervisor 
who maintains surveillance over another and 
the subordinate who is placed under surveil- 
lance, Consider, for example, the effects of 
rewards and surveillance on the person placed 
in the role of supervisor over another. 
Strickland (1958) and others (Kipnis, 1972: 
Kruglanski, 1970) have conducted a series 
of intriguing studies which demonstrate an 
attributional cycle in which surveillance pro- 
duced distrust of a subordinate’s motivations, 
which in turn produced further surveillance, 
In Strickland’s (1958) classic study, sub- 
jects were asked to serve as supervisors 
simultaneously over two subordinates, During 
an initial work period, the supervisor was al- 
lowed relatively high surveillance over one of 
the subordinates and relatively low surveil. 
lance over the other worker, together with the 


including schools 
try (Homans, 1950), 
” institutions (Goff- 
S prevalence in our 
of surveillance out- 
uation in which it 
isingly little study, 
has stimulated re- 
r, has been attribu- 
972; Kelley, 1967, 
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power to reduce wages for inadequate work 
on supervised trials. Although in the end both 
workers had performed equally well at the 
task, this variation in the amount of surveil- 
lance had profound effects on the supervisor's 
attributions concerning the two subordinates’ 
motivations. Relative to the subordinate 
under low surveillance, the supervisors in this 
study saw the high-surveillance subordinates 
as motivated primarily by the surveillance 
itself, and hence as less internally motivated, 
less trustworthy, and less likely to perform 
adequately in the absence of surveillance. 
Thus, when subsequently given a choice of 
which subordinate to monitor, subjects chose 
more often to monitor the previously moni- 
tored subordinate, bringing the process full 
circle.* 

On the other side of the coin, from the 
perspective of the subordinate in such a situ- 
ation, self-perception theory (Bem, 1967; 
1972) suggests that a fundamentally similar 
attribution process may well occur in the sub- 
ordinate’s perceptions of his own behavior, 
leading him ultimately to fulfill the super- 
visor’s low expectations of him. To the extent, 
the theory suggests, that a person observes 
himself engaging in an activity in a situation 
in which extrinsic pressures to do so are 
strong and salient, he, like others, attributes 
his own behavior to the extrinsic pressures in 
the situation and comes to see himself as 
lacking any intrinsic interest in the activity 
or any intrinsic motivation to perform well. 
Hence, if a person is asked to undertake a 
task under conditions of strong external pres- 
sure, the theory suggests that any initial in- 
trinsic interest the person may have had in 
that activity is undermined by this pressure, 
and that when a person is subsequently pre- 


1 Actually, Kruglanski’s (1970) replication of the 
original Strickland study suggests that this last link 
in the chain—the relationship between attributed 
trustworthiness and future monitoring—may be 
somewhat more complex than indicated here. In a 
condition in which supervisors had the power to 
reward rather than fine the monitored subordinates, 
and in which the subordinate under initially high sur- 
veillance never performed well enough to receive a 
reward, supervisor-subjects trusted previously highly 
monitored subordinate less, yet monitored the other, 
more trusted subordinate more, thereby offering him 
a chance to earn rewards in the subsequent session. 


Mark R, LEPPER AND Davi GREENE 


sented with the same activity in the abse 
of strong external forces, he is less like! 
choose to engage in the activity. 

This “overjustification” hypothesis—t 
the use of overly sufficient extrinsic pre 
decreases subsequent intrinsic motiva! 
has recently found experimental supp‘ 
studies of the effects of extrinsic rewards b 
Lepper, Greene, and Nisbett (1973) an 
Greene and Lepper (in press). In th 
studies, children in individual sessions 
were led to expect and then were giv 
extrinsic reward for engaging in an -A 
ity of initial intrinsic interest; were givi 
same reward unexpectedly after they h 
ished with the activity; or were offere 
extrinsic reward for engaging in the act 
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Subsequently, unobtrusive measures 0! € 
children’s intrinsic interest in the activity 
were obtained in their classrooms, in the 


absence of any expectation of reward. The 
results in both studies indicated that > 
tation of a reward while engaging le L 
activity, relative to the other condition pa \ 
nificantly undermined the children’s in 

interest in that activity. 

In the context of this previous researc} 
the present study has two major pu 
(a) to extend the generality of the « 
sions of earlier studies concerning the 
of extrinsic rewards on later intrinsic int 
by attempting to replicate earlier results with 
an experimental activity and extrinsic reward 
quite different from those employed in previ- 
ous research; and (b) to examine, at thes 
same time, the effects of adult surveillance 
during the experimental sessions on sub- 
sequent intrinsic interest in the activity. 
Furthermore, because the implications 
nipulations which undermine children’s in 
trinsic interest in activities extend far beyond 
the laboratory setting, the present study seeks 
to emphasize the relevance of any results ob- 
tained for real-world settings by examining 
the effects of surveillance and expectation ot 
extrinsic rewards on children’s intrinsic in- 
terest in a natural classroom setting. 

To these ends, the study involved a 3 x 2 
design in which surveillance (high—low-no) 
and expectation of reward (expected-unex- 
pected) were manipulated orthogonally. Pre- 
school children were asked to engage in a 
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novel activity in individual sessions. In the 
expected reward conditions, subjects expected 
to be able to win a chance to play with a 
highly attractive set of toys by engaging in 
the activity; while in the unexpected reward 
conditions, subjects had no knowledge of 
these toys until they were finished with the 
activity. All subjects were asked to undertake 
the activity in the experimenter’s absence. 
Orthogonal to the manipulation of reward 
expectancy, subjects in the surveillance con- 
ditions were told that the experimenter would 
x be monitoring their performance during the 
session either most of the time (high surveil- 
lance) or only occasionally (low surveillance) , 
while subjects in the nonsurveillance condi- 
tions were given no such expectation. From a 
self-perception perspective, it was predicted 
that both surveillance and the expectation of 
an extrinsic reward would decrease the 
amount of interest children would show in 
the activity later, in their classrooms, where 
extrinsic pressures were absent. 


METHOD 
Subjects and Experimental Setting 


The subjects for this study were 80 preschool 
children, ranging in age from 4 years to 5 years 3 
months, selected from the student population at the 
Bing Nursery School, located on the Stanford Uni- 
versity campus. These children came from predomi- 
nantly white, middle-class backgrounds, and the 
sample included 39 males and 41 females. An addi- 
tional 12 subjects, distributed across conditions, 
were discarded from the experiment when they were 
F unable to solve the puzzles presented during the 
experimental session. None of these children had 
participated in any earlier related experiments on 
intrinsic motivation. 

The Bing Nursery School and its associated re- 
search facilities served as the experimental setting 
for this study. The nursery school consists of three 
classrooms which conduct similar and simultaneous, 
but independent classes. Two of the classrooms, 
equipped with large one-way mirrors and sound 
equipment, were used in the study. In these classes, 
the program is, by intention, relatively unstructured, 
with considerable “free play” time in which children 
“are allowed to choose from among a variety of both 
continuously and periodically available activities. 
For the purposes of this experiment, this arrange- 
ment allowed the introduction of an experimental 
activity into the ongoing nursery school program by 
the nursery school teachers, without intrusion into 
the classroom by researchers, in a situation in 
which children’s responses to this activity could be 
anobtrusively observed and recorded. 
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Experimental Materials 


To assess the generality of the findings obtained 
in previous studies and to provide an activity with 
explicit performance criteria, the present study em- 
ployed a task different from that used in earlier 
studies. This activity consisted of a set of 20 puz- 
zles, each consisting of a 10X 10X4 in (2.54X 
2.54 X .013 cm.) board with ł in (.006 cm.) insets 
cut to accommodate multicolored plastic puzzle 
pieces in various geometric shapes. Pretesting sug- 
gested this activity to be of initial intrinsic interest 
to most children and of sufficient similarity to other 
normal classroom activities so as not to appear 
strange in the classroom setting. Of this set of 20 
puzzles, 10 were selected for use during the experi- 
mental sessions. All the puzzles were used later in 
the classroom. 

Similarly, in the interests of generality, this study 
employed a different extrinsic reward from previous 
studies. In contrast to the largely symbolic “Good 
Player Award” used in earlier studies, the present 
study employed a “Premack” procedure (Premack, 
1965), in which the opportunity to play with a col- 
lection of highly attractive toys (e.g., a miniature 
garage and gas station, racing cars, a toy dog, a 
lunar lander and robot, a doll, etc.) served as the 
extrinsic incentive. Such a procedure, it was expected, 
would ensure that at least some of the available 
toys would provide a strong incentive for each of 
the subjects. 

Finally, in order to study surveillance per se, in 
the absence of any concurrent feedback to the child 
concerning his performance or interaction between 
the subject and the experimenter, this study em- 
ployed a television camera, through which the child 
could be told that he was being observed. For this 
purpose a General Electric television camera, 
mounted on a movable metal tripod and fitted with 
a Zeiss zoom lens, was used. This procedure, in 
addition to eliminating considerable extraneous 
variance, also permitted a clear variation in the 
amount of time during the experimental sessions 
that the subject believed he was actually under 
surveillance. 


Experimental Sessions 


For the experimental session, each child was es- 
corted individually to one of the research rooms 
attached to the nursery school by an adult experi- 
menter. In the experimental room were two long 
child-sized tables, each holding a set of puzzles, and 
a television camera mounted on a movable metal 
tripod placed next to one of the tables. In addi- 
tion, in the corner of the room by the entrance, was 
the set of attractive toys, hidden from sight by a 
cloth screen. 

As the subjects entered the room, the manipula- 
tion of reward expectancy took place. In the ex- 
pected reward condition, the experimenter pulled 
back the cloth screen, exposing the assortment of 
toys, and asked the subject if he would like a 
chance to play with them. When the subject as- 
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sented, the experimenter explained to him that he 
would be able to earn an opportunity to play with 
these toys if he did a good job on the puzzles, work- 
ing hard and solving them as quickly as he could. 
The cloth curtain was then replaced. In the unex- 
pected reward conditions, the toys were not exposed 
and no mention was made of their existence. 

The subject was then seated at a table which 
contained four puzzles, which were presented to the 
child as “practice” puzzles and which established for 
the child a procedure for the remaining puzzles. The 
experimenter demonstrated to the subject how to 
solve the first puzzle and then gave the subject a 
chance to solve the remaining three puzzles. If at 
any point the child needed help in solving the puz- 
zles, the experimenter came to his aid, but then 
asked the child to solve it for himself. When the 
four puzzles had been finished, the child was asked 
to spread them out on the table and to ring a small 
bell as a signal that he had finished the puzzles. 

When the child had finished the practice puzzles, 
the experimenter indicated that it was now time for 
the child to solve the “real” puzzles on his own. To 
emphasize further the discontinuity between the 
two sets of puzzles, the child was asked to move to 
a second table next to the television camera. This 
second table contained the six puzzles comprising 
the actual experimental task and, again, a small bell. 
The experimenter indicated to all subjects that for 
these puzzles it was important to work hard and to 
solve the puzzles as fast as they could. In the ex- 
pected reward conditions, the experimenter also indì- 
cated that how fast the subject did the puzzles and 
how hard he worked would determine how much 
time he would be given to play with the toys he 
had been shown. In short, for all subjects, the ex- 
perimenter indicated an explicit concern with the 
child’s performance on these puzzles the child was to 
do alone. On these puzzles surveillance was varied. 

In the surveillance conditions, the television 

camera beside the table was equipped with a zoom 
lens pointed directly at the table where the subject 
would be working, and on the table directly in front 
of the subject was a small light. After the child 
had been told that the experimenter would be 
leaving the room, the experimenter explained that 
he would still be able to see how well the child was 
doing on the puzzles, from time to time, through 
the nearby television camera. The experimenter 
pointed out the camera and explained further that 
the child would be able to tell when the experi- 
menter was watching him, because the small light 
on the table in front of him would turn on when- 
ever he was being watched. In the nonsurveillance 
conditions, the television camera was turned to face 
away from the table, the lens was removed from 
the camera, and the small light was removed from 
the table. No mention was made of the camera or 
of any surveillance by the experimenter. All sub- 
jects were told, however, that when they finished 
the puzzles they were to lay them out on the table 
and to ring the bell, as they had done with the 
practice puzzles, to indicate that they had finished 
them. 
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Before leaving the room, the experimenter rein- 
stated the critical elements of procedure for each 
subject and then exited, leaving the subject alone. 
As the subject worked on the puzzles, within the 
surveillance conditions, the amount of surveillance 
was manipulated by an observer watching the ex- 
perimental session from behind a one-way mirror 
along one side of the room. In the low-surveillance 
conditions, the light signaling the subject that he 
was being watched by the experimenter was turned 
on while the subject was working on one of the six 
puzzles; in the high-surveillance conditions, the light 
was turned on during four of the six puzzles. In 
both cases, the choice of puzzles to be undertaken 
under surveillance was determined by consultation 
with a predetermined random order. In addition, 
this observer recorded the amount of time the 
subject took to solve each puzzle and any com- 
ments the child made during the session. 

After the subject had completed the six puzzles 
and had rung the bell, the experimenter reentered 
the room and indicated to all subjects that they 
had done a very good job of solving the puzzle 
quickly. In the expected reward conditions, the 
experimenter added that because the subject had 
done such a good job he had earned the chance to 
play with the toy collection; for subjects in the 
unexpected reward conditions, the experimenter 
added simply and in a noncontingent manner that 
he also had a collection of toys with which the 
subject might like to play. All subjects were given 
10 minutes to play with the toys. At the end of this 
period, the subjects were thanked and returned to 
their classrooms by the experimenter, 


Classroom Observations 


One to 3 weeks after the completion of these 
individual experimental sessions, the primary mea- 
Sure of subsequent intrinsic interest in the activity, 
in the absence of any expectation of extrinsic re- 
ward, was taken in the subjects’ classrooms follow- . 
ing the procedure established in previous studies, 
For the first hour of three consecutive class sessions, 
the experimental activity was set out by the class- 
room teachers, among the other periodic activities 
chosen by the teachers for each day, at a table near 
the door to the classroom. To increase the accuracy 
of the measures, when the materials were available 
the teachers were asked to render inaccessible other 
similar playing materials and to refrain from sitting 
at the target table. Otherwise, the teachers behaved 
in a normal fashion. Hence, when the materials 
= Presented, the c ildren were free to choose 
p me Wade activity and the variety of other Ý 
te Ae eee and outdoors, offered by 
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near-perfect reliability (r=.99) the time spent at 
the target table for each of the children in the 
classroom. 


RESULTS 


. Preliminary analyses indicated no signifi- 
cant effect of sex of child or interaction of 
sex with experimental conditions; hence data 
were collapsed across. this dimension for 
further analysis. Preliminary analyses also 
revealed no significant differences between the 
high-surveillance and low-surveillance condi- 
tions on any measure; and these two treat- 
ments were therefore collapsed into a single 
condition for purposes of analysis, yielding a 
2 X 2 factorial design (Expected—Unexpected 
Reward X Surveillance-Nonsurveillance) . 

Figure 1 presents the data of primary in- 
terest in this study—the proportion of sub- 
jects in each of these four conditions who 
showed an interest in the experimental activ- 
ity in the classroom setting. These propor- 
tions were transformed to arcsines, and a 2 X 
2 analysis of variance was performed on these 
transformed data (cf. Langer & Abelson, 
1972).? This analysis yielded significant main 
effects of both reward expectancy, F (1/2) 
= 4.86, p < .05, and surveillance, F (1, %) 
= 4.86, p < .05, but no interaction between 
these two variables (F < 1). Thus, as in pre- 
vious studies, expectation and receipt of an 


2 An analogous analysis was also performed on the 
amount of time each child spent with the materials. 
The results of this analysis were in the same direc- 

* tion as the nonparametric data, although only the 
effect of the surveillance, F(1,76) = 4.30, p<.05, 
and not the effect of the reward expectancy manipu- 
lation, F(1, 76) = 2.42, p > .10, was statistically sig- 
nificant. This parametric analysis presented a less 
adequate picture of the data in the present study 
than the nonparametric analysis, however, for two 
reasons. First, the marked lack of homoscedasticity 
in these data raises a considerable question as to 
the appropriateness of the analysis of variance with 
unequal cell frequencies (cf. Scheffé, 1959). Second, 
the time data seemed influenced by a number of 
factors other than interest per se. While in earlier 

= studies the use of drawing materials allowed the 
child to work in his own way for virtually unlim- 
ited amounts of time, the use of puzzles with a clear 
criterion of completion in the present study created 
a situation in which extraneous factors such as the 
number of other children at the target table “tying 
up” other puzzles, the child’s skill in solving puzzles, 
or his motivation to solve them quickly, as well as 
the child’s interest in the activity, seemed to affect 
the amount of time spent with the materials. 
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Ficure 1. Subsequent intrinsic interest in the target 
materials, by treatment conditions, 


extrinsic reward for engaging in an activity 
was sufficient to produce decreased intrinsic 
interest in the activity in the classroom set- 
ting; and, orthogonal to this effect, surveil- 
lance by the experimenter during the task pro- 
duced an additional decrease in later interest 
in the activity. | 
Because this study did not include mea- 
sures of preexperimental interest in the target 
materials, of course, it is possible to argue 
that interest in the materials may have been 
increased under the nonsurveillance and un- 
expected reward conditions rather than de- 
creased by surveillance and expected rewards. 
Although imperfect, some relevant data are 
available from a pretest of the materials in 
another classroom at the same nursery school, 
in which the materials were presented for a 
single session following the same procedures 
as in the experiment proper. In this case, 29 
of 34 children (85%) played with the ma- 
terials, suggesting quite a high level of initial 
interest in the activity. The present interpre- 
tation of the data from this study, moreover, 
is consistent with previous data collected in 
conceptually analogous experiments in this 
setting which have shown (a) expected re- 
wards to produce significant decreases in in- 
terest from baseline classroom measures (Lep- 
per et al., 1973), and (b) expected rewards 
to produce significant decreases in subsequent 
interest relative to both no-reward and unex- . 
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pected reward control groups, which did 
not themselves differ (Lepper et al., 1973; 
Greene & Lepper, in press). 

In addition to these data of principal in- 
terest, the study also yielded a score for each 
subject on the amount of time he required to 
solve the six puzzles during the experimental 
session, and these scores were also submitted 
to a 2 X 2 analysis of variance. This analysis 
yielded a marginally significant effect of re- 
ward expectancy, F (1,76) = 3.08, p < .10, 
with subjects expecting to receive a reward 
tending to solve the puzzles more quickly 
than subjects not expecting a reward, but no 
significant effect of surveillance (F < 1) or 
interaction of surveillance with reward ex- 
pectancy (F = 1.26) on puzzle speed. It 
should be noted, however, that this speed 
measure was probably not a good indicator of 
the immediate effects of surveillance on per- 
formance, since any potentially facilitating 
effects of the surveillance manipulation may 
have been obscured for some subjects by the 
obviously distracting effect of the presence of 
the camera and light. 

EN 
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DISCUSSION 


The results of the present study, then, rep- 
licate and extend the findings of previous 
research on the ways in which extrinsic in- 
centives may undermine children’s intrinsic 
interest in an activity. As in previous studies 
(Lepper et al., 1973; Greene & Lepper, in 
press), children tended to work more quickly 
during experimental sessions when they ex- 
pected to earn a reward; nevertheless, chil- 
dren who anticipated and received an extrinsic 
reward for engaging in an activity of initial 
interest were less likely to show interest in 
that activity subsequently, in a classroom 
situation in which extrinsic incentives were 
absent, than children who had not expected 
a reward during the experimental sessions. 
Furthermore, the fact that the present study 
was able to replicate previous results with an 
activity, a reward, and a procedure quite dif- 
ferent from those employed in earlier studies 
attests to the generality of the basic phe- 
nomenon. 

At the same time, the present study also 
indicates that adult surveillance during the 
time the child was engaged in the task was 
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itself sufficient to produce a similar decre- 
ment in subsequent intrinsic interest in the 
activity, as suggested by a self-perception 
analysis of this situation. The knowledge that 
one’s performance at a task is being observed 
and evaluated by someone else, even when 
there is no explicit expectation of any tangible 
reward for engaging in the activity, appears 
sufficient to decrease later interest in the task. 

Interestingly, the data from the present 
study suggest that it made little difference 
whether this surveillance was only occasional 
or nearly constant. Whether this result is due 
to an insufficiently powerful manipulation of 
amount of surveillance or whether the means 
by which surveillance adversely affects in- 
trinsic interest is an all-or-none process re- 
mains an interesting question for further re- 
search. Similarly, although the methodologi- 
cal advantages of the present televised sur- 
veillance technique are clear, it is also impor- 
tant for future research to examine, as well, 
the effects of face-to-face monitoring, which 
may depend heavily on the character of the 
interaction between the agent and the object 
of surveillance.’ 

Notwithstanding the theoretcal subtleties 
of this process, however, the practical impli- 
cations of these studies seem considerable for 
schools and other institutions in which sys- 
tems of extrinsic incentives and periodic sur- 
veillance are employed to control and ma- 
nipulate behavior. To the extent, for example, 
that many of the activities we ask children 
to attempt in school may be of some initial © 
intrinsic interest to at least some of the chil- 
dren, the effect of presenting these activities 
in the context of a system of extrinsic incen- 
tives and adult surveillance may be to under- 
mine that intrinsic interest in those activities, 
Unwittingly, these studies suggest, we may 


3 Since the surveillance manipulation in this study 
bears more than a passing resemblance to proce- 
dures employed by Duvall and Wicklund (1972) to 
manipulate “objective self awareness,” it may be 
worth noting that at least inferentially in this study 
this manipulation served to increase “external” 
rather than “internal” attributions. While it is cer- 
tainly not clear that an objective self-awarenses po- 
sition would speak to the present situation, these 
data may suggest the importance of looking at such 
manipulations under “Jow-dissonance” aso well as 
“high-dissonance” conditions. 


often turn activities of initial interest into 
drudgery which children engage in only when 
external pressures are Present to force or lure 
them to do SO—a suggestion which is highly 
congruent with the observations of a number 
of analysts of our current educational system 
Holt, 1964; Jackson, 1968; Silberman, 
1970). As Jackson has Suggested in his per- 
tive book, Life in classrooms: 


“he distinction between work and play has 
‘ar-reaching Consequences for human af- 
‘airs, and the classroom is the setting 
1 which most people encounter this dis- 
nction in a Personally meaningful way. 
ccording to one of its many definitions, 
vork entails becoming engaged in a pur- 
seful activity that has been prescribed 
us by someone else; an activity in 
which we would not at the moment be 
engaged if it were not for some system of 
authority relationships. As preschoolers the 
students may have played with the concept 
of work, but their fanciful enactments of 
adult work usually lack one essential in- 
` lient, namely: the use of some kind of 
external authority system to tell them 
t to do and to keep them at their job. 
© teacher, with his prescriptive dicta and 
his surveillance over the students’ atten- 
n, provides the missing ingredient that 
ces work real. 


lay be objected, of course, that many 
A scuvilles we ask of children in school are of 
“little or no intrinsic interest to the children; 

that there are many important activities in 

h children would ‘not engage spontane- 
y without external pressure or offer of 
external reward. We agree completely and 
suggest that in these cases the use of extrinsic 

*. rewards is necessary and appropriate. Cer- 
tainly the “lesson” to be learned from these 
studies is not that extrinsic rewards should be 
:bandoned, but rather ‘that if one wishes to 
foster an interest in activities which would 
manifest itself in situations or at times when 
extrinsic pressures are absent, one would be 
well advised to employ the minimal amount 

of pressure sufficient to elicit or maintain the 

- + desired behavior. A virtually identical recom- 
‘ „ mendation, it might be noted, has recently 
been offered by researchers concerned with 
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the practical problem of promoting “generali- 
zation” of the effects of token economy pro- 
grams beyond the immediate situation in 
which rewards are available (O'Leary, Drab- 
man, & Kass, 1973), Suggesting further 
the applicability of these results in natural 
settings. 

It is also worth noting, however, that any 
observer’s estimate of the inherent interest 
value an activity has for an actor is colored 
by the setting in which he observes the actor 
engaging in the activity. If we, like the super- 
visors in the Strickland (1958) study, ob- 
serve an actor engaging in an activity when 
extrinsic pressures are great, we would at- 
tribute his behavior to the external contin- 
Sencies in the situation, In fact, if the self- 
Perception account is correct, it would suggest 
that the use of overly sufficient pressure to 
induce a person to engage in an activity may 
produce a self-sustaining and self-fulfilling 
cycle since both the source and the recipient 
of that pressure would be subject to the same 
attributional “bias,” Thus, the teacher or 
supervisor comes to believe that the child or 
worker is motivated only by external pres- 
sures and therefore maintains that external 
pressure, while the maintenance of that pres- 
sure in and of itself may ultimately conyince 
the child or worker himself that he is moti- 
vated by the external pressure, making him 
less likely to engage in the activity in the 
later absence of that pressure.* 

Such self-fulfilling cycles, as “superstitious” 
behavior, are not likely to be discovered by 
the participants, who have no reason to be 
skeptical of the necessity of a system of overt 
extrinsic controls; indeed, the mere existence 
of such controls bears testimony to their neces- 
sity. Demonstrating that they are not nec- 
essary, moreover, is possible only to the ex- 
tent that the participants do not already be- 


4 Indeed, one subtle message of a supervisor’s use 
of extrinsic incentives to control a subordinate’s be- 
havior may be his distrust of that subordinate’s 
“intrinsic” motivations, or even his own dislike for 
the activity, and such an implicit communication 
may well be one part of the process by which an 
Overjustification effect may be produced. Further 
research to sort among these various attributional 
alternatives, perhaps through the use of observer- 
subjects as well as actors, would certainly seem 
appropriate. 
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lieve the contention to be disproved. If, as 
Kipnis (1972) has proposed, the mere avail- 
ability of institutional powers to a supervisor 
is virtually sufficient to ensure their use, the 
prevalence of self-sustaining cycles of sur- 
veillance and extrinsic control in our society 
is not surprising. A self-perception perspec- 
tive suggests that the prospects for being able 
to set up institutions so as not to perpetuate 
these cycles would be substantially improved 
by further attention to the processes by which 
people make inferences about their own and 
others’ motivations, 
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Individual Versus Dyadic Performance on a Disjunctive Task 
as a Function of Initial Ability Level 


Patrick R. Laughlin and Deborah S. Bitz 
University of Illinois 


Individuals and cooperative pairs of six ability levels were compared on a 
task thought to be clearly disjunctive, in which the group solves the problem 
if any group member can solve the problem. After taking the Mednick Remote 
Associate Test as individuals, 255 college students were trichotomized as high 
(H), medium (M), or low (L) in ability and retook the same test in one of 
the six cooperative pairs HH, HM, HL, MM, ML, LL or as control H, M, or 
L individuals. The prediction of a truth-wins process on a disjunctive task 
was tested by an application of the theoretical model of social decision schemes 
of J. H. Davis. Both an equiprobability and error-wins social decision scheme 
could clearly be rejected, while the predictions of a truth-wins social decision 
scheme closely fit the obtained results (a) for individual items and over 
items, (b) over all ability levels and within the six ability levels, and (c) as 
estimated from both first-test performance of the actual group members and 
from second-test performance of control individuals of the appropriate ability 
level. It is suggested that clear support for a truth-wins process will be ob- 
tained only for tasks at the extreme disjunctive end of the basic conjunctive- 


disjunctive dimension of task characteristics. 


Recent theory and reviews of individual 
versus group performance have emphasized 
the importance of the group task (e.g., Davis, 
1969; Hackman, 1969; Kelley & Thibaut, 
1969; Steiner, 1966, 1972). The basic task 
dimension is probably a continuum anchored 
by-strictly conjunctive tasks, where the group 
solves the problem only if all individual mem- 
bers of the group can solve it, and strictly 
disjunctive tasks, where the group solves the 
problem if any individual member can solve 

. it. Many tasks may be further divided physi- 
cally or conceptually into subtasks. On such 
complementary tasks the group members may 
combine their unshared resources and thus 
exceed their performance as individuals solv- 
ing the task independently. Furthermore, the 
ability level of the group members interacts 
with task difficulty on a complementary task, 
so that cooperative groups may or may not 
exceed the performance of the same individu- 

„als working independently. For example, 
Laughlin and Johnson (1966) found that 
pairs of high-ability individuals performed 
better than a single high-ability person on the 
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Terman Concept Mastery Test, while me- 
dium-ability pairs did not differ from medium- 
ability individuals, and low-ability pairs did 
not differ from low-ability individuals. In 
contrast, Goldman (1965) obtained virtually 
opposite results on the much less difficult 
Wonderlic Intelligence Test. However, intelli- 
gence test items do not possess the strong 
compelling insight properties commonly at- 
tributed to disjunctive or “eureka” tasks, so 
a group member of a given ability who in 
fact knows the correct answer on this type of 
task may not be able to persuade the other 
group members to accept his judgment. Thus, 
the major purpose of the following study was 
to compare individuals and cooperative pairs 
of six ability levels on a task thought to be 
clearly disjunctive, the Remote Associates 
Test (RAT) developed by Mednick and 
Mednick (1967) to operationalize an asso- 
ciative theory of creativity (Mednick, 1962). 

The RAT consists of 30 sets of three words 
which can be related in some way by a com- 
mon remote association, For example, the 
three words cookies, sixteen, and heart can be 
related by the common association sweet. By 
definition, the items of the test are postulated 
to be insight or eureka tasks in which the 
correct answer is immediately evident and 
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accepted once it is proposed. Hence, each item 
of the RAT may be considered a disjunctive 
task, while the test as a whole may be con- 
sidered a complementary task. Accordingly, 
pairs of all ability levels should be able to 
perform at least as well as their best member, 
and should exceed their individual perform- 
ance to the extent that each member knows 
items not known in common by both mem- 
bers. As a test of this prediction, 255 indi- 
viduals were given the RAT, They were then 
trichotomized into high (H), medium (M) 
and low (L) levels of ability and immediately 
retook the same test as cooperative pairs in 
one of the six possible ability combinations 
HH, HM, HL, MM, ML, or LL. Control H, 
M, and L subjects retook the same test as 
individuals. The basic prediction that the 
groups at all six ability levels would solve 
each item if either member of the group could 
solve it was tested by an -application of the 
theoretical model of social decision schemes 
(Davis, 1973). 

Essentially, the theoretical model of social 
decision schemes is a four-step process. Ini- 
tially, the number of distinguishable distribu- 
tions of member preferences for a finite set 
of response alternatives is determined as a 
joint function of the number of group mem- 
bers and the number of response alternatives. 
Second, a row vector of the probabilities of 
each of these distinguishable distributions of 
preferences is determined from the multino- 
mial expansion, using the probability of each 
of the individual responses as estimated from 
either a measure of previous performance of 
the actual individuals who compose the group 
or an independent sample of individuals who 
do not actually interact as a group. Third, 
this row vector is postmultiplied by a social 
decision scheme matrix, which assigns a prob- 
ability to each response alternative for each 
distribution of member preferences, to obtain 
a row vector of the predicted probabilities of 
each response by the group. This social deci- 
sion scheme itself is extraexperimental, and 
may be based upon the constitution or bylaws 
of a group (e.g., majority rules), prevailing 
theory (e.g., truth wins), custom or tradition, 
the law, and the like. Finally, the predicted 
distribution of group responses is compared 
with the actual distribution of group re- 
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sponses. If the predictions generated by a 
particular social decision scheme do not differ 
significantly from the actual group responses, 
the social decision scheme may be considered 
a plausible model or explanation of the under- 
lying group process. 

More specifically, following the notation of 
Davis (1973), for some set of n mutually ex- 
clusive and exhaustive response alternatives, 
Aj, A2, ..., An, the preferences of individuals 
may be described by a probability distribu- 
tion, pi, P2, . . . Pn across the alternatives, 
For a group of r individuals, the internal dis- 
tributions of preferences may be represented 
by the occupancy numbers, 71, r2,. . ., n, and 


n 
z i= r. 
jl 
For a given r and n, the m distinguishable 
distributions of member preferences within a 
group is given by the formula * 


n+r—1) 
r 


The m sets of occupancy numbers are the 
solutions to m+72+...,+%=r. The 
probability, m; that the group members will 
arrange themselves according to the ith dis- 
tribution, i= 1, 2, . . . m, is given by the 
general term of the multinomial expansion 


(pit pot e + py)? 


and is as follows: 


r 
«=| Jor proton (1) 
Fi, 12) °*°y Pp, 


With two response alternatives this reduces to 
the binomial expansion. 

Thus, with the two-person groups (r = 2) 
and the two response alternatives (n= 2) of 
correct (C) and incorrect (1) on each item 


of the RAT, there are m = 3 distinguishable; 


distributions of member preferences for the 
two alternatives. These three are (a) both 
group members correct as individuals (2,0), 


„>In this formula Davis (1973) follows the nota- 
tion of Feller (1950). See Feller (1957, p. 36 ff.) for 
a discussion of the derivation and application of the 
formula to occupancy problems, 
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TABLE 1 
Trutu-Wins (Disjunctive), EQUIPROBABILITY, AND Error-Wins (CONJUNCTIVE) 
SOCIAL DECISION SCHEME MATRICES FOR CORRECT AND INCORRECT 
RESPONSE FOR A Two-PERSON GROUP 
in Social decision scheme 
Distinguishable distributions 
of member preferences aa Tere vý 
Truth wins Equiprobability | i! Error wins 
Correct Incorrect Correct Incorrect Correct Incorrect Correct Incorrect 
2 0 1.00 00. 1.00 00 1.00 00 
1 1 1.00 00 50. 50 00 1.00 
0 2 00 1.00 00 1.00 00 1.00 


(b) one member correct and one member in- 
correct (1,1), and (c) both members incor- 
‘rect (0,2). The values for each of these 
three distributions were obtained from Equa- 
tion 1. Two separate estimates of the p values 
were used: (a) the probability that the mem- 
bers of the pair correctly solved the item as 
individuals and (b) the probability that the 
control individuals correctly solved the item 
on the second test. This resulted in a row 
vector, m = (72,0, 71,1, 70,2)+ 

A social decision scheme is a means of op- 
erating upon this row vector to yield a pre- 
dicted probability distribution P = (Po, Pı) 
of the group responses. Given the ith distribu- 
-tion of group member preferences, there is 
some probability dj; that the group will choose 
the jth alternative. The values of dj for all 
i=1,2,...,mandj=1, 2,..., are de- 
fined by a social decision scheme, which may 
be represented as an m X n stochastic matrix, 
D, with entries [dy] and 


3 [dy] = 1.00. 


j=l 


As previously indicated, this social decision 
scheme comes from some extraexperimental 
source, such as current theory. The “truth- 
wins” social decision scheme (Lorge-Solomon, 
1955, Model A) posited for a disjunctive task 
assigns a probability of 1.00 to the correct 
group response if any group member knows 
the correct answer. Consequently, the social 
decision scheme matrix defined by a truth- 
wins theory assigns a probability of 1.00 to 
the correct response for both the (2,0) and 
(1,1) distributions, and a probability of .00 


to the correct response for the (0,2) distribu- 
tion. Conversely, the truth-wins social deci- 
sion scheme assigns a probability of .00 to the 
incorrect response for the (2,0) and (1,1) 
distributions, and a probability of 1.00 for 
the (0,2) distribution. The resulting truth- 
wins social decision scheme matrix is given 
in Table 1. Two other feasible social decision 
schemes for the two-person group on the 
two-alternative task are equiprobability, 
where the probability that a group with one 
correct and one incorrect member will make 
the correct response is .50, and the “error- 
wins” decision scheme posited for a conjunc- 
tive task, where the probability that a group 
with one correct and one incorrect member 
will make the correct response is .00. The 
matrices for these two social decision schemes 
are also given in Table 1. Note that the three 
social decision schemes assign identical proba- 
bilities to the (2,0) and (0,2) distributions 
but different probabilities to the (1,1) distri- 
bution. 

Given a social decision scheme, the proba- 
bility distribution P = (Po, Px) of a correct 
group response as predicted by the social deci- 
sion scheme may be computed by postmulti- 
plying the row vector = (72,0, 71,1, 70,2) by 
the social decision scheme matrix D: 


(Po, Px) = (72,0, 7,1) m2) ° D. (2) 


Equation 2 was used to compute the predicted 
probability of a correct group response from 
each of the three social decision schemes on 
each item within each ability condition, as 
well as over items and/or over ability condi- 
tions. Comparison of these predictions with 
the actual obtained group responses is there- 
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fore a test of the three competing truth-wins, 
equiprobability, and error-wins social decision 
schemes as theories of the underlying group 
process on this particular experimental task. 
It was predicted that the truth-wins social 
decision scheme would fit the obtained group 
results, while the equiprobability and error- 
wins social decision schemes could be rejected. 

The second interest of the study was the 
overall level of second-test performance for 
the six group and three control individual 
conditions. In a comparable study with the 
Terman Concept Mastery Test, Laughlin and 
Johnson (1966) found that HH pairs per- 
formed better than H individuals, while MM 
pairs did not differ from M individuals and 
LL pairs did not differ from L individuals. 
Although HM pairs performed better than H 
individuals, neither HL and H nor ML and M 
differed. These results were interpreted as in- 
dicating increasingly greater homogeneity of 
ability as the ability level decreased, such that 
the medium-ability members of a pair shared 
more of their total knowledge in common 
than did high-ability members, and low-abil- 
ity more than medium-ability members. 
Laughlin and Johnson also proposed that 
vocabulary ability as measured by the Ter- 
man test is unidimensional in the sense that a 
person of greater ability will know most if 
not all of the words known by a person of 
lesser ability. Consequently, a person of lesser 
ability would contribute little unique infor- 
mation in a pair including a person of greater 
ability. Such homogeneity of ability within 
ability levels should be greater on the vocabu- 
lary items of the Terman test than on the 
remote association items of the RAT, in 
which greater heterogeneity of remote associ- 
ations across people of the same ability level 
would be expected from the manner of con- 
struction of the test (Mednick, 1962). How- 
ever, it is reasonable to expect that RAT abil- 
ity, like the Terman ability, is unidimensional 
in the sense that a person of higher ability 
will know most if not all of the remote asso- 
ciations known by a person of lesser ability. 

Consequently, from the three assumptions 
of (a) a truth-wins process on a disjunctive 
task, (b) greater heterogeneity of ability with- 
in medium- and low-ability levels on the RAT 
than on the Terman, and (c) unidimension- 
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ality, the overall second-test performance was 
predicted to be as follows: HH > (HM = 
HL = H) > MM > (ML = M) > LL > L. 


METHOD 


The subjects were 255 college students of both 
sexes in introductory and social psychology courses 
at the University of Illinois. Each subject first took 
the college and adult Form 1 of the RAT under the 
standard instructions (Mednick & Mednick, 1967). 
After the allowed 40 minutes, the answer sheets were 
collected. The first 180 subjects were then immedi- 
ately assigned at random to same-sex pairs or indi- 
vidual conditions for the second administration of 
the same test. The pairs were instructed to discuss 
each item and take the test as a cooperative group, 
recording their responses on a single test booklet. 
These 180 individual first tests were then rank or- 
dered and trichotomized into high (H), medium 
(M), and low (L) thirds, resulting in scores of 20 
and higher (of a possible 30) being defined as high, 
15-19 as medium, and 14 and below as low. The 
ability combination of each of the resulting pairs 
was determined as HH, HM, HL, MM, ML, or LL. 
The remaining 75 subjects similarly took the RAT 
as individuals on the first test. After their tests were 
collected, they immediately took a 10-minute filler 
task (writing plot titles for brief scenarios) while 
their tests were scored. They were then assigned to 
the six pair or three individual control conditions as 
necessary to achieve 17 replications in each of the 
nine conditions. Over all 255 subjects, the mean first 
test score was 16.45. 


RESULTS AND DISCUSSION 


Estimation of Second-Test Group Perform- 
ance over All Ability Conditions from Proba- 
bility of Item Correct as Individuals 


The predictions of second-test group per- 
formance for the truth-wins, equiprobability, 
and error-wins social decision schemes were 
computed by Equation 2. Table 2 presents 
these predictions for each of the 30 items over 
the 102 pairs (ignoring the ability level of 
the pair). The z values were obtained from 
the probability of a correct response on the 
item on the first test by the 204 individuals 
involved in the 102 pairs. The actual obtained 
probabilities of a correct response on each 
item for the pairs on the second test are also 
given in Table 2. The significance of the dif- 
ference between predicted and obtained values 
for each of the three social decision schemes 
was tested by the statistic z = VN (2 arcsin 
VPprea — 2 arcsin VPort), where VPpreq is 
the square root of the predicted proportion 


PREDICTED PROBABILITIES OF Group CORRECTLY 


Erpe ap Tuy or ITEM CORREC 
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TABLE 2 


ANSWERING EACH ITEM FOR THREE SOCIAL DECISION 


Scuemes or TruTH Wins, EQUIPROBABILITY, AND ERROR WINs, ESTIMATED FROM 


T AS INDIVIDUALS ON FIRST Test 


Item 
Decision scheme g 
v 1 2 3 4 6 7 8 9 10 
Truth wins 9299 | .9556 | .9375 | .8789 9309 | .o873 | 9104 | 9615 | 9192 
Equiprobability 17353 |. 7892 | i7500 | ‘6520 17549 | 8873 | 17108 | ‘8039 | ‘7187 
Error wins (5407 | 16228 | :5625 | 14251 :5699 | 17873 | ‘5052 | ‘6403 | 15122 
Obtained 39020 | ‘9608 | 9412 | 18725 2 | 3o20 | (9902 | ‘9118 | ‘9706 | 9412 
z truth wins-obtaineda 1.09 al 03 4 1.50 5 b 
5 equiprobabilit =5.61 | —5.58| —4.08 
z error wins-obtained 10,10 —9.65 | 
oD ica Nee 
X Truth wins «7093 | .5290 | 8181 
Equiprobability 4608 3137 5735 
Error wins 2123 0984 3289 
Obtained 7549 | 34118 | 18529 
z truth wins-obtained —.91 2.43 —.82 
z equiprobability-obtained =6.10| -211| —6.41 
z error wins-obtained =11.54 | —7.54 | —11,33 
21 22 23 24 25 26 27 28 | 29 30 
Truth wins 6767 | .5685 | .8139 | .4077 15222 | 4375 
Equiprobability 4314 | 13431 | 15686 | [2304 ‘3088 | (2500 
. Error wins 1860 | [1177 | :3233 | 10531 dosa | {0625 
Obtained 7059 | ‘5784 | ‘3431 | ‘4608 6078 | 15490 
s truth wins-obtained —.66 ~:20 —.80 | —1.02 —1.74| —2,23 
sequiprobability-obtained | —6.41| —5) —si80| —4.91 | —6.13| —4.95 =6.08 | —6.30 
s error wins-obtained —11:93 | —10.36 | —11.13,| —10.34 | —11.27 | —10,50 —11.50 | —11,88 


correct, V Pont the square root of the obtained 
proportion correct, and arcsin*the inverse sine 
(Walker & Lev, (1953, p. 424). The results of 
these tests on each item are also given in 
Table 2. A significant z value indicates that 
the social decision scheme in question can be 
rejected as a plausible explanation of the 
process underlying the group decision, while 
a nonsignificant z value indicates that the 
= social. decision scheme cannot be rejected. 
Thus, the smaller the z value, the hetter the 
fit of the prediction from a social’ decision 
scheme and the obtained group results. All z 
values were assessed by two-tailed tests at 
the .10 level of confidence, which, with the 
relatively large number of pairs, should pro- 
vide a conservative test of sufficient statistical 
power to essentially accept the null hypothe- 
sis. For the special case of two persons and 
“two response alternatives, the predictions of 
the equiprobability social decision scheme are 
identical to the probability that the item was 
correct for individuals, so the difficulty of 
„each item can also be determined from Table 


2. 


As indicated in Table 2, the error-wins so- 
cial -decision scheme or conjunctive task 


ag = 1.65, p <.10 (two-tailed test); s = 1.96, p < .05 (two-tailed test); s-= 3.65, p < .001 (two-tailed test), 


model, which requires both members of the 
pair to know an item for the pair to correctly 
answer the item, can be rejected as under- 
predicting the obtained results on each of the 
30 items, Likewise, the equiprobability social 
decision scheme can be rejected on all 30 
items. However, the truth-wins social decision 
scheme or disjunctive task model can only be 
rejected on 5 of 30 items, overpredicting on 
3 items (11, 15, and 19) and underpredicting 
on 2 items (29 and 30). Over all 30 items 
combined, the predicted proportion of correct 
group responses for the three social decision 
schemes was .7960 for truth wins, .5484 for 
equiprobability, and .3007 for error wins, 
while the obtained proportion correct was 
.7598. Again the prediction of the truth-wins 
model did not differ significantly from the 
obtained results (z = .88), while both the 
equiprobabiltiy (z = —4.54) and error-wins 
models (z= —9.66) differed significantly 
from the obtained results. 

In summary, we may conclude that the 
truth-wins social decision scheme closely pre- 
dicted the obtained results, while the equi- 
probability and error-wins social decision 
schemes may clearly be rejected as plausible 
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models of the underlying group process on 
this task. 


Estimation of Second-Test Performance for 
Each Ability Condition from Probability of 
Item Correct as Individuals 


In addition to the previous predictions over 
all pairs, the predictions of the truth-wins, 
equiprobability, and error-wins social decision 
schemes were computed for each item within 
each of the six ability conditions. Table 3 
presents the number of items (out of 30) on 
which each of the three social decision 
schemes could be rejected for each ability 
condition. As indicated in Table 3, the truth- 
wins social decision scheme could be rejected 
on few items, error wins on virtually all, and 
equiprobability on most items within each 
ability condition. More importantly, there 
was little difference in the applicability of a 
given social decision scheme across ability 
conditions, as the differences between the six 
obtained frequencies for each of the three 
social decision schemes (see Table 3) were all 
nonsignificant, x° (5) = 4.00, for truth wins; 
x’ (5) = 3.74, for equiprobability; x? (5) = 
.24, for error wins. The applicability of the 
truth-wins model across ability conditions was 
further tested by an analysis of variance on 
the difference scores for the truth-wins pre- 
diction minus obtained proportion of correct 
responses (both proportions transformed to 
arcsin or œ transformations) on each item 
within each ability condition, considering the 
30 items as replications. This analysis resulted 
in a nonsignificant difference between the six 
ability conditions, F (5,174) < 1. Thus, the 


TABLE 3 


NUMBER or Irems on WHICH PREDICTIONS or TRUTH- 
Wins, EQUIPROBABILITY, AND ERROR-WINS SOCIAL 
DECISION SCHEMES CAN BE REJECTED WITHIN 
Srx ABILITY CONDITIONS, ESTIMATED FROM 
PROBABILITY OF ITEM CORRECT AS 
INDIVIDUALS on First TEST 


AS Ability condition 
Social decision 
scheme 


HH | HM | HL | MM | ML | LL 


Truth wins 3 4 7 3 6 1 
Equiprobability | 24 | 24 | 25 | 23 | 27 | 15 
Error wins 27 | 29 | 29 | 30 | 30} 28 


Note. HH, HM, HL, MM, ML, and LL refer to combinations 
of high, medium, and low ability. N = 30. 
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truth-wins model applied equally across the 
six ability conditions, increasing the general- 
izability of the model as a plausible explana- 
tion of the underlying group process on this 
type of task. 


Estimation oj Second-Test Group Perform- 
ance over all Ability Conditions from Proba- 
bility of Item Correct for Control Individuals 
on Second Test 


Since the pairs retook the same test for a 
second time, it is possible that this added 


practice on a timed test resulted in additional ,—~ 


learning or insight into items neither indi- 
vidual had previously solved on the first ad- 
ministration. This possibility may be assessed 
from the probability that the control indi- 
viduals would correctly solve a given item on 
their second test. Thus, the predictions of 
second-test group performance for the truth- 
wins, equiprobability, and error-wins social 
decision schemes were also computed from 7 
values based on the probability of a correct 
response on each item by the 51 control indi- ~ 
viduals on the second test. These predictions 
and z tests of the predicted versus obtained 
proportions of correct answers for each social 
decision scheme are presented in Table 4. As 
indicated in Table 4, the error-wins social 
decision scheme was rejected on all 30 items, 
equiprobability on 26 of 30, and truth wins on 
only 8 of 30 items. Over all 30 items com- 
bined, the predictions of the three social deci- 
sion schemes were .8337 for truth wins, .5922 
for equiprobability, and .3507 for error wins, 
while the obtained proportion correct was 
-7598. The predictions of the truth-wins social 
decision scheme did not differ significantly 
from the obtained results (z = 1.32), while 
both the equiprobability (z= —2.58) and 
error-wins (z= —6.07) predictions differed 
significantly from the obtained results. 
Thus, the truth-wins social decision scheme 
closely predicted the obtained results when 
the probability that the control individuals 
would solve the item on the second test was 
used to estimate the x values, while the equi- 
probability and error-wins social decision 
schemes were again rejected. These analyses 
controlled for the possibility that the added 
practice on the second test enabled the pairs 
to solve items that neither of the pair mem- 


th wins | -8755 
probability | -6471 

Error wins | 4187 

_ Obtained 9020 
8 truth wins-obtained ~.60 
 equiprobability-obtained =4.54 

# error wins-obtained 7.84 


mene. 
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TABLE 4 
PREDICTED PROBABILITY OF GROUP CORRECTLY ANSWERING EACH Trem FOR THREE SOCIAL Decision 


SCHEMES or TRUTH Wins, EQUIPROBABILITY, 
or ĪTEM CORRECT BY Contr 


AND ERROR Wins, ESTIMATED FROM PROBABILITY 
OL INDIVIDUALS ON SECOND TEST 


Decision scheme = = 


Item 


Truth wins -9688 
Equiprobability 18235 
Error wins -6782 
Obtained 9020 
z truth wins-obtaineds 2.01 
Z equiprobability-obtained 


z error wins-obtained 


Truth wins 
Equiprobability 

Error wins 

Obtained 

z truth wins-obtained | 
z equiprobability-obtained | 


# error wins-obtained 


*s = 1.65, p < .10 (two-tailed test); s = 1.96, p <.05 (two-tailed test); z = 3,65, b < 001 (two-tailed test), 


bers had solved as individuals. From another 
perspective, this indicates that both the prob- 
ability of a correct response from an inde- 
pendent group of individuals and the proba- 
bility of a correct response for the actual 
group members on the first test will equally 
predict the obtained group results, suggesting 
that estimations from either of these proba- 
bilities will give comparable results in an 


AEDE EOR of a social decision scheme analy- 
is. 


Estimation of Second-Test Performance for 
Each Ability Condition from Probability of 
Item Correct for Control Individuals on 
Second Test 

Since each individual in the control condi- 
tions took the same test two times, there are 
four possible response patterns: correct on 


bootie 1 and correct on Test 2 (C,Cs), correct 


») 


on Test 1 and incorrect on Test 2 (C,I»), in- 
correct on Test 1 and correct on Test 2 
(I,C2), and incorrect on Test 1 and incorrect 
on Test 2 (I,Ig). These four patterns occur 
at each of the three ability levels, yielding 
the probabilities for the 12 patterns, 


Hee, Heis” 


++) Dni» 


These 12 probabilities may be used to esti- 
mate the probability that a pair of each of the 
six ability combinations will solve the item on 
any postulated social decision scheme. Since 
there were two group members and four pos- 
sible response patterns over the two tests for 
each member, there are 16 possible patterns 
for a given pair. These 16 possible patterns 
are illustrated for an HM pair in Table 5. 
Similar patterns occur for each of the other 
five ability combinations, The sum of the 16 
probability values for a given pair is 1.00. On 
the assumption of the truth-wins social deci- 
sion scheme, the pair should solve the item 
unless both members are incorrect on the 
second test. Inspection of Table 5 indicates 
that both members are incorrect on the second 
test on Patterns 6, 8, 14, and 16, while at 
least one member is correct on the second 
test on the other 12 patterns. Similarly, the 
assumption of the equiprobability social deci- 
sion scheme is that the group will solve the 
item for all patterns where both members are 
correct on the second test (Patterns 139 
and 11) and will solve the item with a prob- 
ability of .50 on the eight patterns where one 
member is correct and one incorrect on the 
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TABLE 5 
SIXTEEN POSSIBLE PATTERNS FOR AN HM Pair 


Pattern HM pair 
1 Hoyo: N Mocs 
2 Hoc: N Mon 
3 Hoos N Mnc: 
4 Hac Mia 
5 Herts Moyes 
6 Hoyt, Mon 
7 Hotz N Mnc: 
8 Hens O Mins 
9 Hnc: N Moc: 
10 Hic: O Mon 
11 Hnc: A Mno: 
2 Hno: O Mints 
13 Hit N Mews 
14 Hits O Mon: 
15 Huni N Mies 
16 Hin Mig 
Note, Abbreviations are as follows: C = correct, I = in- 


correct, 1 refers to Test 1, 2 refers to Test 2, and H and M refer 
to high and medium ability conditions, 


second test (Patterns 2, 4, 5, 7, 10, 12, 13, 
and 15). The assumption of the error-wins 
model is that the group will solve the item 
only on those patterns where both members 
are correct on the second test (Patterns 1, 3, 
9, and 11) and will not solve the item on 
the other 12 patterns. Thus, the 12 probabil- 
ities that control individuals at one of the 
three ability levels would have one of the 
four patterns C,Co, etc., were used for the 
predictions of the truth-wins, equiprobability, 
and error-wins social decision schemes for 
each of the six ability levels. The appropriate 
probabilities were multiplied for each of the 
16 patterns within each of the six ability 
levels, for example, 


Hoye, X Mojen He,c, X Me,r, etc., 


and the appropriate products were summed to 
determine the predictions of each of the three 
social decision schemes. Table 6 presents the 
number of items (out of 30) on which each 
of the three social decision schemes could be 
rejected for each of the six ability conditions, 
As indicated in Table 6, within each ability 
condition, truth wins could be rejected on the 
least number of items, equiprobability on 
about half of the items, and error wins on 
most items. More importantly, there was 
little difference in the applicability of a given 
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social decision scheme across ability condi- 
tions, as the differences between the six ob- 
tained frequencies for each of the three social 
decision schemes were all nonsignificant, 
x?(5) = 5.00, for truth wins; x°(5) = 3.32, 
for equiprobability; x°(5) = 2.63, for error 
wins. An analysis of variance on the differ- 
ence scores for the truth-wins prediction 
minus obtained proportion of correct re- 
sponses (both proportions transformed to ¢) 
on each item within each ability condition 
was also nonsignificant, F(5, 174) = 1.36. 
Thus, as with the estimation of second-test 
group performance over all ability conditions, 
these analyses within ability conditions con- 
trolled for the possibility that the added 
practice on the second test enabled the pairs 
to solve items that neither of the pair mem- 
bers had solved as individuals, and indicated 
that such possible learning effects were not 
differential across ability conditions. These 
analyses therefore further support the appli- 
cability of the truth-wins social decision 
scheme over various ability levels on this task. 


Overall Second-Test Performance 


Means and results of Newman-Keuls com- 
parisons for overall second-test performance 
(maximum = 30) of the six pair and three 
individual control conditions are given in 
Table 7. The analysis of variance was highly 
significant, F(8, 144) = 41.57, p < .001. As 
indicated in Table 7, the overall order of 
means was exactly as predicted. Moreover, 
Newman-Keuls comparisons indicated that all ` 
predicted differences between pairs of means 
were significant except for the failure of HH 


TABLE 6 
NUMBER OF Trems on WHICH PREDICTIONS OF TRUTH- 
Wins, EqUIPROBABILITY, AND ERROR-WINS SOCIAL 
DECISION Sciemes Can BE REJECTED WITHIN 
Sıx ABILITY CONDITIONS, ESTIMATED FROM 
PROBABILITY OF [TEM CORRECT BY 
CONTROL INDIVIDUALS ON 
Srconp TEST 


Ability condition 


HH an | HL | MM | ML | LL 


Truth wins 4 7 9 9 8 3 i 
Equiprobability | 15 | 16 | 19 | 13 | 19 | 1 1 
Error wins | 22 | 23 | 30 | 24 | 30 | 29 


Note. N = 30. HH, HM, HL, MM, ML, a re 
combinations of high, medium, and low ability a “© "efer to 
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TABLE 7 
MEAN SEconp-TEst PERFORMANCE AND LEVELS OF SIGNIFICANCE OF COMPARISONS BY 
NewMan-Keuts Tests ror THE NINE ABILITY CONDITIONS 
Ability L LL M ML MM HL H HM HH 
çondition 12.29 17.24 17.77 20.94 22.94 23,53 23.71 25.41 26.71 

L -001 -001 001 001 O01 001 001 001 
LL ns 001 001 001 001 001 001 
M OL 001 -001 001 001 001 
ML 05 05 05 -001 001 
MM ns ns 05 001 
HL ns ns OL 
H ns OL 
HM ns 


to perform significantly better than HM. 
Conversely, ML performed better than M, 
while no difference was predicted. Thus, a 
person of medium ability contributed more 
than expected to a partner of high ability, 
and a person of low ability contributed more 
than expected to a partner of medium ability, 
which indicates a somewhat greater hetero- 
geneity of ability across ability levels than 
expected. However, as predicted, HL did not 
differ from H, which indicates that a person 
of low ability contributed nothing to a part- 
ner of high ability. As predicted, all three 
pairs of a given ability level exceeded their 
individual controls (HH vs. H, MM vs. M, 
and LL vs. L), which indicates greater hetero- 
geneity of ability within a given ability level 
on the remote associates of the RAT than on 
the vocabulary items of the Terman-Concept 
astery Test (Laughlin & Johnson, 1966). 
y% overall comparison of the two studies 
thus illustrates the importance of the group 
task in the expected performance of pairs of 
various ability levels relative to the appropri- 
ate individual controls: A person of lesser 
ability may be expected to make a relatively 
greater contribution on a task such as the 
RAT than on a task such as the Terman. 


GENERAL CONCLUSIONS 


y In summary, the results strongly support 
a truth-wins social decision scheme on a dis- 
junctive or insight task such as the RAT. 
Using the theoretical model of Davis (1973) 
to estimate second-test performance, the 
truth-wins social decision scheme could rarely 
< be rejected as a plausible explanation of the 

` underlying group process, both over all items 


» 3 L, LL, M, ML, MM, HL, H, HM, and HH refer to high, medium, and low ability. 


and for each individual item, and over all 
ability levels and within the six ability levels. 
Conversely, the data clearly rejected both an 
equiprobability and error-wins social decision 
scheme as a plausible explanation of the 
underlying group process. Moreover, the same 
conclusions were supported by estimations of 
group performance from the individual per- 
formance of the actual group members and 
the second-test performance of separate 
samples of control individuals of each ability 
level, which controls for learning by the 
groups over the second administration of the 
same task. 

This strong support for a truth-wins social 
decision scheme on the RAT contrasts with a 
number of previous studies in which an 
equiprobability social decision scheme was 
strongly supported on different problem- 
solving tasks, for example, Davis, Hornik, 
and Hornseth (1970) on sequential choice 
behavior, Davis and Restle (1963) on the 
well-known “gold-dust” and “rope” verbal 
problems, and Olson and Davis (1964) on 
arithmetic tasks from the School and College 
Aptitude Test. The crucial difference seems 
to be the task: the RAT clearly seems to be 
an insight or disjunctive task in which the 
correct answer is immediately evident and 
accepted by the group once it is proposed, 
while the problem-solving tasks of these other 
investigations do not possess this character- 
istic to the same degree, and consequently, 
any answer proposed has an equal chance of 
being adopted as the group’s solution. More- 
over, the present study indicates that the 
insight or disjunctive characteristic of the 
RAT is relatively independent of the ability 
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level of the group, as the truth-wins model 
applied equally across the six ability condi- 
tions. (Of course, this conclusion is limited 
to the college students of the study, for 
which the arbitrary relative designation 
“low? may still reflect considerable absolute 
ability.) Thus, the results offer some of the 
strongest evidence yet reported for the intui- 
tively obvious yet rarely supported logic of 
the Lorge-Solomon (1955) position that 
grouping as such neither facilitates nor 
hinders group relative to individual problem 
solving, but simply increases the probability 
that the group will include at least one 
problem solver. Apparently this logic and the 
corollary truth-wins social decision scheme 
will be supported only for tasks at the ex- 
treme disjunctive end of the basic conjunc- 
tive-disjunctive dimension of task character- 
istics. Also, it is worth noting that in one 
sense this conjunctive-disjunctive dimension is 
not an independent variable but a dependent 

variable: Despite the plausible intuitions of 

experimenters, a task is not an insight task 

by definition but by demonstration in actual 

group performance, In this sense the method- 

ology of Davis’ (1973) social decision 

scheme theoretical model can be applied to 

determine empirically the position of a given 

task on the conjunctive-disjunctive dimension 

of group task characteristics. 

Finally, it should be emphasized that these 
results were obtained with two-person groups. 
With increasing group size, a truth-wins social 
decision scheme may become decreasingly 
applicable for the simple but powerful reason 
that “for truth to win, truth must be heard,” 
An individual who in fact knows the correct 
solution may well have an increasingly dif- 
ficult problem of obtaining the attention and 
comprehension of the group with increasing 
group size. There is also a provocative anal- 
ogy with conformity research, in which naive 
individuals faced with a single erroneous 
confederate in a two-person group rarely 
conform, while conformity to two or more 
unanimously erroneous confederates increases 
greatly (e.g., the prototypic study of Asch, 
1956). Future research should further explore 
the interactions between group size, the group 
task, the ability composition of the group 
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relative to the task, and social decis 
schemes as a theory of the underlying grou, 
process. 
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personal distance; 


This article bridges the literature on inter- 
personal distance and mutual gaze." It focuses 
specifically on the eye-contact/distance rela- 
tionship reported by Argyle (1967). Accord- 
ing to Argyle ( 1967), various nonverbal be- 
haviors such as eye contact (EC) and inter- 
personal distance contribute to maintaining a 
level of intimacy for each interaction. In order 
to maintain a certain level of intimacy a 
change in one behavior, such as interpersonal 
distance, must be compensated for by a 
change in another, such as EC, The model 
assumes the use of nonverbal behavior is 

rned during early socialization. 

Although the equilibrium model predicts 
interactional effects, few studies manipulate 
more than one variable. Roles and norms 


This article is based on the author’s doctoral re- 
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Felipe Russo, Educational Affairs Office, American 
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Washington, D.C. 20036. 

‘Reciprocal looking into each other's eyes by two 
interactants has been called by many terms, includ- 
ing eye contact (Argyle & Dean, 1965), mutual 
glance (Exline, Gray, & Schuette, 1965), and mutual 
gaze (Kendon, 1967). These terms shall be used 
interchangeably in this article. 


and interpersonal distance, Dependent varia- 


of eye-contact 
distance; (b) 
with 


(b) was higher for females than for males; and (c) was 
significantly greater for friends than n 


- It was concluded that the 
tendencies, while the total 


« The failure of the mean length of EC to increase with 


seem to be involved but have not been exten- 
sively investigated, Aiello and Jones (1971) 
found both sex and cultural differences 
(among black, white, and Puerto Rican sub- 
cultures) in Proxemic behavior in children, 
supporting Hall’s (1966) position that “dif- 
ferences in the Structuring of space between 
minority groups and the dominant white 
culture are basic” (p. 165) and thus products 
of early socialization, The interaction between 
spatial behavior and other nonverbal behav- 
iors, such as EC, has not been studied with 
children as subjects, however, 

Argyle suggests that increased eye contact 
during interaction indexes an increase in 
intimacy (given that the input of other be- 
haviors is not concomitantly changed). Other 
explanations have been advanced to explain 
the EC/distance relationship, however, Tt 
may be that an increase in EC with increased 
distance is due to information seeking rather 
than expression of intimacy. It has been pre- 
dicted that the amount of eye-contact should 
increase in proportion to the square of the 
distance because the area visible for the same 
angle of vision is inversely proportional to 
the square of the distance. This formulation 
seems overly naive, and the finding that the 
observed curve does not match the theoretical 
prediction should not be interpreted as par- 
ticularly damaging to the information-seeking 
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explanation (Argyle & Ingham, 1972). As Hall 
(1966) has pointed out, with increased dis- 
tance the input from other senses is also re- 
duced. There is thus a correspondingly greater 
need for reliance on visual receptors, but for 
complex reasons. 

Theoretically, the proportion of time a 
dyad is involved in mutual gaze does not have 
to be associated with longer mutual glances. 
It could also be associated with the use of 
more frequent, but shorter EC. Indeed, 
inspection of the data of a study by Argyle 
and Ingham (1972) shows a reversal in effect 
when these two measures are compared. 
Those authors found that male dyads, seated 
3 feet and 6 feet apart, used EC 22.1% and 
23.4% of the time, respectively. Length of 
mutual glance went from 1.06 seconds to 
.86 seconds, however. For females the situa- 
tion was reversed. The percentage of involve- 
ment in EC at 3 feet versus 6 feet dropped 
from 39,9 to 37.9 seconds, while length of 
mutual glance increased from .68 to 1.42 
seconds. The authors did not comment on 
these findings. How their measuring tech- 
nique was sensitive enough to deal with a 
mean of .68 seconds is unclear, especially 
when the reader cannot tell from the report 
whether one observer was recording the gaze 
direction of both interactants or if two ob- 
servers, one per subject, were used. Argyle 
(1967) has reported that mean length of 
mutual gaze indexes intimacy, Given the dif- 
ference in direction of findings when propor- 
tion of time in mutual gaze versus mean 
length of mutual glance are compared, fur- 
ther study on this point seems warranted. 
Thus, both proportion of time involved in EC 
and mean length of mutual glance were used 
as dependent variables in this study. 


METHOD 
Subjects 


Twenty-four males and 24 females were drawn 
in same-sex pairs from kindergarten, third, and sixth 
grade classes, respectively, of Northeast School in 
Ithaca, New York. Half of these dyads were friend- 
ship pairs as selected by the children themselves, 
and half were selected as “not particularly friends” 
as judged by the teacher. The subjects were lower- 
middle to middle-class whites, living in a rural area. 
There was a total of 72 dyads, or 144 children in 
the study. 
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Experimental Setting 


The experimental manipulation took place in a 
large, well-lit vacant art classroom. Chairs were used 
which conformed to the sizes used in kindergarten, 
third, and sixth grades, respectively, and which were 
marked with a piece of white tape down the middle 
of the seat of each chair. In the middle of the 
room, a 10-foot strip of white measuring tape 
extended on the floor in a line between two spe- 
cially constructed one-way observation booths. This 
tape was used as an orientation axis for placement 
of chairs and for efficient measurement of the con- 
versational distances. The chairs were oriented in a 
face-to-face position with the line bisecting each 
seat lined up over the floor tape. Each observer had 
a clear line of vision over an interactant’s shoulder 
to a target’s face for measurement of gaze direction. 

The observation booths were decorated to look 
like play houses and remained in the art room at 
all times so that they were familiar to the subjects. 
Each booth contained an observer who was unfamil- 
iar with nonverbal research and not informed of 
the hypotheses of the study. An Esterline-Angus 
Recorder was used to measure the eye contact of 
the subjects. 


Reliability of Visual Recording 


Reliability of observation using a method similar 
to that of Exline (1963) was tested with the ob- 
servers placed between 8 feet and 11 feet from a 
target (the range of subject-observer distance in the 
experimental situation). Reliability ranged” from .80 
to .95. The reliabilities for measures of number of 
mutual glances and for length of mutual gaze were 
comparable. 


Validity 

It has been suggested that increased observer error 
at greater distances may underline the EC/distance 
finding (for a summary and criticism of the argu- «> 
ment see Argyle & Ingham, 1972). After observer 
training, pretesting sessions which involved corre- 
lating the observers’ records of gaze direction with 
the interactants’ reports were held. When the inter- 
actants were asked to converse “as naturally as pos- 
sible” given the recording task, correlations between 
.89 and .95 were obtained. Under these experimental 
conditions at least, when the subject said he was 
looking at the other’s eyes, the observers’ record 
agreed with him. 


Design 


The study was in two parts. Part 1 was concerned 
with the development of approach distance and is 
not reported in this article. Part 2 dealt with the 
eye-contact/distance relationship. 

A four-way classification design was used, 3 X 3 
% 2X2, A factor being distance between the chairs 
of the interactants, B factor being grade in school, 
Cc factor being sex, and D factor being level of 
friendship. Selection of male and female friendship 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS: PROPORTION OF TIME ENGAGED IN MUTUAL GAZE 
OvER CLOSE, MIDDLE, AND FAR DISTANCES 
x 
Kindergarten Third grade | Sixth grade 
Measure Male Female Male Female Male Female 
F NF F NF F NF F | NF F NF F NF 
Close distance 
M Ad 22 21 16 24 .22 32 26 16 17 A5 32 
we” 208 E | 21 HDS AO | 07 1910" fh 510) 1 ",09 (08 A a |) tee 
Middle distance 
M .22 .20 „30 22 -20 .20 28 +35 23 At AT 39. 
SD A3 12 13 AS -10 04 19 13 AO 06 3 10 
Far distance 
M 21 22 30 -26 13 29 38 29 25 25 31 37 
SD -06 10 Al 09 12 13 19 09 14 14 24 A2 


Note. F = friends; NF = nonfrienda. 
k 
and nonfriendship dyads was alternated. The chairs 
were placed at three distances during the conversa- 
tion, which lasted for a total of 6 minutes, 2 minutes 
at each distance. Interchair distances of 1 foot, 
34 feet, and 6 feet were used, The order of presen- 
tation of distances was controlled so that every 
order was presented once at the level of the D 
factor, the friendship condition, which had an » 
of six dyads in each sell. Also, the order of condi- 
tions was controlled as much as possible so that any 
effects from increased practice of raters was dis- 
tributed over all of the conditions. This control was 
A limited in that sometimes classes were unavail- 
** able and detailed scheduling was impossible. But 
generally, the control was maintained. 

The dependent variables were (a) the percentage 
of time in mutual gaze for each of the three dis- 
tances (i.e., the proportion of the 2 minutes actually 
spent in eye contact) and (b) mean length of 
mutual glance at each of the three distances. 


Procedure 
After a dyad with the appropriate characteristics 


®& was selected and brought to the classroom used for 


the experimental setting, subjects were told that the 
author was interested in studying how “kids use 
space.” For the study of mutual gaze each dyad 
was asked to “try out some distances and tell me 
how you like them.” Each dyad sat and conversed 
on a topic of their choice for 2 minutes at each of 
the three distances while the experimenter left the 
room, After each 2-minute interval, the experimenter 
reentered the room and had the subjects rate how 
they liked talking at that particular distance before 


moving to a different one. In general, the subjects 
enjoyed their task and the observers remarked how 
involved the subjects became in their conversations, 
even at the kindergarten level. 


RESULTS 


So little is known in this area that Type 1 
errors have a high cost value. The study 
therefore maintains an attitude of flexibility 
and speculation in the analysis of the results, 
If a trend reaches the .10 level it is noted as 
a potentially important effect, which further 
investigation may substantiate. Both overall 
analyses of variance and separate analyses for 
each grade are performed, where appropriate.” 


Percentage of Time Engaged in Mutual Gaze 


Table 1 summarizes the overall means of 
the variables broken down by distance, grade, 
sex, and friendship. Percentage of time in 
contact increases linearly with distance and 
with grade; females use more eye contact 
than males, and friends use more eye contact 
than nonfriends. 


2Several authors have noted the importance of 
detecting research possibilities in the preliminary 
investigation of a problem, For discussions of the 
potential errors involved in this approach see Block 
(1960) and Petrinoyich and Hardyck (1969). 
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An overall four-way analysis of variance of 
these variables with repeated measures on 
distance showed that the effect of distance 
was significant, F (2, 120) = 3.7, < .05, and 
there was a strong trend for a grade effect, 
F(2,60) =3.0, p < .10, percentage of time 
in eye contact increasing linearly with in- 
crease in distance and grade. The sex effect is 
highly significant, F(1, 60) = 19.6, p < .001, 
females engaging in more mutual gaze than 
males. There also was a trend for a Grade X 
Sex interaction, F(2, 60) = 2.8, p < -10. The 
trend for the effect of Sex X Friendship was 
also strong, F(1,60) = 3.0, p<.10. The 
other trend of note was that of Distance X 
Sex X Friendship, F(2, 120) = 2.6, p < .10. 

Since there was a trend for a Grade X Sex 
interaction, separate analyses of variance 
were performed. for the kindergarten, third, 
and sixth grades, respectively. At the third 
grade level, sex became significant at the .05 
level, F(1,20) = 5.4, with females again en- 
gaging in more mutual gaze than males. The 
interaction effect of distance, sex, and friend- 
ship was also significant, F(2,40) = 5.1, 
p< .05. The analysis of variance for the 
sixth grade subjects showed the effect of dis- 
tance to be significant, EC increasing with 


TABLE 2 


MEANS AND STANDARD Deviations MEAN LENGTH or EYE CONTACT 
Over CLOSE, MIDDLE, AND Far Distances 
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gaze, yield differing patterns of significance. 


distance, F (2, 40) = 3.4, p < .05, and the ef- 
fect of sex to be significant, F(1, 20) = 12.7, 
p < .01, females using more eye contact than 
males. 


Mean Length of Mutual Gaze 


Mean length of mutual gaze was measured 
by counting the number of units on the 
Esterline-Angus graph paper that were cov- 
ered when gaze was recorded. Table 2 shows 
the means and standard deviations of the 
variables used in the analysis of variance. 
An overall analysis of variance showed that -~ 
the effects of grade, F(2, 60) = 4.1, p < .05, 
sex, F(1,60) = 12.6, p < .001, and friend- 
ship, F(1, 60) = 5.4, p < .05, are significant, 
The Sex X Friendship interaction is signif- 
cant, F(1,60) = 4.5, p< .05, with dyads 
that are both female and friends having 
longer mutual glances. Distance, however, does 
not yield a significant effect, F(2, 120) = 1.9. 


Discussion 


The two measures, proportion of time in- 
volved in EC and mean length of mutual 


Friendship pairs had a significantly larger 
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Kindergarten Third grade Sixth grade 
Measure Male Female Male Female Male Female Jg 
t } e | i 
F | NE F NF | F NF F NF | oF NF F NF 
x Close distance 3 
M 2.67 | 3.09 | 445 | 315 | 298 | 3.08 asi | 319 | 284 | 275 | 4 
SD A 180 lea E N toate gone ase ot | ose 
Middle distance 3 3 
M 3.75 | 3.00 | 3.75 | 3.17 | 3.04 | 2.76 | 3.80 | 415 | 390 | 319 | 698 | 4 
SD | 1.73 | 110 | 96 | 213 | £72 | 53 | 138 | 206 | 132| 142 Ae 
SFS i Sahe LN | | | 3 
Far distance 
M sa | ao an am | A0 A a a ae eo l ase 
SD 1.67 | 53 | 6 | .77 | 1.02 | 154 | 298 | 107 | 229 | 128 | 381 4 


Note, F = friends; NF = nonfriends. 
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mean length of mutual glance, confirming 
Argyle’s assertion that the mean length of 
mutual glance indexes intimacy in the inter- 
action, However, Proportion of time involved 
in mutual gaze did not significantly differ 
with friendship level, 

The sex differences in visual behavior could 
also support Argyle’s hypothesis, As Argyle 
(1967) points out, females are socialized to 
have a higher affiliative orientation which 
they may express through using more EC. 
However, other explanations can account for 
the higher use of mutual gaze by females, 
A higher affiliatiye orientation may increase 
the female’s sensitivity and/or responsivity to 
social cues. Since the eyes of the other are 
& prime source of information about another's 
attention, females socialized in our culture 
į would be expected to look more at the other 

person’s eyes for this social information, thus 

‘~orpasing the amount of time a female dyad 

Heider a EC, and perhaps (but not neces- 

sarily) incréasing the mean length of mutual 

glance for the dyad. The data show that 
looking time in general shows an increase in 
the significance of the sex effect with age, 

Females may rely more on external cues in 

general, and insofar as the eyes of another 

are salient cues, the percentage of time in 
monitoring them should increase. 

Since the sex effect was significant whether 
the dependent variable Was percentage of 
time engaged in mutual gaze or mean length 
of mutual gaze, both expressive and monitor- 

. "ing functions (Kendon, 1967) could be at 
work. Further research must uncover how 
they interact, which variables selectively in- 
fluence their effects (e.g., will affect the reli- 
ance on external cues but not expression of 
affiliation), and how the behavior of one 
member of a dyad influences the perceptions 
| of the other, 

N Argyle’s model does not adequately account 
< for the entire pattern of results found in this 
P study. Length of mutual gaze is considered 
` the more sensitive measure of affiliative orien- 
tation of the interactants, It is associated 
with both the sex and friendship manipula- 
tions. Yet it did not signiñcantly increase 
with distance between interactants. Both the 
sex and friendship manipulations could be 
considered as effecting a change in affiliative 
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motivation although the sex manipulation 
theoretically could also involve a change in 
the importance of external cues, When the 
“intimacy level” is thus changed by selection 
of the motivation of the interactants, average 
length of mutual glance is increased, When the 
“intimacy level” is manipulated by increasing 
the distance between parties, the concomitant 
change in length of mutual glance is not ob- 
served, though percentage of time in mutual 
gaze increases, Percentage of time of looking 
at the other in general also increases, 

It appears that the two measures reflect 
different underlying dynamics. Not only was 
the hypothesized relationship between length 
of glance and distance unsupported, the 
friendship and interactional effects on per- 
centage of EC did not materialize, Proportion 
of time does not seem to be as strongly asso- 
ciated with affiliative motivation. An alterna- 
tive hypothesis is that the percentage of time 
engaged in mutual gaze indexes information 
seeking. Increased length of mutual glance is 
associated with the female sex role and 


friendship conditions, but not with increased — 


interpersonal distance, The affiliative female 
sex role interacts with friendship to yield 
significantly longer mutual gaze, but does not 
produce a similarly reliable effect on per- 
centage of time engaged in mutual gaze, 
Argyle's equilibrium model, which is based 
on the approach forces of affiliative motiva- 
tion is thus not Supported by the findings 
with respect to mutual glance in this study, 
The implications of the patterns of signifi- 
cance are intriguing. Further research must 
explore and replicate the differences between 
the measure of Proportion of mutual gaze and 
mean length of mutual gaze. It is possible 
that the effect of friendship on the propor- 
tion of mutual gaze measure would attain sig- 
nificance had it been manipulated differently, 

In summary, it seems the proportion of EC 
and mean length of mutual glance reflect 
different dynamics of the conversational situ- 
ation. Percentage of time in mutual gaze in- 
creases with interpersonal distance when 
mean length of mutual gaze does not under 
the conditions of the study. The data suggest 
that affiliative motivation is not the appropri- 
ate variable to explain the percentage of 
EC/distance relationship, What the appropri- 
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ate explanatory variables are remain in the 
realm of speculation. It may be that the 
greater the distance between interactants, the 
greater their dependence on visual rather 
than nonvisual cues, for a variety of complex 
reasons. The necessity to increase attention 
to visual cues and to focus on those cues with 
high information value, that is, the eyes and 
face of the other, thus can adequately explain 
the percentage of eye-contact/distance rela- 
tionship. Further research must separate the 
two dependent variables of length of and 
percentage of time engaged in mutual gaze 
and pursue this hypothesis. 
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% that the judgments of actors made by passive observers evidenced less “coy- 
respondence” than did the judgments of active observers, Finally, a compari- 


studies, 


Building upon the theoretical work of often discussed by Jones and Nisbett, 1971) 
Heider (1958), Jones’ and Nisbett (1971 ) refers to an individual who neither influences 
engages Pty formulated a „Provocative hy- nor is influenced by the actor he is observing, 
cools + gerning the “divergent percep- Perhaps the most Obvious reason for sus- 
4 tions?” of actors and observers, These authors pecting attributional differences between ac- 

~ have proposed that there is a general tendency tive and Passive observers is suggested by a 

for actors to attribute their own actions to consideration of the different role require- 
situational requirements but for observers of ments which characterize the two types of ob- 
the same actions to explain them by reference servers. For the active observer, it is psycho- 
to the actors stable personal dispositions. logically and strategically necessary to be 
Empirical support for Jones and Nisbett’s able to understand and predict the behavior 
Proposition has come from direct tests of the of those with whom he interacts, In many 
hypothesis (Nisbett, Caputo, Legant, & Ma- situations, this motivation will prompt an ac- 
recek, 1973; Lay, Ziegler, Hershfield, & tive search for information pertaining to the 

4 Miller, 1974) as well as indirect sources other’s stable dispositions, In many other 
a) (Jones & Harris, 1967; Jones, Rock, Shaver, situations, however, the’ accessible information 

Goethals, & Ward, 1968). is limited. In these situations, the psycho- 

In the present study, an attempt was made logical need to predict and understand the 
to probe more deeply the perceptual discrep- other’s behavior may result in the distortion 
ancies found between actors and observers. of the meaning and completeness of the 
More specifically, the major objective of the 

+ Present study was to empirically and theo- This research was supported by a Canada Council 

retically extend the basic actor-observer dis- Doctoral Fellowship awarded to the senior author, 

tinction to include active as well as passive The authors would like to thank Janinne Braband, 
} observers. The term active observer refers to John Holmes, and Michael Ross for their helpful 
any participant in a social interaction situa- 
tion who, in addition to observing the behav- Miller, Department of Psychology, University of 
ior of the other participants, influences the Western Ontario, London, Canada. é 
behavior of the other participants and is him- * The distinction between active and passive ob- 


A . ‘= Servers was made by Jones and Nisbett (1971) in 
self behaviorally influenced by the other pap their original statement, but they proffered no data 


ticipants. Active observ ers, therefore, function relevant to this distinction, nor as far as the present 
as both actors and observers. The termine aoa know, has any relevant research been forth- 
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available information. Distortion of this kind 
is likely to be in the direction of inferring 
more person information than is warranted. 
Furthermore, the early judgments of active 
observers, through either the process of per- 
ceptual assimilation (Jones & Gerard, 1967) 
or the phenomenon of the self-fulfilling 
prophecy, are likely to receive confirmation 
as the interaction continues and lead to a 
strong conviction that a great deal has been 
learned about the personality of the actor. 

The role requirements of the passive ob- 
server, on the other hand, do not make it as 
necessary to be able to predict and under- 
stand the behavior of the actor. The personal 
element borne of mutually contingent inter- 
action is absent in this relationship. More- 
over, the failure to make an early, simple, 
and correct assessment of the nature of the 
actor is not as critical for the passive observer 
as it is for the active observer. Passive ob- 
servers, unlike active observers, can entertain 
contradictory evidence and suspend judg- 
ment until later in the interaction. 

The reader may recognize in the preceding 
discussion and essence of Jones and Davis’ 
(1965) concepts of “hedonic relevance” and 
“personalism.” By hedonic relevance, Jones 
and Davis refer to the extent to which an 
actor's behavior promotes or undermines the 
purposes of the perceiver. An action is con- 
sidered personalistic to the extent that the 
action is uniquely conditioned by the per- 
ceiver’s presence. It is the contention of 
Jones and Davis that, generally, the greater 
the hedonic relevance and personalism of an 
action for a perceiver, the greater the strength 
of the person inferences drawn by that per- 
ceiver. Considering the earlier discussion, it 
would seem clear that an actor’s behavior 
will be both more hedonically relevant and 
personalistic for an active observer than for 
a passive observer. 

In summary, it appears that the different 
role requirements of active and passive ob- 
servers may very well affect their utilization 
of behavioral data and, consequently, their 
tendencies to make person attributions. The 
role requirements of active observers appear 
to conspire to make observers of this kind see 
more personality in the behavior of actors 
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than do the role requirements of passive ob- 
servers. 

One of the more important implications of 
the proposed perceptual differences between 
active and passive observers emerges with a 
consideration of the attributional principle of 
“correspondent inferences” advanced by Jones 
and Davis (1965). The concept of corre- 
spondent inferences is concerned with the oc- 
currence of person attributions and has as its 
central tenet the assumption that the absence 
of coercive environmental restraints is a pre- 
condition for the assignment of dispositional 
qualities. Generally stated, “Correspondence 
of inferences declines as the action to be ac- 
counted for appears to be constrained by the 
setting in which it occurs” (Jones & Davis, 
1965, p. 223). Within the present context, it 
seems reasonable to expect that the degree of 
correspondence evidenced in the inferences 
drawn by active and passive observers may be 
different. Active observers, as a consequence 
of their greater tendency to see disposition in 
the behavior of the actor, can be expected to 
discriminate less between constrained and 
nonconstrained behavior than can passive ob- 
servers. To use Heider’s (1958) phrase, the 
behavior of the actor will be more likely to 
“engulf the perceptual field” for active ob- 
servers than for passive observers. 

By employing an interaction setting, 
therefore, it is possible to examine perceptual 
and attributional differences between active 
and passive observers. The use of an inter- 


action setting also permits, however, the ex- ^ 


amination of actor-observer differences in the 
way situational attributions of responsibility 
are distributed over personal and impersonal 
forces. Behavioral causality which is not at- 
tributed to the actor can be attributed to 
either the other participant in the interaction 
or to some normative or physical aspect of 
the structure in which the interaction took 
place. 

While there are no direct data comparing 
actors and observers in this respect, a number 
of interpersonal bargaining studies do provide, 
from an actor’s perspective at least, some 
evidence relevant to this point (Kelley, 
Helus, Lamm, Servais, Veen, Zaleska, & 
Wouters, 1971; Kelley, Shure, Deutsch, 


y 
yo 


bie, & Thibaut, 1970; Pruitt & Drews, 1969; 
Shure & Meeker, 


‘than in impe 
Whether or 
tendency remains an empirical question, 

The experimental vehicle chosen for the 
investigation of the questions raised above 
was the familiar Prisoner’s Dilemma (PD) 
game. The PD (see Figure 1) is a two-person 
game which provides binary response alter- 
natives on each trial, with payoffs contingent 
on the values in the resultant cell. The two 
responses in the PD are generally referred to 
as the cooperative choice (X) and the com- 
petitive or defective choice (Y). Each choice 
is usually made” independently and without 
communication, 

The PD provides a useful framework for 
the present study for a number of reasons. 
* First, it is possible for the experimenter to 
yimpose a considerable degree of control over 
the interaction sequence. Control can be in- 
troduced with respect to the amount of feed- 
baek actors and observers receive, as well as 
with respect to the specific nature of the 
| feedback. In such a context, it is Possible to 
provide actors and observers with identical 
information. 
` In addition to the virtue of experimenter 
* control, the PD has the important advan- 
tage, from the viewpoint of the present study, 
of favoring a competitive Strategy over a 
` cooperative Strategy (Kelley & Stahelski, 
1970; Rapoport, 1967; Miller & Holmes, 
in press). In other words, the situational 
forces are stronger in the direction of com- 
petition than they are in the direction of 
Cooperation. As a consequence of the differen- 
tial viability of cooperation and competition 
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FIGURE 1. The gaming matrix, 


in the PD, the postulate of correspondent in- 
ferences suggests that more person informa- 
tion should be inferred from a player who 
plays cooperatively in such a game than from 
a player who, consistent with the coercive 
qualities of the game, plays competitively. 

Having discussed both the theoretical 
framework and the experimental vehicle for 
the present study, it is now possible to state 
the hypotheses which constitute the focus of 
this study, 


Hypothesis 1. Players in the PD (actors) 
will attribute more responsibility for their 
behavior to extërnai causal agents than will 
nonplayers (observers), F urthermore, players 
will perceive their behavior as less disposi- 
tionally influenced than will nonplayers, This 
hypothesis derives directly from the basic 
Jones and Nisbett Proposition concerning the 
“divergent perceptions” of actors and obsery- 
ers, 

Hypothesis 2, Active observers (observers 
engaged in interaction with the actor) will 
attribute more behavioral responsibility to 
the actor than will Passive observers (obsery- 
ers not behaviorally engaged with the actor), 
Furthermore, active observers will perceive 
the behavior of the actor to be more dispo- 
sitionally influenced than will passive obsery- 
ers. The assumption underlying this hypothe- 
sis is that the more involved a person is with 
the target of his Perceptions, the greater will 
be the tendency to perceive that target person 
(actor) in dispositional terms, 
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Hypothesis 3. Consistent with the principle 
of correspondent inferences, it is predicted 
that more person inferences will be drawn 
from cooperative behavior in the PD than 
from competitive behavior. It is also predicted 
that the differential tendency to draw person 
inferences from the two types of behavior 
will be different for active and passive ob- 
servers, Active observers, since they are ex- 
pected to be more willing than passive obsery- 
ers to infer dispositions from actors’ behav- 
ior, are expected to be less influenced by sit- 
uational constraints on the actor’s behavior 
when making person inferences, 

Finally, although not guided by a firm 
hypothesis, the Present study attempted to 
determine if the situational attributions of 
actors and observers followed a similar pat- 
tern. More Precisely, do the situational at- 
tributions of actors and observers emphasize 
personal and impersonal forces to a similar 
degree? 


METHOD 
Subjects 


The subjects were 40 first-year female students 
at the University of Waterloo who took part in the 
experiment as Part of an introductory psychology 
Course requirement, 


Independent Variables 


Perspective manipulation. Subjects were run two 
at a time. The two subjects scheduled for each 
experimental session arrived 
areas and were randomly designated as active or 


observers were not informed of the Presence of the 
passive observers). Both active and passive observ- 
ers Were taken individually to separate experimental 
rooms which were outfitted with intercom systems, 
The intercom systems were set up in such a way 
that the active and passive observer could speak 
directly to the experimenter in another room but 
not to each other, The experimenter was able to 
Speak directly to both types of observers, 

Program manipulation, There was no second 
player in the study. Subjects designated as players, 
thus, had a dual role. With respect to judgments 


spect to judgments about the other player, they 
served as active observers. The behavior of the 
alleged second player was controlled by 
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perimenter. This manipulation was concealed fro 
the subjects by prohibiting direct communicatio: 
between bargainers or between bargainers and ob- 
servers. The simulated behavior of the second player 
took one of two general forms. For one half of the 
interaction sequences, the experimental program 
was “cooperative,” and for the other half, the ex- 
perimental program was “competitive.” 

Cooperative Programs consisted of a tit-for-tat 
treatment for 7 out of every 10 trials, with the 
other 3 responses being randomly introduced coop- 
erative responses (A tit-for-tat treatment is a 
strategy in which the other player [program] con- 
sistently reciprocates the subject’s behavior with a 
one-trial lag.) The competitive program also con- 
sisted of a tit-for-tat treatment for 7 of every 10 
trials, but instead of 3 cooperative responses, 3 
competitive responses were randomly introduced 
every 10 trials. 

The design of the present study, thus, was a 2 X 2 
factorial, with two levels of perspective (active vs. 
passive observer) and two levels of experimental 
program (cooperative vs. competitive) .2 


Procedure 


On their arrival at the reception room, all sub- 
jects were told that the study involved an examina- 
tion of bargaining behavior. Active observers were 
told that they would be playing a bargaining game 
with another girl, while Passive observers were told 
that they would be observing two girls playing a 
bargaining game. Both active and passive observers 
were told that the other subject(s) had arrived 
and had been taken to the experimental area, The 
rooms active and passive observers were taken to 
were identical, each haying a small table, a chair, 
and an intercom outlet. 


the study was to investigate bargaining behavior 
and the participants’ Perceptions of the bargaining 
interaction. Passive observers were also told that 
the purpose of the study was to investigate bar- 
gaining behavior, but Passive observers Were told, 
in addition, that the experimenter Was interested in 
how bargaining behavior was viewed by People not 
involved in the actual bargaining. Both active and 
Passive observers instructions 
for the game, a game matrix identical to that pre- 
sented in Figure 1, and a sheet on which to record 
the trial-by-trial Tesponses of the players. 

Active observers, who were always referred to as 
Player A, were old that they would not be able to 


B but are Teferred to as actors 
r and observers when 
the focus of their judgments is Player A, 
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fluous communication to a minimum. The function 


> of the intercoms was described to the active observ- 


ers as follows: 


When you have decided upon your move for a 
particular trial, indicate this choice by means of 
the intercom to the experimenter. The other 
player will do the same. When both moves have 
been relayed to the experimenter, he will indicate 
to you, also by means of the intercom, the moves 
made by each of the players, as well as the re- 
sultant payoff points. You will then record these 
moves and points and decide upon your next 
move. This procedure will continue until the ex- 
perimenter indicates that the experiment is over. 


Passive observers were shown the instructions 
given to the active observers and were told that they 
would be able to listen as the experimenter an- 
nounced the moves and points made by each player. 
Passive observers were also told to record the moves 
as well as the points. 

The game lasted for 30 trials, at which time a 
postbargaining questionnaire was distributed to both 
active and passive observers. When the subjects had 
completed this questionnaire, the experiment was 
officially over, and the subjects were debriefed. 

Dependent variables. Items on the postbargaining 
questionnaire constituted the dependent variables in 
this study. These questions focused on the perceived 
causal influence of Players A and B, and the game 
situation, as well as on the person information 
inferred from the bargaining behavior. 


RESULTS 
Differences between Actors and Observers 


To test the first hypothesis, four dependent 
measures were employed. The first two mea- 
sures asked subjects to indicate (a) how re- 
sponsible Player A was for Player B’s be- 
havior and (b) how responsible the game sit- 
uation was for Player A’s behavior. Analyses 
of variance performed on the responses to 
these questions yielded two main effects of 
perspective.” Specifically, actors (Player A) 
saw themselves as more responsible for Player 
B’s behavior and saw the game as less re- 
sponsible for their own behavior than did 
passive observers (F = 4.21, p< 05, and F 
= 4,26, p < .05, respectively).* 

The measures relevant to the second part 
of Hypothesis 1 included the following two 
questions: (a) How accurately did Player 
A’s (your) behavior reflect her (your) per- 
sonality? and (b) How much could be learned 
about Player A (you) from her (your) be- 
havior in the game? Analyses of variance 
again yielded main effects of perspective. Ac- 


TABLE 1 


MEAN RESPONSES TO ITEMS TAPPING STRENGTH OF 
PERSON INFERENCES DRAWN FROM 
Prayer A’s BEHAVIOR 


Perspective 
Active Passive 
Item observers observers: 
Coop | Comp | Coop | Comp 
pro- | pro- | pro- | pro 
gram | gram | gram | gram 
How accurately did Player 
A’s (your) behavior ře- 
fect her (your) general 
personality? 3.40 | 4.20 | 5,00 | 5.40 
How much was it possible 
to learn about Player A 
(you) from her (your) 
behavior? 3.50 | 4.30 | 4.20 | 5.80 


Note. Lower values indicate greater accuracy or that more 
has been learned. Coop refers to the cooperative program and 
Comp to the competitive program, 


tors believed (see Table 1) that their be- 
havior more accurately reflected their person- 
ality than did observers (F = 3.64, p< 
.065). Actors also indicated that more could 
be learned about them from their behavior 
than did observers (F = 3.96, p < .05). 

Hypothesis 1, thus, was not confirmed. 
Contrary to the predictions derived from 
Jones and Nisbett’s (1971) proposition, ac- 
tors attributed more behavioral responsibility 
to themselves and perceived more disposition 
in their behavior than did observers. 


Differences between Active and Passive 
Observers 


The first four dependent measures used to 
test perceptual differences between active and 


3Since actors and observers were run in pairs, it 
may be argued that perspective should be treated 
as a within factor in the analysis of the data. How- 
ever, analyses of variance of the data as a 2X2 
factorial design with one between and one within 
factor, compared to analyses with two between and 
no within factors, revealed no differences in the 
error terms or associated F values. In order to 
present the data in the simplest possible manner, 
therefore, most of the reported analyses of variance 
refer to a 2 X 2 factorial design, with both perspec- 
tive and program considered as between factors. 

4 Due to the large number of analyses conducted, 
it was decided that a description of program and 
interaction effects should be withheld until the dis- 
cussion of Hypothesis 3. 
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passive observers were identical to the mea- 
sures used to test actor-observer differences, 
except that these measures focused on sub- 
jects’ perceptions and evaluations of Player 
B (the program). 

On the two measures asking subjects the 
extent to which Player B was a locus of 
causality, analyses of variance produced two 
main effects of perspective. First, Player B 
was judged to be more responsible for Player 
A’s behavior by active observers than by 
passive observers (F = 14.90, p< .001). 
Second, active observers saw the game as less 
responsible for Player B’s behavior than did 
passive observers (F = 8.71, p < .006). 

On the two measures concerning the extent 
to which person information could be inferred 
from Player B’s behavior, analyses of vari- 
ance revealed no significant effects of per- 
spective. However, the sum of the responses 
to the two measures yielded a significant 
Perspective X Program interaction (F = 4.52, 
p < .05).5 Inspection of the means (see Table 
2) indicates that passive observers, under the 
competitive program, believed that they 
learned less about Player B and thought 
Player B’s behavior less accurately reflected 
her personality than did active observers. The 
means of the sums of these two measures 
( = 10.9 vs. X= 8.3, respectively) were 
significantly different (F = 4.32, p < .05). 


TABLE 2 


MEAN Responses TO [reMs TAPPING STRENGTH OF 
PERSON INFERENCES DRAWN FROM 
PLAYER B’s BEHAVIOR 


Perspective 
Active Passive 

Item observers observers 

Coop | Comp | Coop | Comp 

pro- | pro- | pro- | pro- 

gram | gram | gram | gram 
How accurately did Player 
B’s behavior reflect her 

general personality? 3.90 | 4.40 | 3.80 | 5.50 
How much was it possible 
to learn about Player B 

from her behavior? 4.10 | 3.90 | 4.30 | 5.40 

Sum 8.00 | 8.30 | 8.10 | 10.90 


Nolte. Lower values indicate greater accuracy or that more 
has been learned. Coop refers to the cooperative program and 
Comp to the competitive program. 
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In summary, the preceding results sup- 
port Hypothesis 2, Active observers perceived 
Player B as being much more causally potent 
than did passive observers. Consistent with 
the causality data, although admittedly less 
compelling, is the evidence which suggests 
that active observers more readily inferred 
personality from Player B’s behavior than 
did passive observers. 

Additional dependent measures relevant to 
active-passive observer differences included 
eight semantic differential scales. Four of 
these scales (active-passive, cooperative- 
competitive, aggressive-defensive, and flexi- 
ble-inflexible) have previously been shown 
(Miller, Note 1) to be particularly relevant 
to the evaluation of bargaining behavior. A 
multivariate analysis of variance was carried 
out on these four items (see Table 3) and 
revealed a main effect of perspective (F = 
2.64, p < .05), a main effect of program (F 
= 7.65, p < .001), and no interaction effect 
(F < 1). The main effect of perspective was 
due primarily to the fact that active observ- 
ers described Player B as being more active, 
more aggressive, and more flexible than did 
passive observers. Taken together, these dif- 
ferences would seem to suggest that active 
observers were perceiving the actor as being 
more dynamic than were passive observers. 

The remaining four semantic differential 
scales (likable-unlikable, mature-immature, 
intelligent-unintelligent, and  sincere-insin- 
cere) were believed to be more sensitive to 
general dispositions than were the game-re- 
lated scales. An attractiveness dimension was 
assumed to underly these items. The distinc- 
tion between the two types of scales was not 


5 These two dependent measures were significantly 
correlated (r= .315, p < .05, two-tailed). 
s SA multivariate analysis of variance is simply a 
joint test on a set of dependent variables. It is 
essentially the same as a standard analysis of vari- 
ance on data that have been summed over a set of 
dependent measures. Multivariate analysis of vari- 
ance, however, does not assign an equal weight to 
each dependent measure as occurs in analyses of 
sums. Multivariate analysis, while taking into ac- 
count the intercorrelations between dependent mea- 
sures, weights these measures differentially to maxi- 
mize the multivariate F ratio, For a more detailed 
discussion of the multivariate analysis of variance 
the interested reader is directed to Morrison (1967). 
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TABLE 3 


MULTIVARIATE ANALYsIS OF DYNAMISM AND ATTRACTIVENESS SCALES 


Constituent scale 


Dynamism 
Active-passive 
Aggressive-defensive 
Cooperative-competitive 
Flexible-inflexible 


Multivariate 


Attractiveness 
Likable-unlikable 
Mature-immature 
Sincere-insincere 
Intelligent—unintelligent 


Multivariate 
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Perspective effect Program effect Interaction effect 
4.041" 7,.802*** <i 
5.432** <1 <1 
<1 8.020"** <1 
5.646** 9.3041 <1 
2.637** 7,650**** <1 
<i 3.600* 14,400**** 
1.793 <1 <t 
2.404 1.447 <1 
5.788** 14.483**** <i 
1.575 5,052*** 4,393"** 


Nolte, For multivariate / values, df = 4, 33; for others, df = 1, 36, 


*p <,07. 
Dp <05. 
PIED 01s 
+p < 1001. 


made salient to the subjects, and their order 
on the questionnaire was randomized. 

The multivariate analysis performed on 
these four scales (see Table 3) revealed no 
significant effect of perspective (F = 1.57, p 
<.20). A significant multivariate effect of 
program emerged, however, indicating that 
Player B was perceived as more “attractive” 
under the cooperative program than under 
the competitive program (F = 5.05, p< 
.003). A significant multivariate Perspective 
X Program interaction indicated that the pro- 
gram effect was primarily due to the responses 
of active observers. While active observers 
found Player B more attractive in the coop- 
erative condition than in the competitive con- 
dition, passive observers were not affected by 
program (F = 4.39, p < .006). 

Generally, the data gathered from the 
eight semantic differential scales complement 
the data gathered from the other measures. 
The finding that active observers judged 
Player B to be more dynamic than did pas- 
sive observers is consistent with the finding 
that active observers attributed more causal 
responsibility to Player B than did passive 
observers. Similarly, the finding that active 
observers discriminated between programs on 
the attractiveness scales while passive observ- 
ers did not is consistent with the finding that 
active observers drew more person inferences 


than did passive observers. Active observers 
would seem to have a more highly differenti- 
ated view of the target person than would 
passive observers. 


Active-Passive Observer Differences in 
Correspondence of Inferences 


Hypothesis 3, concerning correspondent in- 
ferences, is based on the assumption that the 
structural properties of the PD precipitate 
competitive behavior. To test the validity of 
this assumption, subjects were asked whether 
the nature of the game lent itself more to 
cooperation or competition. Ninety percent of 
the 40 subjects gave competition as their re- 
ply. It was also assumed that subjects would 
perceive the behavior of Player B (program) 
as cooperative or competitive, according to 
condition, and therefore as either consistent 
with or inconsistent with the bias of the 
game, Subjects did in fact see the behavior 
of Player B as more cooperative in the coop- 
erative program than in the competitive pro- 
gram (F = 7.70, p < 01). 

The data relevant to Hypothesis 3 come 
from two sources. The first source concerns 
the extent to which subjects drew person 
inferences about Player B under different 
program conditions. Consistent with the prin- 
ciple of correspondent inferences, subjects al- 
located more responsibility for Player B’s 
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behavior to the game under the competitive 
program than under the cooperative program 
(F = 8.71, p < .006) and indicated that the 
behavior of Player B more accurately re- 
flected her personality in the latter condition 
than in the former condition (F = 4.87, p < 
.05). In other words, subjects’ responses seem 
to reflect recognition of the fact that situa- 
tional forces may have been more responsible 
for the actor’s behavior when the actor played 
competitively than when he played coopera- 
tively. 

The second source of data relevant to cor- 
respondent inferences was collected in con- 
junction with the eight semantic differential 
scales. Along with each of the eight judg- 
ments, subjects were asked to indicate how 
certain they were about the judgment on an 
accompanying 7-point scale. Whereas the se- 
mantic differential scales were designed to 
assess the nature of the person inferences 
(i.e., what subjects learned about Player B), 
the certainty scales were designed to assess 
the strength of these inferences (i.e., how 
much subjects learned about Player B). 

The four certainty scores accompanying 
the dynamism scales were summed, as were 
the scores accompanying the attraction scales. 
Analyses of variance on each of the two sums 
failed to reveal effects of program (F < 1 
and F <1, respectively). In other words, 
subjects were no more confident of their 
judgments in the cooperative program than 
they were in the competitive program. 

While the certainty scores did not demon- 

- Strate that subjects drew stronger inferences 
from cooperative behavior than competitive 
behavior, the measures discussed earlier sug- 
gest that the inferences drawn from coopera- 
tive behavior did tend to be more correspon- 
dent than did the inferences drawn from com- 
petitive behavior. In short, the first part of 
Hypothesis 3 received partial confirmation, 

We now turn to the second part of Hy- 
pothesis 3 concerning active-passive observer 
differences and correspondent inferences. It 
was predicted that active observers would 
discriminate less between constrained and 
nonconstrained behavior than would passive 
observers. To determine the strength of the 
support for the second part of Hypothesis 3, 
an examination was made of the Perspective 
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X Program interaction effects on items mea- 
suring the extent to which subjects inferred 
person information from Player B’s behavior. 

As was previously reported, an interaction 
effect was found on the sum of the two mea- 
sures concerning the accuracy with which 
Player B’s behavior reflected her personality 
and the amount that could be learned about 
Player B from her behavior. The pattern of 
the means in this interaction (see Table 2) 
indicated that passive observers acquired 
more dispositional knowledge in the coopera- 
tive program than in the competitive pro- 
gram (F = 4,32, p < .05), while the disposi- 
tional knowledge acquired by active observers 
did not differ across program conditions. 

Analyses of variance on the summed cer- 
tainty ratings accompanying the attractive- 
ness scales showed no significant interactions 
effect (F <1). An interaction effect did oc- 
cur, however, on the certainty ratings ac- 
companying the dynamism scales (F = 3.21, 
p< .08). The distribution of means in this 
interaction indicated that passive observers 
were more certain of their judgments when 
the program was cooperative than when it 
was competitive, while active observers were 
more certain of their judgments when the 
program was competitive than when it was 
cooperative. Individual comparisons within 
this interaction, however, revealed that the 
only significant mean difference was a simple 
effect of perspective within the competitive 
condition. Active observers in the competitive 
condition were more certain of their evalua- 
tions than were passive observers (F = 5.01, 
p < 05). 

Of additional relevance to the question of 
correspondent inferences is the finding that 
while both active and passive observers dif- 
ferentiated between programs on the dyna- 
mism scales, only active observers differenti- 
ated between programs on the attractiveness 
scales. Although susceptible to a number of 
explanations, this finding may be interpreted 
as suggesting that the judgments of active 
observers more closely corresponded to the 
actors’ behavior than did the judgments of 
passive observers. Since the attractiveness 
scales are thought to be only indirectly re- 
lated to bargaining behavior, however, it is 
not clear that there could or should be cor- 
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respondence between bargaining behavior and 
judgments on these scales, It is perhaps more 
reasonable to assume that the active observ- 
ers’ ratings of Player B on the attractiveness 
scales were mediated by a general positive or 
negative impression stimulated by the nature 
of the experienced interaction. To label the 
inferences of active observers as correspon- 
dent would seem justified only to the extent 
that these inferences had some objective basis 
in the behavior of Player B. 

In review, it appears that the data provide 
consistent, yet inconclusive, support for the 
second part of Hypothesis 3. The evidence 
from some measures indicated that the in- 
ferences of passive observers revealed more 
sensitivity to possible situational influences 
on the behavior of Player B than did the 
inferences of active observers, but other mea- 
sures failed to demonstrate differences be- 
tween the two types of observers in the cor- 
respondence between their perceptions and 
the actors’ behavior. 


Actor-Observer Differences in Situational 
Attributions 


In an exploratory fashion, the data were 
analyzed to determine the extent and manner 
in which situational attributions of actors and 
observers differ. The measures considered 
relevant to this comparison were those that 
asked subjects: (a) How responsible was 
Player B for Player A’s (your) behavior? 
and (b) How responsible was the nature of 
the game for Player A’s (your) behavior? 
An analysis of variance using perspective, 
program, and source of responsibility (game 
or Player B) as factors was performed on the 
data, with source considered as a within fac- 
tor. A highly significant Perspective X Source 
interaction emerged from this analysis, 
F (1,36) = 35.29, p < .001. The distribution 
of the means (see Table 4) indicates that 
actors saw Player B as most responsible for 
their behavior, whereas observers saw the 
game as most responsible for their (Player 
A’s) behavior. 

Data were also analyzed to test differences 
in situational attributions of active and pas- 
sive observers (i.e., with respect to Player 
B’s behavior). An analysis similar to that 
performed for actors and observers revealed 
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TABLE 4 


RESPONSIBILITY ALLOCATED TO DIFFERENT CAUSAL 
AGENTS FOR THE BEHAVIOR OF 
PLAYER A AND B 


Player A | Player B 


Perspective C agents 


Game | Player A | Game 


Player B 
Active observers 6.75 |520| 6.70 | 4.95 
Passive observers 4.50 5.25 6.35 


| 6.55 


Note. The larger the value, the greater the allocated res 
sponsibility, 


another significant Perspective X Source in- 
teraction, F (1,36) = 12.72, p < .002. Par- 
alleling the results for actors and observers, 
active observers judged themselves to be 
more responsible than the game for Player 
B’s behavior, while passive observers judged 
the game to be most responsible. 


Self-Other Differences 


To determine if there were any differences 
between active observers’ judgments of 
themselves and their judgments of Player B, 
a 2 X 2 analysis of variance, employing pro- 
gram and player (self ys. B) as factors, was 
performed. Player was treated as a within 
factor. None of the analyses, on either the 
responsibility or the personality questions, 
revealed any significant differences between 
active observers’ judgments of themselves and 
their judgments of Player B. 


Discussion 


The most intriguing finding in the present 
study was the fact that the differences found 
between actors and observers were contrary 
to those predicted by the hypothesis of Jones 
and Nisbett (1971). That is, actors saw more 
disposition in their own behavior and attrib- 
uted more behavioral responsibility to them- 
selves than did observers. The search for an 
explanation of this discrepancy led to a re- 
examination of the processes purported to 
underly actor—observer differences, 

Jones and Nisbett’s analysis of the actor- 
observer perceptual differences, while con- 
sidering a range of factors, emphasized the 
information-processing differences between 
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the two perspectives. Actors are thought to 
have both more and different information 
available to them than are observers. Further- 
more, actors and observers are believed to 
attend to different aspects of that information 
which they hold in common. 

Information-processing differences, as out- 
lined by Jones and Nisbett (1971), unques- 
tionably exist between actors and observers. 
Whether or not these differences can account 
for the strength and direction of all actor— 
observer differences, however, is problematic. 
Certainly, it is not obvious how such an in- 
terpretation can explain the results of the 
present study. 

From the present authors’ viewpoint, the 
results reported here are most satisfactorily 
accounted for by a motivational rather than 
a cognitive interpretation. More specifically, 
it is believed that the need to see oneself as 
exercising control over one’s environment 
was an operative motive in the present con- 
text and was responsible, in large part, for 
the observed pattern of results. The pervasive 
tendency for people to wish to believe that 
they have environmental control is an inte- 
gral postulate in a number of psychological 
theories. Among those theoretical concepts 
most directly concerned with his motive are 
the notions of effectance (White, 1959), ori- 
ginship (deCharms, 1968), and internal con- 
trol (Rotter, 1966). More recently, Kelley 
(1971) has discussed the interplay between 
attribution processes and the need to perceive 
oneself as exercising effective control: “At- 
tribution processes are to be understood, not 
only as a means of providing the individual 
with a veridical view of his world, but as a 
means of encouraging and maintaining his 
effective exercise of control in that world” 
(p. 22). 

The attributional differences found between 
actors and observers in the present study 
seem to be compatible with Kelley’s asser- 
tion. The tendency for actors to assume re- 
sponsibility for their behavior and to indicate 
that their behavior was consistent with their 
dispositions, may be seen as a manifestation 
of the need of the actor to perceive himself 
as exercising effective control. To the extent 
that actors allocate responsibility for their 
behavior to external causal agents (personal 
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or impersonal), their perceived causal po- 
tency is threatened or reduced. For observ- 
ers, on the other hand, the question of effec- 
tive control is less salient or at least is 
reacted to in a different manner. The percep- 
tion of effective control from the perspective 
of the observer can only be served by being 
able to understand and predict behavior. To 
attribute the actor’s behavior to identifiable 
environmental agents would seem to best 
promote the observer’s sense of control. 

While the results of the present study may 
appear in sharp conflict with those of previ- 
ous actor—observer studies, the interpretation 
imposed upon them is similar to one pro- 
posed by Jones and Nisbett (1971) to ac- 
count for some of the results they reported. 
In a cursory consideration of certain motiva- 
tional factors thought to be relevant to actor- 
observer differences, Jones and Nisbett pro- 
posed that “the individual’s desire for control 
over his environment” (p. 13) may account, 
to some extent, for the fact that actors see 
their behavior as a response to the situation, 
while observers see the actors’ behavior as 
a manifestation of the actors’ dispositions. 
Postulating a close relationship between per- 
ceived control and perceived freedom, Jones 
and Nisbett argued that an actor “would lose 
his sense of freedom (control) to the extent 
that he acknowledged powerful dispositions 
in himself” (p. 14). While this statement is 
essentially true, it is also true that assuming 
oneself to be under complete situational con- 
trol is threatening to one’s sense of personal 
control. 

To have personality traits is not in and of 
itself threatening to one’s perceived freedom 
or perceived control. The major problem in 
comparing the present results with those 
found in the studies reported by Jones and 
Nisbett is that “personality” was defined in 
the latter studies by the experimenters, while 
in the present study it was defined by the 
actors/observers. Jones and Nisbett, adopting 
a widely accepted view, assumed that the 
degree of “personality” actors attribute to 
themselves is directly reflected in the degree 
of behavioral consistency they admit to. This 
definition may be in accordance with most 
psychologists’ operationalization of personal- 
ity, but actors may perceive personality in 
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somewhat different terms. Indicating that 
one’s behavior varies across situations does 
not mean that one does not perceive himself 
as having a personality, just as it does not 
mean that one perceives himself as being 
entirely under situational control. As Jones 
and Nisbett note, actors may view their per- 
Sonalities not “as a collection of response 
tendencies” but “as a configuration of values 
and strategies.” For the actor, therefore, a 
Perception of behavioral inconsistency may 
be completely compatible with a belief that 
his actions demonstrate effective control and 
reflect his personality, 

To summarize briefly, then, it is the pres- 
ent authors’ contention that the need of an 
actor to see himself as free and exercising 
effective control over his environment medi- 
ated the present actor—observer differences, 
Moreover, it is suggested that the results of 
the present study are not wholly inconsistent 
with the tests of the Jones and Nisbett 
hypothesis, particularly those results reported 
by Nisbett et al, (1973). When asked if 
one’s behavior is going to be consistent over 
situations, the need to perceive oneself as 
being free and having effective control is best 
served by responding in the negative. When 
asked if one were responsible for one’s behay- 
ior and if this behavior accurately reflected 
one’s personality, however, the control motive 
would seem best served by responding in the 
affirmative. The same motive, therefore, may 
often be manifested in different and seem- 
ingly contradictory forms, 

It should be mentioned at this time that 
the need to exercise effective control is most 
certainly not the only motive which effects 
actor’s judgments of themselves. An effec- 
tance motive is but one of a number of mani- 
festations of the pervasive need to view one- 
self “positively.” Among the other derivatives 
of this more general need are the drives for 
self-enhancement, self-protectiveness, and self- 
consistency (cf. Feather, 1969, 1971; Heider, 
1958; Kelley, 1967). 3 3 

In many instances these motives may be in 
opposition with one another. That is, the 
success of one drive may serve to thwart the 
success of another. The likely resolution of a 
dilemma of this type is for the motive which 
best promotes a positive image of oneself to 
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triumph. Following this logic, it is conceivable 
that there are many situations where asserting 
one’s effectiveness will militate against, rather 
than promote, one’s positive self-image. Take 
the example of someone whose behavior has 
precipitated an evaluatively negative set of 
events. If such a person were to assume per- 
sonal responsibility for his actions and their 
consequences, his self-image would likely be 
threatened rather than bolstered. In many 
cases, then, the prepotent need to see oneself 
in a positive light may dominate the motive 
of personal effectance and produce “external” 
rather than “internal” attributions. 

Finally, it should be emphasized that while 
the present authors believe that motivational 
processes are important in understanding the 
perceptual differences between actors and ob- 
servers, we do not wish to suggest that all 
actor—observer differences can be accounted 
for by motivational concepts. The notion of 
effective control may be able to parsimoni- 
ously explain the results of the present Study 
as well as many of those reported by Jones 
and Nisbett, but it is far from obvious how 
effective control, or any motivational concept 
for that matter, can account for all of the 
results reported by Jones and Nisbett, It is 
left to future actor-observer research to 
assess the relative explanatory value of the 
motivational concepts outlined here and the 
cognitive concepts discussed by Jones and 
Nisbett. 

Tn addition to actor-observer differences, 
the present study revealed a number of dif- 
ferences in the perceptions of active and 
passive observers, Active observers—observ- 
ers engaged in interaction with the target of 
their perceptions—attributed more causal 
responsibility to the target and appeared to 
have a more differentiated view of the target 
person than did passive observers. Further- 
more, active observers appeared to discrimi- 
nate less between constrained and noncon- 
strained behavior than did passive observers. 
Although active observers indicated that the 
situational constraints of the game favored 
competition, active observers attributed as 
much personal causality to the actor (target 
person) in the competitive condition as in the 
cooperative condition. In other words, while 
accurately perceiving the nature of game, 


514 Dare T. MILLER AND STEPHEN A. NORMAN 


active observers tended to underestimate its 
impact on the interaction. 

The active versus passive observer differ- 
ences are consistent with the experimental 
predictions and may be construed as support 
for the hypothesis that the different role re- 
quirements of active and passive observers 
tend to make individuals in the two perspec- 
tives use information differently. Specifically, 
it may be much more important for active 
observers than for passive observers to be- 
lieve that they have personal knowledge of 
the actor. The finding that the active observ- 
ers’ judgments of themselves did not differ 
significantly from their judgments of the ac- 
tor, however, suggest that another, although 
not unrelated, process may be operative. 

At this point, we return to the notion of 
effective control. If actors are motivated to 
see themselves as free agents and in control 
of their environment, then it seems reason- 
able that they should attribute the same qual- 
ities of freedom and control to those with 
whom they interact. If the active observer is 
interacting with someone who is under situa- 
tional control, then he cannot exercise control 
over that person. To the extent, then, that 
structural forces influence either of the par- 
ticipants, the freedom and control of both is 
reduced. Kelley (1971) essentially made this 
point in his discussion of effective control 
and bargaining behavior when he asserted 
that the causal locus of conflict, from the 
participants’ perspective, is most rationally 
attributed to a personal agent, “rather than 
to the less influenceable external framework 
within which the bargaining occurs” (p. 24). 

Passive observers, on the other hand, have 
no need to be able to exercise control over 
the interaction, From their perspective, they 
may be able to see how constrained the play- 
ers are and find little evidence for individual 
causation. Furthermore, viewing the game as 
the most powerful causal agent in the inter- 
action would seem to foster the passive ob- 
server’s sense of control, 

The finding that active observers reported 
that they were more influenced by the other 
actor than by the game, while passive obsery- 
ers reported the opposite to be true, is con- 
sistent with Kelley’s hypothesis that actors in 
a conflict situation tend to “underestimate” 


the influence of the external cause of the 
conflict and “overestimate” the causal role 
of the opponent. The finding that actors at- 
tributed more responsibility to themselves 
than did observers, and that actors’ judg- 
ments of themselves and the other participant 
did not differ, however, suggests that the hy- 
pothesis may have to be extended to include 
the tendency for actors to overestimate their 
own causal influence as well as that of the 
opponent. 

In conclusion, the present results suggest 
that a strong relationship may exist between 
causal perceptions and effective control. It is 
left to subsequent investigations to test the 
generalizability of this relationship by extend- 
ing the range of social contexts examined to 
include nonconflict situations. Finally, in or- 
der to be able to ascertain the veridicality of 
the perceptions of individuals with different 
perspectives, it is necessary for future investi- 
gations to employ criteria which can objec- 
tively assess the causal role played by the 
participants and the situation, 
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This study explores attitudinal differences between public letter writers 
expressing favorable sentiments toward student protesters or prison rioters and 
those expressing negative sentiments. Using a 2 X 2 X 2 factorial design on a 
sample of 70 public authors, the study found that sex of author and the kind 
of protest activity addressed were not significantly related to the attitudes 
measured. However, multivariate comparisons did indicate: (a) that contra- 
authors have a more pessimistic and simplistic view of human nature than 
do proauthors, (b) that the two groups differ in their assessment of the past, 
present, and future state of national affairs, and (c) that their respective 
authoritarianism and anomie levels do not differ significantly. 


What follows is an empirical study of the 
attitudinal correlates of public letter writing 
in response to two, highly visible instances 
of social dissent in the Buffalo, New York, 
area: student protest and the Attica prison 
riots. In February and March of 1970 the 
State University of New York at Buffalo was 
the scene of serious and protracted campus 
disorders. The 1971 Attica prison uprising 
took place only 20 miles away from the 
Buffalo metropolitan area. Both incidents 
were instances of large-scale social dissent 
within the immediate social environment of 
Buffalo’s citizenry. 

This study began with a systematic review 
of the published portion of letters written to 
the editors of the two local newspapers in 
response to the events detailed above. Next, a 
battery of attitudinal measures was admin- 
istered to two contrasting subsamples of actual 
letter writers: one taking a predominantly 
sympathetic stance toward student and prison 
dissenters, the other articulating mainly nega- 
tive sentiments toward same. 
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This research was not guided by any one, 
or any set of, formal hypotheses. It was con- 
ceived mainly as an exploratory probe into a 
young and largely unexplored area of social 
research. Public mail has never drawn more 
than the sporadic attention of a small group 
of social scientists. So far, their endeavors 
have yielded only scattered bits of empirical 
information, but no cohesively articulated 
body of knowledge. 

It has been ascertained with some degree 
of consistency that public letter writers are 
generally of better than average educational, 
economic, and occupational status; that they 
are drawn predominantly from urban rather 
than rural areas; and that they do not tend 
to be representative of public opinion at large 
(Converse, Clausen, & Miller, 1965; McEvoy, 
Chesler, & Schmuck, 1967; Sussman, 1959, 
1963; McEvoy, Schmuck, & Chesler, Note 
2). Previous research also suggests that the 
assumption of pro- or contrapositions by pub- 
lic letter writers on salient social issues is 
highly predictive of the author’s degree of 
sociopolitical conservatism or liberalism 
(Cowdry, Keniston, & Cabin, 1970; Hoog- 
velt, 1969; Klempner, 1966; Toch, Deutsch, 
& Wilkins, 1960). 

, As to the motives of why people write pub- 
lic letters, these may be twofold: instrumental 
or expressive. The instrumental author writes 
to obtain a desired outcome; the expressive 
author, on the other hand, tends to write in 
order to derive some form of personal gratifi- 
cation from the act of public letter writing 
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itself (Klempner, 1966; Wyant & Herzog, 
1941). It has also been observed that public 
letter writers are generally recruited from the 
ranks of those who fear that their side or 
cause is losing; the need to write appears to 
become more compelling when one’s social, 
political, or ideological goals are endangered 
rather than when they seem assured (Suss- 
man, 1963). Finally, public letter writing 
provides the author with optimal personal 
autonomy: He chooses his own issue, his own 
words, his own moment to act, and he can 
address his letter to any self-selected public 
figure or medium he wishes, Such opportuni- 
ties are generally not afforded him in the 
other, more regulated, areas of civic or public 
functioning (Sussman, 1959), 


METHOD 
The Letter Sample 


With the onset of the disturbances at the Buffalo 
campus, this investigator began to keep a systematic 
record of all letters to the editor being published in 
the two local newspapers in response to those dis- 
turbances. He monitored the editorial columns of 
these dailies for a 6-week period following the pub- 
lication of the first letter after the initial outbreak of 
the disorders. As the letters actually published (V = 
125) ranged over a wide topical area (some of them 
articulating writer opinions on such issues as the 
role of university administrators, the news media, 
etc. in the disorders), it was decided that letters 
not addressing the central issue of dissident action 
(student protest), or not dealing with its principal 
actors (student activists), would be excluded from 
study. This procedure left 91 letters for further 
analysis. The latter were then subjected to content 
analysis, the focus of analysis being whether the 
letters expressed favorable, unfavorable, or mixed 
writer sentiment about the campus disorders. With 
the eruption of the Attica prison uprising 14 years 
later, this researcher repeated the same procedure on 
a new group of public letter writers, this time react- 
ing to the phenomenon of prison dissent. Excluding 
from a total of 89 published letters those that did 
not address the central issue of prison dissent and 
its principal initiators, he retained 59 letters for 
further study. These, too, were subjected to content 
analysis relative to the general attitudinal valence 
expressed by their authors: favorable, unfavorable, 
or mixed. 4 

Tn both instances, the actual content analysis ; 
performed by two independent raters. In classifying 
the letters either as favorable, unfavorable, or mixed, 
four criterion measures were used: (a) the perceived 
legitimacy or illegitimacy of the dissent in question; 
(b) the attribution of positive or negative conno- 
tative characteristics to student or prison rioters; 
(c) the projection of positive or negative socio- 
institutional consequences directly relatable to stu- 
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dent disorders or prison uprisings; and (d )the ad- 
vocacy of positive or negative measures to be taken 
for, or against, the dissenters. Letters that could 
not be clearly identified as favorable or unfavorable 
on any one, or any combination, of the content 
measures above, or in which positive and negative 
connotations balanced each other out, were classified 
as mixed and removed from the sample. A total of 
19 such letters were discarded, Interrater reliability 
for the campus letters was r= +865, for the Attica 
letters r= „85. 


The Letter Writers 


Subsequent to the content analysis of letters, those 
authors, who had written either favorable or un- 
favorable letters in response to the campus disorders 
or the Attica uprising, were reached by telephone 
and asked to cooperate in the next phase of the 
investigation: the completion of a battery of four 
attitude scales. In view of the fact that not all letter 
writers could be Positively identified (incomplete 
signatures, pseudonyms), this part of the data- 
collecting phase aimed less at obtaining a repre- 
sentative sample of such authors than at generating 
the maximum number of returns from them, The 
campus sample was contacted in the period of April 
through June 1970, the Attica group in October to 
November 1971. In general, each author was con- 
tacted within a 60-day period after publication of 
his or her letter, Next, the attitudinal measures were 
hand delivered to the various letter writers who had 
indicated their willingness to cooperate, together with 
a detailed set of instructions for the completion of 
the scales, A stamped, self-addressed envelope was 
provided for return of the completed materials. In 
order to avoid unnec y response bias and to 
guarantee optimal response rate, the survey was 
conducted under the auspices of an out-of-state 
community development institute; the letter writers 
were also given explicit assurance of personal ano- 
nymity. The study was presented to them as part 
of an inquiry into citizen opinion on the state of 
national affairs and on current social issues, Of the 
104 letter writers who initially agreed to cooperate, 
70 actually completed and returned the scales, 
Twenty-two of these had authored favorable, and 
48 unfavorable, letters, 


Measuring Instruments 


The following measuring instruments were em- 
ployed in this study: 

Wrightsman Philosophy of Human Nature Scale, 
This is a Likert-type instrument developed by 
Wrightsman (1964) to assess a person's expectancies 
about human behavior and its underlying motive 
forces. The scale is composed of six subscales; (a) 
trustworthiness—the extent to which people are seen 
as moral and reliable; (b) altruism—the extent of 
unselfishness and concern for others; (c) indepen- 
dence—the extent to which people can maintain 
their own convictions in the face of conformity- 


1In a few instances the measuring instruments 


were forwarded by mail. 
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inducing pressures; (d) will and rationality—the 
extent to which a person understands the motives 
underlying his behavior and actively directs their 
gratification; (e) human complexity—the extent to 
which people are seen as complex and difficult to 
understand versus simple and easy to understand; 
and (f) human variability—the extent of indi- 
vidual differences in basic human nature. In this 
study only the positively worded items of the scale 
were used, each subscale having 7 items.* Item 
scores range from +3 (agree strongly) through —3 
(disagree strongly). In scoring the test, a constant of 
4 was added to each item, changing the effective 
item score to 1 through 7 and yielding a theoretical 
subscale range from 7 (extremely negative view of 
human nature) to 49 (extremely positive view). A 
Subject’s total test score can range from 42 to 294. 

Cantril Self-Anchoring Striving Scale. This is an 
abbreviated version of the self-anchoring scale used 
by Cantril in his cross-national study of human 
concerns (Cantril, 1965). Presenting the respondent 
with a ladder device representing in essence an 11- 
point rating scale, the instrument requires the re- 
spondent to indicate where he feels the country 
stands today in relation to his own subjective aspira- 
tions for it; where he believes it stood 10 years ago; 
and where he thinks it will be 10 years hence. The 
respondent’s three ladder ratings, indicating his own 
perception of the country’s present, past, and future 
state, can be compared; they yield a measure of the 
Subject’s personal sense of national accomplishment 
(rating shifts from past to present) as well as his 
sense of optimism concerning the country’s future 
(rating shifts from present to future). The scale 
allows a score range from 0 (worst possible state of 
national affairs) through 10 (best possible state). 

Srole Anomie Scale. This is a 5-item scale of 
anomie developed by Srole (1956). The items, pre- 
sented as opinion statements and measuring an indi- 
vidual’s sense of social malintegration, are statements 
of personal pessimism and of alienation from others. 
Agreement with an item is indicative of anomie. The 
scale allows for a score range of O through 5. 

Srole Authoritarianism Scale. This scale is made 
up of five items of the California F scale as em- 
ployed by Srole (1956). The subject is asked to 
agree or disagree with each item, with agreement in 
each case indicating authoritarianism. This scale, too, 
has a score range from O through 5. 


Analysis of Data 


The research design employed in this study, was a 
2% 2 2 factorial design with sex, locus of dissent, 
and letter valence as the independent variables and 
the attitudinal test data as the dependent variables. 
After collection of the attitude test data, all infor- 


? Pilot tests on a small sample (W = 6) had indi- 
cated that administration of the whole scale was too 
time consuming. Practical considerations necessitated 
such abbreviation. To the extent that acquiescence 
response set may have been introduced as a result, 
it must be assumed to have operated as a constant 
error in all subject responses. 
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mation obtained was transferred to IBM cards and 
subjected to data analysis. The multiple attitudinal 
test data were subjected to a three-way, fixed ef- 
fects, and multivariate and univariate analysis of 
variance in the manner prescribed by Bock (1963) 
and Finn (Note 1). The multivariate method is 
based on the psychometric assumption that the 
various dependent variable measures in a design are 
interdependent and, thus, are to be treated simul- 
taneously, as a set. Under the, Bock-Finn linear 
model, an exact partition of dispersion (variances 
and covariances) is obtained for a given ordering of 
effects in the analysis of variance model, for both 
equal and unequal 7 cases, 


RESULTS 


Separate multivariate analyses of variance 
were performed on the multiple attitudinal 
test data yielded by each of the three atti- 
tude scales. Thus, three separate multivariate 
analyses of variance were obtained: one for 
the six Wrightsman subscale scores; another 
one for the three Cantril ratings; and a third 
one for the two Srole scale scores. In addi- 
tion, supplementary univariate analyses of 
variance were run on each individual atti- 
tudinal measure separately as well as on the 
Wrightsman total test score. The focus of 
analysis was on the identification of those 
scales or subscales which discriminate signifi- 
cantly between sex (male-female), locus of 
dissent (campus—Attica), and letter valence 
(pro-contra) groups. 


Philosophy of Human Nature 


As for the Wrightsman Scale scores, the 
multivariate tests of all possible interactions 
between the independent variables (ABC, 
AB, AC, BC) failed to reach significance at 
p < .05. The parallel, but not independent, 
univariate F statistics for the seven dependent 
variable measures, considered separately, also 
failed to reach significance, substantiating the 
lack of interaction at any level between the 
three design factors. As for main effects, it 
was observed that neither sex nor locus of 
dissent registered sufficiently differential re- 
sults to yield significance at p < .05, either 
multi- or univariately. This finding implies 
that, on the scales in question, no functional 
relationship was observed between attitudinal 
response and male or female authorship of a 
public letter. Similarly, letter writers address- 
ing student protest activities do not appear to 
differ attitudinally from those responding to 


Pusiic Lerrer Wrrrine 


prison riots. However, the multivariate test of 
the letter valence main effect (pro-contra) 
exceeds the critical value at p < OS, multi- 
variate F(6, 57) = 3.55, P < 004. The con- 
comitant univariate analysis of variance per- 
formed on the 7 Wrightsman scale scores 
separately yielded significant differences for 
pro- and contraletter writers on: (a) the 
total scale; (b) the altruism subscale; and 
(c) the complexity subscale. As can be in- 
ferred from Table 1, contraauthors ob- 
tained significantly lower scores on the three 
measures than did the proauthors, 

These results suggest that the letter writer, 
who condemns the kind of dissidence exempli- 
fied by student protesters and prison rioters, 
has a generically more pessimistic view of hu- 
man nature than does the author with a more 
sympathetic stance toward these dissidents, 
More specifically, the former perceives people 
as less altruistic and more simplistic in their 
motivational make-up than does the latter. 


State of National Affairs; Cantril Ratings 


As for the subject ratings of the country’s 
past, present, and future state of affairs, the 
multivariate tests of all interactions of two or 
three independent variables failed to establish 
significance at p < .05. Similarly, the analo- 
gous univariate F statistics, considering the 
dependent variable scores separately, failed 
to indicate interactive significance at any 
level. Again, no significant main effects were 
found for sex and locus of dissent, either 
multivariately or univariately. The only sig- 
nificant differences in state of domestic affairs 


TABLE 1 
MEAN Scores anp P Ramos Pontic Lerree Wairens 
on Waricutsman Pintosorny or Human Narunk 
SCALES DIFFERENTIATED NY LETTER VALENCE 


Public letter writers 


Variability 


Phin man ye ra a = z 
Pro ont 
EER (u = 22) | (a = 48) | bre ga 
Total test 194.70 | 17680 | 3.61" 
Trustworthiness 35.18 30.83 2.7 Ay 
Altruism 33.59 28.58 Say 
Independence 32.05 | 31.75 05 
Will and rationality 31.82 33.12 0, 
Complexity 32.32 26.00 | 10.28 
2941 | 26.29 | 240 


+ Multivariate F = 3.55, p < 05. 
*o <05. 
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TABLE 2 


MEAN Scores axo F Ratios Punuc Lerrea Warries 
ON CANTRIL SELY-ANCHORING STRIVING SCALE 
DLPPERENTIATED HY LETTER VALENCE 


Public letter writers 
Cantril — z Prei 


Self-Anchoring P 
Striving Scale | p c j 
wean | Gendt) bey ye 
Past 4.18 5.71 18.28* 
Present 5.32 4.75 1.70 
Future 5.54 525 | 58 


* Multivariate F = 10,64, p <05, 
*p <u, 


ratings were those between pro- and contra- 
letter writers, As Table 2 indicates, the multi- 
variate test of significance yields an overall 
difference for all three dependent variable 
measures considered simultaneously, well in 
excess of the p< .05 value, multivariate F 
(3, 60) = 10.64, p < .0001. The parallel, uni- 
variate tests of significance suggest that this 
difference is accounted for largely by the 
highly differential ratings of the past by these 
two groups of public letter writers. Proau- 
thors give a significantly less favorable assess- 
ment of the past state of domestic affairs than 
do the contraauthors, 

A second analysis was performed on trans- 
formations of the same data. Using a so-called 
“nested” classification, in which the within- 
group differences between past and present 
ratings and present and future ratings were 
compared for both pro- and contraletter 
Writers, the study establishes an interesting 
differential data trend for the two groups, 
The mean scores and their significance levels 
can be found in Table 3, 

The analysis above indicates that the pro- 
and contraletter writers of this study give 
evidence of a marked difference in his- 
torical perspective on the state of national 
affairs. Comparing the present condition of 
the country with the past, the proauthors 
perceive a present that compares favorably 
with the past, and, thus, implicitly admit to 
a significant sense of positive national accom- 
plishment over time, In contrast, the contra- 
letter writers manifest a noticeable sense of 
domestic pessimism: To them, the present 
appears rather bleak in the perspective of 
what allegedly was a more favorable past. 
As for the letter writers’ projective percep- 
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TABLE 3 
Mean Wirtutn-Grove CANTRIL SCALE DIFFERENCES 
AND F Ratios FOR PRO- AND CONTRALETTER 
WRITERS 


Public letter writers 


| om 


< Fe 
eh; Contra ivari: 
(univari- = | (univari- 
ate) Ges #2) | ate) 
Present 
minus 
past 1.14 14.77* —.96 22.92* 
Future 
minus 2 
present :22 64 -50 6.84* 
a Multivariate F = 8.17, p < ,05. 
b Multivariate F 13.20, p < 05. 
*p <05. 


tions of the nation’s future, proauthors essen- 
tially anticipate no significant shifts between 
present and future, whereas contraauthors 
foresee a slight, yet significant, change for the 
better. This differential trend between the 
pro- and contraletter writers in their respec- 
tive time perspectives on the state of national 
affairs suggests that the sense of frustration 
that may have caused both of them to write 
may have been engendered by two, funda- 
mentally different, generative conditions: pro- 
authors see gains, made over the past, in 
danger of being blocked from further actuali- 
zation into the future; contraauthors, on the 
other hand, view the current social scene as 
one that threatens to erode the valued 
acquisitions of a better national past. 


Anomie and Authoritarianism 


As for the Srole authoritarianism and 
autonomie scores, the multivariate as well as 
univariate tests of significance for interaction 
between the three design factors, excluding 
main effects, failed to establish significance at 
p < .05. Furthermore, no significant main ef- 
fects were found for any one of the three 
independent variables, either multivariately 
or univariately, indicating that there was no 
statistically acceptable difference in anomie 
and authoritarianism scores between male 
and female letter writers; between those ad- 
dressing themselves to campus unrest; and 
those responding to prison riots; and, most 
importantly, between authors writing in sym- 
pathy with, if not support of, the dissidents 
and those writing in condemnation. 
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DISCUSSION 


The preceding data appear to indicate that 
the larger and most consistent differences in 
attitudinal orientation observed are those be- 
tween public letter writers who write in sup- 
port of student activists and prison rioters 
and those who express mainly condemnatory 
sentiments toward these dissenters. Sex of 
authors as well as the kind of dissidence ad- 
dressed (student protest or prison riot) were 
found not to be related in any consistent 
manner to the attitudinal traits measured. 

Most of the data enumerated here appear 
to suggest that the differences between the 
pro- and contraletter writers of this study are 
similar to those that differentiate two known 
politico-ideological types: the right-wing con- 
servative and the left-wing liberal or radical, 
thus substantiating some studies cited earlier 
(Cowdry et al., 1970; Klempner, 1966). 

First, the public letter writer who expressed 
sentiments antagonistic to protesters and 
rioters was identified as having a more depre- 
ciative notion of human nature than his pro- 
counterpart; he is more disbelieving of man’s 
capacity for altruism and he entertains a 
rather simplistic notion of the motive forces 
that underlie human behavior. These findings 
are in substantive agreement with previous 
probes into sociopolitical conservatism and 
its personality correlates (Abcarian, 1971, 
p. 143; Kirscht & Dillehay, 1967, p. 84; 
Lipset & Raab, 1970, pp. 8-10). 

The second set of data generated in this 
study established that pro- and contraletter 
writers differ significantly in their assessment 
of the state of national affairs and its qualita- 
tive evolution over time. Proauthors admit 
to a national present better than the past, 
but they express pessimism about the prob- 
ability of continued improvement into the 
future. Contraauthors, on the other hand, 
enunciate the belief that the quality of na- 
tional life has deteriorated, thus implicitly 
imputing to present day society the erosion 
of past national accomplishments. Again, 
these results concur with previous studies on 
sociopolitical conservatism and its correlates. 
The fact that sense of national progress is a 
function of basic political orientation was first 
established empirically by Cantril. In 1968 
the latter observed in a national sample si 
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Americans that individuals of liberal persua- 
sion rated the national present higher than 
the past and that they did so in significantly 
larger Proportion than did conservatives 
(Free & Cantril, 1968, p. 111); similar find- 
ings were obtained in 1971 (Cantril & Roll, 
1971). Others have asserted, largely on con- 
ceptual grounds, that it is inherent in the 
preservationist commitment of the conserva- 
tive to idealize the past; to be anachronisti- 
cally attached to it; to view it as more glori- 
ous and perfect than any present he may 
confront. This preponderant symbolic in- 
vestment in past national accomplishments, 
values, and norms has come to be identified 
as conservatism’s “quondam complex” (Lipset 
& Raab, 1970, pp. 460-462). 

The data above on ratings of the national 
state of affairs may have additional sig- 
nificance. The generative condition that 
prompted the pro- and contraauthors of this 
study to write may have been fundamentally 
different for both groups. As indicated earlier, 
in the one case, public letter writing may 
have to be understood as a reactive response 
to the threatened erosion of valued past ac- 
complishments (contraauthors); in the other, 
it may have received its impetus from the 
prospective anticipation that gains, made so 
far, are in danger of being blocked from 
further actualization into the future (pro- 
authors). Both groups are being impelled by 
equally acute sentiments of threat and frus- 
tration, but the one reacts to what appears 
to be an experience of significant loss, the 
other responds to a perceived threat of im- 
peded progress. Assuming such analysis to be 
fundamentally accurate, this researcher sub- 
mits that the contraletter writer may have 
acted out of a sense of decremental, and 
the proauthors out of aspirational, depriva- 
tion, two essentially divergent antecedents to 
Human discontent (Gurr, 1970, pp. 46-52). 
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Subjects were 510 college students who first took Part 1 of the difficult Ter- 
man Concept Mastery Test as individuals and then retook the same test in 
cooperative high-ability or low-ability groups of sizes two through five or as 
control individuals. Nine a priori social decision scheme models were tested as 
theories of the underlying group process. An a posteriori social decision scheme 
was also induced from a comparable previous study with 240 four-person 
groups and then tested against the current groups. The best-fitting a priori 
social decision scheme suggested an underlying “truth-supported wins” process, 
a finding which was more pronounced with increasing group size and for high- 
ability groups. However, the a posteriori principle, representing a combination 
of a truth-supported wins process, strong conformity pressures against a single 
correct member, and an increment from grouping when no member was cor- 
rect, fit the current data better than any of the a priori social decision schemes. 


A recent theoretical approach to small 
group performance is to organize the social 
interaction as a social decision scheme (e.g., 
Davis, 1969, 1973; Davis, Hoppe, & Hornseth, 
1968; Davis, Hornik, & Hornseth, 1970; 
Smoke & Zajonc, 1962; Zajonc, 1966; Zajonc, 
Wolosin, & Wolosin, 1972; Zajonc, Wolosin, 
Wolosin, & Sherman, 1968, 1969). In many 
situations the individual group members may 
initially prefer different response alternatives, 
and the group task is essentially to reach a 
collective decision when confronted with the 
various alternatives. Groups facing such de- 
cisions frequently develop formal social deci- 
sion schemes or voting mechanisms in order to 
formulate a single collective response. For 
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example, the formal decision schemes of 
majority, plurality, or unanimity are provided 
by the constitution, statutes, or bylaws of 
many groups such as juries, legislatures, cor- 
porate boards, or committees, in order to 
adjudicate among individual preferences in 
achieving a group decision. Moreover, informal 
groups and laboratory groups may operate 
on the same fundamental principles although 
they have no formal rules and may not even 
take explicit ballots. The discovery of such 
implicit social decision schemes is an empirical 
problem that may be guided by constructing 
theoretical models of social decision schemes 
and comparing their predictions with the 
actual group decisions. Thus, the comparison 
of theoretical predictions generated by the 
models with the results of an experiment may 
lead to a fuller understanding of the underlying 
group processes. Clearly the number of feasible 
social decision schemes is an increasing func- 
tion of group size and the number of available 
response alternatives. Moreover, different de- 
cision schemes may apply to groups with 
different ability levels, social contexts, tasks, 
norms, and so on. In other words, there is no 
a priori reason to believe that social decision 
schemes are invariant across groups, tasks, and 
situations, 
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Socar DECISION SCHEMES 


The General Social Decision Scheme Model 


Let the set of » mutually exclusive and 
exhaustive response alternatives be A GAs ces) 
An. Assume there exists a probability distribu- 
tion, pı, ps, ..-, Pn across the n outcomes that 
reflects individual response preferences. For 
a group of r randomly sampled individuals, let 
the distribution of group preferences be repre- 
sented by r1, f2, ..., tn. The rj are occupancy 
numbers, and ry indicates the number of mem- 
bers preferring the jth alternative. For every 
distinguishable distribution r re, ..., Tn 
n+rn+...+%=r. The number, m, of 
distinguishable distributions of member prefer- 
ences (so called because alternatives but not 
people are treated as distinguishable) is given 
n+r-1 

r 


by m = ). The probability of the 


ith distinguishable distribution of member 
preferences, i = 1, 2, ..., m, may be obtained 
from: 


r 
w (z AE „Ja ‘Pe p a) 


This is a familiar term from the multinomial 
expansion, (pit pa-t «++ + pa)". 

Assume that social interaction affects a 
group decision (selection of some Aj) and that 
this decision is a joint function of the initial 
member preferences and the social decision 
scheme which translates these preferences into 
the group decision, For any given decision 
task and social context several social decision 
schemes may be plausible in view of previous 
research, current theory, or intuition. For 
each social decision scheme, assume there 
exists a probability, dij, that the jth alternative 
is chosen given the ith internal preference dis- 
tribution. We can then form an m X » social 
decision scheme matrix D with entries [diy]. 
Each row of this matrix D is thus a statement 
of the probability of each alternative being 
chosen given the m possible distinguishable 
distributions of initial member preferences. 
Thus, the group probability distribution, 
Py, Pa, «+5 Pn, over alternatives Ax, Assis; 


i i isi heme 

1 The notation for the general social decision sc) 
model follows Feller (1957). See Feller ( 1957, pp. 36 ff.) 
for a discussion of the derivations and applications of 


the formulae to occupancy problems. 
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where m; is the probability of the ith dis- 
tinguishable distribution as estimated from 
Equation 1. The p; of Equation 1 can be esti- 
mated in one or both of two ways: (a) from 
the previous individual responses of the mem- 
bers who later interact as a group, (b) from a 
separate sample of individuals who do not 
actually interact as a group. The resulting 
predictions, P;, as generated by each social 
decision scheme matrix, D, can be compared 
with the actual distribution of group responses. 
If the predictions of a given social decision 
scheme do not differ from the actual distribu- 
tion of group responses, the social decision 
scheme and the underlying theory it formal- 
izes may be considered a plausible explanation 
of the underlying group process. A more com- 
prehensive treatment of this general framework 
or theoretical model of social decision schemes 
can be found in Davis (1973). 

Employing this approach, the following 
experiment compared homogeneous high- or 
low-ability groups of sizes 2, 3, 4, and 5 on 
a difficult intellective task, Part 1 of the Ter- 
man Concept Mastery Test (Terman, 1956). 
After taking the test as individuals, college 
students were dichotomized at the median 
into high and low ability and then assigned to 
retake the same test in cooperative groups of 
high or low ability of sizes 2 through 5. Control 
high and low individuals retook the same test 
alone. Each of the 115 items of this test con- 
sists of a pair of words with the two response 
alternatives “same” or “opposite” in meaning 
(e.g, “flammable” and “inflammable”). Thus, 
each item can be considered a complementary 
or divisible task (Steiner, 1966, 1972) on which 
individuals may pool their unshared resources 
and thus theoretically exceed their perform- 
ance as individuals working independently. 
For example, if one individual knows the first 
word but not the second word, and vice versa 
for the second individual, the pair can cor- 
rectly respond “same” or “opposite” for an 


wn 


item neither could answer correctly alone. 
Moreover, this may even extend to an indi- 
vidual word, for example, if one member knows 
the meaning of a Greek prefix and another the 
meaning of a root. Similarly, the test as a 
whole over all 115 items can be considered a 
complementary task. Although each decision 
on the 115 items as specified by the instruc- 
tions permitted the three response alterna- 
tives (same, opposite, or omit), they were 
collapsed into two (correct and incorrect), 
both for simplicity of exposition and analysis 
and because the responses incorrect and omit 
are equally failures to achieve the stated ob- 
jective of correct answers. Thus, the task 
defined two alternatives (x = 2) for the four 
different group sizes (r = 2, 3, 4, 5). 


Nine A Priori Social Decision Schemes 


Nine specific social decision schemes (SDS) 
are described below. Each SDS constitutes a 
set of assumptions, represented formally by 
Dy, h= 1, 2, ..., 9. The nine SDS matrices 
thus define special case models of the general 
social decision scheme theory or framework, 
and are given in Table 1 for groups of sizes 
2, 3, 4, and 5 for a two-alternative task such 
as the Terman in this experiment. 

Equiprobability. The rationale for this scheme 
is that the group decision is equally probable 
among those alternatives advocated by one or 
more members. This scheme makes no pro- 
vision for any inequality in the number of 
members favoring different alternatives, nor 
does it consider the particular features of the 
alternatives such as correctness, payoff, or 
morality. Note that if only one alternative is 
proposed there is a probability of 1.00 that it 
will be adopted? 

Proportionality. Here the basic hypothesis 
is that the probability of a group decision is 
the proportion of members advocating that 
alternative, Like equiprobability, the propor- 
tionality idea does not consider any charac- 
teristics of the alternatives, but only the 
internal distribution of group members. Both 
equiprobability and proportionality may be 


2The entries for which dj; = 1.00 or .00 reflect a 
liberty we have taken for convenience of exposition. 
More precisely, such parameters are assumed to take 
values very close to one or zero. 
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considered versions of group equalitarianism— 
equiprobability an equalitarianism of positions 
and proportionality an equalitarianism of 
people. 

Truth-wins. The title of this scheme is due 
to Steiner, who described it in his consideration 
of disjunctive tasks (Steiner, 1966, 1972; 
Steiner & Rajaratnam, 1961). It is assumed 
that a correct answer is so persuasive that 
the group immediately adopts the alternative 
if any group member proposes it. This basic 
hypothesis has been widely used in group 
problem-solving research (e.g., Davis & Restle, 
1963; Laughlin, McGlynn, Anderson, & Jacob- 
son, 1968; Vine & Davis, 1968). Lorge, Fox, 
Davitz, and Brenner (1958) labeled such tasks 
“Eureka” problems to emphasize their prop- 
erty of unique, self-confirming answers or 
“insight” characteristics. Moreover, this ex- 
treme hypothesis has been formalized as a 
mathematical model by Lorge and Solomon 
(1955) and in various modifications by Davis 
(1969) and Steiner and Rajaratnam (1961). 
The basic idea has also been widely used in 
its “empirical sampling” form (e.g., Dunnette, 
Campbell, & Jaastad, 1963; Marquardt, 
1955; Taylor, Berry, & Block, 1958), especially 
in studies of the alleged value of brainstorming. 
However, despite the apparently widespread 
confidence in the special value of group work 
that is implied by this model, very few re- 
searchers have actually reported that it 
accounts for group decisions or solutions (a 
recent exception is Laughlin and Bitz, 1975). 
Rather, the model is almost always seriously 
in error by overpredicting group performance. 
$ Truth-supported wins. A variation of the 
truth-wins model hypothesizes that a correct 
answer is adopted by the group if and only if 
a member advocates it and he is supported 
by at least one other member. One rationale 
for this idea is that the persuasive arguments 
for correctness are insufficient in number or 
quality if only a single person proposes the 
answer, but an additional advocate is a suffi- 
cient increment to persuade all other members 
to adopt the alternative, 

Majority, equiprobability otherwise. Familiar 
social decision schemes such as majority rules 
may not be applicable for some internal dis- 
tributions of member response preferences. 
For example, two members of a four-person 
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group may favor each of two possible alterna- 
tives. A social decision scheme may therefore 
also include a subscheme which applies if the 
primary scheme does not. For example, many 
committees follow a plan in which the majority 
rules with ties broken by a vote of the chair- 
man, who normally does not vote. The major- 
ity, equiprobability otherwise, social decision 
scheme thus combines the democratic ideal 
of majority rules with a subscheme of equi- 
probability to handle nonmajorities among the 
possible group distributions of preferences. 
Consequently, a group decides for an alterna- 
tive if and only if at least a majority advocates 
it, otherwise the decision is equally probable 
among those alternatives advocated by the 
members. Note that this considers simple 
majorities, while other orders of majorities 
(e.g., two thirds) may sometimes be applicable 
and could be expressed as a social decision 
scheme, 

Majority if correct, equiprobability otherwise. 
A slight variation on the previous SDS, the 
basic assumption here is that the majority 
determines the decision if and only if it is 
correct, otherwise the group decision is 
equally probable among those advocated by 
the members. This model combines elements 
of the member distribution with the idea that 
some alternatives have a special appeal such 
as correctness, while using a subscheme for 
instances of nonmajorities. 

Majority if incorrect, equiprobability other- 
wise. This scheme is in a sense a mirror image 
of the previous. It is somewhat fanciful since 
neither intuition nor data would seem to 
support the idea that a majority determines 
the decision if and only if it is incorrect, while 
for the nonmajorities the decision is equally 
probable among those advocated. However, 
we have included both this and the following 
decision scheme to serve as baselines or 
markers. The underlying idea is to demonstrate 
that social processes leading to decisions are 
orderly, and that hypotheses about social 
interaction chosen capriciously do not lead 
to models that fit the data as well as those 
consistent with the current state of social 
psychological theory and research. : 

Evror-wins. Here the assumption is that if 
an incorrect alternative is advocated the group 


adopts it. While perhaps implausible, this is 
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the basic idea of a conjunctive task model 
(Steiner, 1966) and is exemplified by any 
“real-life” task where every member of the 
group must solve the task successfully for the 
group to solve it, such as a relay race at a 
track meet. Moreover, error-wins is the logical 
counterpart of the truth-wins or disjunctive 
task model. 

Error-supported wins. This scheme is a 
mirror image of the truth-supported wins 
hypothesis. It assumes that the social process 
results in an incorrect decision only when more 
than one member advocates an incorrect 
decision, otherwise the group decision is 
correct. This scheme gains in plausibility 
from research on conformity, where the in- 
fluence of two or more wrong confederates 
greatly increases conformity by a naive 
subject. 

Thus, these nine a priori social decision 
schemes were tested as possible explanations 
of the underlying group process in the solution 
of the Terman Concept Mastery Test. The 
Terman would seem to be a rich and interest- 
ing task for a test of the models, as it enables a 
number of social combination processes to 
achieve correct answers by the group that 
exceed the capacity of a single individual work- 
ing independently, because the correct alterna- 
tive is not necessarily self-evident once pro- 
posed and because different models may apply 
for high-ability and low-ability groups. In 
addition, data from a previous study of 240 
4-person groups was used to estimate the 
social decision scheme that best fit that study 
(Laughlin & Branch, 1972). The resulting 
induced or a posteriori social decision scheme 
was also tested on the 4-person groups of the 
present study. Finally, an apparent underlying 
principle in this induced or a posteriori social 
decision scheme was extended to the 2-, 3-; 
and 5-person groups of the present study. 


METHOD 


The subjects were 510 college students of both sexes 
from introductory, experimental, and social psy- 
chology classes at Loyola University of Chicago and 
the University of Illinois at Urbana-Champaign. 
Each subject first took Part 1 (Synonyms and Anto- 
nyms) of Form T of the Terman Concept Mastery Test 
(Terman, 1956) as an individual, either during a 
regular class period or as participation in an experiment. 
No time limit was imposed, but most subjects finished 
in about 20 minutes. 
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The 510 individual scores were dichotomized at the 
median (32 of a possible 115 on a right minus wrong 
correction procedure) into high and low ability. Means 
were 48.53 for highs and 15,83 for lows, with a range 
of —12 to 92 over the entire sample of 510. Subjects 
were then randomly assigned to same-sex groups of 2, 
3, 4, or 5 within each of the high- and low-ability levels 
for the second administration of the same test. Control 
high and low individuals retook the same test alone, 
The groups were instructed to take the test as a co- 
operative group, discussing each item and recording 
their answers on a single common answer sheet. No 
further instructions were given regarding possible modes 
of structuring the task, methods for reaching consensus, 
and so forth, Again, no time limit was imposed, but the 
typical group finished in about 30 minutes. There were 
17 replications in each of the 10 treatment conditions 
defined by high or low ability and group sizes 1 
through 5, 


RESULTS AND DISCUSSION 
Overall Second Test Performance 


Means for overall second test performance 
(number of the 115 items right minus the 
number wrong) are presented in Table 2. 
Overall second test performance was analyzed 
separately at each of the high- and low-ability 
levels by one-way analyses of variance com- 
paring the five group sizes, In addition, 
analyses of covariance were performed at each 
ability level, with the mean first test indi- 
vidual score for the individuals in each group 
as covariate (for example, a group consisting 
of four high-ability individuals with first test 
scores of 55, 60, 60, and 65 would have a mean 
first test individual score of 60). Second test 
means as adjusted by this covariate are also 
presented in Table 2, 

There was a significant effect of group size 
for the high-ability groups, F(4, 80) = 6.28, 
< .001. A trend analysis indicated a highly 


significant linear trend, F(1, 80) = 23.94, 
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p < .001, in which performance increased 
linearly with increasing group size, while the 
departures from linearity were nonsignificant, 
F(3, 80) < 1. The magnitude of the effect 
of group size increased by analysis of covari- 
ance, F(4, 79) = 18.32, p < 001. Similarly, 
the linear trend was highly significant within 
the analysis of covariance, F(1, 79) = 71.11, 
p ~ 001, while the departures from linearity 
were nonsignificant, F(3, 79) < 1, Duncan 
multiple-range comparisons on the adjusted 
means indicated that all pairs differed sig- 
nificantly except group size three versus four, 
Thus, there was a direct linear increase in 
performance with increasing group size from 
one to three, no difference from three to four, 
and then a further increase from group size 
four to five. These results extended the sepa- 
rate previous results of Laughlin and Johnson 
(1966), Laughlin, Branch, and Johnson ( 1969), 
and Laughlin and Branch (1972), in which 
high-ability dyads, triads, and tetrads ex- 
ceeded high individuals, respectively, to a 
demonstration of a linear increase in the per- 
formance of high-ability groups as a function 
of group size within the same study. 

The effect of group size for the low-ability 
groups was not significant by either analyses 
of variance, F(4, 80) = 1.36, or covariance, 
F(4, 79) = 2.14, However, although the 
overall F was nonsignificant, both the linear 
and nonlinear trends and the pattern of 
Duncan comparisons were examined. Trend 
analysis within the analysis of variance indi- 
cated a significant linear trend, F(1, 80) 
= 4.56, p< .05, and nonsignificant depar- 
tures from linearity, F(3, 80) < 1, Similarly, 
trend analysis within the analysis of covari- 
ance indicated a significant linear trend, 


TABLE 2 


MEANS AND ADJUSTED MEANS FOR OVERALL SECOND TEST PERFORMANCE 


Group size 
Ability One Two Three Four Five 
M ad pet M pues M Salinen M Smad M Adjusted 
i 52.82 59.94 59.59 67.12 66.57 71.41 69.10 74.00 75.88 
eg 24 22.66 26.29 25.82 25.76 26.72 31.18 30.98 30.82 31.80 


a Adjusted for the mean first test individual score for the members of the group as covariate, 
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F(1, 79) = 8.33, b < 01, and nonsignificant 
departures from linearity, F(3, 79) <1, 


Duncan comparisons on the adjusted means 
indicated that the only two significant differ- 
ences between the 10 pairs of means were 
better performance for four-person groups 
than individuals (p < .05) and for five-person 
groups than individuals (p< .05). Thus, 
there was no difference as a function of group 
size except for groups of size four or five 
versus the individual controls, indicating a 
weak effect of increasing group size. This was 
consistent with the three previous studies 
(see previous paragraph for references) in 
which neither low-ability dyads nor triads 
exceeded the performance of low-ability in- 
dividuals, 

However, in contrast to the current study, 
Laughlin and Branch (1972) found that 
groups of four low-ability persons did not 
perform better than the low-ability individual 
control. The discrepancy is probably due to 
the dichotomization of ability as high and low 
in the present study and the trichotomization 
of ability as high, medium, and low in the 
previous study, so that some subjects defined 
as low here would be defined as medium there. 
Laughlin and Branch found that a group of 
four medium-ability subjects performed better 
than either the medium- or low-ability con- 
trols. However, in that study, the performance 
of the medium-ability persons was much more 
like that of the lows than of the highs. 

Taken together, the results of the four 
studies are completely consistent in the con- 
clusion that performance on a difficult intellec- 
tive task increases with increasing group size to 
a much greater extent for high-ability levels 
than for low- or medium-ability levels. This 
general finding has obvious implications for 
practical applications of research on group 
problem solving. 


Nine A Priori Social Decision Schemes 


Using the nine social decision scheme 
matrices from Table 1, the probability of a 
correct response on each item from the indi- 
vidual first tests, and Equation 2, the dis- 
tribution of corréct-incorrect second test 
group responses was predicted for each of the 
nine models for each item within each of the 
eight conditions of ability and group size. 
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Goodness-of-model-fit for each of the nine a 
priori social decision scheme models was then 
tested within each of the eight conditions of 
ability and group size by forming a likelihood 
ratio (LR) over the set of items as a whole. 
Assume that for the ith item nri out of Nr 
individuals were correct and ngi out of Ng; 
groups were correct. Assuming a general bi- 
nomial model for both individual and group 
problem solving, if pr; is the probability that 
an individual will solve the ith item, and Pai 
is the probability a group will solve the ith 
item, the likelihood of the observed frequencies 
of individual and group correct responses is: 


Ni; 
Tis ( DA = p) ”n 
Ii 


n 


Ni 
3 ( : ) Paral — pa )Nor~"os, (3) 


ngi 


In the general case we assume no necessary 
relation between individual and group per- 
formance. Hence the maximum likelihood 
estimators of pr; and Pai are the observed pro- 
portions of correct individual and group 
responses, that is, fr = nri/Nri, and pg; 
= ngi/N gis Substituting these values into 
Equation 3 yields the maximum likelihood of 
the ith item data under the general model, 
The maximum likelihood of the entire data 
set over 115 items would be given by the 
product of the itemwise maximum likelihoods; 


115 


L=]] L. 


i=l 


(4) 


Equation 4 represents the numerator of the 
LR. 

Under a particular decision scheme, how- 
ever, pr: and pg; are functionally related, The 
truth-wins decision scheme, for example, 
posits the simple function : j 


Pa = 1—(1 = pu)’, (5) 


where r= group size as previously. Thus, 
under the truth-wins Social decision scheme. 
Equation 3 has only one unknown parameter. 
The maximum likelihood estimator of this 
parameter, p;;*, under the truth-wins decision 
scheme was found using the STEPIT iterative 
search program, The corresponding maximum 
likelihood 2* comprises a component of the 


maximum likelihood of the entire data set 
under a truth-wins model: 


115 


e EAA 


(6) 


Thus, the LR under the truth-wins model is 
the following: 

LR= L/L*. (7) 
—2 loge LR is distributed approximately as 
X%,15 when the truth-wins model. holds. Al- 
though the functional relationship between 
prs and pai is more complex for other social 
decision schemes, the same procedures were 
followed to. compute an LR similar to Equation 
7 for each of the other eight models. 


l FA i oe 


PR: strict proportionalit 
EQ: majority if correct, equiprobability other 
ESWIN: error supported wins, DEST: 
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TABLE 3 
ResuLTs or Likeifoop Rario Tests SHowinG RELATIVE Goopness OF Fir or EACH 
Mopet Across ALL IremMs ror Eacn Group SIZE AND ABILITY LEVEL 
Low ability High ability 
Group size Social decision scheme 

x Prob. x Prob. 

«Pive STEQ 256.00 <.001 374.46 <.001 
STPR 175.69 <.001 253.60 <,001 

TWIN 673.72 <.001 409,55 <.001 

TSWIN 177.10 <.001 118.48 <.50 

MAEQ 177.16 <.001 161.20 <.01 

MCEQ 174.78 <,001 105.98 >.50 

MIEQ 513.21 <,001 607.74 <.001 

EWIN 1,525.05 <.001 1,396.72 <.001 

ESWIN 549.21 <.001 500,51 <.001 

DEST 129.68 <25 128.33 <.25 

Four STEQ 260.44 <.001 313.94 <.001 
STPR 203.74 <.001 237.82 <.001 

TWIN 550,64 <.001 346.67 <.001 

TSWIN 150.29 <.05 119.46 <.50 
MAEQ 183.92 <.001 180.12 <.001 

MCEQ 174.06 <.001 127.00 <.25 
MIEQ 344.86 <.001 424.55 <,001 
EWIN 1,218.40 <.001 1,145.21 <.001 
ESWIN 349.41 <.001 330.23 <.001 

DEST 140.33 <.10 122.32 >.50 
Three STEQ 134.72 <.10 196.58 <.001 
STPR 119.93 <.50 170.11 <.001 
TWIN 507.68 <.001 249.42 <,001 

TSWIN = ESWIN = MAEQ 142.23 <.05 136.72 <.10 

MCEQ 189.67 <,001 96,46 >.50 

MIEQ 266.75 <.001 364.19 <,001 

EWIN 697.21 < 001 774.40 <.001 

DEST 125.61 <.25 118.59 <.50 

Two STEQ = STPR = MAEQ = 166.66 <.001 142.32 <.05 

MCEQ = MIEQ 

TWIN = ESWIN 271.52 <,001 205.76 <.001 

TSWIN = EWIN 414.11 <.001 379.63 <.001 

* DEST 250.02 <.001 175,70 <.001 


TWIN: truth wins, 


SWIN: truth supported wins, MAEQ: 
majority if incorrect, equi 


ise, ot ajor 
estimated from Laughlin and Branch (1972), 


Results of these LR tests for each of the 
nine a priori social decision scheme models 
for each of the high-ability and low-ability 
conditions for group sizes two through five 
are given in Table 3. Note that a smaller value 
of X? indicates a better fit of the model, and 
a corresponding higher probability (e.g. 
p < .25) indicates a better fit than a lower 
probability (e.g., p < .001) in’ the probability 
columns of Table 3. 

As indicated in Table 3, the best fitting 
models for the five-person high-ability groups 
were the closely related truth-supported wins 
and majority if correct, equiprobability other- 
wise social decision schemes, while the other 
seven models could all be rejected. For the 
five-person low-ability groups, all nine a 
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priori models were rejected. The truth-sup- 
ported wins scheme best fit both the high- 
ability and low-ability four-person groups, 
while majority if correct, equiprobability 
otherwise also fit the high-ability but not the 
low-ability four-person groups. For the three- 
person groups of high ability, majority if 
correct, equiprobability otherwise, and truth- 
supported wins had the best fit. However, for 
the three-person groups of low ability, a strict 
proportionality model had the best fit, sug- 
gesting that there was greater recognition of a 
correct answer by the high-ability than low- 
ability groups of three persons. Finally, none 
of the nine a priori social decision schemes, 
which reduce to only three sets of predictions 
for a two-person group, fit either high-ability 
or low-ability dyads. 

Thus, the best fitting a priori social decision 
scheme was a formalization of an underlying 
truth-supported wins process as a model of the 
group interaction. This basic finding was 
stronger for high-ability groups than low- 
ability groups within a given group size from 
three to five and applied increasingly better 
with increasing group size. To our knowledge 
such strong support for a truth-supported 
wins group process has not previously been 
reported on any group task. The support for 
a truth-supported wins process is especially 
striking since the models formalizing such 
traditional theories as truth-wins, equiprob- 
ability, or majority, equiprobability otherwise, 
were all rejected, as were the baseline models 
such as error-wins chosen to demonstrate that 
the process of social interaction is in fact 
orderly rather than capricious. Finally, the 
relative differences in fit for both different 
group sizes and the two ability levels indicate 
the importance of group size and member 
ability as basic parameters of group problem 
solving and decision making. 


An Induced A Posteriori Social Decision Scheme 
from Previous Data 


As previously indicated, the nine a priori 
social decision schemes were chosen to examine 
a range of social processes representing current 
theory, plausible intuition, tests of the gen- 
erality of previous research, and implausible 
baselines. Another possible approach is to 
estimate the actual social decision scheme 
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operating in a given experiment, rather tha 
assuming a social decision scheme a priori and 
testing it against the data of the experiment. 
In a previous experiment (Laughlin & Branch, 
1972) four-person groups and control indi- 
viduals completed the same Terman task 
under the same conditions as the present study 
composing the 15 possible combinations of 
high (H), medium (M), and low (L) four 
person groups HHHH, HHHM, ..., LLLI 
and control H, M, and L individuals. Sinc: 
there were 16 replications of each of the 15 
combinations, a total of 240 four-person 
groups are available on the same task. In a 
sense, the design of the study can be con 
sidered a randomized-blocks design with the 
15 ability conditions as blocks, thus giving a 
more controlled sampling of ability within the 
240 groups than would a random assignment 
of the same 960 subjects to four-person groups, 
and consequently increasing the precision of 
the estimation of the underlying social decision 
scheme, D. Accordingly, this large data set 
was used to estimate the social decision scheme 
in the Laughlin and Branch study. 

Equation 2 was first rewritten as a system 
of linear equations: 


Py = midun + Todor + +++ + Tmdmi 
Pa = midis + Tado + +++ + mde 
Ps = midis + modos + +t + mans (8) 


midin + radon + 2° + tnd 


Recall from Equation 1 that for a given group 
size r the value of r; is a function of the pj. 
Thus, if the p; and P; are known or can be 
estimated from individual and group responses, 
respectively, the r; are also known or can be 
estimated, #;. Consequently, only the dj; are 
unknown, as in the Laughlin and Branch data 
n= 2 and m = 5. Assuming that the true D 
remains constant over a sufficiently long 
sequence of the 115 items of the Terman, we 
may write (changing notation to include a 
trial or item index) Equations 9: 


Pia = mindy + Toad + +++ + tm rds 
Piz —i Ti dij $ To, odoj o Tm, 2d mj 
Pis = Tah; + raad + ++ + amadas (9) 


P, 


Pis = misd + 2,32; ++ Tanlar 
where P;,s is the probability of a group decid- 
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ing for A; on the sth item, and s = eines 
the number of items chosen for study. The 
maximum value in this experiment is S = 115, 
and m= 2 (correct or incorrect), although 
both the Equations 8 and Equations 9 are 
written for the general case. 

The parameters P;,, and Tj, Were estimated 
in the same manner as in the previous analyses 
for the nine a priori social decision schemes. 
Thus, P;,, is the relative frequency of groups 
deciding for the jth alternative on the sth 
item, and #;,, is given by: 


(10) 


where pjs equals the relative frequency of 
individuals deciding for the jth alternative on 
the sth item and 7; equals the jth occupancy 
number for the ith distinguishable distribution. 
The prediction equation (Equation 2) for a 
single item is the following: 


P,=7,-D. (11) 


Equation 11 represents a set of simultaneous 
equations in m X n unknowns (the dj). How- 
ever, since 


n 

YP; = 1.00 and Ð dy = 1.00, 
jul 
the final equation is linearly dependent upon 
the former ones, and the problem becomes one 
of n — 1 equations in m X (n — 1) unknowns. 
Obviously the problem is underdetermined for 
a single item, but one can add another (n — 1) 
equations for every new item considered (as- 
suming nonidentity between item group and 
individual distributions), and by using more 
than m items, the problem becomes over- 
determined. 

One solution to the estimation problem, 
which was chosen here, is to select those dij 
which minimize the sum of squared discrep- 
ancies between predicted and estimated (i.e., 


observed) Pj;,s: 


Q = goodness-of-fit criterion 


where P = #-D. This Q quantity is a quadratic 
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TABLE 4 
SocIAL DECISION SCHEME Marricks, D ESTIMATED 
FROM INDIVIDUAL AND FOUR-PERSON GROUP 
DATA FOR 115 ITEMS FROM LAUGHLIN 
AND Brancu (1972) 


Estimated SDS matrix, Î, 


for each ability level 
Distinguishable 
distributions 
High Medium Low 
ability ability ability 
Cor. | Inc. | Cor. | Inc. | Cor. | Inc. | Cor. | Inc. 
4 0 1.00 | .00 98 02 | 1.00 .00 
3 1 1.00 | .00 | 1.00 00 | 1.00 .00 
2 2 80} .20] .78] .22 72 28 
1 3 -00 | 1.00 00 | 1.00 00 | 1.00 
0 4 -23 77 18} 82 «12 | 88 


Nolte. Entries have been rounded to two decimal places, The 
di; = 1,00 or .00, should actually be regarded as values very 
near to 1.00 or .00, 


function of the unknown dij, and may be 
minimized by standard numerical methods. 
In the present case the Quadratic Program- 
ming program of the University of Illinois 
SOUPAC package was used to obtain di, 
$= 4,2, ..., m7 = 1, 2, ..i,n—1, Then 
n=l 
din e KD dij. 
jmi 

This program is based on the simplex tech- 
nique described by Carr and Howe (1964), 

The results of this estimation procedure are 
given in Table 4. For the distributions of 
member preferences m where either all four 
members were correct (4-0) or three members 
were correct (3-1), the group was virtually 
certain to answer correctly the item at each 
of the three ability levels high, medium, and 
low (as indicated in Table 4, five of these six 
dij values are 1.00 and the sixth is .98 for the 
correct response alternative). When two per- 
sons were correct and two incorrect (2-2), the 
group was correct with a probability of .80 
for high ability, .78 for medium ability, and 
.72 for low ability. Thus, at each ability level 
the group did better than the .50 prediction of 
a strict equiprobability or proportionality 
process, but worse than the certainty of a 
correct response when only one member was 
wrong. In other words, a truth-supported wins 
process enabled the groups in which two per- 
sons favored each alternative to exceed .50, 
but not to the extent where three persons 
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“supported truth.” Moreover, there was a 
slight systematic increase in the probability 
of a correct response for a 2-2 distribution as 
the ability level increased from low to medium 
to high. 

The most unexpected and dramatic finding 
in Table 4 is for the 1-3 distribution where one 
person had the correct answer as an indi- 
vidual, while the other three did not. In these 
cases there was a probability of .00 for a correct 
answer, at all three levels of ability. This sug- 
gests extremely strong conformity pressures 
from the unanimously incorrect majority 
against the single correct individual. Sur- 
prisingly, the correct individual was never able 
to get the group to adopt his answer. This 
result is even more striking in relation to the 
0-4 distribution where no group member was 
correct as an individual. Here there was a 
probability of a correct response of .23 for 
high-ability groups, .18 for medium, and .12 
for low. Thus, a group with no correct mem- 
bers performed better than a group with one 
correct member, a finding which was not pre- 
dicted by any of the nine a priori social decision 
scheme models and would not seem to be pre- 
dicted by any current social psychological 
theory. Perhaps a group with no correct indi- 
viduals is more willing to consider both alterna- 
tives than a group with one correct individual. 

Alternatively, this anomalous pattern of 
results for the 1-3 and 0-4 distributions may 
somehow be an artifact of the decision to 
collapse the three response alternatives right, 

wrong, and omit into correct and incorrect. 
The 0-4 distributions may include a greater 
proportion of omits relative to wrongs than 
the 1-3 distributions, with differential guessing 
patterns as a result. While this is of course 
available in the data, it would involve a 
great increase in the complexity of analysis 
relative to the apparent further information it 
would reveal. For example, rather than m = 5 
of the current analysis, m would be 15, with 
corresponding increases in complexity at 
every level of both a priori and a posteriori 
social decision schemes. 

Thus, the analysis of the previous Laughlin 
and Branch data revealed a social decision 
scheme which did not entirely correspond to 
any of the nine a priori social decision schemes. 
However, once the a posteriori social decision 
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scheme was determined, it could then be 
applied as an a priori social decision scheme 
and tested against the 4-person groups of the 
present experiment. Moreover, since the pres- 
ent experiment also included groups of size 
2, 3, and 5, in addition to 4, it would be of 
interest to extrapolate the social decision 
scheme to these other group sizes. Also, the 
present study used two levels of ability by 
dichotomizing, while the Laughlin and Branch 
data used three levels of ability by trichtomiz- 
ing, so that some of the current highs and 
lows would be mediums in the previous study. 
The following analysis represents a tentative 
extrapolation from the previous study with 
trichotomous 4-person groups to the dichot- 
omous groups of 2 through 5 in the pres- 
ent study. 


Application of the A Posteriori Social Decision 
Scheme to the Present Data 


Assume that there is a constant “guessing 
correctly and group acceptance” probability 
for 0,7 distributions for (41, correct; A», 
incorrect), Assume further that while 1, r — 1 
distributions result in some kind of conformity 
or yielding by the single individual with the 
correct answer to the unanimous majority of 
size r—1 with the incorrect answer, the 
=r» distributions behave differently due 
to one or more additional “truth sayers” (for 
groups where an 7; = rs distribution can occur, 
i.e even-sized groups when there are only 
two response alternatives). The rı = re dis- 
tribution entails an increment in the prob- 
ability of a correct response over 50, which 
consists of the probability of a correct response 
in the 0,7 distribution and an additional 
increment from the influence of more than 
one correct advocate. (The probability of a 
correct response for the 0, 4 distribution in 
Laughlin and Branch (1972) was .23, .18, and 
12 for high, medium, and low groups, respec- 
tively.) Finally, assume that this process is 
independent of group and subgroup size 
beyond the limits mentioned. 

More formally, let P[4A:}-(0, r)] = dmi = 
Pll (1 = r)]= PA E 50 H 

Ms ; i 
the original tendency for A; in the case of 
equal distribution of member preferences, a 
is the initial 0,7 correctness-acceptance con- 
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stant, and 6 is the rı > 1 increment for the 
correct alternative A;. From inspection of the 
Du, Du, and Dz obtained at the three ability 
levels from the four-person groups of the 
Laughlin and Branch study (Table 4) and 
then allowing for two rather than three 
levels of ability, we propose the following: 
PÇA] (0,7), high ability] = a, = .20, and 
b= .10. Therefore, for high-ability group 
members, PLA;| (rı = r2), high ability] = .50 
+an +b = .50 + .20 + .10 = .80, and the re- 
maining distinguishable distributions of mem- 
bers follow a majority social decision 
scheme. For low-ability group members, 
P[Ay| (ri = 72), low ability] = .50 + .15 + 
10 = .75, where b= .10 as before and 
P[A,| (0, r), low ability] = .15. 

The social decision scheme matrices for 
groups of size 2, 3, 4, and 5 constructed in this 
manner from the estimated D from the 
Laughlin and Branch data are given in Table 
5. This estimated social decision scheme is 
subsequently abbreviated DEST, Likelihood 
ratio tests for this DEST social decision 
scheme for each of the eight conditions of 
ability and group size were conducted in the 
same manner as previously described for the 
nine a priori social decision schemes. These 
results are also given in Table 3 in the bottom 
line for each group size. 

As indicated in Table 3, the DEST social 
decision scheme fit the obtained data very 
well for three-, four-, and five-person groups. 
Moreover, while the truth-supported wins and 
majority if correct, equiprobability otherwise 
schemes fit the high-ability groups well, the 
DEST scheme was the only one to fit both 
high-ability and low-ability groups at a given 
group size of three, four, or five. Thus, we 


4 Strictly, with # = 2 response alternatives, these 
social decision scheme matrices are limited to even- 
sized groups of four or more members, since (rı = rs) 
cannot occur in an odd-sized group with only two 
alternatives. In generalizing the principle to five-person 
groups, the 2, 3 case for r = 5 was considered to be 
analogous to the 1, 3 case for ry = 4, with the addition 
of a second correct member, rather than as analogous 
to the 2, 2 case for r = 4, with the addition of a third 
t member, Thus, the conformity processes 
hypothesized in the 1,3 distribution for r = 4 were 
assumed to apply to the 2, 3 distribution for r= 5. 
Because of this hypothesized conformity pressure, 
a=0. The values of b= .20 for high ability, and 

= .15 for low ability remained since r > i 


incorrec 
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TABLE 
SOCIAL DECISION SCHEME MATR , D, GENERALIZED 
ro Four Group Si Us 3 
INFERRED FROM 
INDEPENDENT RESE; 
Distinguishable High Low 
Eo distributions ability ability 
size 
Cor, Inc, Cor. Ine. Cor, Ine, 
Two 2 0 1.00 -00 1.00 -00 
1 1 -80 -20 75 ui 
0 2 20 -80 15 85 
Three 3 0 1.00 .00, 1.00 00 
2 1 1.00 00 | 1.00 «00 
1 2 00, 1,00 -00 1.00 
0 3 20 80 15 „85 
Four 4 0 1.00 00, 1.00 00. 
3 1 1.00 00 | 1.00 .00 
2 2 80 20 75 25 
1 3 00 | 1,00 00 | 1,00 
0 4 .20 «80 AS 185 
Five 5 0 1.00 00 | 1.00 00 
4 1 1.00 00 | 1,00 00 
3 2 1.00 00 | 1,00 00 
2 3 -20 .80 15 285 
1 4 00 | 1,00 00 | 1,00 
0 5 20 80 15 185 


may assert the general superiority of the a 
posteriori DEST scheme over any of the nine 
a priori social decision schemes for groups of 
sizes three, four, or five. However, like all 
nine a priori social decision schemes, the DEST 
scheme did not fit the dyads at either high- 
ability or low-ability levels. 

This inability of any social decision scheme 
to account for dyadic performance was not 
entirely unexpected. A number of theorists 
have speculated that two-person interaction is 
different in fundamental ways from the social 
interaction in larger groups. Also, the nine 
social decision schemes assumed a priori in 
this study reduce to only three sets of unique 
predictions for the two-person group (see 
Table 1), so dyadic performance is necessarily 
less sensitive to a social decision scheme 
analysis than larger groups. However, in 
another study with dyads on a postulated 
insight task (the Mednick Remote Associates 
Test, Mednick & Mednick, 1967) Laughlin 
and Bitz (1975) found that a truth-wins scheme 
fit the data very well, while equiprobability 
and error-wins (representatives of the other 
two sets of unique predictions) could be re- 
jected. This again suggests that different social 
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decision schemes apply for different types of 
group tasks. 

In summary, the best fitting social decision 
scheme was the DEST model as induced from 
previous research. The assumptions of this 
model as a theory of the group process on this 
class of intellective task were previously for- 
malized and may be summarized as follows 
for groups of sizes three or larger: (a) both 
groups consisting of all correct members and 
groups consisting of all but one correct member 
are virtually certain to give the correct re- 
sponse, (b) groups with an equal number of 
correct and incorrect members have a prob- 
ability of .75 to .80 of giving the correct re- 
sponse, consisting of the .5 probability of an 
equiprobability model, the tendency to cor- 
rectness in a group with no correct members, 
and an increment from the influence of more 
than one correct member, (c) groups with 
only one correct member are virtually certain 
to give the incorrect response, (d) groups with 
no correct members have a probability of .12 
to .20 of giving the correct response. Strong 
conformity pressures against the single correct 
member by the unanimously incorrect majority 
are assumed to underlie c, while d may reflect 
greater willingness to consider both alterna- 
tives in a group with no correct members. 

Finally, the results strongly support the 
bidirectional logic of social decision scheme 
theory. A priori social decision schemes can be 
tested against actual group performance by 
assuming a social decision scheme matrix 
which formalizes any set of theoretical assump- 
tions and then predicting the group response 
distribution from the individual response dis- 
tribution (either of the actual group members 
or a separate independent sample of indi- 
viduals who do not interact as a group). Con- 
versely, a posteriori social decision schemes 
can be induced from the actual group and 
individual response distributions by solving 
for the social decision scheme matrix. More- 
over, as in the present study, an a posteriori 
social decision scheme principle induced from 
one experiment can then be tested as an a 
priori social decision scheme in a subsequent 
experiment. Thus, the bidirectional logic of 
social decision scheme theory allows a pro- 
gressive interaction between theory and data 
in the construction of a general theory of 
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group processes. Clearly, future research 
should test the generality of the findings of 
this experiment on other types of group tasks, 
member ability relative to the task demands, 
numbers of response alternatives, and various 
group sizes. 
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Positive and Negative Reactions to Male and F emale Succes: 
and Failure in Relation to the Perceived Status and 
Sex-Typed Appropriateness of Occupations 
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Male and female subjects answered a questionnaire involving items in which 
both a male and female character either succeeded or failed at an occupation. 
The items covered 12 occupations which varied in status and the degree to 
which they were typically filled by males or females. For each item subjects 
indicated which character would be happier about success (or unhappier about 
failure) and rated the level of happiness (or unhappiness) for each character. 
Subsequently, subjects provided reasons for their answers and rated the 12 
occupations in regard to the degree to which they were typically filled by males 
or females and in regard to their prestige or status. Results indicated a strong 
tendency for responses to be consistent with social stereotypes about the 
appropriateness of occupations for one or the other sex. A content analysis of 
reasons supported the importance of social stereotypes as factors influencing 


responses. 


When one considers the range of occupa- 
tions that exist in Western society it is appar- 
ent that they differ in a number of general 
characteristics. One important characteristic 
is the status or prestige attached to jobs. This 
seems to depend upon a set of interrelated 
factors including the value that society tradi- 
tionally places on the type of work involved, 
the preparation and qualifications required 
for the occupation, how difficult it is to enter 
the occupation, and the benefits that flow 
from it. In addition to status, however, 
occupations differ in the degree to which they 
are filled by males or by females. Some occu- 
pations (usually the higher status ones) ap- 
pear to be almost exclusively the province of 
males while other occupations (including 
many of considerably lower status) are more 
likely to be occupied by females. These two 
characteristics of occupations undoubtedly co- 
vary. But, while at first glance it may appear 
that the masculine dominance of an occupa- 
tion is positively related to status, the rela- 
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tionship may in fact be more comp! 
perhaps curvilinear, since one can 
several low status occupations in 
physical and unskilled labor that a 
by males. Whatever the relationship, 
ever, it is clear that level of status an 
degree to which occupations are fil 
males or by females are important ch 
istics of the occupational hierarchy i 
society. 

It can be expected that these two charac 
teristics of occupations are learned by 
dren as they develop, so that gradually 
acquire sets of beliefs about what are 
“better” or higher status jobs and about w! 
jobs are more appropriate for males than for 
females (and vice versa). These belief 
stereotypes since they are probably 
shared) are residues of the socialization 
ess. In terms of our own theoretical app 
they may be incorporated into “abstract 
structures” that enable a relatively stable 
consistent view of social reality (Feath: 
1971). They encompass not only bel 
about the characteristics of different occ 
tions but the normative expectations about 
the usual consequences of success or failure 


at these occupations, depending upon the 
circumstances, 
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REACTIONS TO MALE AND FEMALE Success AND FAILURE 


The study to be described was an attempt 
to map into these social stereotypes, particu- 
larly as they relate to sex roles at the occupa- 
tional level. The general hypothesis was that 
success at an occupation is viewed more posi- 
tively if this success is consistent with societal 
conceptions about the sex role than if it is 
inconsistent and that failure at an occupation 
is viewed more negatively if this failure is 
inconsistent with societal conceptions about 
the sex role than if it is consistent. This 
hypothesis is obviously compatible with the 
general assumption that people tend to prefer 
consistent states of affairs—an assumption 
that is central to the various consistency 
theories (Abelson, Aronson, McGuire, New- 
comb, Rosenberg, & Tannenbaum, 1968; 
Feather, 1971). More specifically, success is 
more attractive (in various ways) for males 
succeeding at a male-dominated occupation 
(e.g., becoming a company director) than for 
females succeeding at the same occupation, 
but less attractive for males succeeding at a 
female-dominated occupation (e.g., becoming 
a typist-stenographer) than for females suc- 
ceeding at the same occupation. In addition, 
failure is more distressing (in various ways) 
for males failing at a male-dominated occupa- 
tion (e.g., becoming a janitor) than for fe- 
males failing at the same occupation, and less 
distressing for males failing at a female- 
dominated occupation (e.g., becoming a do- 
mestic worker) than for females failing at 
the same occupation. In each case, the at- 
tractiveness or repulsiveness of the outcome 
(its valence) depends upon whether that 
occupation is seen as appropriate or inap- 
propriate for the sex of the person involved 
as well as upon its status. 

This general hypothesis was investigated in 
a recent study by Feather and Simon (1975). 
In this study, subjects from a fairly exclusive 
girls’ school were presented with three short 
verbal cues in which a male or a female 
character was described as either at the top 
or bottom of the class after first-term finals 
at either medical school, teachers college, or 
nursing school. Subjects rated the character 
in each cue on semantic differential scales, 
evaluated the importance of different possible 
causal factors involved in each success or 
failure, and judged the likelihood of a number 
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of possible consequences that could follow the 
success or failure. The results indicated fairly 
pervasive tendencies for successful males to 
be upgraded relative to unsuccessful males 
and for successful females to be downgraded 
relative to unsuccessful females. But these 
differences did not interact with occupations 
as expected. Although interesting in their own 
right, they were general effects applying to 
all three occupations (medicine, teaching, and 
nursing )—occupational areas that had been 
deliberately selected so as to vary in masculine 
dominance. 

Feather and Simon (1975) noted some pos- 
sible reasons for the failure to confirm the 
general hypothesis concerning occupations. 
These reasons suggest that whether one ob- 
tains evidence for a particular stereotype 
depends upon the nature of the group tested 
and the sensitivity of the procedure that is 
employed (see also Mischel, 1970). For ex- 
ample, a highly achievement-oriented group 
might respond to the technique developed by 
Feather and Simon simply in terms of an 
achievement stereotype (“Success is good; 
failure is bad”)—particularly since the cues 
identified characters as at the top or bottom 
of the class—whereas, a less achievement- 
oriented group might also take the sex of the 
character into account (see Simon & Feather, 
Note 1). Similarly, groups differing in family 
background, school attended, and other fac- 
tors that could be assumed to influence their 
socialization, might react differently to the 
Feather and Simon procedure, some providing 
relatively detailed and articulated stereotypes, 
reflecting the effects of finely differentiated 
sex-role socialization, others responding in 
very simple ways. In addition to subject or 
group differences, however, it is possible that 
more subtle stereotypes would be revealed by 
more sensitive procedures that more closely 
mirrored the actual conditions under which 
stereotypes are evoked in everyday life. 

The present study was designed to explore 
this last possibility. It involved the develop- 
ment of a new procedure which attempted to 
make stereotypes about sex roles especially 
salient by providing subjects with items that 
sampled a wide range of occupations vary- 
ing in both status and masculine dominance, 
from occupations almost exclusively filled 
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by males to occupations almost exclusively 
filled by females. The salience was also in- 
creased by presenting subjects with situations 
in which both a male and a female were in 
competition, either both succeeding or both 
failing to the same degree, on the assumption 
that the presence of mixed-sex competition 
would be especially likely to elicit ideas about 
what were and what were not appropriate 
achievements for males and females—in a 
general sense, ideas about what were the 
distinctive “territories” of males and females 
to which one or the other could lay claim. 


METHOD 
Subjects 


There were 280 subjects (146 male, 134 female) of 
whom 260 subjects were enrolled in the introductory 
course in psychology at Flinders University. Most 
of the remaining 20 subjects were at Adelaide Uni- 
versity and were included in order to obtain approxi- 
mately equal ns in the different experimental condi- 
tions. The Flinders subjects were tested in April 
1973, on the same day in two separate class sessions 
with an hour's interval between the two. Subjects 
who answered the questionnaire in the first class 
session were requested not to communicate to those 
who were to be tested in the second class session. 
The Adelaide subjects were tested 3 days subse- 
quently. All of the testing was conducted by a male 
experimenter (the author). Twelve subjects (5 male, 
7 female) were excluded from the total sample for 
one of the following reasons: misread the instruc- 
tions, major parts of the questionnaire not com- 
pleted, or evidence in their written comments that 
they had not taken the test seriously. There re- 
mained 268 subjects of whom 141 were males and 
127 females. In the success condition there were 74 
males and 66 females. In the failure condition there 
were 67 males and 61 females. In some of the analy- 
ses to be reported subsequently there were minor 
departures from these ns due to missing data. 


Procedure 


Success and failure conditions were created by 
developing two questionnaires which were distributed 
at random so that approximately half of the sub- 
jects answered one and the remaining subjects 
answered the other. One questionnaire was titled 
“Reactions to Success Questionnaire”; the other was 
titled “Reactions to Failure Questionnaire.” 

Each questionnaire presented subjects with 12 
items in each of which two persons (a male and a 
female) were in a competitive situation involving 
either an examination or an interview for a job. 
Different names were assigned to the two persons 
in the different situations. In the success question- 
naire both persons were successful in all 12 situa- 
tions; in the failure questionnaire both persons were 
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unsuccessful in all 12 situations. In the instructions 
on the title page, subjects were told that the ques- 
tionnaire was concerned with the general problem 
of how males and females react to various kinds of 
academic and occupational success (or failure). They 
were instructed to use their imagination in thinking 
about the two people who succeeded (or failed) and 
the situation in which success (or failure) occurred. 
They were asked to work rapidly, to give their 
first impressions, and to treat each new situation 
quite separately without being influenced by their 
answers to questions about previous situations. All 
answers were obtained under anonymous conditions. 

In the initial instructions for the success question- 
naire, subjects were told the following: 


For each situation you will be asked to indicate 
which of the two people would feel happier about 
the success. In doing this try to think about the 
consequences of the success for each person and 
remember that the consequences of success are not 
always entirely positive; there can be negative 
side-effects as well, In some cases a person might 
feel happy but also secretly a bit apprehensive 
about succeeding. For each situation decide which 
of the two people would probably feel happier, 
more contented, more free of conflict about the 
success. Then go on to rate how happy you think 
each person would be using the scales provided. 


The following examples indicate the form of the 
items in the success questionnaire. 


In the highly competitive final year Law School 
examinations David and Mary both pass at the 
same level and thus qualify to become solicitors. 
Who would feel happier about the success? 


Following interviews of a number of competing 
candidates Dick and Sue are selected to become 


newspaper reporters for a business magazine. Who , 


would feel happier about the success? 


Subjects answering the success questionnaire 
circled the name of the person whom they thought 
would probably feel happier in each situation. They 
then rated how happy they thought each person 
would be by putting a cross on a 5-inch (.13 cm) 
rating scale for that person, labeled “Not happy at 
all” at one extreme, “Uncertain” in the middle of 
the scale, and “Extremely happy” at the other ex- 
treme. The rating scales were later divided into 15 
equal parts and responses were scored from 1 to 15 
in the direction of increasing happiness. 
Corresponding instructions were employed for the 
failure questionnaire. In the initial instructions 
subjects were told the following: 


For each situation you will be asked to indicate 
which of the two people would feel unhappier 
about the failure. In doing this try to think about 
the consequences of the failure for each person 
and remember that the consequences of failure are 
not always entirely negative. In some cases a 
person might feel unhappy but also secretly a bit 
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relieved about failing, For each situation decide 
which of the two people would probably feel 
unhappier, more distressed, more upset about the 
failure. Then go on to rate how unhappy you 
think each person would be, using the scales 
provided. 


The following examples indicate the form of the 
items in the failure questionnaire: 


In the highly competitive final year Medical 
School examinations, Anne and John both fail at 
the same level and thus do not qualify to become 
doctors. Who would feel unhappier about the 
failure? 

Following interviews of a number of competing 
candidates Jack and Pamela fail to be selected to 
become shop assistants in a department store. Who 
would feel unhappier about the failure? 


Subjects answering the failure questionnaire 
circled the name of the person whom they thought 
would probably feel unhappier in each situation. 
They then rated how unhappy they thought each 
person would be by putting a cross on a 5-inch (.13 
cm) rating scale for that person, labeled “Not un- 
happy at all” at one extreme, “Uncertain” in the 
middle of the scale, and “Extremely unhappy” at 
the other extreme. As for the success questionnaire, 
the rating scales were later divided into 15 equal 
parts. Responses were scored from 1 to 15 in the 
direction of increasing unhappiness. In both ques- 
tionnaires, the subjects were told that they should 
put a cross somewhere on each line and that they 
should not hesitate to use the extremes of the 
rating scale if necessary. The names of the persons 
involved were the same for corresponding situations 
in both the success and failure questionnaires. 

The 12 items in each questionnaire covered the 
following occupations (for each occupation the names 
of the male and female competing are also given in 
the order in which they were presented): company 
director (Harry, Gwen), janitor (Helen, Stan), solic- 
itor (David, Mary), doctor (Anne, John), school 
principal (Bill, Joyce), newspaper reporter (Dick, 
Sue), secondary schoolteacher (Jill, Tom), laboratory 
assistant (Lisa, Peter), physiotherapist (Julie, Steve), 
"shop assistant (Jack, Pamela), domestic worker 
(Betty, Jim), typist-stenographer (Michael, Jennifer). 
These occupations were deliberately chosen so as to 
cover a wide range of status using the information 
provided by Congalton (1969) on the status and 
prestige of occupations in Australia. They were also 
selected to cover a range of masculine dominance at 
different levels of occupational status, although it 
was difficult to find a very high status occupation 
that was female dominated. E 

Each questionnaire was produced in four forms, 
as a control on possible order effects, each form 
involving a different random order of the 12 items. 
The four forms were equally represented in the 
distribution of the questionnaires to subjects, 

When subjects had completed the 12 items, they 
were asked to list the main factors that influenced 
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them in making their choices. These reasons were 
subjected to content analysis (see below). They then 
rated each occupation for masculine dominance— 
“according to the degree to which the occupation 
is filled by males”—using numbers from 1 to 7 with 
the following labels: 1, The occupation almost 
exclusively employs males; 2, The occupation em- 
ploys a lot more men than women; 3, The occupa- 
tion employs a few more men than women; 4, The 
occupation employs just about equal numbers of men 
and women; 5, The occupation employs a few more 
women than men; 6, The occupation employs a lot 
more women than men; 7, The occupation almost 
exclusively employs females. Subjects then proceeded 
to rate the occupations for status—“according to 
their social standing in the community”—using 
numbers from 1 to 7 with the following labels: 
1, Very high status; 2, High status; 3, Slightly above 
average status; 4, Moderate status; 5, Slightly below 
average status; 6, Low status; 7, Very low status. 
For both the rating of masculine dominance and 
occupational status, the 12 occupations were pre- 
sented in the same order as they had been in that 
form of the questionnaire, The rating tasks were pre- 
sented on separate, successive pages of the question- 
naire and subjects were requested on the ques- 
tionnaire not to turn back to see how they had 
previously answered. 


RESULTS 


Masculine Dominance and Occupational 
Status Ratings 


Table 1 presents the mean ratings for 
masculine dominance and occupational status 
for each of the 12 occupations, based upon 
ratings from all subjects. There it can be 
seen that the occupations involved in the 


TABLE 1 


or M JLINE DOMINANCE AND 
8 BY ALL RESPONDE 


TO Dir 


MEAN RATI) 
OCCUPATIONAL 
IN RELATIC 


Occupation s eel 
Company director 1.36 1.52 
Solicitor 1.95 1,56 
Janitor 2.15 5.87 
Doctor 2.23 1.29 
School principal 2.30 2.03 
Newspaper reporter 2.68 3.40 
Secondary schoolteacher 3.91 3.03 
4.42 3.98 
4.47 2.47 
5.53 5.41 
6.47 5.88 
6,02 4,46 


higher occupationa . On the rating sc 
1-7 with 4 as the the ical neutral point, Rho = i 
in calculating item means ranged from 265 to 266, 
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questionnaire did in fact cover a wide range 
of masculine dominance and status, at least 
as perceived by the respondents. Thus, an 
important requirement of the procedure was 
met. The Spearman rank-order correlation 
between the means for masculine dominance 
and the means for occupational status across 
the 12 occupations was .64. 


Analysis of Choice Data 


Table 2 presents the percentage of times 
the male characters were judged as happier 
with success and unhappier with failure for 
each of the 12 occupations. (The correspond- 
ing percentages for the female characters can 
be found by subtracting the male percentages 
from 100 since each item involved a forced 
choice between male and female characters.) 
Data are presented for all subjects combined 
since male and female subjects showed no 
significant differences in their choices in either 
the success or failure conditions (except for 
janitor in which the male character was more 
likely to be seen as happier with success by 
female subjects, p < .05). 

Table 2 shows that subjects were more 
likely to judge the male to be happier with 


TABLE 2 
PERCENTAGE OF RESPONDENTS JUDGING MALE RATHER 
THAN FEMALE CHARACTERS AS HAPPIER WITH 
Success AND UNHAPPIER WITH FAILURE 
FOR DIFFERENT OCCUPATIONS 


Percentage of respondents 
Occupation Judging male | Judging male 
as happier | ‘as unhappier 
with success with failure 
than female than female 
Company director 59 80*** 
Solicitor 56 8ite* 
Janitor 65 82** 
Doctor 44 70*** 
School principal 56 80*** 
Newspaper reporter 55 68* 
Secondary schoolteacher 45 60* 
Laboratory assistant 20 39%" 
Physiotherapist 34 49* 
Shop assistant 17 24 
Domestic worker 13 13 
Typist-stenographer 14 16 
Note. Ns used ’in calculating percentages ranged from 138 to 


140 for items in success condition and from 126 to 128 for items 
in failure condition. Percentages in relation to female characters 
can be found by subtracting the percentages above from 100. 
Decimals have been rounded to the nearest whole percentage, 
*p <.05. 
“p < 01, 
atp < 001, 
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success and unhappier with failure than the 
female for jobs that were perceived as male 
dominated (newspaper reporter and above). 
They were more likely to judge the female to 
be happier with success and unhappier with 
failure than the male for jobs that were 
perceived as female dominated (laboratory 
assistant and below). The only exceptions 
were in regard to doctor and secondary 
schoolteacher where, in both cases, subjects 
were more likely to judge the female as 
happier with success but the male as unhap- 
pier with failure. It should be noted that, on 
the average, the schoolteacher was judged as 
close to the midpoint of the masculine domi- 
nance scale (see Table 1). 

In the success condition the rank-order 
correlation between percentage choice of male 
as happier and mean masculine dominance 
across occupations was .88; in the failure 
condition the rank-order correlation between 
percentage choice of male as unhappier and 
mean masculine dominance across occupa- 
tions was .94—both correlations indicating 
the close relationship between the choice of 
male or female and the degree to which an 
occupation was seen as dominated by one 
or the other sex.* That it was the sex appro- 
priateness of the job that was the key element 
rather than its status is evident from the fact 
that, when the rank-order correlations across 
occupations were between percentage choice 
and mean occupational status, the correla- 
tions dropped to .38 and .53, respectively, in 
the success and failure conditions. 

The results of x? tests applied to the fre- 
quency data for each occupation showed that 
the male character was more likely to be 
judged as unhappy with failure than as happy 
with success, and (on the other side of the 
coin) that the female character was more 
likely to be judged as happy with success 
than as unhappy with failure—in all but the 


>In these and subsequent rank-order correlations 
involving masculine dominance and occupational 
status it should be noted that lower numbers de- 
noted higher levels of the variable under considera- 
tion (i.e., higher levels of masculine dominance or 
occupational status). In all cases when Spearman rho 
was calculated the ranks were assigned from 1 to 12 


from the highest level of a variable down to the 
lowest level. 


MEAN RATINGS or HAPPINESS ABOUT Si 


Reactions To MALE AND FEMALE SUCCESS AND FAILURE 


541 


TABLE 3 


UT SUCCESS AND UNHAPPINESS ABOUT FAILURE BY ALL RESPONDENTS 
FOR MALE AND FEMALE CHARACTERS FOR DIFFERENT OCCUPATIONS 


Happiness with success Unhappiness with failure 
Occupation Significant effects 
Male Female Male Female 
Company director 12.97 11.88 13.49 10.59 CoA BOS 
Solicitor 13.39 12.66 13.55 12.36 uar 
Janitor 8.91 8.50 9.70 7.46 TAS ACY BC 
Doctor 13.30 13.48 13.76 13.04 | BCH 
School principal 12.78 11.85 11.16 E) a 
Newspaper reporter 12.20 11.82 10.73 Br, ‘cree 
Secondary schoolteacher 11,84 12.11 12.65 Be 
Laboratory assistant 10.63 12.20 11.32 BOOS 
Physiotherapist 12.08 13.16 13.02 ARSTE 
Shop assistant 8.61 10.61 10.40 Atore 
Domestic worker 6.06 9.34 9.95 l A aad 
Typist-stenographer 8.11 11.80 11.86 cre 


Note, Higher numbers denote more happiness or unhappiness. On the rating scale the range was 1-15 with 8 as the theoretical 


neutral point. In the 2 X 2 X 2 analysis of variance, Factor A 


= sex of respondent, Factor B = outcome, Factor C = sex of 


Crater Minan departures from sample Ns occurred in calculating item means due to missing data. 


*p <.0: 
wp < 01 


Hp < 001 


last three occupations which were perceived 
as strongly female dominated. 

Thus, there was compelling evidence sup- 
porting the general prediction, since subjects’ 
judgments of which sex would be happier or 
unhappier with success or failure were clearly 
related to the perceived masculine dominance 
of the occupation and, by assumption, to how 
appropriate the occupation was seen for each 
sex. There was also evidence that, especially 
in male-dominated occupations, males were 
more likely to be seen as unhappier about 
failure than happier about success (and vice 
versa for females since items were in a 
forced-choice format). 


` Analysis of Happiness and Unhappiness 
Ratings 


Table 3 presents the mean ratings of hap- 
piness about success and unhappiness about 
failure for male and female characters for 
each of the 12 occupations. Data are pre- 
sented for all subjects combined since there 
were very few differences associated with the 
sex of the respondent. The data were 
analyzed using analysis of variance for a 
2x22 design with sex of respondent 
(male or female) as Factor A, outcome 
(success or failure) as Factor B, and sex of 
character (male or female) as Factor C, with 


repeated measures on the last factor, since 
each respondent completed two ratings, one 
for the male and one for the female character 
in each item. The method of unweighted 
means was employed to allow for slightly 
unequal ms (Winer, 1962), The significant 
effects indicated by these analyses are also 
presented in Table 3. 

Table 3 shows that there were pervasive 
main effects of sex of character (C effects). 
In general, male characters were rated as 
happier about success than female characters 
and unhappier about failure than female 
characters when they succeeded or failed at 
male-dominated jobs. Similarly, female char- 
acters were rated as happier about success 
than male characters and unhappier about 
failure than male characters when they suc- 
ceeded or failed at female-dominated jobs. 
Again the only exceptions were in regard to 
doctor and secondary schoolteacher and, in 
the former case, there was a significant B X C 
interaction effect (see below). Once. more, 
therefore, there was a compelling support for 
the general prediction. Whereas the choice 
data in Table 2 provided support in regard 
to the likelihood or probability that one sex 
would be seen as happier or unhappier than 
the other, the rating data provided support 
in regard to the Zevel of perceived happiness 
or unhappiness, 
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Table 3 also shows that there were some 
highly significant interactions involving sex of 
character with outcome (BC effects). These 
occurred for the occupations of company 
director, janitor, doctor, and school principal, 
and in each case they were in the same direc- 
tion: Ratings for male unhappiness after 
failure were higher than ratings for male 
happiness after success, whereas ratings for 
female happiness after success were higher 
than ratings for female unhappiness after 
failure. These interaction effects therefore 
partly paralleled the significant y? effects 
noted in the previous section, but they were 
less extensive across occupations. 

A small number of main effects involving 
sex of respondent (A effects) and outcome (B 
effects) and one interaction involving sex of 
respondent and sex of character (AC effect) 
also emerged from the analysis but they were 
not easily interpretable. In regard to the A 
effects, female respondents tended to assign 
higher happiness and unhappiness ratings 
to success and failure, respectively, for 
the physiotherapist than male respondents, 
whereas these ratings tended to be higher for 
male respondents in relation to the shop 
assistant and domestic worker. In regard to 
the B effects, ratings tended to be higher fol- 
lowing failure than following success for the 
secondary schoolteacher, but higher following 
success than following failure for the news- 
paper reporter and laboratory assistant. The 
AC interaction that emerged for janitor was 
due to the fact that whereas female respon- 
dents tended to provide higher ratings for the 
male character than for the female character, 
male respondents did the reverse, providing 
higher ratings for the female character than 
for the male character, 


Relationship of Ratings to Occupational 
Status and Masculine Dominance 


Tt is clear from Table 3 that the mean 
ratings of happiness and unhappiness varied 
considerably across the 12 occupations. An 
analysis was conducted in which the rank- 
order correlations of rated male and female 
happiness about success with occupational 
status and masculine dominance across occu- 
pations were separately obtained for each 
subject who answered the success question- 
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naire. Similarly, the corresponding rank-order 
correlations involving rated male and female 
unhappiness about failure were also separately 
obtained for each subject who answered the 
failure questionnaire. These correlations were 
also analyzed using the analysis of variance 
for a 2 X 2 X 2 design with sex of respondent 
(A), outcome (B), and sex of character (C) 
as factors, with repeated measures on the last 
factor—first for the correlations relating hap- 
piness-unhappiness ratings to occupational 
status, then for the corresponding correlations 
involving masculine dominance. The mean 
rank-order correlations for each condition and 
the results of the analyses of variance are 
presented in Tables 4 and 5, respectively. 

The analyses revealed highly significant sex 
of character (C) effects (p < .001) for both 
sets of correlations—those involving occupa- 
tional status and those involving masculine 
dominance: The correlations were higher 
when they applied to male characters than 
when they applied to female characters, im- 
plying that ratings of happiness-unhappiness 
across occupations were seen as more closely 
associated with occupational status and mas- 
culine dominance for males who succeeded or 
failed than for females. Each analysis also 
revealed significant outcome (B) effects 
(p< .01 for occupational status; p< .05 
for masculine dominance): The correlations 
were higher when they applied to happiness 
ratings in the success condition than when 
they applied to unhappiness ratings in the 
failure condition. These effects should also be 
considered, however, in relation to the signifi- 
cant interactive effects of sex of character 
with outcome (BC) that occurred in both 
analyses (p< .05 for occupational status: 
$ < .01 for masculine dominance): Although 
there was little difference in the mean cor- 
relations between success and failure condi- 
tions in relation to the male characters, the 
mean correlations Were especially low when 
they applied to the ratings of unhappiness for 
female characters who failed. 

These differences in correlations reflect the 
fact that the ratings of happiness about suc- 
cess and unhappiness about failure would 
depend upon the benefits and costs assumed 
to arise when one succeeds or fails at an 
occupation. These positive and negative con- 
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TABLE 4 


MEANS OF RANK-ORDER CORRELATIONS RELATING 
RESPONDENT'S RATINGS OF HAPPINESS 
OR UNHAPPINESS TO RATINGS 
OF OCCUPATIONAL STATUS AND 
MASCULINE DOMINANCE ACROSS OCCUPATIONS 


Occupational Masculine 
status dominance 
Respondent F. 
Male | Female | Male | Female 
Male 
Success condition | .67 40 55 AL 
Failure condition 66 .26 52 —.08 
Female 
Success condition | .66 50 .49 .09 
Failure condition | .62 33 55 -00 


Note. Minor departures from sample N's occurred in calcu- 
lating means due to missing data. 


sequences would be a function of at least 
two factors: the status level of the job and 
the perceived appropriateness of the occupa- 
tion for the sex concerned. It can be assumed 
that, for male characters, these two factors 
worked in the same direction to influence the 
happiness—unhappiness ratings for success- 
failure over the range of occupations used in 
the present study, where occupations with 
higher status were also seen as more likely 
to be filled by males. Success at a higher 
status job for a male, for example, would not 
only result in higher benefits of various kinds 
associated with status, but it would also be 
rewarding because the job was seen as appro- 
priate for males—success would be consistent 
with the male sex role. Hence there would 
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be clear differentiation in the benefits of 
success across occupations for male charac- 
ters, resulting in higher correlations of happi- 
ness ratings with occupational status and 
masculine dominance. Similarly, the costs of 
failure would be more clearly differentiated 
across occupations for the male characters, 
resulting in higher correlations of unhappiness 
ratings with occupational status and mascu- 
line dominance. It can also be assumed, how- 
ever, that for female characters the two 
factors (status level of the job and the per- 
ceived appropriateness of the occupation for 
the sex concerned) worked in opposite direc- 
tions since the higher status jobs were male 
dominated and the lower status jobs were 
female dominated. Success at a higher status 
job for a female, for example, would presum- 
ably also result in higher benefits (as for the 
male) but there would be a counteracting 
reduction in attractiveness due to the fact 
that success at the occupation would not be 
seen as appropriate for females—success 
would be inconsistent with the female sex 
role. Hence there would be less clear differen- 
tiation in the benefits of success across occu- 
pations for female characters, resulting in 
lower correlations of happiness ratings with 
occupational status and masculine dominance. 
For similar reasons, correlations involving 
unhappiness ratings for female characters 
across occupations would also be attenuated. 

The results in Table 4 further suggest that, 
for the range of occupations used in the pres- 
ent study, differences in status had greater 


TABLE 5 
RESULTS OF ANALYSIS OF VARIANCE 
Occupational status Masculine dominance 
Source — —|— á ~ mos de EE 4 
df MS F dj MS F 
2 e dent (A; 1.46 1 O14 18 
Ci 8:77» l sos | 630 
AXB .23 1 .264 3.34 
Error 1 227 079 
Q, shara s 122,97%** 1 29.080 234.52"** 
Ser O ss (C) 182° i “055 Pr 
B x € 6.75* 1 864 6.97"" 
4 ~ -00 | 1 | -002 02 
Pee F(a ages 7 
rror TASE URIE, fyr: A 
*p <.05. 
** p <01. 


* p < 001. 
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weight in influencing happiness-unhappiness 
ratings than differences in masculine domi- 
nance since the rank-order correlations 
involving occupational status were higher 
throughout. 


Content Analysis of Reasons 


As indicated previously, subjects were asked 
to list the main factors that influenced them 
in making their choices. These reasons were 
subjected to content analysis using the 
following categories: 

A. Male provider: The reason indicated 
that the male was seen as the provider, need- 
ing finance to support the family, etc. 

B. Male dominant: The reason indicated 
that the male was seen as dominant in soci- 
ety; that he was pressured by society to 
succeed and do well, moving to higher status 
positions and a sound job. 

C. Female caretaker: The reason indicated 
that the female was seen as the main nurtu- 
rant caretaker of the family; that her occupa- 
tional choice took into account her responsi- 
bility for raising and caring for her family. 

D. Sex-typed occupations: The reason in- 
dicated that certain occupations are tradi- 
tionally seen by society as appropriate for 
males while other occupations are viewed as 
more appropriate for females. 

E. Overcoming traditional sex roles: The 
reason indicated a concern with overcoming 
traditional sex roles as they exist in society; 
females succeeding in male-dominated jobs, 
etc. 

F. Sex-related personality or physical char- 
acteristics; The reason involved a concern 
with those characteristics of the male (or fe- 
male) that might make him (or her) suitable 
or unsuitable for particular jobs, for example, 
physical strength, aggression, emotionality, 
dependence, confidence, etc. 

G. Occupational benefits: The reason re- 
ferred to benefits associated with the job such 
as status, salary, good working hours— 
without much further elaboration. 

H. Occupational qualifications: The reason 
referred to qualifications or preparation neces- 
sary to enter the occupation such as amount 
of study, ability, education—without much 
elaboration. 
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I. Miscellaneous: The reason could not be 
clearly classified in any of the categories 
above. 

These categories were determined after 
careful analysis of the reasons given by sub- 
jects. Categories A-F related to sex roles in 
society. Coding was not easy since some sub- 
jects provided a lot of material, others very 
little, and some of it was poorly expressed. 
Moreover, although the categories finally 
chosen were fairly exclusive, in a small num- 
ber of cases it was difficult to decide which of 
two categories should be coded (e.g., Category 
D versus Category F). A reason was not 
coded more than once for any protocol and 
then only if it was clearly stated.? An inter- 
scorer reliability check was conducted based 
upon 125 randomly selected protocols. This 
check indicated that the percentage of agree- 
ment between the two scorers (Christine Bar- 
clay and the author) and the relevant ¢ co- 
efficients were as follows: * Category A (96%, 
= .90), Category B (82%, $ = .43), Cate- 
gory C (98%, ¢ = .89), Category D (86%, 
$ = .71), Category E (96%, $ = .91), Cate- 
gory F (87%, $ = .64), Category G (87%, 
$= .68), Category H (92%, $ = .67), and 
Category I (74%, ¢ = 43). 

Table 6 presents the percentage of times 
each type of reason occurred in each of the 
four groups, together with the p values asso- 
ciated with overall x? tests applied to the fre- 
quency data (the frequency with which each 
category was scored—not scored in each of the 
four groups). The data in Table 6 are useful 
in providing information about what respon- 
dents saw as the main factors influencing 
their responses to the items as a whole. Since 
each respondent usually gave a number of 
different reasons (and not in any order of 
importance), it was not possible to investi- 
gate whether the choices and ratings provided 
were systematically related to distinctive 


? The coding was conducted by Christine Barclay. 

3 Agreement would involve both scorers coding the 
Presence (1,1) of a particular reason and both 
scorers coding the absence (0,0) of a particular 
reason. The percentage of 1,1 and 0,0 agreements 
was calculated for each Category over the 125 proto- 
cols. The ¢ coefficient provided a further index of 


interscorer reliability analo; i 
A gous to a c 
AA orrelation 
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TABLE 6 
PERCENTAGE OF RESPONDENTS REPORTING EACH TYPE or REASON IN RELATION 
TO SEX Or RESPONDENT AND NATURE OF OUTCOME 
Percentage of times each reason reported 
Type of reason Males Females t 
Success Failure Success Failure 
(n = 72) (n = 66) (n = 66) (n = 61) 
Male provider 10 35 21 33 <.01 
Male dominant 17 11 26 31 <.05 
Female caretaker 11 9 35 13 <.001 
Sex-typed occupations 29 32 32 41 ns 
Overcoming traditional sex roles 50 12 39 21 <.001 
Sex-related personality and 
physical characteristics 33 21 18 21 ns 
Occupational benefits 26 15 17 5 <.05 
Occupational qualifications 8 20 6 20 <.05 
Miscellaneous 26 48 27 33 <.05 


Note. Two males in the success condition and one male in the failure condition did not provide reasons. Decimals have been 


rounded to the nearest whole percentage, 


types of reasons subsequently noted by the 
respondents. However, it was possible to dis- 
cover the frequency with which certain types 
of reasons occurred both in relation to the 
sex of respondents and to the type of out- 
come in the questionnaire they answered (that 
is, to success or failure)—interesting informa- 
tion in its own right. 

Inspection of Table 6 indicates that there 
were some differences in percentages between 
the groups which were associated with signifi- 
cant overall x’ values. Females were more 
likely than males to provide reasons referring 
to male dominance—especially in the failure 
condition. When compared to male respon- 
dents they were also more likely to report 
“male provider” and “female caretaker” rea- 
sons in the success condition. Both male and 
female respondents were more likely to give 
overcoming traditional sex roles as a reason 
for their choices in the success condition than 
in the failure condition, and this difference 
was more pronounced for the male respon- 
dents. There were no significant differences in 
regard to reasons concerning traditional sex 
roles or sex-related characteristics (Categories 
D and F) but these reasons featured promi- 
nently in all four groups. Reference to “oc- 
cupational benefits” was more likely to occur 
in the success condition than in the failure 
condition, but reference to “occupational 
qualifications” was more likely to occur in 


the failure condition than in the success con- 
dition. These differences should be evaluated 
in the context of the differences in interscorer 
reliability for the different categories noted 
above. 


Discussion 


The present results provided strong support 
for the general hypothesis. Not only were 
males or females more likely to be judged as 
happier (or unhappier) about success (or 
failure) solely on the basis of their sex and 
the degree to which the occupation was seen 
as likely to be filled by males or females, but 
differences in the levels of rated happiness- 
unhappiness followed the same pattern, Sub- 
jects’ responses thus appeared to be influenced 
by their conceptions or stereotypes about 
what jobs were appropriate and inappropriate 
for men and women within society. The pro- 
cedure used in the present study with its 
emphasis on male and female competition was 
successful in making these stereotypes salient. 
The content analysis of reasons also sup- 
ported assumptions that sex-role stereotypes 
were important factors influencing the re- 
sponses that subjects provided to the items as 
a whole. These stereotypes concerned not only 
conceptions about appropriate or traditional 
jobs for men and women but also more gen- 
eral beliefs about sex roles, some of which 
were more likely to be mentioned by one sex 
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or the other (e.g., “male dominance” by fe- 
male subjects) or by all respondents in one 
condition rather than in the other (e.g., “over- 
coming traditional sex roles” in the success 
condition) . 

At the same time there was also evidence of 
differences in the way males and females were 
seen to react to success and failure, apart 
from the general effects just noted. Both the 
percentage of choice data and the rating data 
indicated that failure at male-dominated oc- 
cupations was deemed to be especially dis- 
tasteful for males. They were more likely to 
be judged as unhappy with failure than as 
happy with success at these occupations, and 
the differences between ratings of unhappiness 
about failure and happiness about success 
were especially evident for these occupations. 
Feather and Simon (1975) found that fail- 
ure was generally viewed as more distressing 
for the male than for the female in a number 
of different ways whereas success was a rela- 
tively uncomplicated blessing for the male. 
The results of the present study qualify these 
previous findings somewhat. It is failure at 
male-dominated occupations that is seen as 
especially upsetting for the male—failure at 
occupations that are considered to be his terri- 
tory. That masculine dominance is the im- 
portant element in producing these effects 
rather than occupational status is clear from 
the fact that they occurred for the occupa- 
tion of janitor, a very low status job. 

In contrast, there was evidence from both 
the percentage of choice data and the rating 
data that success at male-dominated occupa- 
tions had a special attraction for females. 
They were more likely to be judged as happy 
with success than as unhappy at failure at 
these occupations, and the differences between 
ratings of happiness about success and un- 
happiness about failure were especially evi- 
dent for these occupations. Apparently there 
was a special cachet or distinction for the 
female who succeeded in male territory. As 
mentioned above, both male and female re- 
spondents referred to the overcoming of tradi- 
tional sex roles as a factor when giving the 
reasons for their responses, especially in the 
success condition. One can conclude that the 
emphasis over the past few years on redefin- 
ing the female sex role in relation to achieve- 
ment was having an effect, at least for the 
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present group of subjects (see also Feather & 
Raphelson, 1974). 

Across all 12 occupations the likelihood that 
the male would be judged as happier (or un- 
happier) with success (or failure) than the 
female was closely related to the perceived 
masculine dominance of the job—much more 
so than to its perceived status. The ratings 
of happiness about success and unhappiness 
about failure, which may be taken as indi- 
cators of the strength of the positive valence 
of success and the negative valence of failure, 
respectively (Feather, 1967), also varied 
considerably across occupations in line with 
the benefits and costs that might be expected 
to ensue from succeeding (or failing) at an 
occupation of a particular perceived status 
and of a particular perceived appropriateness 
for one or the other sex. 

These differences, however, should not di- 
vert attention from the dominating general 
finding that reactions to male and female suc- 
cess and failure for an occupation depended 
upon the perceived appropriateness of the 
occupation for the sex concerned—success be- 
ing evaluated more positively if it was con- 
tent with societal conceptions about the 
sex role than if it was inconsistent, and fail- 
ure being evaluated more negatively if it was 
inconsistent with societal conceptions about 
the sex role than if it was consistent. Much 
of the literature on the so-called “fear of 
success” can be viewed in this light since 
typically in these studies girls have been 
asked to write projective stories to a verbal 
cue about a female character who succeeds 
in a male-dominated occupation (Horner, 
1972a, 1972b). In terms of our general pre- 
diction, such a success would be viewed as 
less attractive for the female than for the 
male succeeding at the same occupation, given 
the stereotypes that exist in society.t The 
fact that these effects can be identified by 


‘It is interesting to note that this difference did 
not occur for the occupation of doctor in the present 
study (see Table 3). Perhaps one should also take 
into account qualitative aspects of the occupation. 
Medicine is typically perceived as involving helping 
behavior (perhaps more consistent with the tradi- 
tional feminine sex role) whereas some of the other 
male-dominated occupations (e.g., company director) 
may have been seen as involving greater degrees of 
aggression and drive (perhaps more consistent y ith 
the traditional masculine sex role). A 
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male subjects as well as by female subjects 
supports the notion that the fear of success 
studies are tapping social stereotypes—widely 
shared beliefs about the appropriate achieve- 
ments for each sex in our society. Whether 
these studies are also tapping motives is a 
question demanding further investigation, in- 
volving not only well-controlled laboratory 
studies of choice, performance, and persistence 
(see Feather & Raphelson, 1974) but also 
close analysis of the meaning of success (Rob- 
bins & Robbins, 1973; Tresemer, 1973). The 
present study greatly extends the scope of 
research in this area since it has considered 
failure as well as success across a wide span 
of occupations and has provided a reason- 
ably objective means for investigating re- 
sponses. 

What direction might future research take? 
It would be useful in future studies to extend 
the inquiry beyond university samples to 
other groups covering a wide spectrum of 
society and to obtain from subjects back- 
ground information about a variety of fac- 
tors that would affect their socialization (e.g., 
socioeconomic status, religion, race, sex). One 
could then determine the degree to which 
responses do in fact reflect generally held 
stereotypes about the consequences of suc- 
cess and failure for men and women in soci- 
ety and the extent to which these stereotypes 
vary with differences in social learning. These 
future studies might also employ the related 
procedure developed by Feather and Simon 
(1975) as well as the present one, and ex- 
tend the inquiry to other types of reaction to 
success and failure (e.g., direction of causal 
attribution; impressions of personality, 
Feather & Simon, 1973, 1975) in addition to 
obtaining information relevant to the positive 
and negative valence of the outcome. Clearly, 
also, it would be useful in further research to 
investigate other occupations, especially high- 
status, female-dominated occupations if a 
sufficient number of them can be found. Fol- 
lowing careful pretesting, one might then be 
able to embark upon a more systematic fac- 
torial experiment involving simultaneous co- 
variation of both degree of male dominance 
and status at different levels, so as to bring 
out the main effects of each component and 
their interactive influence on responses. The 
present study was reasonably successful in 
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covering a wide span of occupations in a 
systematic way, but further sampling of oc- 
cupations is obviously required." 

Finally, there are many other ways in 
which the items developed in the present ques- 
tionnaire could be modified so as to provide 
answers to different questions that might be 
asked. For example, one might study the 
effects of varying the number and ratio of 
males and females who were in competition 
for a particular job; one might investigate the 
effects of providing information about other 
characteristics of the people in competition 
in addition to their gender since in a real-life 
situation, in which a lot of other information 
about people is available, one would expect 
gender to become less important as a basis for 
judgments about reactions to success and 
failure; one might diminish the artificiality 
of the forced-choice format by providing an 
“uncertain” or “don’t know” category, etc. 
These studies, extending to the population at 
large and examining a wide range of occupa- 
tions, would be of obvious value in exploring 
the extent to which there are consistent views 
about the effects of male and female success 
and failure in different occupations. Hope- 
fully, these studies might also provide clues 
about how these stereotypes develop in rela- 
tion to the social learning experiences of peo- 
ple in different segments of society. 

5It would be interesting to treat the traditional 
role of female as wife, mother, and housekeeper in 
occupational terms and include this occupational 
specification in the list. 


REFERENCE NOTE 
1. Simon, J. G., & Feather, N. T. Achievement 
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This study examined the relationship between locus of control and academic 
performance by evaluating the academic achievement of defensive externals, 
which has been considered a confounding factor in previous research. A further 
objective was an examination of the effect of two mediating variables in their 
relation to locus of control and academic performance. One hundred sixty col- 
lege students were administered the Internal, Powerful Others, and Chance 
scales, the Fear of Negative Evaluation scale, and the Achievement via Inde- 
pendence/via Conformance scales. Obtained results demonstrated that internals 
were academically superior to both defensive and congruent externals. Defensive 
externals were shown to have higher grade point averages than congruent 
externals, and female defensive externals achieved greater academic success 
than their male counterparts. Further results suggested that the relationship 
between locus of control and academic performance may not be a direct one 
but may be modified by specific mediating variables. 


The concept of defensive externality, dis- 
cussed by Rotter (1966), refers to the notion 
that an external locus of control may be 
adopted as an ego-defensive, anxiety-reducing 
measure. The rationale for this concept is that 
in attributing reinforcement control to forces 
outside himself, the individual is freed from 
the responsibility for outcomes of his own 
behavior and is thus relieved of the personal 
threat that those outcomes may incur. De- 
fensive externality may therefore be consid- 
ered a means of evading the responsibility for 
anticipated negative reinforcement. 

Horner (1970) and Thurber (1972) have 
suggested that successful academic attain- 
ment is a source of anxiety for many college 
students. Such an observation is tenable, since 
academic success presents a challenge to one’s 
view of oneself and is, therefore, ego involv- 
ing. Also, college grades are a way of obtain- 
ing social recognition as well as representing 
actual achievement (Eisenman, 1967). Pre- 
vious research (Phares, Ritchie, & Davis, 
1968) would suggest that internals respond to 
such achievement-related anxiety by action- 
oriented solutions resulting in greater success 
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and ultimately less anxiety. Other individuals, 
however, might adopt an external orientation 
as a defense against the anxiety associated 
with possible academic failure (Hjelle, 1970). 
These persons, defensive externals, who were 
initially highly competitive would still main- 
tain a comparatively strong achievement 
motivation in the clearly structured, competi- 
tive college academic situation. They would, 
however, defensively account for failure by 
externally controlled attitudes. As a conse- 
quence, such individuals who are somewhat 
similar to internals (e.g., achieving, indus- 
trious) would be more successful academically 
than congruent externals, or persons whose 
behavior is consistent with their external 
“world view.” In comparison to internals, 
however, defensive externals would demon- 
strate a somewhat lower level of academic 
performance, since they adopt defensive ex- 
ternality as an anxiety-reducing measure while 
internals tend to respond to anxiety with 
task-oriented solutions. 

Previous research investigating locus of 
control as a determinant of college academic 
achievement (e.g., Eisenman & Platt, 1968; 
Hjelle, 1970; Prociuk & Breen, 1973) classi- 
fied individuals as either internal or external 
on the basis of their responses to the Rotter 
(1966) Internal—External (I-E) scale. In 
other words, defensive externals were not 
differentiated from congruent externals. Thus, 
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in terms of academic performance, any poten- 
tial differences between internals and exter- 
nals might have been confounded by the rela- 
tively high level of achievement of defensive 
externals. Although this possibility has been 
frequently noted (e.g., Hjelle, 1970; Prociuk 
& Breen, 1973), no systematic investigation 
of the problem has been conducted. The most 
obvious reason has involved the methodologi- 
cal question of differentiating defensive ex- 
ternals from congruent externals., However, 
with the introduction of the Internal, Power- 
ful Others, and Chance scales (Levenson, 
1972), such an investigation now appears 
possible. 

Thurber (1972) has suggested that, in the 
academic situation, the tendency toward de- 
fensive externality is related to the presence 
of a professor who is a powerful extrinsic 
source of reinforcement and to whom the re- 
sponsibility for rewards and punishments may 
be readily surrendered. A similar view is 
expressed by Hersch and Scheibe (1967), who 
stated that in a highly competitive social 
situation (such as college academics), a per- 
son may describe himself as an external, since 
the actions of others may have great rele- 
vance for the success of his own efforts. These 
points thus argue that in college academics, 
the specific external forces to which responsi- 
bility for reinforcement is defensively attrib- 
uted are powerful others. As a logical exten- 
sion, defensive externals may be identified as 
those individuals who believe that powerful 
others (e.g., professors) are responsible for 
their reinforcements (e.g., grades) but who 
tend to behave somewhat similarly to inter- 
nals, regarding reinforcement as primarily 
obtained through their own effort. This ap- 
parent inconsistency may be understood by 
the fact that an external belief in powerful 
others implies that a potential for personal 
control exists (Levenson, 1972, 1974). Con- 
gruent externals, on the other hand, may be 
identified as those individuals who possess the 
belief that their level of academic perform- 
ance is determined by luck, chance, or fate. 
As a consequence, such persons would be least 
effective and least efficient in academic en- 
deavors (Prociuk & Breen, 1974), thus achiev- 
ing a lower degree of academic success com- 
pared to either defensive externals or inter- 
nals. 
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As previously noted, defensive externality 
refers to the attribution of responsibility for 
reinforcement to external forces as a defense 
against the anxiety associated with anticipated 
negative evaluation or failure. Therefore, it 
is reasonable to assume that fear of negative 
evaluation may be an important mediating 
variable in an investigation of this concept. 
Specifically, fear of negative evaluation might 
increase one’s predisposition toward defensive 
externality if the individual is in a state of 
anxiety and believes that he can reduce the 
anxiety by adopting such a defense mecha- 
nism. Consequently, a further objective of 
this research was to examine the social eval- 
uative anxiety experienced by individuals in 
the college academic environment. 

Additionally, research (Thurber, 1972) 
suggests that conformance versus indepen- 
dence may also be an important mediating 
variable in research on defensive externality. 
This view is consistent with several studies 
which have demonstrated that an external 
orientation predisposes one to be more sensi- 
tive to the reactions or demands of outside 
agents, especially those in high-status posi- 
tions (Biondo & MacDonald, 1971; Ritchie 
& Phares, 1969). As a logical extension, de- 
fensive externals, who attribute responsibility 
for rewards and punishments to powerful oth- 
ers or those in high-status positions, would 
tend to become more sensitive to the demands 
of such others. Consequently, the defensive 
external’s level of academic performance 
might reflect a type of conforming depen- 
dency in which the rules of the academic 
game are followed in exchange for reinforce- 
ment (grades) from powerful others (profes- 
sors). 

An investigation of conformance versus 
independence is also seen as particularly 
relevant for interpreting the difference in 
academic performance between males and 
females. Externality represents such attitudes 
as conformity and dependency, which have 
been traditionally associated with the female 
role (Thurber, 1972). Therefore, females may 
be more motivated to adopt defensive exter- 
nality because it simultaneously leads to an 
affirmation of expected feminine attitudes. If 
such an external orientation effectively deals 
with achievement-related anxiety in women, it 
follows that females should show academic 
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performance superior to males, a finding re- 
ported by Eisenman and Platt (1968). 

In summary, the following relationships 
were hypothesized. Internals, male and fe- 
male, possessing a belief that academic per- 
formance is a direct function of their own 
ability and effort, are more successful aca- 
demically than either corresponding defensive 
externals or congruent externals. Second, de- 
fensive externals of either sex, who believe 
that powerful others control their sources of 
reinforcement, are more successful academi- 
cally than corresponding congruent externals, 
who believe that their academic success is 
determined primarily by luck or chance fac- 
tors, Finally, female defensive externals, for 
whom defensive externality simultaneously 
affirms traditional feminine characteristics, 
are more successful academically than male 
defensive externals. Regarding mediating 
variables, it was predicted that achievement 
via independence is a significant variable ac- 
counting for the academic performance of 
internals, and that achievement via conform- 
ance is a significant variable accounting for 
the levels of academic achievement of defen- 
sive externals and congruent externals. Also, 
it was predicted that fear of negative evalua- 
tion is a significant variable accounting for 
the academic performance of congruent ex- 
ternals, who employ neither task-oriented 
solutions nor defensive externality as a means 
of reducing achievement-related anxiety. 


METHOD 
Subjects 


Subjects were 66 male and 94 female college stu- 
dents enrolled in second- and third-year psychology 
courses at the University of Manitoba. All subjects 
participated voluntarily, as part of a classroom exer- 
cise. Upper year students were selected, since a 
measure of academic performance, grade point aver- 
age, was available for them. 


Scales 


Each of the Internal (I), Powerful Others (P), 
and Chance (C) scales developed by Levenson (1972, 
1974) consists of eight items in a 6-point format. 
The items are administered to the subject as a 
unified attitude scale of 24 items. The scales consist 
of several items adapted from the I-E scale and a 
set of statements written specifically for these new 
scales, The possible range of scores on each of these 
measures is from 0 to 48. 

The Achievement via Conformance (Ac) and 
Achievement via Independence (Ai) measures are 


551 


two of the 18 scales of the California Psychological 
Inventory (Gough, 1964). The Ac and Ai were de- 
veloped to identify those interest and motivational 
factors which facilitate achievement in any situation 
where conformance or independence, respectively, are 
regarded as positive behavio The Ac sis 

38 items, while the Ai has 32 items; both scales em- 
ploy a true—false format. 

The Fear of Negative Evaluation (FNE) scale 
was developed by Watson and Friend (1969) to 
measure the anxiety of being evaluated. Defined by 
these researchers, fear of negative evaluation refers 
to an individual’s apprehension about other’s eval- 
uations, distress over negative evaluation, and avoid- 
ance of evaluative situations. Watson and Friend 
(1969) stated that this concept differs from test 
anxiety in that it is not specific to testing situations 
but may operate in a variety of social-evaluative 
instances, lt is, therefore, considered relevant in any 
potential evaluative situation where social anxiety 
may act as a mediating variable. The FNE consists 
of 30 true—false items. 


Procedure 


The battery of questionnaires was administered to 
four undergraduate psychology classes. The subjects 
were told that the scales they were completing were 
designed to measure certain psychological constructs, 
the results of which were to be used for research 
purposes only. Additionally, they were informed 
that participation was on a voluntary basis. The 
measure of academic performance, grade point aver- 
age (GPA), was obtained from the University Reg- 
istrar’s Office for each subject. 


Statistical Analyses 


Subjects were subdivided according to sex, and 
each individual’s scores (his score on cach of the 
I, P, and C scales) were converted to their standard 
score equivalents. Employing a direct comparison of 
such subject's three standard scores, he was identified 
as either an internal, defensive external, or congruent 
external, which depended upon whether his highest 
standard score occurred on the I, P, or C scale, 
respectively. Each subject was thus placed into one 
of six groups according to sex (two levels) and 
locus of control (three levels). 

A stepwise regression analysis (Afifi & Azen, 1972) 
was calculated for each of the six subject groups. In 
each case, academic performance (GPA) was the 
criterion or dependent variable with scores on the 
Ac, Ai, FNE, and the specific Levenson scale (e.g. 
I scale for internals) as the predictor or independent 
variables. The first step in this selected the 
best single variable predicting GPA. The second step 
found the variable best predicting GPA, given the 
first variable already entered into the regression 
equation. In subsequent steps, either a variable was 
entered which best improved the prediction of GPA, 
given all other variables entered from previous 
steps, or a variable was removed from the predictor 
set if its predictive power fell below a given level 
(see Afifi & Azen, 1972, p. 128). Employing this 
analysis, it was possible to determine which predictor 
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variables accounted for the greatest proportion of 
the variance in GPA, that is, which variables best 
predicted GPA. Given that the proportion of vari- 
ance accounted for by any given variable was de- 
pendent upon its effect and importance, such an 
analysis permitted an evaluation of the relationship 
among locus of control, the mediating variables, and 
academic performance. For example, if Ai scores 
accounted for a significant proportion of the variance 
in the GPAs of male and female internals, such 
results would provide support for the prediction that 
achievement via independence was an important 
variable mediating the relationship between academic 
performance and internal locus of control. 

Following such analyses, the mean GPAs of spe- 
cific groups were compared according to stated 
hypotheses using the Bonferroni t statistic (Kesel- 
man, 1974; Perlmutter & Myers, 1973), which thus 
specified the relationship between locus of control 
and academic performance. This planned compari- 
son procedure was employed to control the experi- 
mentwise error rate, that is, the probability that one 
or more erroneous conclusions will be drawn in a 
given study (Kirk, 1968), since these multiple com- 
parisons of means were nonorthogonal. 


RESULTS 


Means and standard deviations of all varia- 
bles were calculated for males and females 
and orthogonal ¢ tests computed to determine 
whether any mean differences might be at- 
tributed to sex. Results indicated no signifi- 


TABLE 1 


SUMMARY TABLE: STEPWISE REGR 
OF GRADE POINT Avera 


ON ANALYSES 


Group pa rs t Rib 
Males 
Internals FNE | .121 2.88** | 332 
i 145 2.38* 
Ai 055 1.28 
Defensive externals | P 094 | —1.47 151 
Congruent externals | Ai -362 3.11** | .389 
Females 
Internals Ai 140 2.52* | .200 
Defensive externals | P -107 2.38* | .251 
Ai -129 2.01 
Congruent externals | Ai 126 2.67% | .313 
C 147 2.16* 


Note, Abbreviations of scales are as follows: I = Internal, 

= Powerful Others, C = Chance, Ai = Achievement via 
Independence, and FNE = Fear of Negative Evaluation. 

* 7? is the squared semipartial correlation coefficient. 

b R? value is based on the entire set of four predictor 
variables, 
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cant sex differences on any of the measures 
except GPA, where females were superior, 
t(158) = 2.50, p < .05. 

As previously noted, through stepwise re- 
gression analysis it was possible to select the 
best subset of predictor variables, in order of 
their predictive power, while simultaneously 
removing those variables whose predictive 
ability fell below a given level. Preliminary 
analysis indicated that any variable which 
accounted for 4% or less of the variance in 
GPA did not add significantly to the predic- 
tive power of the subset; hence this cutoff 
level was adopted. Table 1 summarizes the 
results of the stepwise regression analysis 
calculated for each subject group.? Squared 
multiple correlation coefficients (R?) based on 
the entire predictor-variable set, are also re- 
ported. 

Results indicated that for male internals, 
FNE scores accounted for 12.1% of the vari- 
ance in GPA, (19) = 2.88, p < -01, while I 
scores accounted for 14.5%, ¢(19) = 2.38, p 
<.05. For female internals, Ai scores ac- 
counted for 14.0%, #(36) = 2.52, p < .05, of 
the variance in GPA. 

Although none of the predictor variables 
accounted for a significant proportion of the 
variance in GPA for male defensive externals, 


1 For researchers interested in how well GPA was 
predicted from the entire set of six predictor yaria- 
bles, without regard to any locus of control classi- 
ficatory moderations, the following supplementary 
analyses were calculated, For males (n= 66), the 
correlations between GPA and predictor variables 
I, P, C, Ac, Ai, and FNE were .41, p< .01; hh 
P< 05; —A4l, p<.01; 32, P<.03; 46, p<.01; 
and —.09, respectively. For females, (n= 94), cor- 
responding correlation coefficients were 47, p<.01; 
701; =.22, p < .05; 35, b <.01; 44, p <.01; and 
—.01. For males, a stepwise regression analysis indi- 
cated that Ai accounted for 21.3%, (59) = 2.92, p 
S.01, while I accounted for 11.6%, t(59) 


not predicting GPA significantly. The entire predictor 
variable set accounted for 35.8% of the GPA vari- 
ance, F (6,59) =5.47, < .01. For females, such 
analysis indicated that I accounted for 22.1%, t(87) 
= 3.37, p < .01, and Ai accounted for 8.7%, t(87) 
=2.97, p<.01, of the GPA variance; the other 
variables not Predicting significantly. The six varia- 
bles, combined, accounted for 36.1% of the variance 
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TABLE 2 


GRADE POINT AVERAGE MEANS AND STANDARD 
Deviations FOLLOWING STEPWISE 
REGRESSION ANALYSES 


Locus of control 
Sex 
Internals | Oeiras | externals 

Males 

n 24 23 9 

2.81 2.37 2.03 

SD 31 21 32 
Females 

A AL 27 26 

x 2.93 2.72 2.29 

SD 20 34 28 


the P scale accounted for the greatest per- 
centage, namely, 9.4%. For female defensive 
externals, P scores accounted for 10.7%, 
t(22) = 2.38, p < .05, of the GPA variance. 

For male congruent externals, Ai scores 
accounted for 36.2%, t(14) = 3.11, p < .01, 
of the GPA variance. Ai scores accounted for 
12.6%, t(21) = 2.67, p< .05, and C scores 
accounted for 14.7%, t(21) = 2.16, p< .05, 
of the variance in GPA for female congruent 
externals. 

Table 2 provides a summary of the GPA 
means and standard deviations, for the six 
subject groups, following stepwise regression 
analyses. Bonferroni £ statistics were then 
calculated comparing the mean GPA values. 
To control the experimentwise error rate, the 
probability of Type I error was set at a= 
‘05 for the entire collection of comparisons. 
Results of the calculated pairwise compari- 
sons are summarized in Table 3. 

Results indicated that male and female in- 
ternals were more successful academically 
than corresponding congruent externals with ¢ 
values of 7.34 and 9.95 for males and females, 
respectively (p < 001). Also, results (£= 
5.59; t = 2.81) demonstrated that male and 
female internals, respectively, achieved a sig- 
nificantly (p < .0096) higher level of aca- 
demic success than corresponding defensive 
externals. 

Pairwise comparisons of mean GPAs of 
male and female defensive externals with 
corresponding congruent externals indicated 
the former to be more successful academically. 
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Bonferonni ¢ values were 3.31 and 4.95 for 
males and females, respectively, (p < .0096). 
Finally, a comparison of the mean GPAs of 
female with male defensive externals showed 
that females achieved a higher level of aca- 
demic success, £ = 4.38, p < .0096. 

Supplementary pairwise comparisons dem- 
onstrated no significant differences between 
the levels of achievement of male versus fe- 
male internals and male versus female con- 
gruent externals. Consistent with the evalua- 
tion of previously reported mean differences 
and respecting a priori established confidence 
levels, these supplementary comparisons were 
evaluated employing a = .0096. 


Discussion 
Locus of Control and Academic Performance 


Results of Bonferroni ¢ statistics supported 
the hypothesized relationship between locus 
of control and academic performance. The 
findings indicated that internals were more 
successful academically than either defensive 
externals or congruent externals. Such re- 
sults are consistent with the theoretical rela- 
tionship between locus of control and aca- 
demic performance suggested by Rotter 
(1966). Rotter hypothesized that internals, 
who believe that reinforcements (grades) are 
a direct result of their own ability and effort, 
manifest more achievement-directed behavior 
than do externals. The consequent effect, that 
is, internals achieve a higher degree of aca- 
demic success than defensive externals or con- 


TABLE 3 


PAIRWISE NONORTHOGONAL COMPARISONS OF 
GRADE POINT AVERAGE MEANS 


Comparison t 
Males 
Internals vs. congruent externals 7.34** 
Internals vs. defensive externals 5.59% 
Defensive externals vs. congruent 
externals 3.31" 
Females 
Internals vs. congruent externals 9.95"* 
Internals vs. defensive externals 2.81* 
Defensive externals vs. congruent 
externals 4,95* 
Defensive externals 
Females vs. males 4.38* 
* p < .0096. 
** p < 001. 
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gruent externals, was demonstrated by the 
obtained data. 

The hypothesis that defensive externals are 
more successful academically than congruent 
externals was also supported. As previously 
stated, defensive externals may be identified 
as those persons who ascribe reinforcement 
responsibility to powerful others primarily as 
a defense against possible failure. Despite 
this external orientation, these individuals 
continue to maintain a comparatively strong 
achievement orientation (Hjelle, 1970) and 
thus achieve a higher degree of academic 
success than congruent externals, whose 
achievement-related behaviors reflect their ex- 
ternal belief that chance, luck, or fate control 
their reinforcements. This specific finding is 
considered particularly important for inter- 
preting the nonsignificant or equivocal results 
of previous research (e.g., Eisenman & Platt, 
1968; Hjelle, 1970; Prociuk & Breen, 1973) 
in which defensive externals were not differen- 
tiated from congruent externals. Rather, 
these two groups were classified together sim- 
ply as externals. Thus in terms of academic 
performance, any noted differences between 
internals and externals would have been at- 
tenuated as a result of the relatively high level 
of achievement by defensive externals demon- 
strated by the present study. 

Finally, results supported the hypothesis 
that female defensive externals are academi- 
cally superior to their male counterparts. 
Thurber (1972) suggested that females may 
be more predisposed to adopt defensive ex- 
ternality as a defensive measure for allevi- 
ating achievement-related anxiety, since it 
simultaneously leads to an affirmation of 
expected feminine attitudes such as depen- 
dency. Results of the present study provide 
some indirect support for this contention. 
Specifically, for female defensive externals, 
the data indicated that the greater the belief 
in control by powerful others, the higher was 
the level of academic success. For male defen- 
sive externals, however, the relationship be- 
tween belief in powerful others and academic 
achievement, while nonsignificant, was the 
inverse. 

It should be noted that females, as a group, 
had a significantly higher mean grade point 
average than males, a finding also reported 
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by Eisenman and Platt (1968). 
the present study enable some clarif:< 
this particular finding. Specifically. 
search demonstrated that female 
externals were more successful academ 

than their male counterparts and that c 
ences between male versus female inte 

or congruent externals failed to reach sig: 
cance. Therefore, the difference in acade! 
success between males and females may 
least be partially explained by the superi 
achievement of female versus male defensi 
externals. 

While the results of this investigation sup 
ported the hypotheses concerning locus of 
control and academic performance, cross- 
validation and replication is considered desir- 
able. Confirmatory data from new samples 
and different institutions would extend the 
generalizability of the present findings. 


de 


Effect of Mediating Variables 


Contrary to expectation, fear of negative * 


evaluation was positively related to grade 
point average for male internals and ac- 
counted significantly for their level of aca- 
demic performance. It thus appears that 
while such individuals may generally feel con- 
fident in social evaluative situations, which 
is demonstrated by the negative relationship 
between internal control and fear of negative 


evaluation (Watson & Friend, 1969), they | 


nevertheless express some concern regarding 
the possible negative evaluation of their per- 
formance in the college academic situation. 
This concern might very likely result from 
a realistic appraisal of the consequences asso- 
ciated with academic failure. Such an inter- 
pretation is consistent with previous research 
suggesting that internals utilize information 
effectively (Phares, 1968) and is also con- 
gruent with social learning theory (Rotter, 
Chance, & Phares, 1972), which emphasizes 
the importance of the psychological situation 
in accounting for human behavior. The fact 
that these findings are specific only to male 
internals may be explained in terms of social 
expectation, as it is generally accepted that 
academic success is more important in deter- 
mining worldly success for males than for 
females. 

As anticipated, achievement via indepen- 
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dence was a significant variable accounting 
for the level of academic achievement of fe- 
male internals, and while not significant, ac- 
counted for more than 4% variance, the 
adopted cutoff level, for male internals. These 
results provide some support for the predic- 
tion that internals, described as independent 
and self-directed (Odell, Note 1), employ an 
independent achievement method in college 
academics, 

Results also demonstrated that achieve- 
ment via independence was an important 
variable in accounting for the academic per- 
formance of congruent externals. While these 
persons might be expected to be conforming 
(Odell, Note 1), such results suggest that 
they nevertheless attempt to achieve academic 
success by self-directed methods. Previous re- 
search (Davis & Phares, 1967) has demon- 
strated that externals are less able to actively 
seek information relevant to problem solu- 
tion. Given that such is the case, congruent 
externals might concentrate on and study 
topics which they consider to be important 
and not necessarily those relating to academic 
work. As a consequence, it would be expected 
that such individuals would demonstrate the 
relatively low level of academic performance 
indicated by the obtained results. 

Within the broader context of college aca- 
demic achievement, it should be noted that 
achievement via independence was an impor- 
tant variable in accounting for the academic 
performance of many individuals, irrespective 
of their locus of control. Such a finding has 
interesting implications given previous re- 
search (Domino, 1971) on the interactive 
effects of achievement orientation and teach- 
ing style. If the current college environ- 
ment is encouraging independent and original 
thought, as is suspected, then it is not al- 
together surprising that many students should 
adopt an independent versus conforming 
achievement orientation. However, in view of 
previous findings (Domino, 1968), it is only 
those persons who are characteristically inde- 
pendent and self-directed who would gain 
from such a curriculum emphasis. Con- 
versely, individuals described as conforming 
and other-directed might be seen as taught 
in a manner dissonant with their preferred 
achievement orientation, thus lessening their 
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opportunity for academic success. With re- 
spect to locus of control, this interactive 
effect of achievement orientation and teaching 
emphasis may serve to widen the academic 
performance level between internals, who are 
described as independent, and externals, who 
are described as conforming and dependent. 
An investigation of this question is considered 
necessary. 


Implications for Future Research 


Although several researchers (e.g., Hjelle, 
1970; Prociuk & Breen, 1973) noted that 
defensive externality may be a factor con- 
founding the relationship between locus of 
control and academic performance, no sys- 
tematic investigation of this problem had 
been previously conducted. The present 
study, considering this factor, demonstrated 
that defensive externals were academically 
superior to congruent externals, thus pro- 
viding a plausible explanation for the results 
of previous research (e.g., Eisenman & Platt, 
1968; Hjelle, 1970). Future research on 
locus of control should consider defensive 
externality, where this factor is suspected as 
exercising a confounding effect. 

Additionally, this research, unlike previous 
investigations, considered the effect of medi- 
ating variables. Obtained results, for exam- 
ple, suggested that achievement via indepen- 
dence may be an!important variable in ac- 
counting for the academic performance of 
individuals who believe that chance, luck, or 
fate control their reinforcements. Such find- 
ings appear to indicate that the relation be- 
tween locus of control and academic per- 
formance is not a direct one but may be 
modified by specific mediating variables. 

Results of the present study also suggest 
that the Levenson (1972, 1974) scales are a 
viable alternative to the Rotter I-E scale, 
particularly where a measure of belief in 
powerful others, separate from a chance, luck, 
or fate expectancy, is considered desirable. 
Obtained results demonstrated that the Inter- 
nal, Powerful Others, and Chance scales were 
effective measures in accounting for the aca- 
demic performance of male internals, male 
and female defensive externals, and female 
congruent externals, respectively. Such find- 
ings attest to the usefulness of these scales 
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in comparison with the I-E scale, which was 
found to be virtually ineffective in predicting 
academic achievement (Warehime, 1972). 
Although it is recognized that this comparison 
relates only to research involving academic 
performance, additional evidence supporting 
the effectiveness of the Internal, Powerful 
Others, and Chance scales has been reported 
in research examining social action involve- 
ment and information seeking (Levenson, 
1972, 1974), study habits and attitudes 
(Prociuk & Breen, 1974), and Machiavel- 
lianism (Prociuk & Breen, in press). It is 
thus suggested that the Levenson scales be 
considered in future research involving the 
locus of control construct. 


REFERENCE NOTE 


1. Odell, M. Personality correlates of independence 
and conformity. Unpublished master’s thesis, Ohio 
State University, 1959. 
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Viewing leadership behavior as a joint function of situational factors, indi- 
vidual predispositions, and followers’ responsive behavior, this study used 64 
Subjects, half of them high and half of them low in measured leadership self- 
concept. Half had a high and half a low stake in a group task. Each subject 
was twice appointed leader of a three-man group, once exposed to “followers” 
(experimenter’s confederates) altercasting him to lead, once to followers alter- 
casting him to follow. Multivariate analyses of variance showed effects of 
followers’ behavior on profiles of a subject’s behavior (when subjects were 
pushed to lead they led, when pushed to follow they increased socioemotional 
activity) and perceptions of followers. These effects were specified by leader- 
ship self-concept (little difference between groups when confronting pushy 
followers, more when confronting passive ones), prompting an interpretation in 


terms of role incongruity. 


What is it that determines how men lead? 
Hollander and Julian (1968, 1969) are prin- 
cipal exponents of the view that leadership 
involves the interaction of three important 
determinants: “(a) the ‘leader,’ with his per- 
sonality perceptions and resources relevant to 
goal attainments; (b) the ‘followers,’ with 
their personalities, perceptions, and relevant 
resources; and (c) the situational context 
within which these variables function. To 
ignore entirely any of these factors destroys 
the inherently dyadic character of the phe- 
nomenon” (1968, p. 890). This view of 
leadership as process, widely reiterated in the 
contemporary social psychological literature 
(cf. Gibb, 1968, 1969; Jones & Gerard, 
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1967; Secord & Backman, 1964), has direct 
implications for a research paradigm on 
leadership processes. The dependent variable 
to be explained is the behavior of those oc- 
cupying leadership positions (whether formal 
or informal) within the group. 

The paradigm also specifies three important 
sets of independent variables. One of these 
is the responsive behavior of followers to the 
leader’s attempts to influence them. A useful 
approach to exploring this kind of reciprocal 
relationship is the perspective of altercasting 
(Weinstein & Deutschberger, 1963). The 
formal position one occupies in a group does 
not determine in toto one’s role performance. 
In any group situation there may be a subtle 
process of negotiation for roles. In this proc- 
ess, not only do persons lay claim to certain 
role performances, but act to cast others 
(“alters”) into particular roles, Altercasting 
processes are particularly germane to research 
on leadership in which persons with structural 
authority to lead are subject to altercasting 
pressure from their followers. 

Most important, leaders are altercast through 
pressures to exemplify the followers’ values 
and to meet their role expectations with re- 
spect to leadership style—leaders follow fol- 
lowers. While leaders may have some flexibil- 
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ity in departing from group norms (Hollander, 
1958), there is a clear line of demarcation 
beyond which leaders become more sanc- 
tioned for deviance than ordinary followers 
(Wiggins, Dill, & Schwartz, 1965). Con- 
gruent with this pattern are the findings of 
Haythorn, Couch, Haefner, Langham, and 
Carter (1956) and Sanford (1952) that 
groups differing in the authoritarianism of 
the membership tended to elicit different 
kinds of leadership. Over time, leaders tended 
to meet the role demands of their groups, 
regardless of their own personalities. 

The Haythorn study is one of the few 
which attempted to make an explicit associa- 
tion of leadership and followership behavior. 
Even so, constituting groups of different 
levels of authoritarianism is not the same as 
directly incorporating different expectations, 
and more important, different overt role 
demands, as variables in the analysis. Direct 
manipulation of followers’ behavior did occur 
in the work of Pepinsky, Hemphill, and 
Shevitz (1958). Confederates of the experi- 
menters gave either positive or negative feed- 
back to attempts at leadership behavior on 
the part of real subjects. Positive reinforce- 
ment increased leadership behavior, while 
negative reinforcement decreased it. The sub- 
sequent work of Bavelas, Hastorf, Gross, and 
Kite (1965) and Zdep and Oakes (1967) 
confirm this basic pattern. In their work, 
leadership behavior was reinforced for se- 
lected target subjects in two ways: first, 
directly, by the experimenter positively rein- 
forcing the target’s verbal behavior and nega- 
tively reinforcing his nonparticipation; and 
second, indirectly, by inducing followers to 
altercast the target into a leadership position 
by positively reinforcing nontargets for agree- 
ment with targets and negatively reinforcing 
them for competing leadership responses. Our 
work is in this tradition in its concern with 
the role support the group gives to or with- 
holds from the leader. However, we are deal- 
ing with appointed leaders whose position is 
legitimated by external authority and who 
have been directed to express formal leader- 
ship responsibility. And the reinforcement 
they receive is not operant but in the form 
of continuous role demands. 

With followers’ input to the group task 
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constant in quantity and quality, what differ- 
ence does it make to the behavior of a for- 
mally appointed leader if his followers either 
demand directive leadership from him or, con- 
versely, demand that he assume a subordinate 
role in the task activity? The answer is, of 
course, it depends. If the followers are effec- 
tively accomplishing the tasks at hand, lead- 
ers should conform to the followers’ role 
demands. But what happens when the task 
is put in jeopardy because of followers’ in- 
adequate role performances? When followers 
demand directive leadership no problem arises 
because the leader simply leads. But when 
followers demand subordination from a for- 
mal leader and simultaneously endanger the 
task, conflict arises for the leader between 
members’ role expectations and task demands. 
How the leader resolves this conflict is one 
of the principal concerns of this study. 

Characteristics of the situational context, 
a second of Hollander and Julian’s critical 
independent variables should be important in 
affecting how leaders resolve the conflict. In 
the present study, we are concerned with one 
such condition, the importance of the task 
being put in jeopardy by inadequacies of the 
role performance of the followers. The more 
important the task is to the group, the more 
likely should leaders be to engage in the kinds 
of activity that would ensure success, regard- 
less of members’ expectations. Thus, in gen- 
eral, we expect more directive activity on the 
part of appointed leaders when the stake in 
the group’s performance is high than when it 
is low. When task success is relatively un- 
important, leaders’ activities are expected to 
vary with role demands of the followers. 

The last of Hollander and Julian’s inde- 
pendent variables is the personality resources 
the leader brings to the role. And central to 
those resources is the orientation of the 
leader himself to the leadership role, that is, 
the extent to which the leader sees himself 
as capable and effective in leadership roles 
and oriented toward assuming them. When 
someone who is uncomfortable with and tends 
to avoid leading is placed in a leadership 
position, role incongruity results. Smelser 
(1961) has studied the effects of role incon- 
gruity on group productivity and interper- 
sonal perception by systematically assigning 
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J people very high and very low in dominance 
to either dominant or subordinate roles in 
work groups. He found that dyads charac- 
terized by congruence of personality and role 
assignment performed more productively than 
those characterized by incongruence. 

Berkowitz (1956) explored the effects of 
incongruity between personal dispositions and 
role assignment upon leadership itself. He 
placed natural leaders and nonleaders in 
either central or peripheral positions in a 
communication network, positions demanding 
either leader or follower behavior in order for 
the group to be effective. Berkowitz found 
role incongruity to be resolved in favor of 
role demands from the environment; non- 
leaders in central positions came to lead more 
over time, while natural leaders in a position 
not requiring leader activity reduced their 
level of directiveness. 

But how is role incongruity resolved when 
leaders confront conflicting demands from the 
environment? We have been discussing three 
sources of pressure on an appointed leader: 
pressures from his responsibility for the suc- 
cess of the task (which vary directly with its 
importance); pressures from his self-concep- 
tion as a leader (or nonleader) ; and pressures 
from the role demands of his followers. These 
pressures can operate in complementary or 
competing ways. Take first the person who 
sees himself as a leader. When pushed by his 
followers to lead, he can be expected to do 
what comes naturally—and actively lead. We 
would expect his leadership to be more direc- 
tive when the group task is important and to 
be more concerned with group cohesion with a 
less important task. 

But when the natural leader finds his 
formal position disregarded by his followers, 
he must cope with role incongruity. If he has 
relatively little interest in the group outcome, 
he can be expected to resolve the incongruity 
in favor of the demands of his followers. 
Socioemotional factors would predominate 
over task considerations and the good leader 
would be a good follower. When the incon- 
gruent demands of the followers are threaten- 
ing an important task, however, the conflict 
js exacerbated and the leader is apt to be 
most directive and authoritarian, even hostile 


and antagonistic. 
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Now consider someone who sees himself as 
a nonleader. When others demand that he 
cede leadership (which he does not want in 
the first place), he can be expected to go 
along cheerfully, at least when the stake in 
the outcome is low. A high stake may create 
some tension, but it would be too much to 
expect him to adopt the uncomfortable lead- 
ership role in the face of role demands of 
others. Instead, he could be expected to 
remain in the group, contributing in a 
passive way. 

Role incongruity for the nonleader occurs 
when the group demands that he actively 
lead. When the stake is low, his personal 
predisposition should predominate; since there 
is little motivation to participate actively in 
an uncomfortable situation, he is likely to be 
passive or withdraw. Passivity or withdrawal 
is less likely when nonleaders have more 
vested interest in task success. Since active 
direction is not a personally comfortable re- 
sponse to the situation, however, the non- 
leaders should focus on the definition of the 
situation and on group members’ roles in an 
effort to get others to assume more active 
responsibility. 


METHOD 


The experimental subjects were drawn from stu- 
dents taking introductory sociology classes in a 
southern university. There were a total of 64 sub- 
jects, all male. Willingness to accept the leadership 
role was measured by a self-concept questionnaire 
designed specifically for this study.* 

Below are sample items from the Leadership Self- 
Concept Scale. The directions were as follows: “You 
are to indicate, by circling a number to the right 
of each statement, how appropriate or how charac- 
teristic that quote would be of you.” Following are 
the sample items: 


I often feel hesitant to speak out in a group, being 
somewhat self-conscious about what people will 
think. 


1 The initial appointment to a position of leader- 
ship, legitimated by the status of the appointer, 
creates one source of incongruity for the subject 
who does not see himself as a leader. This incongru- 
ity is then compounded by the heavy demands for 
directive action from his “follower” co-workers. 

2 The leadership self-concept scale consisted of 14 
items and was pretested at Western Kentucky Uni- 
versity using a total of 62 introductory sociology 
students. Item analysis yielded a coefficient alpha 
of .84. 
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I enjoy trying to work out different approaches to 
solving problems and getting the group to go along 
with them. 


I feel uncomfortable in supervising other people’s 
work and telling them what to do. 


When people in the group are moving slowly, I go 
out of my way to encourage them. 


These items were ranked on a scale from 1 (not at 
all like me) to 5 (very much like me) with a mid- 
point of 3 (somewhat like me). (Copies of the full 
14-item scale are available from the first author.) 
Subjects were dichotomized at the median into high 
and low groups, 

A second variable, the stake factor, was intro- 
duced by having the students’ respective professors 
tell them at the beginning of the semester that they 
would be required to participate in outside work 
sessions as part of their course requirement. Half of 
the subjects in the experiment were told that they 
would be graded for their efforts and that their work 
in the session would be equivalent to one of six 
papers for the course. The other half of the subjects 
were simply told that their work was part of their 
course requirement; these latter subjects were told 
that mo actual grade would be given. Thus, subjects 
experienced either a high-stake or a low-stake situa- 
tion; in no case did a subject experience both. 

The final variable, the direction of the reciprocal 
altercast, represents a within-subjects measure. Each 
subject engaged in two separate work sessions with 
two different students in each session. The four stu- 
dents with whom the subject worked were confeder- 
ates of the experimenter. The behavior of these 
confederates was programmed for each session; in 
one, they were directed to altercast the subject into 
a superordinate role, in the other, a subordinate role. 
Each subject experienced both forms of behavior on 
the part of the confederates, 

The subject was appointed leader by the experi- 
menter (a teaching assistant). The subject was told 
he was responsible for seeing that the task was com- 
pleted and for evaluating the work before it was 
submitted. The two sessions differed experimentally 
only in the behavior of the two confederates. 

Each subject was scheduled to report to a particu- 
lar room to work on what he believed was an out- 
side class project. While he was waiting in the outer 
office, the two confederates arrived to participate in 
the project. The experimenter then brought the 
group into the room and directed them to seats 
around a table. The experimenter introduced the 
subject to the confederates as students from another 
section of the introductory course. 

The task in the experiment called for the group 
to make up a list of attitude items dealing with 
either social or academic aspects of campus life. The 
group was told to discuss the items jointly and to 
develop a single list by the end of the session. If the 
subject was in the high-stake condition, the experi- 
menter told the group that they would be graded 
on their work and the grade would be equivalent 
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to one of their course papers. If the condition was 
one of low stake, the experimenter stated that al- 
though no actual grade would be given, the work was 
part of a course requirement. At this point the ex- 
perimenter left the room, telling the group they had 
15 minutes to complete the task. 

At the end of the first session, the experimenter 
returned, collected the items, and excused the two 
confederates by telling them they had been scheduled 
to complete their second session at a later time. 
The subject was told that he was to complete his 
second session immediately. Two new confederates 
were brought into the room and introduced to the 
subject. Again, the group was given 15 minutes to 
complete the task. At the end of this session, all 
three participants were excused. 

The most crucial phase of the experiment was the 
programmed behavior of the confederates. Each pair 
of confederates had to give the behavior appropriate 
for the super- and subordinate altercasts. In any 
one session the two confederates had to act as a 
pair, pushing the subject in the same direction, In 
both sessions the subject had been told by the 
experimenter to lead; in one case he was pushed by 
the confederates not to lead, in the other he was 
pushed to lead. In no cases were the confederates 
aware of the subject’s self-conception concerning his 
role as leader. 

Regardless of the type of behavior employed by 
the confederates, they were instructed to embroil the 
subject in the interaction. The role-playing of the 
confederates was especially crucial and delicate in 
the nature of the remarks which were directed 
toward the subject. He was to be treated at least 
as an equal; the confederates were instructed not 
to attack the leader personally or make the altercast 
too strong. In both sessions, however, the confeder- 
ates’ behavior was designed to place the task in 
jeopardy; that is, they were programmed to discuss 
irrelevant or tangential matters in the work sessions 
and when they did contribute material, the material 
was of poor quality. Unless the subject exerted his 
leadership influence, the task was not likely to be 
completed successfully. 

The behavior employed by the confederates ‘in the 
active situation (subordinate altercast) was designed 
to keep the subject in the leadership role in name 
only, This was accomplished in several ways. The 
confederates took over the division of labor, with 
one directing the subject to take notes while the 
other offered unrequested assistance. This had the 
effect of altercasting the subject into a support- 
seeking role, conveying to him that he was perhaps 
incapable of directing the activities, 

In the passive situation (superordinate altercast) 
the confederates’ behavior was designed to project 
the subject into a leadership role. Here they ex- 
pressed bewilderment as to what should be done in 
the session and often acted passively. In this condi- 
tion, the confederates made no attempt to define 
the situation. By refusing to contribute to the dis- 
cussion without explicit direction, the confederates 
ceded to the subject maximum freedom to define the 
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situation. The job of the confederates here was not 
to force the subject to do all of the work, but rather 
to get him to make them do the work. By humbly 
telling the subject, “You're the leader, you tell us 
what to do,” they could also cede him maximum 
leadership. The contribution of the confederates to 
the group task was also controlled by haying con- 
federates draw from the common pool of uniformly 
poor items when making suggestions. Both the quan- 
tity and quality of the confederates’ suggestions were 
the same across the two conditions. 


Dependent Measures 


Responses of the appointed leaders were measured 
at two levels, a global, more macroscopic set of 
ratings of the entire session based on the subject's 
altercasting (Weinstein & Deutschberger, 1963) and 
a more microscopic act-by-act analysis based on 
Bales’s (1950) categories. 

Both sets of ratings were made from tape record- 
ings of each session independently by two judges 
who were ignorant of the experimental design and 
proposed hypotheses. The altercasting dimensions 
that were of specific interest were: (a) structural 
distance or leadership (the extent to which the sub- 
ject cast the confederates into a superordinate or 
subordinate role in the situation); (b) interdepen- 
dence versus autonomy (the extent to which the 
individual projected his co-workers as being tied to 
him by bonds of common fate, perspective, or con- 
currence of interests); and (c) degrees of freedom 
(extensiveness of the constraints the subject placed 
on his followers to assume appropriate roles in the 
situation). 

The expectations stated previously regarding the 
responses of appointed leaders can be expressed in 
terms of these dependent measures. The exercise of 
directive leadership should be seen in low altercasting 
ratings of structural distance (push for others to be 
subordinate) and low degrees of freedom. In terms 
of the Bales categories, there should be a relatively 
high concentration of acts in task answer categories: 
giving opinions, orientations, and suggestions. Focus 
on group cohesion should be seen in altercasting of 
interdependence and in the positive socioemotional 
Bales categories (tension release, showing solidarity, 
and agreement). Authoritarian, antagonistic leader- 
ship would be marked by extremely low degrees of 
freedom allowed and high negative socioemotional 
activity (showing tension, disagreement, and antago- 
nism). More passive redirection should be expressed 
in high levels of task questions (asking for opinions, 
suggestions, and orientations). Finally, withdrawal 
should be expressed in altercasting of autonomy and 
in reduced overall activity. 


Experimental Design and Analysis 


The design was a complete factorial plan with 
one within-subjects condition (the altercast received 
from the confederates) and two between-subjects 


` conditions (stake and leadership self-concept) yield- 


ing a 2 X 2 X 2 design with 16 observations per cell. 


561 


Because of potential redundancy in altercasting 
ratings and Bales categories and limitations on the 
degrees of freedom available in the design, analyses 
of these two types of measures were kept separate, 
Since the expected patterns of behavior were concep- 
tualized in terms of profiles defined by patterning 
in the dependent measures, multivariate analysis of 
variance was used in the data analysis. The order 
of the type of confederate’s altercasting and the 
assignment of specific confederates to roles were 
both counterbalanced, 


RESULTS 


The most powerful variable in the study 
was our experimental manipulation of fol- 
lower behavior. This is clear in both the holis- 


3 Certain difficulties occur in the use of multi- 
variate analyses of variance on a repeated-measures 
design involving several error terms. To avoid 
these, two separate multivariate analyses of variance 
were run. In the first case the observations were 
summed over the repeated condition. This gave all 
main effects and interactions which did not involve 
the manipulated variation in stooges’ behavior. To 
get the effects of confederates’ behavior, difference 
scores were computed for each of the individual 
dependent variables by subtracting the score in the 
session in which the confederates demanded leader- 
ship from the score from the session in which they 
attempted to seize leadership themselves. ‘The dif- 
ference scores were then analyzed in a between- 
groups design in which the difference between the 
grand mean and zero yielded the confederate behav- 
ior (altercasting) main effect, and the stake and 
leadership self-concept effects were aliases for the 
interactions of these variables with the confederate 
behavior variable. We are indebted to Richard F, 
Link for suggesting this solution. Two separate 
multivariate analyses of variance were run on the 
Bales data. One dealt with the Bales categories taken 
individually; the other used Bales’s four basic group- 
ings. Both for convenience and clarity of presenta- 
tion and for statistical reasons, we have chosen to 
present only the grouped analysis. In no case when 
the grouped analysis yielded multivariate significance 
was the probability level lower for the analysis based 
upon the ungrouped Bales categories, In a number 
of cases there were specific individual categories 
which did yield univariate significance although their 
group only approached significance. The probability 
values were universally more extreme in the un- 
grouped data. We had some question about the 
stability of the ungrouped analysis since, for higher 
order interactions, the number of dependent vari- 
ables using all 12 Bales categories, approach the 
number of observations per cell. The individual 
Bales categories are sometimes referred to in discus- 
sion when the pattern for the group is concentrated 
in one category and when they serve to illuminate 
the grosser findings. 
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tically based and the act-by-act based ap- 
proaches to assessing the subjects’ interaction. 
When subjects were altercast into a subordi- 
nate position within the group, they engaged 
in significantly less task activity and signifi- 
cantly more socioemotional activity, compared 
with the situation when they were confronting 
confederates who were altercasting them to 
exercise their assigned superordinate respon- 
sibilities. Both positive and negative socio- 
emotional activity were higher when the sub- 
jects were altercast as subordinates. The 
relevant data are shown in Table 1. 
(Although the Bales data are all in raw 
counts, comparison of the difference scores 
per se across the various categories may be 
misleading. Task activities, for example, pre- 
dominated heavily across both altercasting 
conditions. We adjusted for the overall level 
of activity in each Bales category by express- 
ing the change across conditions as a per- 
centage of the active condition mean for that 
category. Thus, in the bottom row of Table 1, 
we find negative socioemotional activity, for 
example, having a high relative change al- 


TABLE 1 


MEAN DIFFERENCES IN SUBJECTS’ BEHAVIOR BETWEEN 
SESSIONS OF ACTIVE AND PASSIVE 
CONFEDERATES’ BEHAVIOR 


Acts in Bales Categories 


Negative’ | Task | Task 
a answers | questions 


| emotional | emotional 


Mean change 2.55 64 —12.75 —4.33 
Change as % 

of active ? , 

mean 17.6% 32.8% 23.3% 36.9% 


Altercasting ratings* 


Inter- 
Structural | dependence rey a 
distance Dareus Degrees of freedom 
autonomy 
Mean change 406 AT7 —.078 
Change as % 
of active ; 
mean 12.4% 11.5% 18.7% 


Note. The subtraction is active minus passi that a 
positive sign indicates a higher mean when the subjects were 
being pushed to follow (active altercasting), while a negative 
sign indicates a higher mean when the subjects were being 
pushed to lead (passive altercasting). Both kinds of measures 
reach multivariate significance at the .05 level; each separate 
variable except degrees of freedom is statistically significant 
at that level or beyond. EA ): 

«For structural distance, high score indicates ceding au- 
thority to other; high score on interdependence indicates 
greater autonomy claimed; high score on degrees of freedom 
indicates fewer constraints placed on the other. 


though the absolute difference is the smallest 
in the table.) 

The altercasting findings parallel those 
from the more atomistic Bales data. When 
confronted with others who are themselves 
assuming positions of superordination, our 
subjects were less likely to insist on exercising 
the prerogatives of their structural position 
than when the others are assuming subordi- 
nate positions. This ceding of leadership did 
not occur without some cost, however. Far 
from showing a compliant acceptance of the 
subordinate position, our subjects showed an 
increased emphasis on themselves as inde- 
pendent agents in the situation as reflected in 
their greater projection of autonomy. 

The followers’ behavior affected not only 
the leaders’ behavior but their perceptions of 
the followers as well. In a postexperimental 
questionnaire, subjects were asked to rate the 
performance of their fellow workers (i.e., the 
confederates) in the dimensions of amount 
of contribution to the group task, its quality, 
and cooperativeness. Although the confeder- 
ates’ performances had been carefully cali- 
brated to eliminate differences in number and 
quality of items contributed, they are never- 
theless perceived differentially by the subjects 
depending on the altercasting condition. The 
multivariate analysis of variance of the differ- 
ence was significant (p < .005), with the 
principal contributions to significance coming 
from significant tendencies to rate both the 
amount and quality of the confederates’ con- 
tribution as greater when the confederates 
played an active role (in both cases p < .005). 
Followers who dragged their heels were not 
given equal credit for work equal to that of 
their more active counterparts. These findings 
not only indicate that the manipulation of 
follower’s behavior was successful but allows 
us to explore the interactive effects of other 
variables with followers’ behavior. 

Our second major manipulated condition 
was the stake that appointed leaders had in 
the outcome of the group task. We hypothe- 
sized that when there was more to lose, there 
would be more directive, heavier-handed 
leadership than when little was at stake. The 
findings did not bear out these predictions, 
No stake differences could be found in the 
altercasting dimensions, either individually or 


~ collectively. The two stake vectors on the 


grouped Bales category differed to an extent 
which approached but did not achieve sta- 
tistical significance (p < .10). In the signifi- 
cant univariate findings, there was approxi- 
mately one third more questioning activity in 
the high-stake condition (p < .025). (See 
Table 2.) There was slightly more answering 
activity and a near significant (p < .10) dif- 
ference in positive socioemotional acts as well. 
Only negative socioemotional acts were less 
frequent in the high-stake condition, and 
these were quite infrequent overall. Had our 
hypothesis been correct, the major difference 
should have come from the task answer group 
of Bales categories. Instead, higher stake led 
to an overall increase in activity. Subjects in 
the high-stake condition averaged approxi- 
mately one-eighth more acts than those in the 
low-stake condition, with the means being 
189.90 and 167.82, respectively, p < .05. 
Thus, when under the pressure of an outcome 
that was perceived to have real consequences 
for them, our appointed leaders generally had 
higher levels of activity, But these higher 
levels of activity were not directive alone. 
Indeed, the greatest relative differences were 
in positive socioemotional acts and in the task 
question category. These relative differences 
were obtained by adjusting for frequency by 
expressing the difference between leaders and 
nonleaders on each Bales category as a per- 
centage of the grand mean for that category. 

We had expected the subjects’ behavior to 
be more variable in the low-stake condition, 
that is, that it would depend more strongly 
upon the particular role demands of the fol- 
lowers when the stake was low than when it 
was high. Were this the case, a significant 
interaction should have emerged between 
stake and behavior of the confederates. No 
such interaction emerged at the multivariate 
level. There was an interaction at the uni- 
variate level in the task question category, 
but opposite in direction to that predicted. 
There was more difference between sessions 
marked by different confederate behavior in 
the high-stake condition than in the low-stake 
condition. (The mean differences—session 
with active altercast minus session with pas- 
sive altercast were —6.44 for high stake and 
—2.22 for low stake; p< .05.) Thus, it is 
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TABLE 2 
MEANS oF BALES CATEGORIES FOR VARYING VALUES 
or STAKE* 


(summed over both sessions) 


Category 
yanapi Positive | Negative | pag, | Task 
socio- socio- answers ques- 
emotional** | emotional "~~ | tions" 
High stake 29.27 3.03 | 127.46 | 31.27 
Low stake 23.65 3.48 117.21 |24.31 
Difference 5.62 —.43 10.25 | 6.96 
Difference as 
% of grand 
mean 21.2% 13.2% 8.4%] 25.0% 
* Multivariate significance .05 < p <.10, 
05 <p < .10. 
ap <.025, 


particularly the diffident, passive confederate 
that the leader with something at stake at- 
tempts to bring into play by asking him for 
orientation, suggestions, and especially opin- 
ions. 

Our third major independent variable was 
the individual’s conception of himself in the 
leadership role. We expected that the differ- 
ences in leadership self-concept would emerge 
in different responses to the context in which 
leadership was to be exercised. No main ef- 
fect for this variable was predicted nor was 
such an effect found for any of the altercast- 
ing ratings or grouped Bales categories. We 
did predict a series of rather complex three- 
way interactions in which leadership self- 
concept would modify the effects of stake and 
the confederates’ responsive behavior. Con- 
trary to our predictions, stake failed to affect 
the joint effect of leadership self-concept and 
confederates’ responsive behavior on the sub- 
ject’s behavior in his assigned leadership role. 
There were, however, important differences 
between the behavior of persons who saw 
themselves as leaders and those who did not 
in their reactions to the behavior of the 
stooges. The multivariate significance (p < 
.025) indicates that the profile of leadership 
activity differs between the two groups. An 
inspection of the data in Table 3 reveals the 
pattern of profile differences. 

It can be seen that the largest absolute dif- 
ference between the natural leaders and non- 
leaders comes in the task answers category, 
while the largest relative differences come in 
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TABLE 3 


MEANS For LEADERS AND NONLEADERS ON 


GROUPED Bares CATEGORIES BY 


DIFFERING BEHAVIOR OF CONFEDERATES? 
Positive | CRS Pan 
PEEN Sanau | Task answers Task* questions 
Variable s 
| Active | Passive | Active | Passive | Active | Passive | Active | Passive 
Leaders 14.08 12.19 2.42 | 1.42 54.23 70.41 12.24 13.80 
Nonleaders 14.93 11.72 148 | 120 55.36 64.68 11.21 18.32 
Difference —85 Aroi 94 22 S13 Sate S675 1,03 4.52 
Difference as % of grand 
mean 5.9% 3.9% 48.2%, 16.8% 21% 8.5% 8.8% 28:1% 


^ In active altercast, subjects are pushed to follow; in pa: 
to the confederates’ behavior, Multivariate significance of ti 
federates (deri 
* Univariately significant, p < .01, 


the task question and negative socioemotional 
categories, If we look further, a rather clear 
overall pattern emerges. The differences be- 
tween the leaders and nonleaders were rela- 
tively negligible when they were confronting 
confederates who tended to take over the 
situation and exert leadership themselves. The 
only exception to this pattern is in the nega- 
tive socioemotional area (to be discussed 
later). It was principally when confronted by 
co-workers who demanded active, directive 
leadership that the two groups diverged. For 
both leaders and nonleaders, directive activity 
(as measured by task answers) increased 
when such activity was demanded by the 
followers. But the absolute increase was nearly 
twice as large for the leaders. The sharpest 
difference, and one which achieved univariate 
significance (p < .01), was in the task ques- 
tion area, a measurement of directive activity 
being solicited from, rather than given to, 
other members of the group. Leaders showed 
almost no increase in such activity with in- 
creased passivity of their followers, while the 
nonleaders’ increase was very substantial (the 
respective percentages of the active means 
being 12.7% and 63.4%). 


Discussion 


We have examined the behavior of ap- 
pointed leaders in relation to: (a) their own 
conception of themselves as leaders; (b) the 
demands of the other group members that the 
leader assume either a superordinate and 
directive or a subordinate and nondirective 
role in the situation; and (c) the stake that 
the group had in the outcome of its activities. 


from the multivariate analysis of variance of differences) 


ey are pushed to lead. The altercast label of a session refers 
nteraction between leadership self-concept and behavior of con- 
< 025. 


We had no specific hypotheses regarding 
the main effects of leadership self-concept it- 
self, expecting that its effects would be condi- 
tioned differentially by the several levels of 
the other two factors. We discuss the fate of 
those hypotheses on interaction effects pres- 
ently. 

Our findings regarding the responsive be- 
havior of the followers confirm earlier leader- 
ship research: Leaders assumed roles comple- 
mentary to the pattern of altercasting by their 
followers. When pressed to lead appointed 


leaders led, asserting greater structural au- ` 


thority, increasing task activity, and reducing 
socioemotional activity while increasing their 
emphasis on group identification with the 
common task. 

Confronted with followers who attempted 
to take over the show, the appointed leaders 
shifted some of their concentration from prob- 
lems of task management to problems of 
group cohesion (or its lack), increasing socio- 
emotional activity, in the face of decreased 
overall activity. Inspection of the more de- 
tailed Bales categories gives some clue as to 
the nature of the interactive process that pro- 


duced this finding. When the confederates _ 
assumed de facto leadership, the appointed * 


leader was left either to accede to or to dis- 
agree with their proposed initiatives. Thus, 
the greater frequency of positive socioemo- 
tional acts was entirely from increased agree- 
ments; and similarly, the bulk of the in- 
crease in negative socioemotional activity 
came from increased frequency of expressing 
disagreement, : 
Perhaps the subjects were simply getting 


f 


ee ee! 
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the job accomplished by allowing leadership 
to remain with the usurping followers. But 
recall that the study was designed so that 
accepting such initiatives would place the 
task success in jeopardy. The confederates 
brought irrelevant material into their task 
activity and made Suggestions of uniformly 
poor quality (whether they were pushy or 
Passive followers). The conflict between task 
Success and role demands was more real than 
apparent, however, for the perception of the 
quality and quantity of the co-workers? con- 
tributions increased when the co-workers 
assumed a dominant position in the work sit- 
uation. Shades of Festinger (1959) and en- 
forced compliance in the resolution of cogni- 
tive dissonance! 

As hypothesized, in the higher stake condi- 
tion, our subjects hustled more, showing 
greater overall activity levels. The higher ac- 
tivity levels did not come from increased di- 
rectiveness as had been anticipated, but from 
more positive socioemotional activity and 
more task questions. 

Both of these categories can be seen as 
alternatives to directiveness and were most 
often employed when the task was seen as in 


` greatest jeopardy, the passive condition. Ask- 


ing questions functions to draw others out; 
agreement (the most common form of posi- 
tive -socioemotional acts) serves to reinforce 
participation on the part of followers.* Ney- 
ertheless, stake was not as powerful a variable 


: as we had expected. While 10 points on one’s 


final grade total made a difference, the differ- 
ence appeared to be marginal relative to the 
Pressures of the demands of the teaching as- 
sistant, of one’s co-workers, and of one’s self- 
conception. 

The interaction of leadership self-concept 
with the role demands of the confederates is 
‘important in its implications for the resolu- 
‘tion of role incongruity. Role incongruity oc- 
curred for the “natural leaders” when their 
subordinates seized active control of the situ- 


-ation. Their response to this incongruity is 


not particularly apparent in the data. In al- 


most every respect, they, as leaders, usually 
mi Ca 


#This is not to say that our subjects were non- 


‘directive: Task answegs account for ovg two thirds 


ef all acts (a common pattern in problem-solving 


` groups, cf. Bales & Hare, 1963). ti 


565 


went along with the role demands of the fol- 
lowers-turned-leaders, at least to the same 
extent that the “natural nonleaders” did. We 
had expected that the poor quality of the 
follower’s contributions would be difficult for 
our natural leaders to accept, forcing them to 
find some way of reasserting leadership. In- 
stead, cognitive processes colored perception, 
so that standardized contributions were seen 
as more numerous and useful when the con- 
federates were actively making it difficult for 
the appointed leaders to lead. And this re- 
duced task pressure to assert leadership. 

Nevertheless, the natural leaders were be- 
ing altercast in a way both incongruent with 
their self-conceptions and counter to their 
official role assignment. The discomfort we 
would expect to be engendered is reflected 
particularly in the category “shows antago- 
nism.” When natural leaders are forced not to 
lead, they act somewhat nasty about being 
deprived of their appointed place, This tech- 
nique, if used sparingly, is probably useful as 
a warning of interaction going wrong. It can 
only be used sparingly, however, for antag- 
onism is a very powerful tactic and any excess 
must cause interaction to break down. 

For natural nonleaders, role incongruity 
arose when they were being pushed by their 
followers to lead. The incongruity was more 
extreme than for the natural leaders, Not 
only were the role demands of the confed- 
erates incongruent, but the very appointment 
of a nonleader to a leadership position cre- 
ated incongruity. Moreover, there was no 
Cognitive reduction of the pressures of task 
jeopardy, 

It is reasonable, then, that the most pro- 
nounced behavioral adjustment to role incon- 
gruity is made by the nonleaders. How did 
they make this adjustment? Their personal 
predilections limited the extent to which they 
could increase directiveness, Engaging in 
much overt socioemotional activity would 
have placed them in center stage, a position 
which nonleaders usually avoid. So, as we 
reasoned, the principal avenue open to them 
Was an attempt to draw their followers back 
into the situation. This is the pattern seen in 
the multivariate Profiles (Table 3). When 
confronted with the incongruous (for them) 
situation of passive followers the nonleaders 
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increased their task answers, but less than 
did the natural leaders. But the nonleaders 
enormously increased their activity in the task 
question category (particularly asking for sug- 
gestions and opinions). 

Thus the nonleaders appeared to be nego- 
tiating for a comfortable role in the situa- 
tion. That they bargain more desperately than 
do natural leaders when faced with incon- 
gruity is certainly understandable in the light 
of the extra situational pressures on them. 
The findings may also reflect once again the 
lack of symmetry between natural leaders and 
natural nonleaders. Thos¢ who often lead find 
it easier to follow thay those who usually 
follow find it to lead. i 
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garding the use of aggression. In this 


A i Laboratory research has consistently dem- 
3> onstrated that social reinforcement (approval) 
“t+ from observers can induce intense levels of 
~ interpersonal aggression (Borden & Taylor, 
1973%, Geen & Pigg, 1970; Geen & Stonner, 
+7) 1971).*Clearly, however, the observers of an 
aggressive exchange do not always give overt 
indications of their approval or disapproval. 
T. Oftentimes, these people are only passive 
F spectators of the incident. While knowledge 
< of how aggressive tendencies are modified by 
* “the presence of spectators is of considerable 
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Two experiments investigated the effect: 
aggressive responding. In one experimen’ 
aggressed more than those observed by 
removed from the situation, subjects 
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s of the presence of an observer on 


t, male subjects observed by a male 


a female. When the male observer was 
> level of aggressiveness more closely 
matched the level manifested by the opponent. The removal of the female 
observer had little effect on the subjects’ behavior. In the second experiment, 
the male or female observer of the subject’s behavior was disguised as a 
member of an organization with explicit values (aggressive or pacifistic) re- 
case, significant differences in aggression 
were associated with the observer’s values but not the observer’s sex. Following 
„the departure of the observer, the shock settings of subjects in the two 
ageressive-value observer groups showed a significant decrease. The average 
shock settings of subjects in the two pacifistic-value observer groups remained 
at about the same level. In sum, the results indicated that the subjects’ ag- 
gressive behavior was apparently a function of their expectations of approval 
for such behavior, based on the inferred or explicit values of the observer. 
The results were further discussed in terms of social learning theory. 


theoretical and practical importance, research 
on the problem has not produced consistent 
findings. 

Zajonc (1965) proposed that the mere 
presence of others increases the subject’s gen- 
eralized drive, which in turn increases the 
emission of dominant responses. Dominant 
response facilitation has also been shown to 
occur when subjects anticipate positive or 
negative outcomes (Cottrell, 1968) or experi- 
ence evaluation apprehension (Henchy & 
Glass, 1968). 

In an aggressive situation, aggressive re- 
sponses are assumed to be dominant following 
personal insult or physical attack by another 
person (Baron, 1971b; Buss, 1966; Taylor, 
1967). Facilitation of such responding has 
béen shown to occur when subjects are aroused 
by exposure to loud noise (Geen & O’Neal, 
1969), erotic stimuli (Zillman, 1971), or 
physical exertion (Zillman, Katcher, & Mi- 
lavsky, 1972). 

Researchers who have used an audience as 
the source of arousal, however, have reported 
mixed results. In a study by Borden and Tay- 
lor (1973), for example, subjects who were 
silently observed by a small audience of male 
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and female peers delivered more intense elec- 
tric shocks to their opponent than subjects 
who were not observed. This increased ag- 
Sressiveness in the presence of observers was 
displayed by both angered and nonangered 
subjects. Baron (1971a), on the other hand, 
found that angered subjects delivered Jess in- 
tense shocks to their opponent in the presence 
of observers than subjects who were alone, 
while nonangered subjects were unaffected by 
the observer’s presence. In this instance, the 
observers were introduced to the subject as a 
visiting professor and his research assistant. 
Discussing his failure to support social facili- 
tation notions, Baron noted that the observers 
may have inhibited subjects from aggressing 
by arousing their fear of social disapproval or 
possible overt censure. 

An alternative interpretation is that the 
subjects’ responses in these studies were 
largely determined by characteristics of the 
observers (e.g., their sex, status, values, etc.), 
rather than their mere presence. According to 
Bandura’s (1973) social learning theory of 
aggression, the activation of aggressive behay- 
ior is “better explained in terms of anticipated 
consequences than in terms of drive” (p. 57). 
Thus, the inconsistency of the previous find- 
ings may be a result of differential anticipa- 
tions of approval or disapproval from the ob- 
servers. 

-One inseparable characteristic of any ob- 
server is his or her sex, It is well-documented 
that in this culture, physical aggression is 
taught to, engaged in, and largely approved 
by males (Bardwick, 1971). As Buss (1971) 
noted, “Our masculine role [ is] tough, com- 
petitive, hard-hitting, and in a word aggres- 
sive. Status as a male is to be achieved by 
being aggressive and masculinity is perhaps 
the most basic aspect of a man’s identity” 
(p. 17). 

The present research tested the notion that 
when an individual has an opportunity to 
aggress and he is in the presence of others, 
he will use whatever information is available 
concerning the observer’s values and expecta- 
tions in order to maximize the probability of 
favorable evaluation. In the first experiment, 
male subjects chose the intensity of electric 
shock they wished to deliver to an opponent 
who maintained a moderately aggressive 
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strategy toward them. The willingness of these 
subjects to hurt their opponent was measured 
in the presence of either a male or a female 
observer. If there exists a generalized ex- 
pectancy that other males approve of physi- 
cal assertiveness, subjects who were observed 
by males would be expected to deliver more 
intense shocks to their opponent than sub- 
jects who were observed by females. 
Following this manipulation, each subject 
also responded for a period without an ob- 
server present. Subjects’ subsequent behavior 
while alone was examined in view of recent 
findings which indicate that socially induced 
aggression may persist to a significant degree 
even after observers have left the situation 
(Borden & Taylor, 1973; Geen & Pigg, 1970). 


EXPERIMENT 1 
Subjects 


Subjects were 16 male Kent State University stu- 
dents enrolled in the introductory psychology courses, 
Each subject received experimental credits for his 
participation as partial fulfillment of the course re- 
quirement. 


Apparatus 


Each subject was seated at a task board (de- 
scribed in Taylor, 1967). A second subject (a con- 
federate) was escorted to an adjacent room. The 
confederate and his task board were visible to the 
real subject through a window which separated the 
two rooms. 

The experimenter described the use of shock in 
the experiment and explained the subject’s: rights to 
either refuse participation or to stop the procedures 
at any time. If the subject agreed to participate, he 
was asked to sign a statement that indicated his 
understanding of this arrangement, No subject re- 
fused to participate or asked to terminate the pro- 
cedures once they had begun. 

The experimenter attached a concentric shock 
electrode to the inside of the subject's left wrist 
(described in Tursky, Watson, & O'Connell, 1965) 
and delivered a series of shocks that were increased 
in milliampere steps. The subject was instructed to 
tell the experimenter when these reached an intensity 
that was definitely unpleasant and that he did not 
wish to exceed. When the experimenter was informed 
that this intensity had been reached, the shocks 
were discontinued, and the subject was permitted to 
rest while the threshold was ostensibly obtained 
from the opponent. A few moments thereafter, the 
tape-recorded task instructions were played through 
the subject’s intercom. 

Each subject was told that he would be competing 
on a task involving reaction time with the other 
subject in the adjoining room. At the beginning of 
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“trials. 


* the experimenter pointed 
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Me trial, he was instructed to select any 1 of 10 
‘nsities of shock he wished his opponent to re- 


Tive. He was informed that the shock would be 
administered to his opponent at the end of the trial 
if he were faster than his opponent and that he 
would receive the shock his opponent set for him if 
he were slower than his opponent. Thus, he realized 
that either he or his opponent would receive a 
shock, depending upon the outcome of the compe- 
tition, and that both could select the intensity of 
the shock the other would receive. 

Actually, the frequency of wins (50% for all sub- 
jects) and the shock settings by the opponent were 
preprogrammed and controlled by the experimenter. 
The shock-setting strategy employed by the pseudo 
opponent consisted of a random patterning of equal 
numbers of the intensities “three” and “four.” Previ- 
ous research using this apparatus (Borden & Taylor, 
1973) has shown that this level corresponds to the 
average intensity chosen by subjects on their first 
trial setting and was therefore employed by the 
“average” opponent. The shock level that subjects 
actually experienced when they lost a trial was 
either 65% or 70% of their predetermined un- 
pleasantness level. At the end of each trial, 1 of 
the 10 small lights on the top of the task board 
informed the subject of the intensity of shock the 
opponent had chosen and, if the subject was slower 
on the trial, the shock as well. 


Experimental Manipulations 


All subjects competed with their opponent for 13 
Half of them began the task in the presence 
of a male observer; the other half began the task 
while being observed by a female. Following a 
series of these observation trials, the observer left 


“the room, and the subject continued the task unob- 


served. 

A few moments after the instructions were over, 

the male or female observer was introduced into the 
situation. The experimenter entered the subject’s 
room and explained that a student from another 
psychology course had arrived while the instructions 
were being played and had requested to watch this 
experiment. The experimenter asked the subject 
Whether he would mind if this person came in and 
observed. All subjects complied with the request. 
_ The experimenter then escorted the observer into 
the room and asked whether he (she) had observed 
any other experiments or was familiar with this one. 
The observer always answered no. Consequently, 
to the opponent through 
the window and explained to the observer how the 
task board operated and the ostensible purpose of 
the study. The observer was then seated beside the 
subject such that his shock-setting behavior was 
prominent, Finally, before leaving the room, the ex- 
perimenter requested that the observer not talk with 
or distract the subject. 

For both groups, the observer was present for the 
first seven competition trials. The subject's first trial 
«shock setting was made without any knowledge of 
the opponent’s strategy. For the next six trials, the 
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subject was aware that his opponent was using a 
moderately low aggressive strategy (mean shock 
value of 3.5). Immediately following the feedback 
on the seventh trial, the experimenter interrupted 
the sequence of events and announced through the 
intercom that there would be a “brief rest period, 
after which the trials would again resume.” 

Momentarily, the experimenter reentered the sub- 
ject’s room to ask whether the subject had any 
questions or was experiencing any problems. In the 
course of their conversation, the observer interrupted 
to ask whether the remainder of the trials would 
be the same. The experimenter replied that they 
would. The observer then stated that he (she) was 
already late for an appointment elsewhere and re- 
quested to be excused. The experimenter excused the 
observer, returned his attention to the subject, briefly 
reemphasized some minor procedural aspects, and 
left the room. Each subject received six more “alone” 
trials, after which the electrodes were disconnected 
and he was debriefed. 

In sum, the complete session (13 trials) consisted 
of 7 observation trials and 6 alone trials; the first 
shock setting occurring prior to feedback from the 
opponent. The shocks were randomly distributed 
within each of the two blocks of trials, with the 
exception of trials on which the subject’s reaction 
time was especially fast or slow, in which case he 
won or lost, respectively. 


RESULTS 


The measure of aggression was the magni- 
tude of shock the subject set for his opponent 
to receive. Since the shock setting on the first 
trial occurred prior to any feedback from the 
opponent, Trial 1 shock settings were analyzed 
independently from the shock intensities on 
succeeding trials. 

At this point the only procedural difference 
between the groups was that in one group 
the subject was observed by a male, whereas 
in the other group the subject was observed 
by a female. The average first-trial setting by 
subjects with a male observer was 4.88, while 
the average for subjects with a female ob- 
server was 2.50. According to an analysis of 
variance, these means were significantly dif- 
ferent, F (1,14) = 5.97, p < .05. 

The shock settings for the remaining six 
observation trials (postfeedback) were then 
analyzed separately. The mean shock settings 
of subjects who were observed by a male was 
5.21 on these trials, while the mean for sub- 
jects observed by a female was 3.10. The 
effect of the observer’s sex on these trials 
was also significant beyond the .05 level (F 
= 4.80). 


On the subsequent alone trials, subjects 
who had been observed by a male reduced 
their shock settings to an average of 4.21. 
Conversely, the average shock setting of sub- 
jects who had been observed by a female in- 
creased slightly to 3.42 on the subsequent 
alone trials. An analysis of variance indicated 
that this interaction was significant, F (1, 14) 
= 8.45, p < .01. Subsequent Newman-Keuls 
tests of each group revealed that the reduction 
in aggressiveness following the removal of the 
male observer was significant (p< .01), 
whereas the increase following the removal of 
the female observer was not. 

Apparently, relatively subtle characteristics 
of the social situation surrounding an alter- 
cation do influence the expression of physical 
aggression by the participants. In this case, a 
generalized expectancy associated with the sex 
of an observer was shown to exert a directive 
influence on an individual’s responding in an 
aggressive situation. What would happen if 
the individual in such a situation had addi- 
tional, perhaps inconsistent, information about 
the observer’s values concerning the use of 
aggressive tactics? For example, how would a 
subject respond in the presence of an aggres- 
sive female or a pacifistic male? 

In Experiment 2 the observer’s values re- 
garding the appropriateness of aggressive be- 
havior were more clearly defined. The male 
or female observer of the subject’s shock-set- 
ting behavior was disguised as a member of 
an organization with expressed values regard- 
ing competitive, aggressive behavior. The ex- 
tent to which this type of information aug- 
ments, cancels, or interacts with the observer— 
sex effect was examined during the observer's 
presence and also on the alone trials which 
followed. 


EXPERIMENT 2 
Method 
Subjects and Procedure 


Subjects were 40 additional male Kent State Uni- 
versity undergraduates. There were four groups of 
subjects, each of which made the first seven shock 
settings in the presence of an observer (n = 10 per 
group). Following these observation trials, subjects 
in each group continued the competition with their 
opponent for six more trials during which their 
behavior was not observed. The apparatus and pro- 
cedure were the same as in Experiment 1. 
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Experimental Manipulations 


Observer’s sex. For half of the subjects, a D 
observer was present during the observation trials. 
The other half of the subjects were observed by a 
female. 

Observers values. The observers values were 
manipulated by having the observer mention in a 
contrived conversation with the experimenter that 
he (she) was an active member of a group which 
either condoned or opposed competition and aggres- 
sion, 

The competitive, aggressive observer was intro- 
duced into the situation immediately following the 
task instructions as a student from another psychol- 
ogy class. Along with his (her) books, the observer 
carried a faded jacket which was placed on a nearby 
chair as he (she) was seated beside the subject. On 
the shoulder of the jacket there was a colorful 
embroidered patch with the name of the univer- 
sity’s karate club, which circumscribed an em- 
broidered fist and a bolt of lightning. 

While explaining the ostensible purpose of the 
experiment to the observer, the experimenter glanced 
at this patch and casually asked the observer if he 
(she) was a member of a karate club. The observer 
responded affirmatively and briefly added that he 
(she) was an instructor, about to receive a brown 
belt, and greatly enjoyed the sport. The experimenter 
then redirected his attention back to the experiment, 
further described a number of procedural aspects 
of the task, and left the room. The competition 
trials commenced a few moments thereafter. 

The introduction of the pacifistic observer was 
comparable. The observer also carried a jacket (the 
same one) with a shoulder patch of similar color 
and shape. In this case, it consisted of a large peace 
symbol with the letters S.A.N.E. embroidered across 
the top half of the badge. While explaining the 
experiment to the observer, the experimenter noticed 
the patch and asked about the group. The observer 
explained that the letters represented the acronym 
for the Society Against Nuclear Expansion, a world- 
wide peace organization, added that he (she) was 
one of the organizers of the group’s local chapter, 
and briefly outlined its history and major activities. 
The experimenter then resumed his discussion of 
the procedure and left the room. 

Thus, there were four observer conditions: ag- 
gressive male observer, aggressive female observer, 
pacifistic male observer, and pacifistic female ob- 
server. Subjects in each of the groups competed in 
the presence of one of these observers during the 
first half of the session. During the second half of 
the session, all subjects competed with their op- 
ponent without an observer. 


1In the postexperimental interview, all subjects 
were able to recall accurately the observer’s affilia- 
tion. One subject in the aggressive male observer 
condition admitted suspicion regarding the obsery- 
er’s presence. His data were not included in the 
analysis, and an additional subject was run in this 
condition. 
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Resuits 


As in the preceding experiment, the sub- 
He... Ject’s first-trial shock setting was analyzed 
sior, Independently of the trials on which the sub- 
sub.ject had feedback from his opponent. For the 

} Soci first selection, subjects in the two groups with 
Pisan! an aggressive observer (aggressive male ob- 
age server, aggressive female observer) chose sig- 

At nificantly higher shocks for their opponent 

a Tayl than subjects in the two pacifistic observer 
are 8roups (pacifistic male observer, pacifistic 
der female observer). The respective means for 

i l% the aggressive versus pacifistic groups were 
4.80 and 2.80, F (1,36) = 6.43, p < .02. As 
` shown in Figure 1, subjects with aggressive 
observers continued to respond in this manner 
throughout the remaining observation trials, 
F (1,36) = 4.13, p < .05. However, on the 
subsequent alone trials, when the observer was 
no longer present, they assumed a strategy 
which more closely conformed to their oppo- 
nent’s behavior, F (1,36) = 5.14, p< .03. 
Newman-Keuls comparisons indicated that 
this interaction was primarily due to the large 
drop in aggressiveness associated with the 
removal of the aggressive observer. 

Subjects in the two groups with a male ob- 
server (aggressive male observer, pacifistic 
male observer) tended to choose higher initial 
shocks than subjects in the two groups with 
a female observer (aggressive female observer, 
pacifistic female observer), F (1,36) = 2.66, 

“p= .10, and responded somewhat more ag- 
gressively during the observation trials, F (1, 
36) = 2.26, p=.14. However, these differ- 
ences only approached significance. Appar- 
ently, knowledge of the observer’s values at- 
cenuated the observer-sex effect. This finding, 
of course, is reasonable, since the sex effect 
was presumably a function of values asso- 
ciated with a more generalized sex role of the 
observer. When the observer’s aggressive val- 
ues were made explicit and the appropriate- 
ness of aggressive responding more clearly 
| defined, the subjects responded accordingly. 

Finally, the observer-sex and observer- 
value factors showed no signs of interaction 
on the initial shock setting, F (1,36) = 1.18, 

_ ns, or during the observation trials when sub- 
jects were receiving feedback from their op- 


. ponent, F (1,36) = 1.32, ns. 
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Fıcure 1. Mean shock settings of subjects in the 
presence of either an aggressive observer (male and 
female) or a pacifistic observer (male and female) 
and on subsequent alone trials. 


DISCUSSION 


Until recently, the effects of nonparticipat- 
ing observers on aggressive responding have 
remained unexplored. The studies which have 
been conducted (Baron, 1971b; Borden & 
Taylor, 1973) have found differences in ag- 
gressive responding between subjects who 
were alone and in the presence of observers, 
but the differences were opposite in direction. 
In a number of respects, the composition of 
the observer groups used in these studies have 
not been comparable (e.g., the number of 
observers, their status, their sex, etc.). Based 
on these findings and recent statements by 
social learning theorists (Bandura, 1971, 
1973), the present study examined the possi- 
bility that anticipation of the observer’s ap- 
proval is the important component of ob- 
server—aggressor effects. In both experiments 
of the present study, when approval from the 
observer could be viewed as a likely outcome 
of increased aggression, subjects responded 
toward their experimental partner with sig- 
nificantly more intense levels of aggression 
than when approval was unlikely. Further- 
more, after the observer left the situation and 
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approval consequences were eliminated, sub- 
ject’s aggressive behavior returned to a level 
which more closely matched the strategy used 
by their opponent. It is important to note that 
subjects in this study never actually received 
overt indications of approval or disapproval 
from the observer. Thus, their behavior was 
modified merely by the anticipation of ap- 
proval from the observer. These findings, then, 
are in accord with a conclusion emphasized 
by Bandura (1971) in a recent review of the 
experimental and theoretical literature relat- 
ing to social learning theory: “As a result of 
prior experiences, people come to expect that 
certain actions will gain them outcomes they 
value . . . Actions are therefore regulated to 
a large extent by anticipated consequences” 
(p. 3). 

The results from studies using a variety of 
procedures have demonstrated that retaliative 
behavior is typically characterized by reci- 
procity. In general, subjects tend to respond 
toward their opponent with a level of aggres- 
sion which closely matches the level that the 
opponent has directed toward them (Baron, 
1971b; Dengerink, 1971; Geen, 1968; Pisano 
& Taylor, 1971; Taylor, 1967). In this case, 
it was found that subjects overretaliated 
toward their opponent when in the presence 
of a male with unknown values or a member 
of either sex with aggressive values. When 
the observer happened to be a female with 
unknown values or a pacifistic male or female, 
subjects tended to underretaliate. Apparently, 
then, the presence of observers serves to re- 
duce an otherwise important determinant of 
aggression, the behavior of the opponent. As 
soon as the observer leaves the situation, how- 
ever, reciprocity seems to be immediately 
restored. 

In the present study the opponents always 
maintained the same level of attack toward 
the subjects. It would be interesting to dis- 
cover the extent to which subjects would dis- 
tive and underretaliative ten- 
dencies with opponents whose level of attack 
strategy differed from the presently studied 
level. Also, subjects for whom a self-concept 
of manliness is less important might not be 
as influenceable as the subjects in this study. 
One such subculture, of course, might be fe- 
males. Furthermore, selection of subjects who 
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t 
differ in susceptibility to social influence or 


need for social approval represents additional” 
possibilities for future research on observer | 


effects. fl 

Finally, several studies have shown that the% 
presence of observers increases the emission» 
of dominant responses in learning situations į 
(Cottrell, 1968; Henchy & Glass, 1968; Za- * 
jonc & Sales, 1966). With respect to aggres- ' 
sive behavior, however, the effect of an ob- | 
server’s presence is apparently more a func- 
tion of wko is watching than simply whether 
or not anyone is watching. 
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objects along a 7-po 
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other studies also characterized these 
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differences aided the interpretation o! 


several common social decision scheme 
attitudinal decision distributions in or 


social process theory that might be 


scheme best pi 
items. A model incorporatin; 
erences also g 
semantic differential items. 


Attitudes are social cognitions usually re- 
garded as characteristic of individuals. How- 
ever, groups too are often observed to “take 
a position” on an issue that could be called 
attitudinal if an individual rather than a 
group had expressed it. In perhaps most cases, 
a group’s position along some attitudinal con- 
tinuum is not determined by a single person 
but is some function of the individual mem- 
bers’ preferences or predilections that have 
been aggregated through the social interaction 
necessary to establish the group judgment. 
The central question of this article is to ask 
how individual attitudes, perhaps initially 
quite disparate, may be translated in a “group 
attitude.” Clearly, such a question has a 
number of answers depending upon the kind 
of group and social context of interest. 

An individual’s position on the attitudinal 
continuum is typically estimated by asking a 
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1 scale and about several social 
differences (both 


made 


data, but inspection of entire distribu- 


f choice shift effects, Predictions from 
models were compared with the group 
der to ascertain the general kind of 


required. A majority-averaging decision 


roup attitudes for the 


subject to decide which one of 
mutually exclusive and exhaustive 
tives most nearly represents his or i 
tion, for example, agree very much, .. - 
disagree very much, etc. The selection of a 
response alternative in this way is i 
what is meant by a decision 
decision-making behavior has often 
ciated, experimentally, with the study of 
gambling, risk taking, and the like. Just as a 
subject’s specification of an alternative ( 
on a Likert scale) or location 
n-millimeter line) informs an ex; 
about an individual’s attitude towa: 
psychological object, so can the ; 
ment of one’s preference among 
inform the other group membe 
position. The ensuing group dis: 
serves to establish a position 
selection or location specification 
continuum that reflects a collective 

Until recently, there has been a 
general intuition that group discussion lea 
to decision would result in the group’s atti- 
tude being generally less extreme than that 
observed for similar individuals; social inter- 
action could thus be regarded as a pro 
ameliorating influence. The mitigation of ex- 
tremist views has even been regarded as an 


important virtue of group decision making, 
` in that it serves as a safeguard against pre- 
` cipitous action. On the other hand, the prac- 
o" gice» of entrusting important decisions to 
y “ groups sometimes appears to be an impedi- 
~ ment to desirable change, if indeed the group 
` tends to foster a middling position when an 
extreme solution or alternative seems most 
needed. 

Tt thus came as something of a surprise 
when Moscovici and Zavalloni (1969) sug- 
gested that for opinion and social judgment 
ratings, group decisions might be more ex- 
treme than those observed for individuals. A 
similar sense of surprise had been elicited at 
the beginning of the last decade when Stoner 
(Note 1), Marquis (1962), and Wallach, 
Kogan, and Bem (1962, 1964) reported that 
groups on the average were more willing to 
advise risky courses of action in miniature 
social situations (‘“choice-dilemmas” items) 
than were comparable individuals. Later, 
choice-dilemmas items were written which 
yielded cautious shifts, so the term choice 
shift is sometimes preferred in order to indi- 
cate the more general nature of the phenome- 
non. Thus, a small group has at least some- 
times been regarded as a “polarizer” of 
opinon (Fraser, Gouge, & Billig, 1971; 
Moscovici, Doise, & Dulong, 1972; Muscovici 
& Zavalloni, 1969; Myers & Bishop, 1970, 
1971) or a kind of “extremity-biasing” 
mechanism. The importance of choice shifts 
‘and polarizations seems to be that they 
are exceptions to the “averaging tendency” 
previously thought to mark group decisions 
and which had sometimes been thought to be 
! the appropriate interpretation of classic 
‘early experiments by Allport (1924), Sherif 
(1936), and a number of others. 

Actually, polarization and choice shifts are 
individual-group differences on some statistic 
of interest, usually means. Individual and 
group “differences” have been observed for 
many years in the area of problem solving 
(cf. literature summaries by Davis, 1969; 
Lorge, Fox, Davitz, & Brenner, 1958) and as 
such are perhaps not so novel as at first 
appears. What was new in the individual- 
group attitude and risk-taking studies was 
the enhancement through group discussion 
._, of a predilection observed in individuals. Ad- 
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mittedly, the exact definition of polarization 
is not especially clear, but. the important 
thing is that groups, at least some kinds of 
groups, cannot after all be depended upon to 
pool opinion in such a way as to yield 
middling decisions relative to individuals. 
And polarization is a convenient label for 
these revealing findings. 

Tt turns out that the seemingly counter- 
intuitive polarization and choice shifts effects 
mentioned above can be produced by appar- 
ently quite common and straightforward s9- 
cial processes. For example, Davis (1973) 
has shown that a (simple) majority process 
or a (simple) plurality process can, when the 
initial individual distribution is skewed oppo- 
site to the direction of change, produce the 
kind of differences described as polarizations 
or shifts at the group level. At the same time, 
averaging processes tend to move the group 
decision distribution nearer the scale middle 
relative to the individual distribution. Equali- 
tarian social processes (sometimes called 
equiprobability group decision models) tend 
to smooth in the group distribution what- 
ever perturbations were observed in the indi- 
vidual input distribution. When the indivdual 
distribution is symmetrically distributed, both 
averaging and majority (as well as plurality) 
processes would tend to exaggerate in the 
group distribution whatever peakedness oc- 
curred in the individual distribution, and the 
means would not be expected to differ. The 
point is that both individual input distribu- 
tion properties and the group social decision 
scheme influence the social output. In other 
words, group shifts, polarizations, and other 
kinds of individual-group differences are to 
be expected under certain circumstances. But 
their understanding requires a consideration 
of both individual and social processes. 

Vinokur (1969) and Johnson, Davis, and 
Kauma (Note 2) have observed that indi- 
vidual distributions on choice-dilemmas items 
tend to be skewed opposite to the direction of 
shift. Unfortunately, individual and group dis- 
tribution data are not available for similar 
direction-important individual-group differ- 
ences in the attitude polarization studies 
(e.g., Doise, 1969; Moscovici & Zavalloni, 
1969; Myers & Bishop, 1970, 1971), and thus 
the action of such well-defined social decision 
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schemes as a majority process cannot be ruled 
out, There are a number of potential hazards 
that attend the heavy concentration upon 
arithmetic means and analyses of variance 
(see, further, Abelson, 1973; Cartwright, 
1971; Davis, 1973) to the exclusion of other 
distributional properties of group and indi- 
vidual decisions. For example, it is possible 
to obtain a significant difference between 
individual and group means, but inspection of 
the two distributions would hardly justify 
much enthusiasm about the degree of shifting 
or polarizing that had occurred. The converse 
is also possible in principle. Surely some ef- 
fort should be devoted to scrutinizing entire 
data sets in addition to inspecting their means. 

We are not proposing a majority or plural- 
ity model (or any other model) as explana- 
tions for group polarization effects at this 
time but rather that such explanations cannot 
be ruled out given the research evidence now 
available. In fact, a primary goal of the 
present experiment was to gather appropriate 
individual and group data such that a social 
decision scheme analysis could be carried 
through. It would be a serious conceptual mis- 
take to imagine that all groups, regardless of 
their history, task, or social context may be 
characterized by the same social decision 
process or that all attitude objects yield the 
same distribution of initial individual opinion. 
Such parameter invariance cannot reasonably 
be assumed for most social psychological ex- 
periments. Recall that variations in either 
individual input or social decision schemes or 
both can alter the group decision output. For 
that reason, several attitude objects and two 
kinds of response scales have been included 
as decision tasks in this study. 


Social Decision Scheme Theory: A Summary * 


While much of a group’s discussion is 
generally devoted to eliciting member’s opin- 
ions, some of the time is spent in achieving 
consensus about the group position. In other 
words, following the initial individual input, 
some social process is required to pool, com- 
bine, or aggregate the individual attitudes 


1A more detailed exposition of social decision 
scheme theory, including several special case models 
and illustrative applications, may be found in Davis 
(1973). 
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into a group attitude. Of course, it is neces- 
sary to be more specific about the effects of 
such an aggregating process. Social decision 
scheme theory (Davis, 1973) was designed to 
provide a general framework for just such 
questions, a theoretical framework that easily 
admits to special case models for various ex- 
perimental situations. It is generally efficient 
to test several models simultaneously with a 
single data set. In the present experiment, 
several special decision schemes were con- 
sidered plausible, a priori, in varying degrees, 
and this led to several different sets of pre- 
dicitions assuming several special case models 
of the general social decision scheme theory. 

Let us suppose that there exists some 
probability, f» j=1,2,...,m, that an 
individual will endorse the jth response 
alternative, 4;, of an attitude item. Following 
attitude measurement custom, these n re- 
sponse alternatives are mutually exclusive and 
exhaustive. Similarly, the probability of a 
group choosing A; is given by P; j= 1,2, 
. .., n. Given the individual distribution, 
Pi, P2, - ++, Pn, the theoretical problem 
stated earlier may be recast as one of moving 
to the group distribution, Pa, Po, ..., Ph, 
by explicit assumptions about the nature of 
the social processes yielding the group deci- 
sion, namely, the social decision scheme. 

If groups are of size r, then it is possible 
to ask what are the logically possible ways r 
group members may distribute themselves 
over the n response alternatives on the item 
or issue at hand. There are 


Cae 
=m 
v 


such distinguishable distributions of 7 mem- 
bers into m alternatives, if alternatives but 
not members are regarded as distinguishable. 
The probability of the ith (i= 1,2, ..., 
m) distinguishable distribution occurring is 
given by 


r 
Ne ee Tay assy oe pot... Pate (1) 


a familiar term from the multinomial expan- 
sion (pı + po +. . .+ pn)”, where the occu- 
pancy numbers ry, 72, ... , fn are the fre- 
quences with which group members initially 
prefer alternatives Ay, Ao,..., An; 71 +72 + 
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Given that the members’ initial attitudes 
constitute the starting point, the next ques- 
tion is how the social process yields decisions 
for the various A;. More explicitly, we define 
di the conditional probability that the jth 
alternative, Aj, will be chosen by a group 
with the ith distribution of opinion. The 
parameter dj;, however, is determined by the 
social decision scheme, which is an assump- 
tion about the way the group deals with 
initial opinion such that alternative Ay repre- 
sents group consensus. (We are, of course, 
ignoring groups that disintegrate or do not act 
at all, intriguing and fascinating phenomena 
in their own right.) 

We may use the social decision scheme 
notion in a formal way to obtain the group 
distribution of opinion, (Pa, Ps, Ps,..., 
Pa), by postmultiplying the m-tuple, m = 
(1,72, » «+ 5 Tm), by the m X n matrix D 
having entries [d;;], that is, 


d (Py, Pa, aa, Pa) = 


dy diy din 
(Ei) Tay -> -9 Tm) dzi das: : “don ame) 
dinadms* + *dmn, 


‘The central issue is, of course, the precise 
form of D, the social decision scheme matrix. 
In a sense, D is a complete statement of 
“everything that can happen.” For every dis- 

{finguishable distribution occurring with some 
prohability, m; there exists some probability, 
dy, that each of the attitudinal response pos- 
sibilities (A;, A2, . . + , An) will be chosen 
by the group, Some familiar plans, such as 
“majority rules,” may indeed guide part of a 
social decision scheme, but what about dis- 
tinguishable distributions that are nonmajori- 
ties, cases where the primary scheme does not 
apply? Obviously, groups frequently deal 
with such situations, and often these are po- 
tentially among the most interesting groups 
because of the special opportunities for con- 
flict, divisive debate, etc. Thus, a subscheme 
must frequently be included along with the 


_ primary decision scheme so that all decision 


contingencies are considered. In a sense, the 
social decision scheme matrix is rather like 
a “picture” of the social process as it works 
through the decision task. It is the various 
forms of the social process that might plau- 
sibly be expected in the present experiment, 
to which we now turn. 


Polarizations, Shifts, and Social Decision 
Scheme Theory 


In the case of freely interacting ad hoc 
groups addressing attitudinal decisions, we 
may ask what social decision schemes are 
attractive candidates. A variety of social 
objects (Nixon, Frenchmen, etc.) were 
chosen in this study such that the individual 
distribution of evaluative decisions would 
be likely to take a number of different 
shapes over the response continuum. The 
question is, given the individual input form, 
what social process might act upon these 
member preferences to yield a group re- 
sponse? We have observed in several pre- 
vious experiments (e.g., Davis, Hornik, & 
Hornseth, 1970; Johnson & Davis, 1972) in 
which groups made decisions under substan- 
tial uncertainty (probabilities of various out- 
comes were unknown to subjects initially) 
that an equiprobability social decision scheme 
(any position advocated was as likely to be 
decided upon as any other alternative advo- 
cated) closely described the group decision 
distributions. The salient property of equi- 
probability models is that perturbations or 
departures from uniformity in the individual 
distribution tend to be smoothed in the group 
distribution. 

For attitudinal decisions, we mentioned 
earlier that the mean shifts observed by 
Moscovici and Zavalloni (1969) and others 
might have come about by majority or plural- 
ity social decision processes, because such 
models are “mode exaggerating” (more prop- 
erly, “maxima exaggerating”) in that depar- 
tures from uniformity in the individual dis- 
tribution are increased further in the group 
distribution.* Such a notion gains plausibility 


*Tt is a common intuition that “majority- 
conformity” processes have distributional effects like 
averaging, but this is not the case; recall that major- 
ity models will usually lead to exactly opposite 
consequences for skewed individual distributions (cf. 
Davis, 1973), namely, exaggeration of skewness. 
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from findings by Davis, Kerr, Sussmann, and 
Rissman (1974). In that experiment, indi- 
viduals and groups made decisions about 
duplex bets in which the outcome odds were 
clearly labeled which thus reduced subject 
uncertainty relative to that in the Davis et 
al. (1970) study mentioned previously. In 
the duplex bet experiment, group distribu- 
tions were generally exaggerations of the 
individual distribution, and hence mean shifts 
were observed for bets yielding skewed indi- 
vidual distributions, the direction of mean 
shift depending upon the original (individual) 
direction of skewness: Symmetrical unimodal 
individual distributions saw only group exag- 
geration of the peakedness with no significant 
change in the group mean. 

Attitudinal decision making may be much 
like that required for duplex bets. Group 
members have a fairly clear picture of the 
task in both cases, so that each person’s 
position during discussion is rather clearly 
defined and may be rather easily communi- 
cated. Thus, for attitudinal decisions, a more 
decisive (and perhaps more familiar) social 
decision scheme such as simple majority may 
be required for adjudication of conflict than 
was the case with the highly uncertain deci- 
sions of earlier risk-taking research (€.8., 
Johnson & Davis, 1972). Despite this intui- 
tively attractive line of reasoning and the 
results of other investigators suggesting atti- 
tude polarization, sufficient ambiguities remain 
(e.g., variations in local experimental condi- 
tions, lack of information about individual 
and group distributions) that it would be 
imprudent to hypothesize either a majority 
or plurality, or for that matter equiprobabil- 
ity, social decision scheme for the attitudinal 
processes in the groups we studied. 


METHOD 


Subjects 


The subjects were 500 undergraduate students 
(250 females and 250 males) who participated in 
the experiment as part of the course requirement 
for an introductory psychology class at the Uni- 
versity of Illinois. The data from 11 additional 
subjects were not included in any of the analyses: 
9 males were given the questionnaires in an incor- 
rect order; 2 females tested together were found to 
be roommates. 
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Tasks 


Experimental task. One set of experimental tasks 
required the subjects to judge Billy Graham, New 
Left, Frenchmen, and President Nixon on a set of 
three semantic differential scales. The 7-point scales 
were as follows: from 1 (bad) to 7 (good), from 
1 (passive) to 7 (active), and from 1 (strong) 
to 7 (weak). The cover sheet of the questionnaire 
included detailed instructions (Osgood, Suci, & 
Tannenbaum, 1957) on the use of semantic differen- 
tial scales. Only the evaluative scale (bad-good) 
was of interest in this study; the other two (actiy- 
ity and potency) were filler tasks. The evaluative 
judgment was intended to parallel in some degree 
the Moscovici and Zavalloni (1969) study in which 
French students expressed their attitudes toward 
General de Gaulle (Nixon in this study) and Les 
Americains (Frenchmen in this study). The selection 
of these four attitude objects was also derived from 
the desirability of subjects’ evaluative ratings, which 
were varied suitably as to both the means and 
shapes of distributions across the 7 response 
alternatives. 

For a second set of experimental tasks, subjects 
expressed their preferences in terms of a percentage 
on each of four social issues stated as questions. The 
questions were (a) “What percentage of the national 
budget ($150,000,000,000) should be spent on pollu- 
tion control?”; (b) “What percentage of control 
should students have in the planning and administra- 
tion of a university?”; (c) What percentage of an 
undergraduate’s waking hours should be spent study- 
ing?”; (d) “What percentage of a university’s re- 
search funds should be devoted to applied scientific 
research (as opposed to basic scientific research) ?” 
The percentage continuum was grouped so that sub- 
jects chose 1 of 10 class intervals which varied from 
0%-9% up to from 90%-100%. 

Each of the eight items was presented on a 
separate sheet of a booklet. 

Trrelevant task. When the design, an elaboration 
of the Solomon Four-Group Design (see Campbell 
& Stanley, 1963, p. 24), called for subjects to receive 
no pretesting, an irrelevant task was used. This task 
was similar to semantic differential items of the 
experimental task except that different objects were 
rated, namely, animals (e.g., snakes) and inanimate 
objects (e.g., shovels). 


Design 


The design (Solomon Four-Group) was intended 
to be especially sensitive to any effects from pre- 
testing. The four experimental groups and their re- 
spective task sequences are as follows: (a) pretest 
group condition, individual-group-individual deci- 
sions; (b) pretest individual condition, individual- 
individual-(population estimates) decisions; (c) 
no-pretest individual condition, (irrelevant task)— 
individual-(population estimates) ; and (d) o-pretest 
group condition, (irrelevant task)-group-individual 
decisions. The population estimation task is described 


subsequently. There was also a between-subjects 
sex factor. 


Experimental Procedure 


Each experimental session was divided into three 
periods. After ariving at a large, central experimental 
room, the group of same-sexed subjects (varying in 
number from 6 to 24) were welcomed and informed 
that the purpose of the study was to gain informa- 
tion concerning student opinions under various stan- 
dard conditions. Thereafter, the procedures described 
below were employed. 

Period 1, Subjects later assigned to the individual 
or group conditions were treated identically in 
Period 1: 


1. Pretest conditions: Subjects were asked to indi- 
cate their opinions on a number of social and na- 
tional issues and instructed how to complete the 
questionnaire booklets. They were assured of the 
confidentiality of their responses and informed that 
there were no correct or best answers. After proce- 
dural questions were answered, subjects responded 
to the attitude items. 

2, No-pretest conditions: Individual subjects were 
told that they would first receive a booklet which 
would familiarize them with the form of the ques- 
tionnaire. The booklet they received contained the 
irrelevant task previously discussed. They then 
received instructions on how to complete the 
questionnaire. 


Period 2. In the pretest group condition, subjects 
were informed that they had completed the earlier 
questionnaire (experimental task) in order to become 
familiar with the content of the items as well as 
to allow themselves to state their opinions. Now, 
they were to discuss the items as a four-person group 
and arrive at a unanimous group decision for each 
item, The above instructions were generally similar 
to those used by Moscovici and Zavalloni (1969) 
and identical to those used by Davis et al. (1974). 
The following instructions were also given: 


You might or might not want to change your 
opinion on some items. We are not concerned as 
to whether or not you are consistent with your 
former responses, or after thinking about some 
items, whether or not you decide to change your 
mind. We are interested, however, in obtaining 
the best estimate of a position or opinion. 


Next, the small adjoining rooms in which the 
groups would discuss the questionnaire items were 
described. A single booklet was provided in each of 


. these rooms for marking the group’s responses. On 


the table in front of each chair, there appeared a 
member-identifying letter (A, B, C, or D). Subjects 
were asked ‘to take note of which letter went with 
which group member. They were told then that 
they would be allowed approximately 15 minutes to 
make their group decisions, although slightly more 
time-was, in fact, allowed to accommodate the few 
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Jate-finishing groups. Groups that finished early were 
to remain seated in their rooms. 

Groups were randomly composed of those subjects 
in attendance. If any subjects were left over after 
the groups of four had been selected, they were 
taken to another room at the outset of Period 2 and 
run in the pretested individual condition. 

Finally, groups were taken to the smaller rooms 
to begin their discussion of the items. Throughout 
these discussions, each group’s progress was moni- 
tored by the experimenter through a one-way mirror. 
If a group was still far from completion after about 
15 minutes, the experimenter asked the subjects to 
finish as quickly as possible. After the last group had 
finished, all subjects were asked to return to the 
larger room, 

In the no-pretest condition, subjects were informed 
that they had completed the questionnaire (irrele- 
vant task) in order to become familiar with the 
form of the questionnaire items. They were then 
asked to discuss some social and national issues with 
others and arrive at unanimous group decisions, The 
remaining instructions were almost identical to those 
given during Period 2 in the pretest group condition. 

In the individual pretest condition, all subjects 
were told that they had completed the questionnaire 
(experimental task) in order to become familiar 
with the content of the items as well as to allow 
them to state their opinions. Also, as in the pretested 
group condition, now that they had seen the ques- 
tionnaire items, they might or might not want to 
change their opinions. Everyone then completed 
the experiment task individually for a second time. 

In the mo-pretest individual condition, as in 
Period 2 of the no-pretest group condition, subjects 
were told that they had completed the questionnaire 
(irrelevant task) in order to become familiar with 
the form of the questionnaire. They were then asked 
to give their personal opinions on social and national 
issues (the experimental task). 

Period 3. In all group conditions, after returning 
to the larger room, the former group members were 
asked to complete the experimental task individually, 
giving their own personal opinion on each item. 
Upon finishing the experimental task, subjects were 
to go ahead and complete a short postexperimental 
questionnaire. Approximately 8 minutes were allotted 
for the two tasks, but the questionnaires were not 
collected until everyone had finished. The session 
ended with a short debriefing. Subjects in all condi- 
tions were also given a handout explaining the 
purposes of the study and a list of relevant general 
readings. 

In all individual conditions, subjects were asked to 
go through the experimental task once more. How- 
ever, rather than indicate their own opinion, they 
were to estimate, for each of the 16 scales, the 
percentage of the general population of similar col- 
lege students who would check each of the response 
alternatives. Next, subjects completed a short post- 
experimental questionnaire. A debriefing concluded 
the experimental session as described previously. 
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RESULTS 


Three sets of analyses will be reported: 
(a) Two overall analyses of variance (within- 
subjects and between-subjects analyses) ex- 
ploited the design features in exploring mean 
differences that might bear in the traditional 
way upon “shift” and “polarization” effects. 
(b) Predictions under several social decision 
scheme models (Davis, 1973) were compared 
with the distribution of group decisions ob- 
served for each attitude object or social issue. 
(c) Postdiscussion sociometric and personal 
reaction data were summarized to provide 
additional information. 


Overall Analyses 


Subjects, both individual and group, re- 
sponded to a total of eight attitudinal items; 
there were two sets of four items each. For 
one set of four attitude objects (Billy 
Graham, New Left, Frenchmen, and Nixon), 
the required decisions followed a standard 
“semantic differential” format in that the 
individual or groups chose 1 of the 7 alterna- 
tives ranging from 1 (bad) to 7 (good). For 
the other 4 items (the percentage of the 
national budget to be devoted to pollution 
control; the percentage of university adminis- 
trative control to be invested with students; 
the percentage of time students should spend 
studying; and the percentage of university 
funds to be devoted to applied as opposed to 
basic research), subjects chose 1 of the 10 
class intervals, ranging from 0%-9% to 
90%-100%, that best expressed their opinion. 
Clearly, content varies considerably from item 
to item. Moreover, the decision distributions 
were observed to vary considerably from item 
to item, and the form of the sample distribu- 
tion has substantial implications for the na- 
ture of the individual-group relationship ac- 
cording to social decision scheme theory, an 
issue we will consider later. In response to 
these issues as well as the custom established 
for group shift and polarization research, we 
elected to treat each item as 4 separate de- 
cision task. 

Within-subjects comparisons. Recall that in 
one experimental condition, individual sub- 
jects responded privately in Period 1 and 
again made private decisions during Period 2. 


(During Period 3, these individuals completed 
a different task.) These judgments were ana- 
lyzed by a 2 (Periods 1 and 2) X 2 (Sex) 
analysis of variance. No significant main ef- 
fects were observed for any of the semantic 
differential items (Billy Graham, New Leit, 
Frenchmen, and Nixon). For the percentage 
items (pollution, university control, studying, 
and applied research), only the pollution 
item yielded a single significant effect; The 
average individual decision at Period 2 was 
to spend significantly less, than at Period 1, 
of the national budget for pollution control, 
F(1,48) = 5.186, p < .03. This would sug- 
gest for the set of all eight items in general 
that average individual preferences are not 
very strongly influenced by the two testings 
per se. 

The more interesting mean comparisons 
came from the experimental conditions in 
which the decision order was individual- 
eroup-individual. The private decisions of the 
four members were averaged at Period 1 and 
again at Period 3. These Period 1 and 
Period 3 values were treated as scores along 
with the group decision at Period 2 in a 3 
(Periods 1, 2, and 3) X 2 (Sex) analysis of 
variance. (See Davis, 1973, and Davis et al., 
1974, for a critical discussion of the some- 
what dubious, though widespread, practice of 
averaging prediscussion and/or postdiscussion 
responses of the individual group members for 
the purpose of a repeated-measures analysis 
of this type in which there is only a single, 
corresponding group datum. We follow this 
practice here only for the purpose of main- 


taining continuity with the literature.) Three 


of the four semantic differential items yielded 
a significant main effect for period: for New 
Left, F(2, 92) = 11.58, p < .001; for French- 
men, F(2,92) = 8.94, p < 001; for Nixon, 
F (2,92) = 5.04, P< 01; there was no sig- 
nificant main effect for Billy Graham. There 
was a significant Period X Sex interaction, 
F(2,92) = 6.71, p < .01, on the New Left 
item. The mean values for all eight items for 
each period are given in Table 1. Post hoc 
comparisons (by the Newman-Keuls test) on 
each of the three items giving a significant 
periods effect showed that the individual- 
group difference (Period 1 vs. Period 2) were 
significant for New Left ($ < .01), French- 


P 


TABLE 1 


Mean Varyes or Decisions ar EACH PERIOD FOR 
EACH ĪTEM AGGREGATED OVER SEX 


; F eriod i iod 3 
Attitude object | cyndividuaD | (Cross individu) 
Semantic differential | 
item* | 
Billy Graham 4.35, 4.15, 4.23, 
New Left 3.84, 4.29), 4.125 
Frenchmen 441, 4.06), 4.15, 
Nixon 3.45, 3.23, 3.47, 
Percentage item? 
Pollution control 2.47, 2.02), 
University control 3.44, 3.32, | 
Studying 2.84, 2.28, 
Applied research 3.274 3.68), 


Note. Periods are within subjects. Means within a row having 
a common subscript do not differ significantly at the .0S alpha 
level, (For many comparisons the actual probability under the 
null hypothesis was much smaller than .05), 

= On a scale from 1 (bad) to 7 (good). 

t On a scale from 0 (0%-9%) to 9 (90%-100%), 


men (p < .01), and Nixon (p < .05), respec- 
tively. On the average, groups liked the New 
Left more, Frenchmen less, and Nixon less 
than did individuals on each of these issues. 

Consider next the percentage items, Three 
of the four items yielded a significant effect 
for periods: for pollution, (2,92) = 11.75, 
p < .001; for studying, F(2,98) = 20.20, p 
<.001; for research, F(2,90) = 4.22, p< 
-02; and there was no significant periods ef- 
fect for university control. Sex exerted a sig- 
nificant average effect on the pollution and 
university control items, (1,46) = 23.38, 
$ < .001; F(1,45) = 4.29, p < .05, respec- 
tively, in that females preferred a higher 
percentage than males in both cases. No 
interactions were significant on any item. 

Post hoc comparisons between Periods 1 
and 2 (individual vs. group) for the three 
significant percentage items revealed that, on 
the average, groups endorsed percentages sig- 
nificantly lower (p< .01) for the pollution 
item, significantly higher (p < .05) for the 
research item, and significantly lower (p < 
.01) for the studying item than did the aver- 
age individual on those items. Thus, in 6/8 
of the comparisons, the average group deci- 
sion and average individual decision (Period 1 
vs. Period 2) were significantly different, but 
the direction of the difference was not always 
the same from item to item. 

A fairly general empirical finding is that 
the postdiscussion mean of ex-members’ indi- 
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vidual responses is rarely coincident with the 
immediately preceding group mean but rather 
falls between the group mean and the pre- 
dicussion individual mean: X, < Xs; < Xo, or 
X, > X; > Xe (see, e.g., Davis et al., 1974). 
Inspection of Table 1 shows that similar rela- 
tions hold for 7/8 of the attitude items. The 
Nixon item was deviant (i.e., the postdiscus- 
sion mean for that item fell outside the above 
interval) as was the university control item. 
Also, the Nixon item was the only one for 
which the difference between group and post- 
discussion individual means (Period 2 vs. 
Period 3) was significant (p < .05). 
Between-subjects comparisons. At Period he 
the full design was exploited by casting sub- 
jects into 2 (Group-Individual) x 2 (Sex) x 
2 (Pretested-Unpretested) between-subjects 
analyses of variance. The mean values for 
selected cells within each item are given in 
Table 2. There were significant sex main ef- 


TABLE 2 
MEAN VALUES OF Decisions ar PERIOD 2 


Condition 
Attitude object — - — 
Individual Group 
Semantic differential item 
Billy Graham*èe 
Pretest 4.14 4.16 
No pretest 5.10 4.24 
New Left 
Pretest 4.46 4.32 
No pretest 3.84 4.22 
Frenchmen" 
Pretest 4.46 4.06 
No pretest 4,54 4,23 
Nixon% 
Pretest 3,14 3.19 
No pretest 3.98 2.96 
Percentage items 
Pollution’: 
Pretest 2.92 2.02 
No pretest 1.98 2.16 
University control“ 
Pretest 4.14 3.32 
No pretest 3.53 3.60, 
Studying* 
Pretest 2.88 2.28 
No pretest 2.83 2.22 
Applied research*e 
Pretest 3.66 3.68 
No pretest 3.33 4.88 


Note, Rows and columns are between subjects, Each mean 
is based on 50 cases. 

a aiaeag main effect significant at the .05 alpha 
level, 

b Pretest-No Prestest main effect significant at the .05 alpha 
level. 

e Individual-Group X Pretest-No Pretest interaction signifi- 
cant at the .05 alpha level. 
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fects on 3/8 of the items (pollution, univer- 
sity control, and research), but sex never 
interacted with individual-group and inter- 
acted only once (New Left item) with level 
of pretesting. All three-way interactions were 
nonsignificant. However, as shown in Table 2, 
the Individual-Group X Pretest-No Pretest 
interaction was significant on five of eight 
items, Individual-group main effects on these 
items must therefore be interpreted with 
caution, 

Simple main effects tests revealed that with 
the exception of the applied research item, 
pretested and nonpretested groups did not 
reach significantly different decisions. Rather, 
the interactions were generally attributable to 
differences between pretested and nonpre- 
tested individuals at Period 2. These differ- 
ences do not appear to be due to the repeated 
testing of pretested individuals, since they 
made very similar decisions at Periods 1 
and 2. However, it is possible that the judg- 
ments of the nonpretested individuals were in 
some way influenced by the trivial Period 1 


freee GROUPS 
Oe INDIVIOUALS 


le, FRENCHMEN 


RELATIVE FREQUENCY OF DECISIONS 


ia MUY GRAWAM 
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filler task (rating inanimate objects and ani- 
mals on semantic differential scales). 

The direction of the mean difference of pri- 
mary interest (individuals vs. groups) was in 
every case the same as that observed for the 
within-subjects analyses. In this sense, the 
two perspectives yield completely consistent 
results. For the semantic differential items, 
groups as opposed to individuals liked Billy 
Graham significantly less, F(1, 191) = 3.94, 
p < .05; the New Left more, F(1, 191) = .33, 
ns; Frenchmen significantly less, F(1, 191) = 
6.16, p <.02; and Nixon significantly less, 
F(1, 190) = 4.88, p < .03. On the percentage 
items, groups as opposed to individuals de- 
cided for a smaller average amount on pollu- 
tion, F(1, 190) = 3.23, ns; a smaller amount 
on university control, F(1, 188) = 2.96, ns; a 
significantly smaller amount on studying F(1, 
187) = 8.33, p < .005; and a significantly 
larger amount on applied research, F(1, 184) 
= 10.18, p < .002. 

Although the direction of individual-group 
mean differences in the between-subjects 
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Ficure 1. The relative frequency with which individuals (solid line) and 
groups (broken line) chose each of the response alternatives on the evaluative 
continuum of each of the four attitude objects for the items with a semantic 


differential format. 


analyses were the same as for the within-sub- 
jects analyses across the set of eight items, it 
was not always the same item that yielded a 
significant individual-group mean effect in the 
two analyses. Nevertheless, the two analyses 
taken together support the idea that groups 
ind individuals generally differed on the aver- 
ixe, In this sense, these results are consistent 
with the “polarization-extremity shift” find- 
ings described at the outset. However, not all 
mean shifts were characterized by increased 
polarization, In searching for an explanation, 
it would seem that not only the decision dis- 
‘ribution’s central tendency but also its gen- 
‘ral shape should be considered, We turn next 
ta a consideration of these issues, 


Distribution of Decisions and Social 
Decision Schemes 

The individual (solid curve) and group 
(broken curve) decision distributions from 
Period 2 are displayed in Figure 1 for the 
semantic differential items and in Figure 2 
for the percentage items. In both cases, the 


relative frequency polygons represent data col- 
lapsed over the sex and pretesting variables, 


K horvi 


RELATIVE FREQUENCY OF DECISIONS 
È i 


Grour Posrrion as A Function or Memper ATTITUDES 
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Several of the distributions are anything 
but symmetrical, many are quite skewed, and 
some are irregular. None of this information, 
of course, was suggested by sample means 
alone, Inspection of Figure 1 along with the 
associated sample means (see Table 2) re- 
vealed that individual € > group £ (see Fig- 
ure la), and this mean relation is consistent 
with inspection in that the preponderance of 
the group decisions fell below the preponder- 
ance of the individuals; individual 2 < group 
£ (see Figure 1b), and this mean relation is 
not so evident from inspecting the graphs 
where the preponderance of the groups fell 
below the of the individuals; 
individual X > group X (see Figure te), yet 
the curves do not differ much by inspection in 
that both have very pronounced modes at Re- 
sponse Alternative 4; individual X > group £ 
(see Figure 1d) yet the individual curve is 
roughly bimodal, whereas the group curve is 
definitely unimodal and somewhat positively 
skewed, Figure 2 (percentage items) may be 
inspected in the same way. The point is that 
not only may conclusions based solely on 
means be incomplete but they even may lead 
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Fiourr 2. The relative frequency with which Individuals (solid line) and 
groups (broken line) chose each of the response alternatives on a percentage 
continuum for each of the social issues as labeled in the four subgraphs. 
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to unwarranted conclusions about the direc- 
tion and extent of group shifting and polariz- 
ing. In other words, if whole data sets are 
considered, then it seems by inspection that 
group-individual differences do not in most 
cases represent the kind of change implied by 
the terms polarization or shift which are ap- 
plied to mean differences. Again, taking each 
data set in toto, there appears to be a tend- 
ency for group decisions generally to fall 
back into the middle relative to individuals, 
and this is consistent with conventional wis- 
dom prior to the discovery of group extremity 
shifts (Cartwright, 1973). However, not all 
cases display this general middling tendency 
(e.g., see Figure 2c). 

What sort of social decision process can ex- 
plain these relations between individual and 
group distributions? To attack this question, 
we considered a variety of social decision 
schemes that were sufficiently well defined to 
permit explicit predictions. Assuming each 
social decision scheme model in turn, predic- 
tions from each were compared with the ob- 
served group distribution on each item. 

Social decision scheme models. Eight famil- 
iar social decision scheme models are de- 
scribed in the note to Table 3. In Table 3, 
4 of the 210 possible distributions of group 
members’ initial preferences on the seven 
alternatives of the semantic differential items 
are given as examples, and the model’s treat- 
ment of each distribution (i.e., the probability 
of a group, with members arranged in the 
manner of the selected example, choosing 
each alternative) is displayed. The full social 
decision scheme matrices under each model 
for the semantic differential items were 210 
x 7, and for the percentage items they were 
715 X 10; both were too large to be presented 
in toto within Table 3. 

Using the relative frequency with which 
pretest and no-pretest individuals at Period 2 
chose each alternative on an item as estimates 
of p; the theoretical probability with which 
a group chose each alternative, p; was calcu- 
lated using Equations 1 and 2, where the 
matrix, D, was defined by the particular so- 
cial decision scheme from Table 3. This was 
carried through for each model separately on 
each attitude item. The goodness of fit be- 
tween prediction and obtained distributions 
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was tested (throughout) by the Kolmogorov- 
Smirnov test using the test statistic Dmax 
(Siegel, 1956). The tests of the eight models 
were very severe, since (a) the group sample 
sizes were very large (» = 100), and (b) .20 
was chosen as the alpha level.’ With such 
powerful tests, this meant that quite small 
differences between any predicted and ob- 
served distribution resulted in the rejection 
of the entire theoretical social decision scheme 
model for that item. 

The equiprobability, proportionality, major- 
ity-equiprobability, and majority-proportion- 
ality models could be confidently rejected 
(i.e., p < .20) on all eight items. On the four 
percentage items, the median model provided 
an accurate account for the distribution of 
group decision on the pollution item (Dmax 
= .067, ns) but failed on the other three 
percentage items. On the four semantic differ- 
ential items, the median model accurately 
predicted the group decision distribution for 
the New Left item (Dmax = .049, ns) but 
was significantly discrepant on the other three 
items. Although somewhat less accurate, ac- 
ceptable fits were also given by the majority- 
averaging model (Dmax = .07, ns) and aver- 
ing model (Dmax = -09 ns) on the same item, 
but both were rejected confidently elsewhere. 
Finally, the plurality-proportionality model 
provided a statistically acceptable account of 
group decisions on the Frenchmen item. 

To summarize, the median model accu- 
rately accounted for (i.e., was not discrepant 
at the .20 level) the group decision distribu- 


3Note that in standard null hypothesis testing 
situations (e.g., t test or analysis of variance) it is 
“virtuous” that the probability of a Type 1 error 
be chosen small, a commendably conservative deci- 
sion, since rejection of the null hypothesis implies 
support for some psychological hypothesis dear to 
the researcher. In theoretical curve fitting, such as 
the case in hand, conservative practice suggests that 
“alpha level” be chosen large, since acceptance of the 
hypothesis being tested directly implies support for 
the psychological notions at issue. Model tests are 
severe in yet another way. The use of estimates, pj, 
in Equation 2 ignores the presence of any sampling 
error, and the subsequent significance tests thus tend 
to be “overly strong.” The use of very large indi- 
vidual estimation samples is of some aid in meeting 
this problem, but any bias present would tend to 
work against the acceptance of the social decision 
explanation we are proposing. 
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tions on 2/8 of the items, and the plurality— 
proportionality, majority-averaging, and aver- 
aging models for 1/8 of them, The remaining 
four models from those listed in the note to 
Table 3 were always significantly discrepant 
with observed group decision distributions. 
The lack of fit on the percentage items 
could plausibly be attributed to their am- 
biguity for many of the subjects (e.g., several 
did not understand the distinction between 
basic and applied research; the phrase “con- 
trol . . . in the planning and administration 
of a university” also allowed several interpre- 
tations). However, the poor fit on the 
semantic differential items was disappointing 
and unexpected. This might have been due 
to an unfortunate choice of decision schemes; 
however, the decision schemes applied here 
have been successful in previous research and 
span a rather wide range of decision-making 
rules. Another explanation is that the esti- 
mates of p; were biased or inaccurate. The 
between-subjects analysis of variance at 
Period 2 suggested that this may have been 
the case. Recall that the Group—Individual 
X Pretest-No Pretest interactions were gen- 
erally due to significant differences between 
pretest and no-pretest individuals. A distribu- 
tional comparison of pretest and no-pretest 
individuals at Period 2 revealed much the 
same pattern (e.g., on three of the four se- 
mantic differential items the distributions for 
the two conditions differed significantly if the 
alpha level was .20 in a Komogoroy-Smirnov 
two-sample test, see Siegel, 1956). A distribu- 
tional analysis indicated close similarity be- 
tween pretested and nonpretested groups; the 
conditions differed significantly (if « = .20) 
only on the applied research item. To the 
extent that the nonpretested individuals 
were unrepresentative of initial group member 
opinion, their inclusion in the estimation 
sample for p; would have led to poorer deci- 
sion scheme predictions. Other evidence exists 
to support this conjecture. The experimenter’s 
log reports that subjects frequently reacted to 
the trivial filler task with laughter and joking, 
often failing to take their assignment seri- 
ously. According to the log, those who went 
into groups in Period 2 usually became more 
conscientious, but subjects who remained in 
the individual condition often did not. Also, 
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large groups of individuals were run near the 
end of the experiment to fill up the indi- 
vidual conditions. This size factor made main- 
taining order and attention to the tasks in 
the nonpretested individual condition even 
more difficult. 

Therefore, predictions were once again gen- 
erated for each of the eight decision schemes, 
except that now only pretested individuals at 
Period 2 were used to estimate pj. Once again, 
the entire sample of groups was included in 
the criterion distributions. All of the models 
could be confidently rejected (p< .20) on 
each of the percentage items. The equiprob- 
ability, proportionality, and majority—equi- 
probability models failed for all the semantic 
differential items as well. The majority— 
proportionality (Dmax = .082, ms) and the 
plurality-proportionality (Dmax = .043, ns) 
models each gave acceptable fits for the 
Frenchmen item. The median model provided 
acceptable predictions for the Billy Graham 
(Dmax = 066, ns) and Nixon (Dmax = .052, 
ns) items. Finally, the averaging and major- 
ity-averaging models could be rejected only 
for the Frenchmen item (Dmax criteria for 
majority—averaging and averaging models, 
respectively, are as follows: for Billy Graham, 
Dmax = .090 and .105; for New Teh Dae = 
.089 and .089; for Nixon, Dmax = .022 and 
-052). In summary, although no model could 
successfully account for the decisions on the 
percentage items, the majority—averaging and 
averaging models successfully predicted three 
of the four semantic differential items, while 
the median model was acceptable for two 
items. Figure 3 illustrates the close correspon- 
dence between the predictions of the most 
successful model (majority-averaging) and 
the observed data. 

We might now ask how the pattern of 
group-individual distribution differences are 
informative relative to the kind of predictions 
made by the various models, Although we 
could hardly judge any of them to have been 
statistically very satisfactory, which theo- 
retical model’s account was generally closest 
to observed group decision distributions? This 
kind of information is an important guide not 
only to interpretation of results from this 
study but in the search for a more accurate 
theory of group attitudinal decision making 
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Ficure 3. The relative frequency with which groups (solid line) chose each 
of the response alternatives on the semantic differential items. (Predictions 
assuming a majority-averaging model are given by the broken line.) 


in the future. To this end, the relative effec- 
-tiveness of each social decision scheme model 
was assessed by determining the overall 
closeness of each model’s predictions (P1, P2, 
... , Py) to observations (Pi,P2,..., 
Pa) on each item, (Again, predictions were 
based only upon pretested individuals). Thus, 
for each model 


Ú a 
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where n = 7 (semantic differential items) or 
n=10 (percentage items) was calculated 
separately for each item, and these values 
appear in Table 4. Table 4 also shows the 
closeness rankings of models within each of 
the items (columns) and a sum of ranks and 
deviation scores over items of each type. It 
is evident from Table 4 that the majority- 
averaging, median, and averaging models are 
superior to the others for the semantic dif- 
ferential items. Thus, it would be prudent in 
the future to look to hypothetical explana- 


tions of this general sort in seeking descrip- 
tive models for social decision making of this 
kind. Although none of the models provided 
a statistically acceptable fit for any of the 
percentage items, from Table 4 it is clear 
that the plurality-proportionality model was 
closest, overall, on these items. 

Finally, we counted the number of times 
that a pretested group’s decision could be cor- 
rectly predicted by each of several decision 
schemes from a knowledge of the initial dis- 
tribution of member opinion. This analysis 
was undertaken only for the semantic dif- 
ferential items in light of the poor fit of all 
models on the percentage items. The number 
of times each distinguishable” distribution 
occurred was usually quite small (often 
zero), even after pooling across items; there- 
fore, no useful comparisons of the strictly 
“probabilistic” models (e.g., equiprobability, 
proportionality) seemed possible. We thus 
contrasted only those models which made a 
point prediction. Obviously, some of the 
models (e.g., majority, plurality) meet this 
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condition only for certain distinguishable 2 
32222923S distributions (eg, 3-1 or 2-1-1 splits). 
BSE EES Others (e.g., averaging) fail to meet it for oc- 
casional distinguishable distributions (eg. 
[2,2,0,0,0,0,0] on a semantic differential item). 
Generally, a decision scheme makes point 
prediction of the jth response alternative for 
the ith distinguishable distribution if and 
only if dy = 1 and d = 0, kj. Three of 
the decision schemes seemed to show this 
pattern with sufficient frequency to warrant 
comparison: the majority-averaging, aver- 
aging, and median decision schemes. For 4 
each, the percentage of “hits” was tabulated. 

For the four semantic differential items, the 
median model was somewhat superior to the 
majority-averaging and averaging models. 

The median model was accurate on 76 of 118 
(64%) “point predictable” groups decisions. 

The majority-averaging model hit on 93 of 

163 (57%) group decisions, while the aver- 
aging model was correct for 77 of 146 (53%) 4 
decisions. Again note that these comparisons 

are not based upon all of the pretested 
groups’ decisions due to the probabilistic 
nature of many of the decision scheme 
predictions. The accuracy of these three 
models is comparable to that of two extrem- 
ity-variance models tested by Burns (1967). 
Cartwright (1971) has commented that “the 
attempt to make specific predictions is com- 3 
mendable, and Burns’s percentage of success 
provides a baseline against which to compare 
alternative models” (p. 371). It should be 
noted that Burns’s models required two fitted 
parameters not estimatable from initial indi- 
vidual responses. The three models discussed 
above achieved similar accuracy without the 
use of any fitted parameter. 

Space permits consideration of only two 
results from the postexperimental question- 
naires designed to elicit personal impressions 
of group interaction. In order to assess the 
relationship between extremity of attitude 
and attributed persuasiveness, each subject’s i 
perceived persuasiveness score was calculated i 
by summing the ratings given to him by his i 
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three co-workers on a 7-point scale. This per- 
suasiveness score was correlated with the 
absolute value of his original attitude rating 
(i.e., negative scale values were reflected > 
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about the apparent midpoint of the contin- 
uum) aggregated across all eight attitude 
items. Across all subjects the relationship was 
negligible (r = —.08) and remained so even 
when sex and seating position were treated 
as moderator variables. 

Ex-group members also indicated which of 
five descriptions of social decision schemes 
(unanimity, majority, plurality, equiprobabil- 
ity, and dictator) was most similar to the 
manner whereby (a) their group kad reached 
a decision, (b) similar groups typically reach 
such decisions, and (c) similar groups should 
reach such decisions. (Such concepts as aver- 
ages or medians seemed to be either too dif- 
ficult for a verbal translation or too often 
used in a nontechnical sense to be meaningful, 
thus they were not included in the set of 
alternative decision schemes.) A majority plan 
was clearly the most popular answer to all 
three questions. Moreover, its popularity in- 
creased as one moved from reality to the 
ideal. The only competitor was the unanimity 
rule in the first question, and this may have 
been due to groups being instructed at the 
outset to agree unanimously on the position 
decided, which is, after all, always a neces- 
sary second step for reporting a decision ir- 
regardless of the social process that selected 
that alternative. 


Discussion 


The generally significant individual-group 
mean differences obtained in our analyses of 
variance (both between and within subjects) 
appear to confirm the notion that the choice 
shifts often found to chraacterize group risk 
taking are manifest in similar kinds of groups 
choosing an attitudinal position. Although 
similar differences in group problem solving 
have been known for years (cf. Lorge et al., 
1958; Davis, 1969), the idea that important 
decisions, especially those entailing risk, might 
be “biased” by group discussion was some- 
what unsettling. When Moscovici and his 
co-workers demonstrated similar individual- 


group differences in attitudinal decision 


making, not only was the more general nature 
of the phenomena recognized, but attention 
was then called to group decisions concerning 
attitudes, an important but neglected issue in 
its own right. 


The initial analyses of individual-group 
mean differences, which were generally sig- 
nificant in this study, could also lead to a 
similarly unsettling suspicion that group dis- 
cussion, and consequently the confrontation 
of different views, may not necessarily impose 
restraint or foster a more moderate position. 
But rather, interaction may incline members 
yet further along the attitudinal continuum in 
the direction of the position generally favored 
by group members. The important implica- 
tion of such an effect was clearly recognized 
at the outset by Moscovici and Zavalloni 
(1969), who labeled the effect “polarization.” 

Polarization was intended, of course, as a 
label and not an explanation. The complexity 
of the quest for an explanation is well illus- 
trated by considering the nature of the whole 
data set rather than sample means alone. 
When for some item we examined the indi- 
vidual and group mean decisions along with 
the relevant distributions, the descriptive in- 
adequacy of means alone was evident. From 
the distributional perspective, the theoretical 
question became how the group “got that 
way” in view of the individual distribution 
descriptive of member preferences at the be- 
ginning of interaction. In view of earlier re- 
ports, one might plausibly suspect some form 
of a plurality or majority process to be a 
likely candidate, since such interaction proc- 
esses would decrease any (unimodal) skew- 
ness in the group distribution which had been 
found in the individual distribution and 
would move means toward the mode. (Sym- 
metrically distributed individuals would result 
in the group distribution mode being exag- 
gerated at the same location, but individual- 
group means would not differ. However, we 
did find what appeared at first to be such 
differences due to imperfect symmetry in the 
two distributions on the Frenchmen item. 
And this again illustrates the importance of 
inspecting whole distributions.) However, the 
general movement of group modes back 
toward the center of the scale initially 
casts doubt on the majority or plurality 
explanations. 

The next step then was to find the best 
approximation to the group distribution from 
a comparison with predictions from several 
common social decision schemes. Although not 
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acceptably accurate as a descriptive model, 
a majority (if one exists)—averaging other- 
wise social decision scheme generally provided 
the closest account of the group distributions 
across the semantic differential items. Also, 
such a model squares with the subjects’ post- 
group perceptions that a majority plan was 
most often what their group followed. Fur- 
thermore, this result is consistent with other 
recent group decision-making research (e.g., 
Davis et al., 1974; Davis et al., in press) in 
which some form of a majority model was 
superior to several alternative models. 

Up to this point we have placed little em- 
phasis upon the proportion of logically pos- 
sible distinguishable distributions of opinion 
within a group in which a majority does not 
exist and a subscheme must therefore be em- 
ployed. Recall that there are 210 distinguish- 
able distributions of internal opinion possible 
on a semantic differential item, yet only 49 
of these are majorities. In other words, 77% 
of the possible arrays of member opinion 
must be adjudicated by the subscheme. For 
the percentage items, only 96 out of 715 of 
the distributions permit a majority primary 
social decision plan to operate on initial pref- 
erences, and thus in 87% of the possibilities 
the subscheme is in operation, This theoreti- 
cal emphasis on subschemes was actually 
reflected in our groups. On the semantic dif- 
ferential items, for example, a majority ex- 
isted at the outset of discussion for only 22% 
of the instances for which complete data were 
available. Thus, it is quite reasonable that 
while the majority scheme is pushing group 
decisions in one direction (viz., exaggerating 
departures from uniformity in the individual 
distribution), the averaging subscheme could 
be pushing in quite another (viz., toward the 
average). We have, of course, investigated 
only three subschemes acting in conjunction 
with the primary plan: averaging, proportion- 
ality, and equiprobability, all of which have 
quite different effects. We limited ourselves 
to these three because there is some justica- 
tion from past research with other kinds of 
tasks and because these subschemes are 
socially interpretable means of aggregating 
individual member preferences.* 

One additional possibility might be to con- 
sider that positions (alternatives) sometimes 
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differ in the degree to which an advocate of 
that position yields or acquiesces during the 
discussion producing the decision. Davis 
(1973, pp. 123-124) has suggested that some 
tasks might foster social processes of this 
sort and called them “pattern-weighting” 
models. The idea is that some positions or 
alternatives, when advocated, may be per- 
ceived by fellow members as more “impor- 
tant” or “deserving” of selection; or, some 
positions, when advocated by one or more 
persons, may embolden their advocators to 
yield less than others holding less “favored” 
alternatives. 

In the experiment at hand, the permissible 
alternatives were ordered in the sense that 
one alternative represented more or less 
of something than another alternative. The 
simplest possibility would be that the more 
extreme the position advocated, the more 
persuasive and/or unyielding its advocate 
could be. Yet recall that extremity of position 
was not significantly correlated with perceived 
persuasiveness. More importantly, as re- 
peatedly observed above, the group decision 
distribution did not generally move toward 
scale endpoints. 

One alternative to the simple extremity- 
persuasiveness hypothesis would be to invoke 
the idea that a position’s persuasive force is 
related to its perceived popularity in the 
population. This latter notion is consistent 
with Brown’s (1965) cultural value hypothe- 
sis. Of course, we are concerned here with 
perceived relative popularity of positions, not 
necessarily the true distribution of opinion. 
It is easy to imagine that most persons have 
engaged in some discussion or other that 
would give one some idea of the prevailing 
opinion among one’s peers concerning regard 
for Nixon, the New Left, Billy Graham, and 
Frenchmen. Unfortunately, it is not so easy 
to imagine a similarly well-informed subject 
who is able to respond with confident accu- 
racy about the population opinion distribution 
concerning the percentage of the university 


* It is obviously not difficult to find some social 
decision scheme matrix, D, such that when the 
calculations of Equation 2 are carried through, 
predicted and observed group decision distributions 
are suitably close. But any such empirical “exercise” 
could leave us with a meaningless but good fit. 
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budget to be spent on applied as opposed to 
basic research. The same ignorance seems 
likely to occur with regard to the pollution, 
university control, and studying issues. The 
problem is not so much that these latter 
issues are unimportant to the average under- 
graduate as that the form of the response 
required is unfamiliar. 

In order to dsecribe more formally the 
norm-weighted model just sketched, consider 
first the familiar arithmetic average. An aver- 
age model would assign the group decision, 
with probability near 1, to that alternative 
nearest the mean, 


M = (1/7) Z ritn 
ja 


where x, is the scale value of Aj; r=0, 
12,3, s a O r and 


Lier. 


ja 


Suppose now that each alternative has some 
weight, wy, reflecting the perceived impor- 
tance or popularity of that alternative, 
where .00 < wy < 1.00, and 


Z w = 1.00, 


ma 


We could then assume that the group chooses, 
with probability near 1,00, the alternative, Aj, 
that is nearest 


At = È rwa / È rav), 


called a norm-weighted average in this context. 

In this experiment, estimates of w, could be 
obtained for each item from the independent 
sample of individuals who at Period 3 marked 
in each space on the response scale their 
judgment of the proportion of the population 
who would have chosen that alternative (for 
the reasons elaborated above, only pretested 
individuals were considered). These judg- 
ments were normalized for each subject (thus 
ensuring that each set of responses would sum 
to one) and averaged across individual sub- 
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jects in that condition to give the estimated 
weights, ty. 

Two versions of the norm-weighted model 
were calculated for each of the eight items. 
In one version, the predictions were made 
assuming the expression given in the preced- 
ing paragraph. In the other, the norm- 
weighted model was treated as a subscheme, 
to apply only if no majority were present, 
although we have seen that a majority does 
not exist for most of the possible distinguish- 
able distributions of initial member opinion, 
For the semantic differential items, this ma- 
jority norm-weighted average otherwise model 
gave a very accurate description of the group 
distributions for the New Left and French- 
men items, but its predictions differed sig- 
nificantly for Billy Graham (Dmax = 125, 
p < 10) and Nixon (Dmax = 162, p < 05). 
The norm-weighted model was comparably 
accurate; its predictions differed from the 
observed data on both the Billy Graham and 
Nixon items (Dmax = .129 and .161, p < .01 
and p < .05, respectively). Based on the ini- 
tial distribution of group opinion, point predic- 
tions under the majority norm-weighted model 
were computed for every pretested group. For 
the four semantic differential items, this 
model correctly predicted the group decision 
for 110 of 194 (579%) decisions, Although 
this is slightly poorer in terms of the per- 
centage of hits than that for the median 
model in a similar analysis, the median model 
was unable to make a point prediction for 
many cases, and hence the direct comparison 
is not appropriate. When predictions were 
made under the majority norm-weighted 
model for only those groups for which the 
median model could make a point prediction, 
a hit rate (64% or 75 of 118 decisions) 
closer to the median model's performance 
was obtained. 

Since the normative weights for each item 
represented average values, a failure of the 
norm-weighted models could reflect (among 
other things) the inadequacy of such aver- 
aged weights to take into account large indi- 
vidual differences in perceived norms. To 
investigate this possibility, the mean value of 
each subject's perceived norm was calculated. 
Then the within-items variance of these 
means was computed. Consistent with the 
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results of the test of the norm-weighted deci- 
sion schemes, the pooled variance for the 
Nixon and Billy Graham items was signifi- 
cantly greater than the pooled variance of the 
New Left and Frenchman items, F (97,98) 
= 2.863, p < .01. Whether or not the poor 
fit for these items was due to the lack of 
common percevied norms as well as the more 
interesting question of the role of conflicting 
perceptions of reference group norms on 
group decisions should be the focus of further 
research. 

The results for the percentage items were 
quite different in that both norm-weighted 
models were confidently rejected for all items 
(p < .01, in all cases). It thus seems prudent 
to distinguish between the two kinds of items, 
perhaps along the lines described above. Of 
course, the reason for including two types 
of items in the first place was to ascertain 
the effects of variations in response format. 
Future research might profitably address it- 
self to studying variations in item type as 
well as the obvious variations in social con- 
tent or attitude object. For example, Fishbein 
(1967) has distinguished between the en- 
dorsement of belief statements and attitude 
statements. He has suggested that, although 
related, beliefs and attitudes reflect different 
psychological processes. In the present study, 
it may be that the semantic differential items 
require evaluations of psychological objects, 
while the percentage items stimulate expres- 
sions of belief. As a result, the social decision 
schemes underlying group responses to the 
two kinds of items might differ considerably, 
and it will be necessary in the future to in- 
clude such notions in providing a rationale 
for the relevant social process theory. 

The moderate success of the norm-weighted 
models on the attitude items is compatible 
with several theories of the decision-making 
process. For example, as suggested by Brown’s 
(1965) “value” hypothesis, group members 
may more willingly defer to advocates of 
positions they perceived to be popular, and/or 
advocates of popular positions may be more 
willing to state and defend their views. To 
examine these issues, the deviation of each 
pretested group member's opinion from the 
mean of the normative distribution was calcu- 
lated for each semantic differential item. The 
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sum across items of the absolute values of 
these scores indexed each subject’s overall 
closeness to the perceived norms. These scores 
were then correlated with the subjects’ per- 
suasiveness scores and with a similarly de- 
rived score reflecting each subject’s contribu- 
tion to discussion. Both correlations were 
quite small (r = .07 for persuasiveness and 
r= .01 for contribution) and nonsignificant. 
Thus, these specific hypotheses were not sup- 
ported. However, this pattern of results illus- 
trates how a social decision scheme analysis 
can help delimit the range of plausible 
psychological explanations of the choice shifts 
observed. 

Moscovici et al. (1972) ended their recent 
article on group polarization by concluding: 
Generally it is considered proper to end an article 
by pointing out that much remains to be done and 
that many problems remain to be explored. The 
truth of the matter is that such an effort would 
make little sense unless a specified model for group 
decisions can be elaborated. If one limits oneself to 
the existing models, notably the conformity models, 
applying them over and over again to phenomena 
that do not fall under their scope, one will be 


doomed to fruitless repetitions and the exploration 
of non-existent problems (p. 397). 


We find ourselves in substantial agreement, 
especially with regard to the desirability of 
specific models that permit actual comparison 
and contrast between prediction and data. 
One problem with such ideas as “diffusion of 
responsibility” or even general “conformity” 
has been that they are not sufficiently explicit 
to permit point predictions; it is thus diffi- 
cult to test them against data. Even an in- 
correct but explicit theory may be valuable 
in pointing the way toward more accurate 
accounts when treated as a baseline in this 
way. Moreover, some internal distributions 
of member opinion may possess theoretically 
more interesting implications for the inter- 
action—consensus process than other internal 
arrays. The experimental manipulation of 
group composition, while not attempted here 
might then permit not only more powerful 
tests of social decision scheme features but 
results that bear directly on such issues as 
minority acquiescence-conflict with a major- 
ity, degree of postgroup opinion change as a 
function of original opinion distribution type. 
and so on. ‘ 
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One hundred male and 83 female subjects were assessed on absorption (i.e. 
involvement in everyday imaginative activities), attitudes toward hypnosis, 
and hypnotic suggestibility. Significant positive correlations were obtained for 
both sexes between absorption and attitudes and between each of these varia- 
bles and hypnotic suggestibility. Multiple regression analyses indicated that 


most of the predicted variance in hypno 


tic suggestibility scores was accounted 


for in both sexes by the absorption variable. 


Tn the last few years investigators of widely 
differing theoretical orientations have begun 
to consider imaginative activities as impor- 
tant to an understanding of the cognitive 
processes underlying hypnotic suggestibility 
(Spanos & Barber, 1974). One of the empiri- 
cal mainstays of this theoretical convergence 
has been the demonstration that hypnotic 
suggestibility is related to subjects’ tendency 
to become “involved” or “absorbed” in every- 
day imaginative activities such as reading a 
novel, watching or acting in dramatic produc- 
tions, and the like (As, 1962; As, O’Hara, & 
Mungar, 1962; Barber & Glass, 1962; Hil- 
gard, 1970; Lee-Teng, 1965; Shor, Orne, & 
O’Connell, 1962; Tellegan & Atkinson, 1974; 
Spanos & McPeake, in press). Self-report 
questionnaires have been the most common 
procedures for assessing imaginative involve- 
ments of this type. 

A separate body of research has investi- 
gated the role of subjects’ attitudes toward 
hypnosis in determining hypnotic suggestibil- 
ity. Again, a common strategy for studying 
this issue has involved correlating self-report 
assessments of subjects’ attitudes toward hyp- 
nosis with hypnotic suggestibility. In general, 
the results of these studies have been con- 
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sistent in indicating small to moderate posi- 
tive correlations between the extent of favor- 
able attitudes toward hypnosis and hypnotic 
suggestibility (Barber & Calverley, 1966; 
London, Cooper, & Johnson, 1962; Melei & 
Hilgard, 1964; Shor, Orne, & O'Connell, 
1962). 

Despite the importance of both involvement 
in everyday imaginative activities and favor- 
able attitudes toward hypnosis in predicting 
hypnotic suggestibility, few studies have sys- 
tematically examined the interrelationships of 
these two predictor variables. It may be, for 
example, that involvement in everyday 
imaginative activities is related to hypnotic 
suggestibility only when subjects’ attitudes 
toward hypnosis are relatively favorable 
(Barber, Spanos, & Chaves, 1974). On the 
other hand, favorable attitudes toward hyp- 
nosis may predict hypnotic suggestibility 
only when subjects possess a set of imagina- 
tive skills that enables them to experience 
suggested effects. The present study was de- 
signed to make initial inroads into under- 
standing the interrelationships of these vari- 
ables. More specifically, it was designed to 
provide information regarding the following 
empirical questions: 

(1) Can earlier findings be replicated which 
demonstrated positive correlations between in- 
volyement in everyday imaginative activities 
and hypnotic suggestibility and between fa- 
vorable attitudes toward hypnosis and hyp- 
notic suggestibility? 

(2) Do involvement in everyday imagina- 
tive activities and favorable attitudes toward 
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EVERYDAY ĪMAGINATIVE Activities AND HYPNOSIS 


hypnosis constitute independent dimensions, 
or instead are scores on one variable system- 
atically related to scores on the other? 

(3) What is the combined power of in- 
volvement in everyday imaginative activities 
and favorable attitudes toward hypnosis for 
predicting hypnotic suggestibility? 


METHOD 
Subjects and Procedure 


One hundred eighty-three community college stu- 
dents (100 males, 83 females; ages 18—48) volun- 
teered to participate without pay in a group hyp- 
nosis experiment. Subjects were tested in classroom 
groups which ranged in number from 10 to 28. All 
subjects were tested in two separate sessions. During 
the first session they were administered a question- 
naire devised by Tellegan and Atkinson (1974) to 
assess involvement in everyday imaginative activities. 
In the second session they were administered a scale 
devised by Barber and Calverley (1966) to assess 
attitudes toward hypnosis and the Harvard Group 
Scale of Hypnotic Susceptibility (Shor & Orne, 
1962). The Harvard scale was administered via 
auditotape to ensure standardization of presentation.+ 


Assessment of Variables 


Tellegan and Atkinson involvement in everyday 
imaginative activity (absorption) questionnaire. 
Tellegan and Atkinson have devised a self-report 
questionnaire which asks subjects to indicate the 
extent to which they become involved or absorbed 
jn activities such as daydreaming, reading a novel, 
being moved by poetry, acting in a dramatic produc- 
tion, and the like. Scores on this questionnaire can 
range from a high of 40 to a low of 0. Thorkelson 
(1973) has reported that this instrument correlates 
to a moderate degree with several measures of 
creativity and fantasy activity, but is independent of 
introversion and neuroticism measures. 

Attitudes toward hypnosis scale. The scale used to 
assess subjects’ attitudes toward hypnosis was taken 
verbatim from Barber and Calverley (1966). These 
investigators modified an earlier attitude scale de- 
vised by Melei and Hilgard (1964). Barber and 
Calverley’s modified scale is a Likert-type question- 
naire consisting of six attitudinal subdimensions. 
Each of these subdimensions js scored on a 5-point 
subscale, and these subscale scores are summed to 
yield a single attitude score for each subject. Attitude 
scores can range from a high of 24 to a low of 0. 

Harvard Group Scale of Hypnotic Susceptibility. 
The Harvard scale (Shor & Orne, 1962) is a modifi- 
cation of the Stanford Hypnotic Susceptibility Scale 
(Weitzenhoffer & Hilgard, 1959) which can be self- 
scored by the subject and, thereby, administered in 
group settings. This instrument consists of a stan- 


1We thank Walter Carter for assistance in running 
subjects. 
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dardized hypnotic induction procedure immediately 
followed by the administration of 12 standardized 
suggestions which assess responses such as eye 
catalepsy, amnesia, and hallucination. Performance 
on this instrument correlates highly with perform- 
ance on individually administered hypnotic suscepti- 
bility scales (Shor & Orne, 1962). 


RESULTS 
Means on All Variables 


Table 1 presents the overall means for all 
subjects on each of the variables assessed. 
Also presented are the means for each varia- 
ble taken separately for male and female sub- 
jects. As this table shows, ¢ tests indicated 
that male and female subjects differed signifi- 
cantly from one another only on the absorp- 
tion variable. Females achieved significantly 
higher absorption scores than did males. Males 
and females did not differ from one another 
on hypnotic suggestibility or on their attitude 
toward hypnosis. 


Correlations Among All Variables 


Table 2 presents the correlations among the 
variables for the overall sample and for the 
sample broken down by sex. The most im- 
portant findings in this table relating to the 
overall sample indicate the following statisti- 
cally significant relationships: (a) a moderate 
positive correlation between absorption and 
hypnotic suggestibility, (b) a small positive 
correlation between attitude and hypnotic sug- 
gestibility, and (c) a small positive correla- 
tion between absorption and attitude. 

Table 2 also provides the correlations 
among the variables broken down by sex. The 
main findings for the two sexes are similar 
to one another and similar to the findings 
observed when the sexes are combined. Ab- 
sorption and hypnotic suggestibility correlated 


TABLE 1 


OVERALL MEAN SCORES AND MEAN SCORES 
Broken Down BY SEX 


Variable X Overall X Males ¥ Females 
Hypnotic 
suggestibility 6.5 6.4 6.7 
Absorption 28.1 27.0% 29.58 
Attitude 19.5 19.5 19.4 


a These means differ from one another at .05 level of signi- 
ficance. All tests are two-tailed. 
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TABLE 2 


CoRRELATIONS FOR THE OVERALL SAMPLE AND THE 
MALE AND FEMALE SAMPLES 


Variable | Absorption Attitude 
Hypnotic 43"* Tp 
Suggestibility TI fs 

(.43)"* (.23)* 

Absorption .26** 
“32** 

(.21)* 


Note. Within each row the top, underlined number represents 
lation for the overall sample, the middle number the 
for the male sample, and the lower number in 
ses the correlation for the female sample. 

.05, two-tailed. 
** 001, two-tailed. 


moderately and in a positive direction for 
both sexes. Attitude and hypnotic suggesti- 
bility also correlated positively with one an- 
other for both sexes. Finally, both sexes ex- 
hibited a positive correlation between absorp- 
tion and attitude. 

The correlations achieved by males and 
females between hypnotic suggestibility and 
the other two variables were compared. None 
of these differences achieved significance. In 
other words the magnitude of the correlations 
between hypnotic suggestibility and absorp- 
tion and between hypnotic suggestibility and 
attitude did not differ significantly between 
males and females. 


Multiple Correlations Among Hypnotic 
Suggestibility, Absorption, and Attitude 


In order to determine the combined ability 
of absorption and attitude for predicting 
hypnotic suggestibility, multiple correlations 
were carried out between these variables for 
the overall sample and for the sample broken 
down by sex. Table 3 presents the data rele- 
vant to these multiple correlations. As this 


TABLE 3 
MULTIPLE REGRESSION ANALYSIS 


Beta weight 


Sue Cumulative | 
Absorption | Attitude 
Overall” 46 38; aes 
Male? A5 29 | 26 
Female® 50 A5 | 13 


NıcHoras P. Spanos AnD Jonn D. McPEAKE 


table indicates, most of the predicted variance 
in hypnotic suggestibility scores was ac- 
counted for by the absorption variable. More 
specifically, the absorption variable accounted 
for 75% of the predicted variance in hyp- 
notic suggestibility scores for the overall 
sample, 55% of the predicted variance for 
the male sample, and 88% of the predicted 
variance for the female sample. Analyses of 
variance were carried out to determine 
whether there was a significant loss in ability 
to predict hypnotic suggestibility when atti- 
tude was eliminated from the regression equa- 
tions. As indicated in Table 3, elimination of 
this variable significantly lowered predictive 
power for the overall sample and also for the 
male sample. However, predictive power was 
not significantly reduced when this variable 
was eliminated from the female sample. In 
short, these results indicate that for the over- 
all sample and also for the male sample, both 
the absorption and attitude variables con- 
tributed significantly to the prediction of 
hypnotic suggestibility, with absorption as 
the stronger of the predictor variables. In the 
female sample, however, inclusion of the atti- 
tude variable did not significantly add to the 
power of the absorption variable in predict- 
ing hypnotic suggestibility.* 


Discussion 


The results of the present study replicated 
several important research findings. More 
specifically, these results supported the find- 
ings that: (a) a positive correlation occurs 
between measures of involvement in everyday 
imaginative activities and hypnotic suggesti- 
bility (As, 1962; As et al., 1962; Lee-Teng, 
1965; Shor et al., 1962; Tellegan & Atkinson, 
1974; Spanos & McPeake, in press); (b) the 
correlation between these two variables oc- 


2 Complementary results were obtained when these 
data were analyzed with partial correlations. The 
correlations between absorption and hypnotic sug- 
gestibility with attitude partialed out were significant 
for the total sample (r= .38, p < 001) and also for 
the male sample (r=.30, p< .01) and female 
sample (r= .45, p < .001). The correlations between 
attitude and hypnotic suggestibility with absorption 
partialed out were significant for the total sample 


(1, 181) = 7.62, p <.01. 
4 98) 18, p < 01. 


sF 
bP}, = 
e F(1, 81) = 1.82, ns. 


(r=.20, p < .01) and the male sample, (r=.26, 
$ < .01), but were nonsignificant for the female sam- 
ple (r = .15, p > .05). 
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curs in both male and female samples (Telle- 
gan & Atkinson, 1974); (c) a positive corre- 
lation is obtained between the degree of 
favorable attitudes toward hypnosis and 
hypnotic suggestibility (Barber & Calverley, 
1966; London et al., 1962; Melei & Hilgard, 
1964; Shor, Orne, & O’Connell, 1962); and 
(d) male and female college students do not 
differ in the level of hypnotic suggestibility 
which they achieve (Hilgard, 1965). 

The original findings of this study more 
fully explicate the relationships between in- 
volvement in everyday imaginative activities, 
favorable attitudes toward hypnosis, and 
hypnotic suggestibility. First, these findings 
indicate subjects high on absorption also tend 
to have favorable attitudes toward hypnosis 
whereas low absorption subjects tend to hold 
less favorable attitudes toward hypnosis. One 
interpretation of these findings indicates that 
subjects who enjoy exercising their imagina- 
tive skills in a wide variety of daily life situ- 
ations also look forward to, and do not feel 
threatened by, the opportunity to become in- 
volved in more unusual psychological activi- 
ties (i.e., the hypnotic situation). 

Second, absorption continues to predict 
hypnotic suggestibility when the influence of 
‘attitude toward hypnosis has been statistically 
eliminated. This result means that the often 
replicated finding of a relationship between 
involvement in everyday imaginative activi- 
ties and hypnotic suggestibility cannot be 
explained away in terms of subjects’ attitudes 
toward hypnosis. These findings also add 
weight to the more general notion propounded 
by numerous investigators, that imaginative 
processes play an important role in determin- 
ing subjects’ response to test suggestion 
(Spanos & Barber, 1974; Spanos, DeMoor, & 
Barber, 1973). The fact that the correlations 
between absorption and hypnotic suggestibil- 
ity were significant for both sexes after elimi- 
nation of the attitude variable further indi- 
cates that similar cognitive processes underlie 
at least part of the hypnotic performance of 
males and females. 

The relationships between attitude toward 
hypnosis and hypnotic suggestibility are more 
complex. When only males are considered, the 
attitude variable adds significantly to the 
power of the regression equation in predicting 
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hypnotic suggestibility. However, when fe- 
males are considered alone, addition of the 
attitude variable does not significantly affect 
the regression equation. Although this finding 
may indicate that attitudes play a less potent 
role in determining the hypnotic suggestibil- 
ity of females than of males, such an inter- 
pretation should be considered highly tenta- 
tive until the finding is replicated. The im- 
portance of exercising caution before accept- 
ing such an interpretation is highlighted by at 
least two other findings. First, Melei and Hil- 
gard (1964) found attitudes toward hypnosis 
to predict hypnotic suggestibility in females 
but not in males, Second, Barber and Calver- 
ley (1964a, 1964b) found that female sub- 
jects inculcated experimentally with negative 
attitudes about the test situation exhibited 
lower levels of suggestibility than did sub- 
jects given positive attitudes. 

In summary, the findings of the present 
study support the contention that both in- 
volvement in everyday imaginative activities 
and attitudes toward hypnosis are important 
variables in determining hypnotic suggestibil- 
ity. These findings further indicate that these 
two variables do not constitute independent 
dimensions but are, instead, positively cor- 
related with one another. Nonetheless, each of 
these variables plays some role in determin- 
ing hypnotic suggestibility. 


REFERENCES 


As, A. Non-hypnotic experiences related to hypno- 
tizability in male and female college students. 
Scandinavian Journal of Psychology, 1962, 3, 112- 
121. 

As, A., O'Hara, J. W., & Munger, N. P. The mea- 
surement of subjective experiences presumably 
related to hypnotic susceptibility. Scandinavian 
Journal of Psychology, 1962, 3, 47-64, 

Barber, T. X., & Calverley, D. S. Empirical evi- 
dence for a theory of “hypnotic” behavior: Effects 
of pretest instructions on response to primary sug- 
gestions. Psychological Record, 1964, 14, 457-467. 
(a) 

Barber, T. X., & Calverley, D. S. The definition of 
the situation as a variable affecting “hypnotic- 
like” suggestibility. Journal of Clinical Psychol- 
ogy, 1964, 20, 438-440. (b) 

Barber, T. X., & Calverley, D. S. Hypnotizability, 
Suggestibility and personality: II. Assessment of 
previous imaginative-fantasy experiences by the 
As, Barber-Glass, and Shor questionnaires. Jour- 
nal of Clinical Psychology, 1965, 21, 57-58. 


598 


Barber, T. X., & Calverley, D. S. Toward a theory 
of hypnotic behavior: Experimental evaluation of 
Hull’s postulate that hypnotic susceptibility is a 
habit phenomenon. Journal of Personality, 1966, 
34, 416-433. 

Barber, T. X., & Glass, L. B. Significant factors in 
hypnotic behavior. Journal of Abnormal and So- 
cial Psychology, 1962, 64, 222-228. 

Barber, T. X., Spanos, N. P., & Chaves, J. F. 
Hypnosis, imagination, and human capabilities, 
New York: Pergamon Press, 1974. 

Hilgard, E. R. Hypnotic susceptibility. New York: 
Harcourt, Brace & World, 1965. 

Hilgard, J. Personality and hypnosis. Chicago: Uni- 
versity of Chicago Press, 1970. 

Kinney, J. C. M. Modification of hypnotic suscepti- 
bility. Unpublished doctoral dissertation, Stanford 
University, 1969. 

Lee-Teng, E. Trance-susceptibility, induction-sus- 
ceptibility, and acquiescence as factors in hypnotic 
performance. Journal of Abnormal Psychology, 
1965, 70, 383-389. 

London, P., Cooper, L. M., & Johnson, H. J. Sub- 
ject characteristics in hypnosis research: II. At- 
titudes toward hypnosis, volunteer status, and 
personality measures. III. Some correlates of 
hypnotic susceptibility. Jnternational Journal of 
Clinical and Experimental Hypnosis, 1962, 10, 
13-21. 

Melei, J. P., & Hilgard, E. R. Attitudes toward hyp- 
nosis, self-predictions and hypnotic susceptibility. 
International Journal of Clinical and Experimental 
Hypnosis, 1964, 12, 99-108. 


Nıcmoras P. Spanos AnD Joun D. McPEAKE 


Shor, R. E., & Orne, E. C. The Harvard Group Scale 
of Hypnotic Susceptibility: Form A. Palo Alto, 
California: Consulting Psychologists Press, 1962. 

Shor, R. E., Orne, M. T., & O'Connell, D. N. Vali- 
dation and cross-validation of a scale of self-re- 
ported personal experiences which predicts hypno- 
tizability. Journal of Psychology, 1962, 53, 55-75. 

Spanos, N. P., & Barber, T. X. Toward a converg- 
ence in hypnosis research. American Psychologist, 
1974, 29, 500-511. 

Spanos, N. P., DeMoor, W., & Barber, T. X. Hyp- 
nosis and behavior therapy: Common denomi- 
nators. American Journal of Clinical Hypnosis, 
1973, 16, 45-64. 

Spanos, N. P., & McPeake, J. D. The interaction of 
attitudes toward hypnosis and involvement in 
everyday imaginative activities on hypnotic sug- 
gestibility. Amerian Journal of Clinical Hypnosis, 
in press. 

Tellegan, A., & Atkinson, G. A. Openness to ab- 
sorbing and self-altering experiences, a trait re- 
lated to hypnotic susceptibility. Journal of Ab- 
normal Psychology, 1974, 83, 268-277. 

Thorkelson, K. E. The relationship between hypnotic 
susceptibility and certain personality, physiologi- 
cal and electroencephalographic variables in mono- 
zygotic and dizygotic twin pairs. Unpublished 
gaira dissertation, University of Minnesota, 
1973. 

Weitzenhoffer, A. M., & Hilgard, E. R. Stanford 
Hypnotic Susceptibility Scale, Form A, Palo Alto, 
Calif.: Consulting Psychologists Press, 1959. 


(Received September 14, 1973) 


(Continued from page 573) 


Personality Research: Components of Variance Attributable to the Person and the Situa- 
tion. Irwin G. Sarason (Department of Psychology, University of Washington, Seattle, 
Washington 98105), Ronald E. Smith, and Edward Diener. 4 

Salience of Reward and Intrinsic Motivation, Michael Ross (Department of Psychology; 
University of Waterloo, Waterloo, Ontario, Canada N2L 3G1). $ 

Hypothetical and Actual Moral Reasoning in a Situation of Civil Disobedience, Norma 
Haan (Institute of Human Development, University of California, 1203 Edward Chace 


Tolman Hall, Berkeley, California 94720). 


Evaluation Apprehension in Verbal Conditioning: A Test of Four Subject Effects Models. 
Steven J. Kingsbury, David P. Stevens, and Edward J. Murray (Department of 
Psychology, University of Miami, Coral Gables, Florida 33124). 

The Importance of Extremity. Peter Warr (Medical Research Council Social and Applied 
Psychology Unit, Department of Psychology, University of Sheffield, Sheffield S10 2TN, 


England) and Paul Jackson. 


ODERN LIFE 


Patricia Niles Middlebrook, Central Connecticut State College 


“Yt has remained for Pat Middlebrook to show the way in which humanism, social 
ance and the research tradition can be effectively integrated in an introductory social 
only sorry that her text was not available to introduce me to this 
exciting field when I was a student.” 
Phillip Zimbardo, Stanford University 


relev: 
psychology text. I’m 


Topically organized, this comprehensive book provides in-depth treatment of all the 
arch areas including attitude formation and change, affiliation, aggression, 


major rese 
ior. Middlebrook discusses a great many of 


altruism, person perception, and group behav 
the major experiments in Social Psychology; she develops the student’s faculties for 

evaluating research and encourages him to think in research terms. Her research is up-to- 

date with many 1973 and “in press” references. She also emphasizes areas of interest to 


students and instructors in, for example, her chapters on the dyad and on altruism. 


for the book are 150 illustrations that are closely related to the 
ade in the text; discussion questions at the 
over 1,000 references; 


Included in the apparatus 
text; numerous boxes expanding on points m 

end of each of the eleven chapters; a glossary; a bibliography with 
summaries at the end of each chapter and at the end of each chapter within chapters. 


Please write for an examination copy giving course title, enrollment, and decision date. 


1973 700 pages. Order Code 31726. 


ALFRED A.KNOPF 


The College Department 400 Hahn Road, Westminster, Maryland 21157 


Thesaurus of 
Psychological 
Index Terms 


Contains over 4,000 terms representing an 
authoritative, unambiguous, and hierarchically 
structured vocabulary used by psychology and 
related disciplines. This vocabulary will facilitate 
precise content representation of concepts de- 
scribed in psychological literature and allow for 
more flexibility in automated search and retrieval 
operations, These two factors promote more ef- 
fective communication of psychological concepts 
in original works and research, ultimately aiding 
the content of publications to be focused in accord- 
ance with the interest patterns of psychologists. 


This Thesaurus, developed by the Psychological 
Abstracts Information Services, is divided into 
three sections: 


The relationship section shows the relation- 
ships among concepts that are associated with 
each other. Term relationships noted are: 
synonymous, broader, narrower, or related. 


The alphabetical section presents each prefer- 
red term in alphabetical order, enabling rapid 
identification of terms used for indexing. 


The hierarchical section contains 17 broad con- 
tent areas in which terms are presented in de- 
scending order according to breadth of concept 
represented. The 17 categories correspond to 
major classifications used in Psychological 
Abstracts. 


The relationship, alphabetical, and hierarchical 
sections will assist the user in selecting appro- 
priate terms for different uses. 

Administrators 

© determine areas of research for an organization 

e assure representative expertise in personnel 
selection 

Researchers and Authors 

e find synonyms for a term representing a concept 

è select terms that represent a concept 

e define subject matter topics for a paper or book 

Educators 

© structure a curriculum 

© use as a tutorial aid 

Consultants 

© search the literature 

e aid in retrieval of information 

Librarians 

e provide access to psychological literature 

e aid in cataloging psychology material 

e aid in ordering new psychological literature 

Students 

e locate concepts relevant to psychology 

e focus areas of study 


The Thesaurus contains 350 pages with a hard- 
cover and costs $12. Please note that all orders 
amounting to $15 or less must be accompanied 
by full payment except for those orders submitted 
on institutional purchase order forms. 


American Psychological Association 
Publication Sales Department X 
1200 17th Street, N.W. 

Washington, D.C. 20036 


(Contents continued) 


Positive and Negative Reactions to Male and Female Success and Failure in Relation 
to the Perceived Status and Sex-Typed Appropriateness of Occupations/N, T. 
cs T 

Defensive Externality and Its Relation to Academic Performance/Terry J. Prociuk 
inte OTS T. Prociuh 


... And Some Men Have Leadership Thrust upon Them/Lawrence Beckhouse, 
Judith Tanur, John Weiler, ond Eugene Weinstein aee irana, y o OUS, 


Witnessed Aggression: Influence of an Observer's Sex and Values on Aggressive Re- 
ee Cr ee i. Esayo Re 


Group Position as a Function of Member Attitudes: Choice Shift Effects from the 
Perspective of Social Decision Scheme Theory/Norbert L. Kerr, James H. Davis, 


Gaile Meth onaid. KentRismany ET E Pra 
Involvement in Everyday Imaginative Activities, Attitudes Toward Hypnosis, and 
Hypnotic Suggestibility/ Nicholas P. Spanos and John D. McPeake ........,... 


List of Manuscripts T e a eE a eE a a ene 


536 


549 


557 


567 


574 


27) & 


"A generally suporion textbook... ` 
hie fon li cups in social psychology: 


*from a review of the first edition in the American Sociological Review 


C 


t 
P ROIO 
Social Payshollogy 


James A. Schellenberg, Western Michigan University 


A concise, basic text which explores and reviews the major topics in « 
social psychology. 


è Builds on a framework of case histories and experiments, providing basic 
grounding in social psychology. 


Theoretical and methodological pointers, separated from the main body of the 


text, provide background and highlight key ideas at the most significant point 
in the presentation. 


® General organization — Seven topics. The material is grouped around seven rm © 
general topics: affiliation, aggression, social influence, attitudes, groups, socia! 
norms, and personality in society. Each section contains four chapters and 
concludes with a review and discussion chapter. A glossary of key concepts 
is included along with a bibliography and Instructor’s Manual with a large 
supply of essay, multiple choice, and true-false questions. 


© Major points of revision. The section on “Culture and Personality” is expanded 
into a full, four-chapter concluding section with treatment of culture, T 
psychological ecology, and contemporary issues of individual identity. A brie $ 
final chapter on “Research: Theory and Practice” has been added jand major 
new materials are found in chapters on altruism (“Helping People’) ; attraction P 
(“Liking People”); interpersonal perception (“Perceiving Persons”); and ¢ 
social identity (“Individuality in Contemporary Society”). 


1974 paperbound 350 pages Order Code 31829 


For an examination copy, please write, giving course title, enrollment, and 
decision date. 


Random House M: 


4 
THE COLLEGE DEPARTMENT 400 HAHN ROAD, WESTMINSTER, MARYLAND 21157 


ournal of Personality 
and Social Psychology 


VOLUME 31, NUMBER 4 APRIL 1975 


Bellet peepee of Intrinsic and Extrinsic Motivation/Bobby J. Calder and Barry M. 
FID saan state SMe a ee ca ete ey MN Ponce a NE Soe pe 


Components of “Authority” as Determinants of Compliance/H. Andrew Michener 
and Martha R. Burt ...... a N E, T Poe alle Siociy « 


Cooperation and Competition in Coacting Groups/Morris A. Okun and Francis J. 
Di Vesta 


Development of Integrative Solutions in Bilateral Negotiation/Dean G. Pruitt and 
EN die TEUS E coer cid Aa rE At CE Rae aga Micha ra 


Sex Role Adaptability: One Consequence of Psychological Androgyny/Sandra L. Bem 
Nonverbal Communication of Affect in Children/Ross Buch gage sic cscs cece veces 


Antipollution Measures and Psychological Reactance Theory: A Field Experiment/ 
Michie MBAM Giese Readers eh ie ta cte es bs (tee E a baie esos oe ve 


The Role of Situational Restrictiveness on Self-fulfilling Prophecies: A Theoretical and 
Empirical Extension of Kelley and Stahelski’s Triangle Hypothesis/Dale T, Miller 
CUE IAC, TNT, Ani ais oa gm ie idee Sie en rrr 


The Role of Expectations and Attributions in the Alleviation of Learned Helplessness/ 
Carol Deck’... arn «Waite taints ita uate On a occ ic carn oan 


The Influence of a Communicator’s Body Position on Opinion Change in Others/Hugh 
McGinley, Richard LeFevre, and Pat McGinley n.n.. oona ces. 


Helping a Motorist in Distress: The Effects of Sex, Race, and Neighborhood/Stephen 
G. West, Glayde Whitney, and Robert Schnedler o... aooaa 


Effects of Task-Intrinsic Rewards upon Extrinsic and Intrinsic Motivation/Arie W. 
Kruglanski, Aviah Riter, Daniella Arazi, Raphael Agassi, Joseph M onteqio, Isaac 
RET ONG MMP A EAEEREN abt ede MT ea ea ara Hic aes et salves why 


Gain and Loss in Attraction: Attributions from Nonverbal Behavior/Gerald L. Clore, 
Nancy Hirschberg Wiggins, and Stuart [thin ..... 0... c cece cece eee c ee cusi 


Heider’s Structural Balance Principle as a Conceptual Rule/Nickolas B, Cottrell .... 


The Facilitation of Aggression by Aggression: Evidence Against the Catharsis Hy- 
pothesis/Russell G. Geen, David Stonner, and Gary L. Shope ... 


Information Properties of Descriptive Words/Robert K. Gifford ................., 


Effect of Cognitive Appraisal on d’ and Response Bias to Experimental Stress/ 
BRAC IMVVa wy LN CLL) e eo alsicse E = S ov pie als asst nla'ese OM ae 


Can Money Enhance Intrinsic Motivation?: A Test of the Content—Consequence 
Hypothesis/Arie W. Kruglanski, Aviah Riter, Asher Amitai, Bath-Shevah Mar- 
golin, Leorah Shabtai, and Daliah Zaksh ....... 060. cc cece ccc c cece veceneees 

Conjugate Lateral Eye Movements as an Index of Hemispheric Activation/Raquel 


ES GUY MeN RE ioon AL I OTE E RUMEN. 0) sroliars Lalajs es lejacas deen cee 


599 
606 
615 


621 
634 
644 


654 


661 
674 
686 


691 


699 


706 
713 


721 
727 


735 


744 


751 


(Continued on inside back cover) 


PUBLISHED MONTHLY BY THE AMERICAN PSYCHOLOGICAL ASSOCIATION 


P SPRINGER books for fall classes. 


NEW! 


Bertram P. Karon. Updated from the perspective 
of 1975, this is a reissue of Dr. Karon’s classic 
study of the effects of discrimination on blacks and 
whites, Includes a new appendix on the Southern 
white. Foreword by Silnan Tomkins. 220p., 
1975, $5.95 (cloth $8.95) 


Black Scars 


Experimental Psychology 
of Alfred Minet 


Robert Pollack-Margaret Brenner, eds. In 14 
papers, translated into English, Binet speaks as 
an experimental psychologist with’ a message for 
research today—in psychophysics, perceptual de- 
velopment, cognition, and memory. With com- 
plete Binet bibliography. 252p., 1969, $5.95 
(cloth $8.95) 


The Family’s Role NEW! 


in Soviet Education 


Ludwig Liegle. Foreword by Urie Bronfenbren- 
ner.. Based on original Soviet research. 190 p., 
New, $5.95 (cloth $9.95) 


Testing Freudian Concepts 


Irving Sarnoff. “One of the best introductions: of 
the methodology involved in studying the impact 
of dynamic variables on a range of significant 
human behaviors.” —Philip Zimbardo. 276p. 
1971, $5.95 (cloth $9.50) 


Conflict Among Humans 


Robert Nye. “The determinants, dynamics, out- 
comes, and reduction modes of hostility... a 
good undergraduate introduction to one area of 
the literature of conflict.” —Contemporary Psy- 
chology. 220p., 1973, $5.50 (cloth $8.50) 


For examination copies on 30-day approval: include course title, present text, and enrollment. 
Charges cancelled on class-size orders or if book is returned. 


SPRINGER PUBLISHING COMPANY /200 Park Avenue South, New York 10003 ——— 


o 


Volume 31 


EDITOR: 
ASSOCIATE EDITORS: 


IRWIN ALTMAN 

University of Utah 

_ Norman H. ANDERSON 

University of California, San Diego 
ALBERT BANDURA 

Stanjord University 

Daryt J. BEM 

Stanford Uningrsity 

Seymour M. BERGI 
University of Massachusetts 
Jonn C. BRIGHAM 

Florida State University 
Unie BRONFENBRENNER 
Cornell University 

J. MERRILL CARLSMITH 
Stanford University 

Isipor CHEIN 

New York University 
Ronert B. CIALDINI 

Arizona State University 
Tuomas CooK 3 
Northwestern University 
Warter H, CROCKETT 
Univesity of Kansas 
James E, Dirtes 

Yale University 


Norman T, FEATHER 
Flinders University oj South 
‘Australia, Bedford Park, Australia 


KENNETH GERGEN 
Swarthmore College 


Aran E. Gross £ 
University oj Missouri 


Barry COLLINS 
University of Calijornia, 

Los Angeles 
MicnaeL G. EFRAN 
University of Toronto, 

Toronto, Canada 
SHELDON FELDMAN 
Swarthmore College 
R. Gorsuch es 
Texas Christian University 
Harrison G. GOUGH 
University of California, Berkeley 
Epwarp E. Jones 
Duke 
GEORGE LEVINGER 
University of Massachusetts 


THE AMERICAN PSYCHOLO 


Copyright © 1975 by the American 


April 1975 


Journal of Personality 
and Social Psychology 


oo oan Tamar aa ea 


Number 4 


WO) os: Ss eee E 


25% 


27. 4 


Joun T. LANZETTA, Dartmouth College 
Auice H. EAGLY, University of Massachusetts— 


Amherst 


ANTHONY G. GREENWALD, Ohio State University 
ALBERT Pepirone, University of Pennsylvania 
Wu1am Scorr, James Cook University, Townsville, 


Australia 


CONSULTING EDITORS 
Wiiuiam HAYTHORN 

Florida State University 
Ciype HENDRICK 

Kent State University 
Davio S. HOLMES, 

University of Kansas 
Louise Kipper 

Temple University 
Róbert E. Kueck 

Dartmouth College 
Minton E. Liretz 

University of Colorado 
Crirrorp E. LUNNERORG 

University oj Washington 
R. Crank MCCAULEY 

Bryn Mawr College 


Achert MEHRADIAN 5 

University of California, Berkeley 
Geral A, MENDELSON 

University of California, Berkeley 
Davin Messick 7 

University of California, 

Santa Barbara 

Jupson Muis 

University of Maryland 
Heevert L, Mimers 

Ohio State University 
Wintam N, Morris 

Dartmouth College 


Ricnarp Nispert 
University of Michigan, Ann Arbor 


JANE A, PILIAVIN a 
University. of Wisconsin—M adison 


ISSUE CONSULTANTS 


Brexice E. Lott 
University of Rhode Island 


Epwin, MEGARGER 
rida State University 


Sruart Oskamp 
Claremont Graduate Sc hool 


Berxarp C, ROSEN 
Cornell University 


PUBLISHED BY 


Dean Pruirr 
State University of New York 
at Bufalo 
Miron E, ROSENBAUM 
University of Towa 


Davin J. SCHNEIDER 
Brandeis. University 
Marvin Suaw 
University of Florida 
LAWRENCE STRICKER 
Educational Testing Service, 
Princeton, New Jersey 
Luoyn H, STRICKLAND 
Carleton University, Ottawa, Ontario 
Frep L. STRODTBECK 
University oj Chicago 
Perer SUEDPELD 
University oj British Columbia, 
Vancouver, Canada 
James T, Teoescmi 
State University of New York, 
Albany 
Haney Ursnaw 
University oj Minois at Chicago 
Circle 
t WALSTER 
sity of Wisconsi 
Lapp WHEELER 
University of Rochester 
Ronerrt S. Wyer, Jr. 
University of Ilinois at 
Urbana-Champaign 
Ronenr Zien 
University of Florida 
Puri ZIMBARDO 
Stanford University 


Madison 


Jury B. ROTTER 
University oj Connecticut 
Zick Runin 
Harvard University 
Brennan RUL 
University of Alberta, 
Edmonton, Canada 
F. ŞAMELSON 
Kansas State University 
Cani SCHOOLER 
National Institute of Mental Health, 
Bethesda, Maryland 
, STAUB 
rsity of Massachusetts 
Nancy Widens 
University of Illinois 


GICAL ASSOCIATION, Inc. 


Psychological Association, Inc. 


a! 


The Journal of Personality and Social Psychology publishes original research reports in the areas 
of social psychology and personality dynamics. The Journal includes articles on the following topics: 
social motivation; attitudes and attitude change; social interaction; verbal and nonverbal communi- 
cation processes; group behavior; person perception ; conformity and deviation ; and personality structure 
and dynamics. Methodological articles, studies primarily concerned with the development of measuring 
instruments, replications, and reports of negative results or failures to replicate published work are 
acceptable if they are judged to make a substantial contribution to knowledge, Low priority is given to 
papers concerned with personality assessment or the development and validation of assessment instru- 
ments. Manuscripts in these latter categories should be brief, not more than nine typewritten pages. 
The Journal does not include studies of abnormal processes or populations. 


Manuscripts: Submit manuscripts in duplicate to John T. Lanzetta, Editor, Journal of Personality and Social Psy- 
chology, Department of Psychology, Dartmouth College, Hanover, New Hampshire 03755, according to instructions 
provided below. 

Manuscripts must conform in style to that described in the Publication Manual of the American Psychological 
Association (2nd ed.), (a) An abstract of 175 words or less, typed on a separate page, must accompany each manu- 
script. Instructions for preparation of the abstract are in the Publication Manual (p, 15), (b) The title should be 15 words 
or less. If this length is exceeded, the Editors will use their judgment in shortening the titles of accepted reports. (c) 
Duplication of information in text, tables, and figures should be avoided. (4d) One set of figures should be glossy prints 
and must not exceed 8} X 11 inches. Figures will generally be reduced to single-column width, Symbols and letters 
should be large enough to be readable after reduction. Metric equivalents must be given for all nonmetric measurements, 
Authors should keep a copy of the manuscript to guard against loss. Accepted papers are normally published in order 
of receipt. The length of manuscripts should normally not exceed 25 typewritten pages. Longer papers will be considered 
for pubiication if the length is judged necessary for effective communication, 

Because the reviewers have agreed to participate in a blind reviewing system, authors submitting manuscripts are 
requested to include with each copy of the manuscript a cover sheet stating the title of the manuscript, the name of 
the author or authors, the author's institutional affiliation, and the date the manuscript is submitted. Any footnotes 
identifying the author or his affiliation should also appear on the cover sheet. The first page of the manuscript should 
omit the author's name and affiliation but should include the title and the date submitted, Every effort should be 
made by the author to see that the manuscript itself contains no clues as to his identity. 


Copyright and Permissions: All rights reserved, Written permission must be obtalned from the American Psycho- 
logical Association for copying or reprinting text of more than 500 words, tables, or figures. Permissions are normally 
granted contingent upon like permission of the author, Inclusion of the APA copyright notice on the first page of repro- 
duced material, and payment of a fee’of $10 per page of APA material, Permission fees are waived for authors who wish 
to reproduce their own articles, Address requests for permission to the Permissions Office, American Paychological 
Association, 1200 Seventeenth Street, N.W., Washington, D.C. 20036. 


Subscriptions: Subscriptions are available on a calendar gan basis only (January through December), Nonmember 
rates for 1975: $48 domestic, $49 foreign, $5 single issue. APA member rate : $20, Write to Subscription Section, APA, 


Reprints: Authors may order reprints of their articles from the printer when they recelve galley proofs, 


Back Issues and Back Volumes: For information regarding back issues or back volumes write to Order Depart- 
ment, APA, 


Microform Editions: For information regarding microform editions write to any of thefollowing: Johnson Associates, 
Inc., P.O, Box 1017, Greenwich, Connecticut 06830; University Microfilms, Ann Arbor, Michigan 48106; or Princeton 
Microfilms, Princeton, New Jersey 08540, 


Change of Address: Send change of address notice and a recent malling label to the attention of the Subscription 
Section, APA, 30 days prior to the actual change of address, Undelivered copies resulting from address changes will not 
be replaced; second-class mail is not forwarded unless subscribers notify the local post office in writing that they will 
guarantee second-class forwarding postage. 


Published monthly in two volumes per year, six issues per volume, by the American Psychological Association, Inc., 
1200 Seventeenth Street, N.W., Washington, D.C. 20036, Printed Ín the U.S.A. Second-class postage paid at Washing- 
ton, D.C., and additional mailing offices. 


Journal Staff 
Anita DeVivo, Executive Editor and ARTHUR GORDON AND RITA JASKOWITZ, 
General Manager Technical Editors 
Mary GriBALL, Supervising Ly D. Sprer, Manager, Publications 
Technical Editor Business Services 


Anne RepMAn, Subscription Manager 


E 


n 


er ONEN 
- 1975, Va No. 4, 599-605 


Ens Py J 
and Social Psychology 


Self-Perception of Intrinsic and Extrinsic Motivation 


Bobby J. Calder and Barry M. Staw 
University of Illinois at Urbana-Champaign 


Self-perception theory predicts that intrinsic and extrinsic motivation do not 
combine additively but rather interact. To test this predicted interaction, 
intrinsic and extrinsic motivation were both manipulated as independent vari- 
ables. The results revealed a significant interaction for task satisfaction and a 
trend for the interaction on a behavioral measure, These results are discussed 
in terms of a general approach to the self-perception of motivation. 


Research on motivation has frequently 
drawn a distinction between intrinsic and ex- 
trinsic motivation (eg. Atkinson, 1964; 
Hunt, 1965; Koch, 1956; Young, 1961). If 
a situation contains a specific goal which 
provides satisfaction independent of the ac- 
tual activity itself, behavior is said to be ex- 
trinsically motivated. On the other hand, if 
the activity is valued for its own sake and 
appears to be self-sustained, behavior is said 
to be intrinsically motivated (Young, 1961, 
p. 171), Although this distinction is concep- 
tually appealing, it raises difficult questions. 
There are two major problems confronting 
the account of any behavior in terms of in- 
trinsic and extrinsic motivation (Cofer & 
Appley, 1967). The most serious problem is 
that the phenomenon is merely named, not 
explained. Labeling a behavior as intrinsically 
motivated begs the question of the theoretical 
nature of the process through which the be- 
havior has become a motive, The second 
problem is that there are other theories 
which might plausibly explain the phenome- 
non. No doubt the most common alternative 
explanation involves secondary reinforcement. 
Secondary reinforcement refers to a process 
by which an originally neutral stimulus 
acquires reinforcing properties through its 
association with a primary reinforcer. In 
these terms, an intrinsically motivated activ- 
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study. Support for the study was provided by the 
University of Illinois Graduate Research Board. The 
order of this authorship is alphabetical. 
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ity is simply one in which the reinforcement 
value of the goal has associatively rubbed off 
on the behavior itself. Thus, it is difficult to 
use the notion of intrinsic motivation be- 
yond the descriptive level. 

Although the status of intrinsic motiva- 
tion as a psychological construct is unclear, 
we would argue that the concept is of consid- 
erable interest from still another perspective. 
Instead of asking what intrinsic motivation 
is and how it operates, it may be viewed as a 
perception on the part of individuals, That is, 
suppose that individuals attempt to label 
their behavior in motivational terms much as 
do motivational theorists. The seeds of such 
an approach have been developed by de- 
Charms (1968) as part of his work on per- 
sonal causation as an affective determinant 
of behavior. deCharms argued as follows: 


As a first approximation, we propose that when- 
ever a person experiences himself to be the locus of 
causality for his own behavior (to be an Origin), he 
will consider himself to be intrinsically motivated, 
Conversely, when a person perceives the locus of 
causality for his behavior to be external to himself 
(that he is a Pawn), he will consider himself to be 
extrinsically motivated (p. 328). 


For deCharms, the crux of the distinction 
between intrinsic and extrinsic motivation 
stems from the feeling or perception of per- 
sonal causation, Satisfaction derives from an 
activity which is perceived as intrinsically 
motivated because of a person’s need to feel 
a sense of personal causation in his or her 
actions, 

deCharms’ ideas may be readily extended 
to a more general approach to intrinsic moti- 
vation by means of Bem’s (1967a, 1967b, 
1970, 1972) self-perception theory, Accord- 
ing to this theory, a person infers his internal 
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states by observing his own behavior and the 
context in which it occurs. Thus a person 
may label his behavior as intrinsically moti- 
vated under some conditions and as extrinsi- 
cally motivated under others. The environ- 
ment provides cues as to whether a person’s 
internal motivation is intrinsic or extrinsic. 
deCharms’ (1968) discussion of intrinsic 
motivation poses an interesting question. 
Common sense would lead one to expect that 
intrinsic and extrinsic motivation summate to 
produce satisfaction, and most organizational 
theories of job attitudes have made this as- 
sumption (e.g., Porter & Lawler, 1968; 
Vroom, 1964). However, deCharms argued 
that intrinsic and extrinsic motivation inter- 
act. Specifically, the introduction of extrinsic 
rewards for a behavior may decrease motiva- 
tion rather than enhance it, because the re- 
wards decrease the perception of intrinsic 
motivation. He also predicted, conversely, that 
motivation may be enhanced if a reward is 
withheld. 
Several recent studies have tended to sup- 
port the hypothesis that intrinsic and ex- 
trinsic motivation are not additive. Deci 
(1971) found that the introduction of con- 
tingent monetary rewards for an interesting 
puzzle-solving task apparently decreased sub- 
jects’ subsequent desire to work on the puz- 
zles, while a noncontingent monetary reward 
did not (Deci, 1972a). Calder and Straw 
(1975), however, have pointed out a number 
of methodological problems that render this 
work very difficult to interpret. Three other 
studies provide firmer evidence. Lepper, 
Greene, and Nisbett (1973) demonstrated 
that children who expected a reward for an 
interesting task, playing with magic markers, 
subsequently played with the markers in a 
free-time situation less than subjects who did 
not expect the reward or who received no 
reward. Kruglanski, Friedman, and Zeevi 
(1971) showed that children who received an 
extrinsic reward rated an experimental task 
as less enjoyable and were less likely to vol- 
unteer for similar experiments. Finally, Ross 
(in press) found that preschool children 
given a salient contingent reward displayed 
less interest in a target activity than sub- 
jects in a nonsalient reward or a control con- 
dition. (See Staw, 1975, for an extensive dis- 
cussion of this literature.) 
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All of the previous studies of the interac- 
tion of intrinsic and extrinsic motivation have 
used the same experimental design: An ex- 
‘trinsic reward is introduced to an interesting 
task, and some dependent variable measure 
of intrinsic motivation is assessed. The pur- 
pose of the present study is to test this inter- 
action hypothesis more directly by manipu- 
lating both intrinsic and extrinsic factors as 
independent variables and measuring their 
effects on relevant dependent variables. Task 
satisfaction and a behavioral measure of task 
persistence are employed as dependent. varia- 
bles in the present study because of their 
obvious practical importance, their use in 
other studies, and their relevance to the inter- 
action prediction. 

Since both intrinsic and extrinsic factors 
are manipulated independent variables in this 
design, one can clearly test the assumption 
of additivity versus the interaction of intrinsic 
and extrinsic motivation. Theoretically, the 
interaction prediction may be derived as fol- 
lows. When a task involves high intrinsic 
interest, introduction of extrinsic rewards may 
lead to the self-perception that one is per- 
forming the activity primarily to obtain the 
extrinsic reward. Thus, for an intrinsically 
interesting task, extrinsic rewards may lead to 
a decrease in satisfaction and persistence on 
a task. On the other hand, when a task in- 
volves less intrinsic interest, the self-percep- 
tion effect is not expected to apply. One would 
expect, for a task not high in intrinsic in- 
terest, a direct (or reinforcement) relation- 
ship between extrinsic rewards and task satis- 
faction and persistence. Although it is not 
made explicit in the work of deCharms, the 
interaction hypothesis may thus be predicated 
upon two effects: an inverse (or self-percep- 
tion) effect when a task is initially high in 
intrinsic interest and a direct (or reinforce- 
ment) effect when there is initially less in- 
terest in a task.? 


PRELIMINARY EXPERIMENT 


The design of this study, then, calls for 
varying intrinsic as well as extrinsic motiva- 


1 Theoretical discussions of the interaction of 
intrinsic and extrinsic motivation tend to parallel 
those in the dissonance literature in that there are 
often elaborate dissonance (or self-perception) ef- 
fects while there are simply reinforcement effects. 


re 


INTRINSIC AND EXTRINSIC MOTIVATION 


tion. As suggested by our earlier discussion, 
manipulating intrinsic motivation reduces to 
finding a task which may be readily labeled 
as inherently pleasurable in one instance but 
not in another. Moreover, for our purposes, 
this difference must not be confounded with 
other factors which could lead to alternative 
explanations. Two such factors are crucial: 
The task should always consist of the same 
overt behavior, and subjects’ perceptions of 
the task should differ primarily on an affec- 
tive dimension as opposed to a cognitive or 
behavioral dimension. To operationalize such 
a manipulation, considerable pilot testing was 
necessary. 

The basic experimental task consisted of 
solving 15 jigsaw-type puzzles. The manipu- 
lation of intrinsic motivation was accomp- 
lished by having the puzzles blank for one 
group of subjects versus having interesting 
pictures on the puzzles of another group. Fif- 
teen pictures were carefully selected from 
back issues of magazines (chiefly Life), 
mounted on a large piece of poster board and 
laminated. There was considerable variety in 
the content of the pictures, ranging from 
sporting events to the President, and they all 
involved some unique topic of interest. (To 
ensure high general interest for all subjects, it 
was found necessary to include three Playboy 
centerfolds.) All pictures were mounted on 
the same size board and cut into five pieces 
to form picture puzzles. Each blank board 
was cut in exactly the same way as a corre- 
sponding picture board. The blank and pic- 
ture puzzles were thus matched except for 
the picture. 

Overt behavior on the task was equated 
through the experimental procedure. Each 
puzzle, whether it contained a picture or was 
blank, contained only five large pieces. How- 
ever, the simplicity of the puzzles was not 
sufficient to eliminate differences in perform- 
ance across groups or subjects. Hence, sub- 
jects were given a board with each puzzle 
which contained an outline of the parts of 
the puzzle, and to solve each puzzle, subjects 
only had to pick up a piece and place it over 
the corresponding outlined shape. The task 
was presented as a test of “cognitive infor- 
mation-processing” in which the researchers 
were interested in the order in which the puz- 
zle parts were selected. Subjects recorded this 
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order by writing down the symbol appearing 
on each puzzle piece as they placed them on 
the pattern board. Thus, solving the puzzles 
was extremely routine. There was no possibil- 
ity of making an error, so the extra cues pro- 
vided by the picture made no difference in 
performance. Subjects were literally forced 
into the same pattern of actions by the re- 
quirements of the task. 

Although this procedure controls for any 
differences in overt behavior between the 
blank and picture puzzles, it is still possible 
that subjects might perceive unintended dif- 
ferences between the two. To check on this 
possibility, 15 subjects were run on each 
puzzle type and then given a series of se- 
mantic differential scales regarding the task, 
An attempt was made to include cognitive, 
behavioral, and affective scales. The mean 
ratings for the blank and picture puzzles are 
presented in Table 1. The only significant 
differences for the two groups are on the 
more affective scales connoting intrinsic moti- 
vation. The picture task is rated as signifi- 
cantly more interesting, good, exciting, and 
pleasurable. It is, of course, impossible to 
prove that there are no unintended differ- 
ences in overt behavior or perceptions, but 
our blank versus picture manipulation would 
seem to minimize this possibility. 


METHOD 


Procedure 


The subjects were 40 undergraduate males ful- 
filling a requirement for an introductory course in 
organizational behavior, Ten subjects were randomly 
assigned to four experimental conditions. Half of 
the subjects worked on the blank puzzles, while the 
other half worked on the picture puzzles, For half 
of the subjects, payment was never mentioned, 
while the other half were told that since the task 
would take about 20 to 30 minutes, an equitable 
payment for their time would be $1. The experi- 
mental task was the same as that for the prelimi- 
nary study. The 15 disassembled puzzles and their 
corresponding pattern boards were placed on rows 
of tables in a long room as if to form an assembly 
line. For subjects in the payment condition, a one 
dollar bill was placed at the end of the room after 
the 15th puzzle. In order to make sure this reward 
was salient (Ross, in press, Note 1), before each 
subject began, the experimenter pointed to the 
money and said, “When you finish, you can have 
the dollar over there.” The experimenter then left 
the room and returned after the subjects had fin- 
ished the puzzles. To summarize, it should be noted 
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TABLE 1 
MEAN RATINGS OF THE Puzzte TASK 
Variable Blank puzzle Picture puzzle 
Boring-interesting 3.47 4.73** 
Bad-good 3.60 4.67** 
Monotonous- 
exciting 2.60 CEE ag 
Painful- 
pleasurable 4.00 4.67* 
Easy-hard 1.47 1.20 
Complex-simple 6.47 6.67 
Active-passive 4.73 3.73 
Slow-fast 5.07 5.47 
Constrained-free 4.80 4.40 
Intuitive-rational 4.33 4.20 
Ordinary-novel 3.07 4.00 
Ambiguous-clear 6.47 5.87 
Rigid-loose 3.47 4.40 


Note. The semantic differentials were scored from 1 to 7 
in the direction of the adjective on the right. 
* p <.05, one-tailed ¢ test. 
#* p < 01, one-tailed ¢ test. 


that the reward was equitable, salient, expected, non- 
contingent, and given at the end of the task.* 


Dependent Variables 


The major dependent variable was task satisfac- 
tion. When a subject finished, he was told that the 
experimenter needed “to get some information about 
people’s reactions to the task” in order to see if this 
affected the sequencing of puzzle parts. Subjects rated 
the extent to which they found the “puzzle task 
itself” enjoyable on a 17-point scale ranging from 
extremely unenjoyable to extremely enjoyable. 
Several other questions on 11-point scales concerned 
subjects’ perception of the situation. 

It was also desired to obtain a behavioral mea- 
sure of motivation. Since the procedure did not 
lend itself to the typical free-time measure (Deci, 
1971, 1972a, 1972b; Lepper et al., 1973), subjects 
were asked to volunteer for future experiments of a 
similar nature without payment. The amount of time 
for which subjects volunteered was coded in minutes. 


RESULTS 


A least-squares analysis of variance of sub- 
jects’ ratings of how enjoyable they found 
the task revealed a significant Blank—Pic- 
ture X Money interaction (see Table 2). As 


2Jt should be noted that care was taken in de- 
signing this study to avoid any confusion with in- 
sufficient justification effects. The low intrinsic 
motivation condition was designed not to be very 
negative. Subjects were required to perform a very 
routine task as part of their normal research require- 
ment. This procedure had none of the trappings 
associated with dissonance effects (such as choice, 
negative consequences, or the withdrawal of incen- 
tives). Our low intrinsic motivation condition is more 
similar to the reinforcement condition of the “disso- 
nance studies.” 
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predicted by the self-perception hypothesis, 
the manipulations of intrinsic and extrinsic 
motivation were not additive in their effect on 
task satisfaction. The form of this interaction 
is displayed in Figure 1. For the low intrinsi- 
cally motivating blank puzzle task, the en- 
joyable ratings increase with the introduction 
of the extrinsic monetary reward. However, 
for the high intrinsically motivating picture 
puzzle task, the enjoyable ratings decrease. 

The mean amounts of time subjects volun- 
teered for future experiments of a similar 
nature are given in Table 3. The pattern of 
these means exactly parallels those of the 
enjoyable ratings, although the interaction 
in the analysis of variance (see Table 2) is 
not significant by conventional standards. 
This trend suggests that the effects of in- 
trinsic and extrinsic motivation were not 
additive for the behavioral measure either. It 
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FIGURE 1. Mean ratings of task satisfaction for 
the interaction between intrinsic and extrinsic 
motivation. 


INTRINSIC AND EXTRINSIC MOTIVATION 


603 


TABLE 2 
ANALYSES OF VARIANCE 
Source 
Variable Blank picture (A) Money (B) AXB 

F b F t? F ? 
Enjoyable 1.85 NS Si NS 8.58 <.006 
Time volunteered* <1 NS <1 NS 2.31 <.138 
Work vs. leisure 6.77 <.013 1.90 NS 5.27 <.028 
Perceived effort <1 NS 5.44 <.025 <1 NS 
Awareness of extrinsic motivation 4.26 <.046 <1 NS <1 NS 
Awareness of intrinsic motivation 1 NS <1 NS <1 NS 


"Since the distribution of time scores tends to be skewed, a natural log transformation was used for this analysis of variance, 


should be noted that this measure may have 
been weakened by the fact that the experi- 
ment occurred near the end of the semester. 
This factor may have increased individual 
variability in subjects’ willingness to volun- 
teer. 

The remaining dependent variables concern 
subjects’ perceptions of various aspects of 
the experimental situation. One question 
asked whether the puzzle task was more like 
work or leisure-time activity. As shown in 
Table 3, the means for this variable increase 
for the blank puzzle and decrease for the 
picture puzzle with the introduction of 
money. This is the same pattern shown by 
the enjoyable ratings except that the blank 
puzzle is always more worklike than the pic- 
ture puzzle. This pattern is reflected in Table 
2 by the significant blank-picture main effect 
as well as the significant interaction. 

Subjects were also asked about perceived 
effort, the extent to which they “tried” to do 
well on the puzzle task. Though not specifi- 
cally predicted, the payment of money ap- 


TABLE 3 
MEAN TIME VOLUNTEERED AND PERCEPTIONS 
Blank Picture 
Variable v 
No o 

money | Money | money | Money 
Enjoyabl —1.80 | 1.40 | 230 | —.10 
Time yoliinteered 22.50 | 28.50 | 40.50 | 24.00 
Work vs. leisure =.70| 1.70 | 250 | “1.90 
Perceived effort 7.60 | 5.30 | 650 | 5.40 
peice ary 7.10} 6.30 | 4.70 | 4.30 
RRA ee 4.90) 5.40 | 5.90 | 5.40 


à The higher the number is, the more leisurelike the task, 
the more perceived effort, etc. 


parently decreased the perception of trying 
on both the blank and picture task as indi- 
cated by the significant main effect for money 
in Table 2. Two other questions asked to 
what extent the subject perceived that he was 
motivated by “external factors (like course 
credit, the researcher, etc.)” and to what ex- 
tent the subject perceived that he was moti- 
vated by his own “intrinsic interest in the 
puzzle task itself.” As revealed in Table 2, 
the only significant result was a blank—picture 
main effect for extrinsic motivation. Subjects 
in the blank puzzle task always saw them- 
selves as more externally motivated. This 
finding suggests that subjects may have been 
more aware of the effects of external factors 
on their level of motivation than intrinsic 
factors. Other questions concerned how well 
the subject thought he performed, beliefs 
about the researcher’s opinion of the task 
and, in the payment condition, how valuable 
the payment was and whether it Was more a 
bribe or a reward. These questions revealed 
no significant effects. 


Discussion 


As predicted, an interaction was found 
between intrinsic and extrinsic motivation for 
the task satisfaction variable. There was also 
a trend for this interaction on the behavioroid 
measure of volunteering. Although subjects’ 
perceptions of their motivation as extrinsic or 
intrinsic did not indicate that they were aware 
of this effect, their ratings of whether the 
task was more like work or leisure-time actiy- 
ity did display an interaction between in- 
trinsic and extrinsic motivation. It should 
also be noted that the form of the interaction 


604. 


for task satisfaction in Figure 1 is stronger 
than theoretically required, The picture puz- 
zle actually became less enjoyable than the 
blank puzzle with the introduction of money. 

While the present results demonstrate the 
interaction of intrinsic and extrinsic motiva- 
tion, it is not clear to us that such an inter- 
action need always be obtained. A more 
prudent hypothesis might be that under some 
conditions, self-perception may produce an 
interaction between intrinsic and extrinsic 
motivation. Consider a description by Wood- 
worth (1918) of the phenomenon of intrinsic 
motivation: 


. while a man may enter a certain line of 
business from a purely external economic motive, 
he develops an interest in the business for its own 
sake . . . and the motive force that drives him in 
the daily task, provided of course this does not 
degenerate into mere automatic routine, is precisely 
an interest in the problems confronting him and in 
the processes by which he is able to deal with those 
problems. The end furnishes the motive force for 
the search for means but once the means are found, 
they are apt to become interesting on their own 
account (p. 104, italics added). 


Taking a lead from Woodworth, a convenient 
way of viewing the self-perception process is 
to assume that an individual performs an 
intuitive means-ends analysis of his behavior. 
As shown in Figure 2, different self-percep- 
tions may result according to the affect as- 
sociated with the means and the ends of an 
action. Intrinsic motivation can be attributed 
most clearly when the means are positive and 
the ends are negative or neutral. Extrinsic 
motivation can be attributed when the means 
are negative or neutral and the ends are posi- 
tive. When both are positive, the attribution 
may be unstable. 

In the present experiment, the attributional 
instability of the picture-money combination 
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Ficure 2. A means-ends analysis of the self-percep- 
tion of motivation. 
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was apparently resolved by a decrease in 
intrinsic motivation, which produces the inter- 
action effect. This effect, however, may well 
depend on two classes of variables: (a) 
parameters of the task environment itself and 
(b) other variables which were held constant 
in this study. In terms of task parameters, 
previous studies by Lepper et al. (1973) and 
Ross (in press) have already provided 
boundary conditions for the decrease in in- 
trinsic motivation with the introduction of 
a reward. The reward must be salient and 
expected. These are boundary conditions in 
the sense that any stimulus should in general 
have less effect on self-perception to the 
extent that it is nonsalient and unexpected. 
There are parameters of even greater theo- 
retical interest. For example, Deci (1971, 
1972a, 1972b) has suggested that noncon- 
tingent and verbal rewards do not produce 
the interaction effect, but the evidence for 
this proposition is most ambiguous (cf. Cal- 
der & Staw, 1975). Other properties of re- 
wards are almost certain to be important 
though. Extremely large rewards may become 
disassociated from the task or simply out- 
weigh other factors. It ought not be assumed 
that the interaction is Linear X Linear just 
because we tend to conduct two level experi- 
ments. Along the same lines, it should be 
noted that the blank puzzle task was designed 
not to be extremely negative (see Table 1). 
It is known that increased satisfaction can 
result when an individual chooses to perform 
a costly behavior which is insufficiently justi- 
fied (Staw, 1974; Weick, 1964). Although 
self-perception and dissonance are competing 
explanations here, similar manipulations could 
bear on deCharms’ interaction hypothesis. It 
is necessary to explore such parameters to 
determine the nature and conditions of the 
interaction. 

Of particular theoretical interest are other 
variables which may determine reactions to 
attributional instability. An individual may 
simply assume that he is intrinsically or ex- 
trinsically motivated. Which of these he as- 
sumes may well depend on personality factors 
such as Rotter’s (1966) dimension of internal 
versus external control, situational norm 
about how one ought to be motivated, or t 
effectiveness of one’s task performance (on 
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is usually intrinsically motivated on the 
things he or she does well). Additionally, an 
individual may attempt to clarify one’s self- 
perception, such as by considering the impli- 
cations of how the reward was presented (e.g., 
Steiner’s, 1970, bribe vs. bonus distinction). 

The effect of intrinsic and extrinsic factors 
on satisfaction and task persistence obtained 
in this study offers strong support for the 
interaction of intrinsic and extrinsic motiva- 
tion. It remains for future research to specify 
the necessary conditions for this interaction. 


REFERENCE NOTE 


1. Ross, M. Personal communication, 1973. 
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Components of “Authority” as Determinants of Compliance 
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This study investigated factors affecting compliance to orders from a formal 
authority. The design created a two-level status hierarchy in which subjects 
occupied identical low-status positions and responded to demands from a 
simulated high-status leader. Four components of authority—normativity, 
coercive power, collective justification, and success—failure—were manipulated 
as independent variables. Another component, the endorsement accorded the 
leader, was included in the design as a measured variable. Results indicated 
that compliance increased significantly when coercive power was high (rather 
than low), when justification was collective (rather than partisan), and when 
demands were normative (rather than counternormative). Contrary to theo- 
retical expectation, endorsement did not affect compliance by low-status mem- 
bers. The findings show that the normative aspect of legitimacy serves as a 
compliance-gaining base even when stripped of enforcing sanctions and under- 
lying goals and that the distinction between normativity and endorsement is 


valid for research on social power. 


Compliance to orders cannot be taken for 
granted. High-status authorities customarily 
attain acquiescence to their directives, but on 
some occasions low-status members may re- 
fuse to comply. Over a decade ago, French, 
Morrison, and Levinger (1960) asked the 
question, “When do high-status authorities 
succeed in gaining compliance, and when 
not?” This remains an open issue today. 

Authority, as it operates in natural settings, 
involves numerous components. Theorists 
differ in their conceptions (Peabody, 1964, 
chap. 7), but treatments of authority-based 
influence include certain commonalities. Most 
approaches maintain that authorities have the 
“legitimate” right to request compliance, as 
well as some other resource base (usually 
coercive power), to back up their directives. 
Since these components covary in natural 
settings, it is difficult to ascertain which cause 
compliance and which do not. 
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The present article analyzes the compo- 
nents of authority as they bear on compliance 
by low-status members. Specifically, this re- 
search investigates (a) whether two forms of 
legitimacy, normativity and endorsement, 
have separate effects on compliance and (b) 
whether either form of legitimacy serves as a 
compliance-gaining base when divorced from 
supportive sanctions. The experimental de- 
sign includes normativity, endorsement, coer- 
cive power, and justification as independent 
variables and uses a collective setting in 
which a high-status authority issues direc- 
tives to low-status members, 


Legitimacy 


French and Raven (1959) discussed legiti- 
mate power in their classic statement on bases 
of social power. But Schopler (1965), in his 
extensive review, concluded that the available 
empirical evidence does not substantiate the 
hypothesis that legitimacy affects compli- 
ance. As Schopler correctly noted, “The 
superiority of legitimate power in its ability 
to obtain conformity, even when contrasted 
to nonlegitimate power, has not been satis- 
factorily established” (p. 197). 

One difficulty with previous research on 
legitimacy is that “legitimate authority” is 
not a simple concept but a multidimensional 
one. Numerous theorists (Etzioni, 1965: 
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% Jackson, 1964; Scott, Dornbusch, Busching, 
& Laing, 1967; Weber, 1947; Michener & 
Burt, 1974) have found it useful to dis- 
tinguish two components: normativity and 
endorsement. 

Normativity is position-specific legitimacy, 
and it designates the prerogatives inherent in 
an organizational role. This concept, which is 
equivalent to French and Raven’s (1959) 
“legitimate power” and to Simon, Smithberg, 
and Thompson’s (1970) “authority of legiti- 
macy,” denotes influence based on under- 

* standings regarding the demands an authority 
can impose on a low-status member. Norma- 
tivity arises from a group base, and coalitional 
support from group members customarily 
stabilizes and validates the high-status au- 
thority’s prerogatives. The present design 
varies normativity by manipulating the norm 
itself (ie., the level of demands the group 
formally permits a leader to impose). 

Compliance to an authority’s directives is 
greater when his or her orders are within 
the normatively prescribed prerogatives than 
when they are not. Empirical studies by 
Frank (1944) and by French, Morrison, and 
Levinger (1960) have shown that when a 
high-status person’s orders are out-of-bounds, 
compliance dwindles and attitudinal resist- 
ance increases. These results remain ambigu- 
ous, however, because no research on norma- 
tivity to date has assessed its effects while 
simultaneously controlling both endorsement 
and the coercive power available to en- 
force normative demands. The present design 
corrects this deficiency. 

Compliance to authority may stem from 
role obligations (i.e., normativity), but it may 
also arise from endorsement. Endorsement, or 
person-specific legitimacy, refers to a low- 
status member’s sentiments regarding a par- 
ticular high-status person as a role occupant. 
As used in previous studies (Julian, Hol- 
lander, & Regula, 1969; Michener & Lawler, 
1974; Michener & Tausig, 1971), endorse- 
ment indexes the extent to which low-status 
members support the high-status person and 
want him or her to continue in a position of 
influence and control within the group. 

Theoretical discussions (Hollander, 1964, 
chap. 14-19; Homans, 1961) have argued 
that the greater the endorsement accorded a 
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high-status person, the more the compliance 
he can obtain from group members. Previous 
experimental research provides little support 
for this proposition, despite theoretical argu- 
ments in its favor. In a field study, French 
and Snyder (1959) found no differences in 
compliance as a function of the endorsement 
accorded a formal authority. A laboratory 
study by Raven and French (1958a) reported 
a weak tendency for compliance to increase 
with endorsement, but a related study (Raven 
& French, 1958b) failed to replicate this 
effect. Overall, these findings suggest that 
endorsement is not a base of influence but 
a conservative critic would note that the data 
analyses in these earlier investigations were 
deficient. Although all three studies specified 
explicit causal models, the procedures for 
estimating effects were inappropriate, and 
thus the endorsement hypothesis did not re- 
ceive an adequate test. In this sense, one must 
conclude that empirical work has not yet 
adequately investigated the theoretical predic- 
tions. The present research attempts to deter- 
mine whether endorsement affects compli- 
ance under conditions where normativity and 
coercive power are simultaneously controlled. 


Coercive Power and Justification 


In addition to normativity and endorse- 
ment, the study design includes coercive 
power as a factor. Research in dyadic situa- 
tions has demonstrated that compliance is 
greater as the magnitude and the credibility 
of threats increase (Horai & Tedeschi, 1969; 
Tedeschi, 1970). Coercion acts as a base of 
power in larger social groups as well (French 
et al., 1960; Michener & Zeller, 1972; Raven 
& French, 1958b). Since coercive power fre- 
quently underlies authority relationships, re- 
search investigating legitimacy must ensure 
that any compliance obtained is indeed due 
to normativity or endorsement, not merely to 
coercion. This study manipulates coercive 
power independently of normativity and en- 
dorsement to avoid the confound frequently 
present in naturally occurring authority situa- 
tions, where these factors covary. 

In addition to normativity, endorsement, 
and coercive power, the present design in- 
cludes an additional manipulated variable, 
justification. Authorities frequently justify 
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their orders by underscoring the collective 
interests of the low-status targets. They say, 
in effect, “I need your compliance to further 
the goal you and I are both committed to.” 
A request from a high-status person that 
serves the collective interest will usually re- 
ceive more compliance than one that serves 
selfish or partisan interests. Research sup- 
porting this proposition has apparently not 
been conducted in a large collective setting, 
but studies using dyads (Berkowitz, 1969; 
Lerner & Lichtman, 1968) found that the 
less selfish the apparent motive behind a 
normative invocation, the greater was the 
compliance to the request. Wager (1971), 
using hypothetical situations, reported that 
respondents were more likely to resist an in- 
fluence attempt when the motivation behind 
the attempt appeared self-serving. 

Justification is an important variable to 
include in any study of legitimacy. One may 
be tempted to assume that whenever a high- 
status person makes a normative demand, he 
or she intends to serve collective interests. 
Conversely, an out-of-bounds request may 
connote that the high-status person is exploit- 
ing the members for his or her own benefit. 
This alignment of normativity with justifica- 
tion need not always occur, however, because 
the legality of a request and the underlying 
goal served by a request are not the same 
thing. By varying the justification for the 
request (collective interest vs. self-interest) 
while simultaneously varying the normatively 
permissible demand, the present study avoids 
this potential confound. 


METHOD 
Procedure 


The experimental framework created a formal 
hierarchical structure, within which components of 
authority were varied in a 2X2X2X2 design 
(Success-Failure X Coercive Power X Normativity X 
Justification). The design varied success-failure to 
affect endorsement, which cannot be manipulated 
directly, 

Subjects were 163 male University of Wisconsin 
undergraduates who were paid volunteers. They 
were promised $1.50 for participation, plus whatever 
they could earn during the experiment. (The actual 
payment averaged $2.30.) Three subjects were 
dropped due to an error in experimental procedure. 
The remaining 160 subjects were randomly assigned 
to the 16 cells of the design. Subjects were run in 
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groups of 3, 4, or 5 persons, one of whom was a 
confederate. 

When subjects entered the laboratory, they took 
seats and read a set of instructions describing the 
basic framework. Subjects learned that they were 
to participate in a simulation of several (mythical) 
countries organized as a federation with a central 
administration. Each low-status subject would repre- 
sent a nation and be responsible for the prosperity 
of that country’s economy over four rounds of activ- 
ity. Each would control resources to invest for his 
country’s benefit, and, if successful, these invest 
ments would return profits that subjects could 
convert into money (to retain for themselves) at the 
end of the experiment. 

When subjects finished reading, the experimenter ¢ 
reviewed the instructions and then distributed a 
second set. These new instructions described the 
federation’s central administration and indicated thai 
one person would be chosen to occupy the high- 
status role of administrator. This person was to be 
responsible for public welfare functions (including 
public health, air transportation, and pollution co. 
trol) which were cross-national in scope and bene 
ficial to everyone. Successful performance of these 
functions would require financial support from all 
low-status subjects, and consequently the adminis- 
trator would be empowered to assess tax demands 
against the subjects. 

The experimenter then told the subjects that, to 
assure success in these collective functions, all persons 
in their group would take a “skill test” so that the 
most competent member could be assigned to the 
high-status role. This test consisted of a practice 
trial on an investment task that paralleled the job 
the high-status person would have to perform for 
the group. The test validated the status hierarchy 
and provided a rationale for installing someone in 
the high-status role. When subjects completed this 
test, the experimenter collected their sheets. 

As he left to grade the tests, the experimenter 
directed the subjects to establish an assessment agree 
ment, which was a written contract regulating the 
tax demands that a high-status person could levy 
against members when collecting funds for the 
federation. The assessment agreement manipulated 
normativity, one of the independent variables. To 
assure that low-status persons would construe a 
given demand from the high-status member either 
as normative or as counternormative (depending on 
experimental treatment), the procedures induced 
some groups to set a high limit on the authority’s 
assessments, while they led other groups to set a low 
limit. To guarantee that groups established the norm 
randomly assigned to them by experimental treat- 
ment, each group included a confederate who used 
a series of standard arguments to sway opinion in 
the intended direction. This procedure always suf- 
ficed to establish the appropriate norm. Members 
indicated assent by publicly signing the formal 
agreement. 

After the subjects signed this agreement, the 
experimenter returned to the room and announced 
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Fy that one subject (the confederate) had emerged as 


the most competent group member on the skill test 
and therefore had earned the right to occupy the 
high-status position. All the real subjects were 
assigned to identical low-status positions and placed 
in separate rooms, where they could not communi- 
cate with one another. 

From this point on, each subject was run inde- 
pendently of the others. Although subjects believed 
they were working in a collective context, the group 
actually had no high-status member, Bogus com- 
munications from the leader to the low-status sub- 
jects manipulated the independent variables. The 
experimenter did not know the content of these 
communications, and thus he remained blind to 
three manipulations (success-failure, coercive power, 
and justification), although he did perforce know 
the normativity treatment received by each subject. 


Description of a Round 


Group activity proceeded for four similar rounds. 
At the beginning of every round, each subject 
received an assessment for public welfare, ostensibly 
from the high-status person. 

Each subject informed the leader in writing how 
much he would pay and then made investment 
decisions for his country’s economy by allocating 
whatever resource units remained after his tax pay- 
ment. As each subject realized, the fewer resources 
he gave the high-status person, the more he had left 
for profitable investments of his own, and this 
provided an important reason for not paying the 
assessments. 

‘At the end of every round, the experimenter returned 
to inform each subject how profitable his country’s 
investments had beeen. He also delivered a report 
from the high-status person which indicated the 
success or failure of the leader's efforts on the 
confederation’s public welfare problems. 

After the second round, and again after the fourth, 
each subject completed a questionnaire tapping his 
perception of the collectivity’s operations and his 
attitudes toward the high-status authority. 


Manipulation of the Independent Variables 


Success-failure. At the end of every round, each 
subject received a message (ostensibly from the 
high-status person) reporting his success or failure at 
maintaining the confederation’s public welfare. This 
report included a scale ranging from +10 (maximum 
success) to —10 (maximum failure); subjects knew 
that an average group could expect to obtain a score 
near zero (i.e, from +2 to —2). In the success 
condition, feedback from the high-status person 
averaged +8; in the failure condition, it averaged 
—s. Subjects cared about the leader’s performance 
on this task because, under collective success, their 
own profits jumped approximately 20% (above what 
they earned on their private investments), while 
under failure their profits dropped by approximately 


20%. 
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Coercive power. The high-status authority had the 
power to fine members if they refused to comply 
with his tax demands. In the high-coercive-power 
condition, the instructions informed subjects that the 
high-status person could amerce each subject up to 
90% of his profits on any round where noncompli- 
ance occurred; in the low-coercive-power condition, 
subjects learned that the high-status person could 
fine no more than 10%. The high-status person had 
this prerogative whether or not his demands were 
legitimate. Even if the leader's demands exceeded the 
limits set by the group’s assessment agreement, he 
could use his fining capacity if the subject refused 
to pay. In fact, no fine was ever levied against 
any subject (regardless of his behavior). Punishment 
always remained a possibility, however, and this was 
reaffirmed by a handwritten note from the high- 
status person during Round 4 admonishing the 
low-status subject to pay the full assessment and 
threatening punishment for noncompliance. 

Normativity. Each group established a formal 
agreement limiting the high-status person’s assess- 
ments against low-status members. In the high- 
normativity condition, the agreement permitted the 
high-status person to request no more than 50% of 
each member's resources on each round; in the low- 
normativity condition, it stipulated the limit of 20%. 
In conjunction with the assessments sent by the 
leader, these differences in norm level constituted the 
normativity manipulation. During Rounds 1 and 2, 
the high-status person asked for 20% of each sub- 
ject’s resources. During Rounds 3 and 4, he increased 
the demands to 50%. Thus, in combination with a 
norm level of 50% (high normativity), his demands 
on Rounds 3 and 4 were within bounds. But in 
combination with a norm level of 20% (low 
normativity), his demands were illegitimate. 

Justification. A handwritten note ostensibly from 
the high-status person accompanied the assessments 
on Rounds 3 and 4. In the collective justification 
condition, the message explained that environmental 
pressures had increased, which made the confedera- 
tion’s public welfare problems more serious. Specif- 
ically, the note on Round 3 said, “The chart makes 
me think this round will be very difficult, so PI 
have to ask for 50% of your resources.” A similar 
note for Round 4 read, “Another bad round, so I'll 
need 50% for welfare. Don’t put me in a position 
where I have to fine you.” In the selfish justification 
condition, the high-status person demanded the extra 
assessments for his own aggrandizement. The note 
on Round 3 read, “I think I should be paid more, 
so I’ve raised the Public Welfare Assessment so 
I can take the extra for myself.” On Round 4 the 
note stated, “I’ll need 50% of your resources this 
round so I can jack up my salary again. Don’t put 
me in a position where I have to fine you.” Nor- 
mally, the high-status person was to receive a salary 
comparable to what the members could make on 
their individual investments, Subjects realized, how- 
ever, that nothing in the situation would prevent 
the leader from diverting part of the public wel- 
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fare assessments to his own pocket. The selfish 
justification messages indicated this was happening, 


Measurement of Endorsement 


In addition to the four manipulated independent 
variables, this study incorporated a measured pre- 
dictor variable, the endorsement accorded the high- 
status person. Endorsement was measured on the 
questionnaire administered after the second round 
of activity. The endorsement scale, which has been 
developed in previous research (Michener & Lawler, 
in press; Michener & Tausig, 1971), contained the 
following five items: (a) “Consider the person 
occupying the position of leadership. How legitimate 
is it for him to occupy this position?” (b) “How 
satisfied are you with the leader’s use of his power 
in arriving at group decisions?” (c) “How willing 
would you be to have the person who served as 
leader this time head the group again?” (d) “How 
satisfied are you with the performance of the leader 
in directing the group?” and (e) “To what extent 
do you support or oppose the leader?” Subjects 
responded to each item on a scale running from 1 
(strong dissatisfaction) to 9 (strong satisfaction). 
To obtain an endorsement score, the five items 
were summed. 


Measurement of Compliance 


The major dependent variable was the subject’s 
compliance to the public welfare assessments on 
Rounds 3 and 4, when the high-status person raised 
his demands from 20% to 50%. Each subject 
recorded his payment on a form sent to the leader 
on each round, Compliance by subjects ranged from 
0% (i.e., a flat refusal to Pay any assessments) to 
50% (full compliance to the demands). 

After Round 4 the experimenter terminated the 
procedures, explained the study’s purpose to the 
subjects in detail, revealed the deceptions, answered 
any questions, and paid the subjects. 


RESULTS 
Checks on Experimental Manipulations 


Items on the questionnaire administered 
after Round 4 checked the effectiveness of the 
experimental manipulations. 

Success—failure, An item asked, “In your 
opinion, how successful have the high-status 
person’s public welfare decisions been so far?” 
Subjects’ responses (on a 9-point scale) cor- 
related highly (r = .931, p < .001) with the 
success—failure manipulation, which attests to 
its effectiveness. 

Coercive power. The questionnaire asked, 
“Tf you refuse to pay the full public welfare 
assessment demanded by the high-status per- 
son, what percentage of your profits can he 
penalize you?” Responses by subjects cor- 
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related highly (r = .930, p < .001) with the 
actual level of the high-status person’s coer- 
cive power (i.e., 10% or 90%). 

Normativity. The questionnaire asked, 
“Has the high-status person abided by the 
Assessment Agreement that everyone signed?” 
As expected, subjects indicated that the high- 
status person had broken the agreement more 
under the 20% norm than under the 50% 
norm (r = .907, p < .001). 

Justification. A question asked, “To what 
extent has the high-status person been moti- 
vated to serve the collective interest instead 
of his own selfish interest?” Responses (on a 
9-point scale) indicated the expected effect 
(r = .608, p < .001). 

Endorsement. The 5-item endorsement scale 
proved highly reliable. The index of internal 
consistency (Cronbach’s alpha) is 884, which 
compares with figures of .923 and .924 ob- 
tained in previous studies (Michener & 
Lawler, in press; Michener & Tausig, 1971). 
A second measure of endorsement (taken 
on the postexperimental questionnaire after 
Round 4) shows that subjects did not sub- 
stantially change their sentiments toward the 
high-status person during Rounds 3 and 4. As 
discussed below, success-failure proved the 
only significant determinant of the first en- 
dorsement measure and remained the only 
determinant of the second measure as well. 
Since no more than a slight drop in endorse- 
ment (consistent across all conditions) oc- 
curred between the first and second measures, 
test conditions remained intact during the 
critical rounds. 

Overall, the checks indicate that the ex- 
perimental procedures established appropriate 
conditions to test the research hypotheses. 


Basic Causal Model 


Table 1 reports the mean compliance 
scores summed over Rounds 3 and 4. The 
data were analyzed in terms of the causal 
model depicted in Figure 1, which portrays 
compliance as a function of the four manipu- 
lated variables. These independent variables 
are orthogonal due to equal cell ns, and they 
do not correlate with the error term (U) 
because subjects were randomly assigned to 
treatments. The causal path from justifica- 
tion to compliance is shorter than the others, 
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TABLE 1 


Mean VALUES FOR COMPLIANCE (ROUNDS 3 AND 4) As A FUNCTION OF SUCCESS-FAILURE, 
COERCIVE POWER, NoRMATIVITY, AND JUSTIFICATION 


Weak coercive power (10%) 


Strong coercive power (90%) 


Independent variable 


Selfish Collective Selfish Collective 
justification justification justification justification 
High normativity (50%) 
Success 45.2 72.0 69.0 84.3 
Failure 34.0 55.0 70.1 81.0 
Low normativity (20%) 
Success 32.0 56.0 51.5 78.0 
Failure 33.0 4.5 52.0 59.5 


since justification was manipulated later in 
the experimental session. The model assumes 
that normativity was manipulated when the 
groups established their assessment agreement 
rather than on Rounds 3 and 4 when the 
high-status person increased his demands. 

Multiple regression was used to estimate 
the parameters (beta coefficients) for the 
independent variables. The results indicate 
that three independent variables are signifi- 
cant determinants of compliance. Subjects 
comply more when the high-status person’s 
assessments are normative (rather than out-of- 
bounds), when his coercive power is high 
(rather than low), and when he uses a col- 
lective (rather than selfish) justification. The 
findings for coercive power and justification 
achieve significance at the .0001 level, while 
the finding for normativity reaches signifi- 
cance at the .027 level. Success—failure has no 
significant effect on compliance. Ancillary 
analyses indicate no significant interaction 
effects among the independent variables. 

The regression equation for compliance, 
with all variables expressed in standardized 


form, is as follows: 


C = 061S + .301P + .155N 


+ 3637 + .866U, (1) 


x u 
SUCCESS~FAIL 61 (NS: 
-866 


COERCIVE POWER BAE ee COMPLIANCE 
ASS a 
NORMATIVITY 
JUSTIFICATION 


Ficure 1. Basic causal model for compliance. 


where C is compliance, S is success~failure, 
P is coercive power, W is normativity, J is 
justification, and U is the error term. The 
multiple correlation coefficient for compliance 
is .5004, and the four independent variables 
account for 23.11% of the variance in compli- 
ance (this figure is corrected for degrees of 
freedom). 


Expanded Causal Model 


Analyzing the data in terms of the basic 
causal model (see Figure 1) illuminates the 
effects of the manipulated independent vari- 
ables on compliance, but it provides no 
information about the impact of endorsement 
(which is measured). 

In response to this problem, Figure 2 pre- 
sents an expanded causal model that tran- 
scends the model in Figure 1 by incorporating 
endorsement as a cause of compliance. In 
addition, endorsement itself stands as a de- 
pendent variable caused by the prior ma- 
nipulated variables (success-failure, coercive 
power, and normativity) as well as by other 
unknown factors (summarized by the error 
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n 
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COERCIVE POWER COMPLIANCE 
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NORMATIVITY 
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Ficure 2. Expanded causal model for compliance. 
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TABLE 2 

ZERO-ORDER CORRELATIONS AMONG VARIABLES 
Variable Success-failure Punishment Normativity Justification Endorsement Compliance 

Success-failure 1.000 
Punishment -000 1.000 
Normativity -000 -000 1.000 
Justification -000 -000 -000 1.000 
Endorsement 713 010 —.012 —.020 1.000 
Compliance .061 301 155 363 180 1,000 


term, V). This representation coincides with 
the temporal structure of the experiment. 

Table 2 shows that the observed zero- 
order correlation between endorsement and 
compliance is .180. This correlation could 
occur if endorsement causes compliance, but 
it could also result if both endorsement and 
compliance are caused by some outside vari- 
able(s) not explicitly entered in the causal 
model. Since subjects were not randomly 
assigned to “levels” of endorsement, the 
model in Figure 2 explicitly assumes that 
the correlation between the error terms, U 
and V, might be nonzero. When obtaining 
parameter estimates, one must estimate both 
the direct path from endorsement to compli- 
ance (representing the causal impact of en- 
dorsement on compliance) and the correlation 
between the error terms U and V (represent- 
ing the “spurious” effect of outside variables). 
Since ordinary least-squares procedures can- 
not provide consistent and unbiased estimates 
for the coefficients in this case, an alternative 
technique is needed. 

Two-stage least-squares, a procedure intro- 
duced by the econometricians Basmann (1957) 
and Theil (1961), can provide parameter 
estimates that while still biased, have the 
desirable property of consistency. A detailed 
exegesis of the two-stage least-squares tech- 
nique appears in Johnston (1963, chap. 9) 
and in Miller (1971). When applied to the 
model in Figure 2, this technique yields 
the following parameter estimates in the 
structural equation for endorsement (E): 


E= .7135 + .010P 
—.012N + .701V (2) 


and for compliance (C): 


C = 300P + .156N + -3637 
+ .061Ê + .866U, 


(3) 
with ryy = .236. 


To meet the conditions for identification, 
this model assumes that success—failure has 
no direct causal impact on compliance. Thus, 
no coefficient for success-failure (S) appears 
in the structural equation for compliance 
(see Equation 3). The endorsement term is 
replaced in Equation 3 by a surrogate vari- 
able (£), which is the value of endorsement 
estimated from the manipulated independent 
variables. 

Overall, the results of the two-stage least- 
squares procedure indicate that success— 
failure is the only significant determinant 
(p < .0001) of endorsement, while coercive 
power ($ < .0001), normativity (p < .03), 
and justification (p < .0001) determine com- 
pliance. Endorsement does not cause compli- 
ance, which means that the zero-order cor- 
relation between endorsement and compliance 
is spurious. In substantive terms, this implies 
that some unknown variable(s) cause both 
endorsement and compliance. The model sum- 
marizes this effect by depicting a correlation 
of .236 between the error terms, U and V. 


Discussion 


This study has investigated the proposi- 
tion that endorsement, normativity, coercive 
power, and justification exert independent ef- 
fects on compliance to orders from an author- 
ity. Results indicate that three of the four 
factors—normativity, coercive power, and 
justification—do significantly affect compli- 
ance. These findings emerge in a laboratory 
design that eliminates the potential confounds 
of naturally occurring authority situations. 
Ancillary analyses reveal no significant 
interaction effects among the independent 
variables. 

The relative strengths of the independent 
variables differ in these data. The path co- 
efficients demonstrate that coercive power and 
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justification have much greater impact on 
compliance than does normativity. Of course, 
this comparison applies only under the fixed 
parameters of the experiment. 

The finding that normativity determines 
compliance in the absence of supporting sanc- 
tions is new and important. Independent of 
coercive sanction, without active group sup- 
port, and against their immediate self- 
interest, low-status members still complied 
more under the 50% norm than under the 
20% norm. This is a stringent test of norma- 
tivity’s effectiveness, and it voices a positive 
answer to Schopler’s (1965, p. 197) question 
whether legitimacy affects compliance. 

Another interesting finding concerns en- 
dorsement. The experimental procedures cre- 
ated the theoretically important distinction 
between normativity (position legitimacy) 
and endorsement (person legitimacy). The 
data indicate that these aspects of compliance 
operate differently: Normativity affects com- 
pliance, but endorsement does not. A high 
level of endorsement did not aid the high- 
status person in gaining payments from the 
subjects. This disconfirms the hypothesis ad- 
vanced by Homans (1961), and it reinforces 
the negative findings of earlier, indirect at- 
tempts to test this hypothesis (French & 
Snyder, 1959; Raven & French, 1958a, 
1958b). 

What directions do these results suggest 
for future research on legitimacy? The diver- 
gent effects of endorsement and normativity 
dictate separate avenues of investigation. 

First, endorsement may not determine the 
effectiveness of a leader’s influence attempts, 
but it does affect influence usage. In a related 
study, Michener and Burt (Note 1) found 
that leaders used more coercive influence 
under low endorsement than under high en- 
dorsement. Future research might investigate 
endorsement’s effect on a high-status person’s 
choice among influence modes (persuasion, 
coercion, etc.), as well as its impact on his 
or her choice between policy change and so- 
cial control. As noted by Gamson (1968), 
authorities can respond to influence from 
members by either acquiescing or exerting 
counterinfluence in the form of social control. 
The level of endorsement accorded the high- 
status person may affect this decision. 
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Second, future research might profitably 
investigate how variations in group structure 
affect normative influence. Norms are proper- 
ties of collectivities, and their efficacy depends 
on the probability that group members will 
form a conservative coalition to support them 
in the face of challenge. By separating the 
subjects, the present experiment precluded 
any collective response to the high-status per- 
son’s transgressions. Nevertheless, one might 
predict that if the low-status persons were 
able to communicate, they would respond as 
a coalition to the leader’s out-of-bounds be- 
havior (counternormative tax demands, usur- 
pation of collective funds). Thus, by com- 
paring an open-communication structure with 
that used in the present experiment, future 
research could test the hypothesis that the 
opportunity for coalitional activity strength- 
ens the normativity effect. 
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This investigation was conducted (a) to determine the generalizability of the 
inverse relation between differential reward structure and group efficiency in 
high task-interdependence situations from intragroup settings to cogroups, and 
(b) to compare the distribution of information in groups with equal and 
unequal reward structures. Reward structures (equal and unequal) were 
orthogonally crossed with two levels of cogroup relationships (independently 
and dependently related). The results clearly indicated that competition and 
cooperation were functions of the reward structures and dependency relation- 
ships between cogroups. Furthermore, magnitude of the inverse relation be- 
tween differential reward structure and group efficiency decreased when groups 
were dependently related; and the distribution of information was more equal 


in cogroups with an equal than with an unequal reward structure. 


The concepts of cooperation and competi- 
tion can be used to refer to the orientation of 
group members or, as in this article, to 
situational characteristics (Deutsch, 1949). 
(Obviously, the notions of orientation and 
situational characteristics are not mutually 
exclusive since the situational characteristics 
must inevitably modify the orientation of the 
group member and become manifested in 
what is described as “group dynamics.”) 
Thus, a situation is cooperative when the out- 
comes are distributed equally, that is, when 
gain by an individual group member contrib- 
utes to the gain of all group members; it is 
competitive when the outcomes are distrib- 
uted unequally, that is, when the gain of one 
member interferes with the gain of others 
(see, e.g., Kelley & Thibaut, 1969). 

Miller and Hamblin (1963; also see 
Thomas, 1957) concluded from their review 
of the literature on group cooperation and 
competition that the effects of these contrast- 
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ing social situations are further influenced by 
the degree of means or task interdependence. 
In a high-task interdependence situation, all 
group members work together thereby assur- 
ing that a problem is solved efficiently by the 
group as a whole. In contrast, a low task- 
interdependence situation is characterized by 
each person performing the task individually 
and independently. In its extreme form there 
is no opportunity for one member to facilitate 
or impede the efforts of the other persons 
present. Inasmuch as individuals are not 
interdependent, this situation is psycho- 
logically equivalent to a “nongroup.” 

Miller and Hamblin’s (1963) analysis led 
to an experiment in which they examined 
the effect of the interaction of outcome dis- 
tribution by task-interdependence on group 
efficiency. In high task-interdependence situ- 
ations, differential reward structure and group 
efficiency were found to be inversely related. 
Thus, cooperation was superior to competition 
in situations in which individual success was 
dependent on the performance of other mem- 
bers of the group. In low task-interdependence 
situations, cooperation had no distinctively 
superior effects; competition and cooperation 
had similar effects on “group” productivity. 

Since only single groups were employed in 
Miller and Hamblin’s study, their results 
relate mainly to situations involving intra- 
group cooperation and competition. It is also 
apparent that the presence of one group, in a 
coaction relation with another group (co- 
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groups), may clearly have an impact on the 
performance of the other group (Vinacke, 
Wilson, & Meredith, 1964, p. 479). Never- 
theless, the nature of this influence is not 
a simple one and requires an extension of the 
generalizations concerning intragroup cooper- 
ation and competition to more complex 
settings involving intergroup relationships. 

The present study was directed to the latter 
concern: Equal outcome distributions within 
a cogroup (i.e., all members of a given group 
receive equal reward) were assumed to influ- 
ence the adoption of cooperative strategies; 
unequal outcome distributions (i.e., members 
of each group receive different rewards) were 
hypothesized to influence competitive actions. 
However, the primary purpose of this study 
was to consider the relationship between the 
cogroups in order to explicate further the 
dynamics of cooperation and competition. 
Thus, it was hypothesized that when cogroups 
are independently related to one another the 
external group should have only minor effects, 
if any, on the social situation created since 
the reward structure (outcome distribution) 
for each group remains unaffected. Accord- 
ingly, the inverse relation between reward 
structure and group efficiency attained in a 
single group setting was predicted to gen- 
eralize to each of the independently related 
cogroups. 

When cogroups are dependently related, 
the external group can influence the kind of 
process adopted by the group. Thus, with 
equal intragroup outcome distribution (i.e., 
equal reward structures), the result is co- 
operation regardless of the intergroup rela- 
tionship since the allocation of rewards 
dictates only that the productivity of each 
group be optimized by its respective members. 
On the other hand, in groups with unequal 
reward structures the relationship of one 
group to the other becomes important in 
determining social interaction. As discussed 
above, independently related cogroups with 
unequal intragroup reward structures adopt a 
competitive strategy analogous to the one 
created in single group settings. But when 
such groups are dependently related, members 
of each group must maximize their gain 
within their respective group as well as the 
gain of their group relative to the other 
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cogroup. This consequence is partially co- 
operative and partially competitive. It re- 
quires a complex strategy if group efficiency 
is to be maximized: Initially the group 
member must cooperate with his partners to 
establish his group’s position. Once superior- 
ity of the group is attained, he then competes 
to establish his own position within the group. 
The employment of this strategy should re- 
sult in greater productivity of such cogroups 
relative to other cogroups which have unequal 
reward structure and which are independently 
related. 

On the basis of the rationale above, it was 
hypothesized that the difference in efficiency 
between cogroups with equal and unequal 
reward structures would be significantly less 
when cogroups are dependently related than 
when they are independently related. This hy- 
pothesis would be supported by a significant 
effect of the interaction of Reward Structure 
X Intergroup Relationship on the number of 
redundant interactions (transactions). 

The rationale above also implies that, with 
equal reward structures, members of each 
cogroup develop cooperative strategies (e.g., 
teamwork and pooling of information) which 
facilitate the locomotion of their partners 
toward a jointly shared goal. Conversely, with 
unequal reward structures, each member of 
a cogroup develops competitive strategies 
(e.g., hoarding of information) which fa- 
cilitate individual advancement toward a 
mutually, or partially, exclusive goal. This 
reasoning suggests that the distribution of 
information by individuals in groups with 
equal reward structures is significantly more 
nearly equal than the distribution of informa- 
tion by individuals in groups with unequal 
reward structures. This hypothesis would 
be supported by a significant main effect 
on the dispersion of information due to 
reward structure. 


METHOD 
Design 


The overall design of the present experiment was 
an adaptation of Samuels’s (1970) Reward Distribu- 
tion Model. As used in this study it implied a 
factorial analysis of variance with two between- ` 
subjects and one within-subjects factors. The 
between-subjects factors were (a) reward structure 
(equal and unequal) and (b) intergroup relation- 
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ship (dependent cogroups and independent cogroups). 
The within-subjects factor was problems (or trials) 
since each group solved four problems during the 
course of the experiment. 


Subjects 


The subjects were 96 students enrolled in an 
introductory educational psychology course. Par- 
ticipation in the experiment was voluntary. However, 
differential credit was given toward the subject’s final 
course grade, according to prestated performance 
criteria, in order to induce treatments. 


Apparatus and Materials 


The apparatus was simply two sets of partitions 
with three compartments in each. The partitions ap- 
peared to have a Y shape when viewed from above. 
A slot at the bottom of each compartment permitted 
cards to be exchanged among partners within a 
group, 

The information (i.e., numbers) was passed from 
one group member to another on 3” X 4” cards, 
Poker chips were used as tokens to represent points 
earned at given stages of play. Record sheets were 
provided on which subjects recorded (a) the in- 
formation received, (b) who the source of the 
information was, and (c) to whom the information 
was passed. 


- Task 


The task was a modification of one employed by 
Miller and Hamblin (1963). It required subjects to 
determine which of 10 numbers, from the series 1 
through 10, had been omitted by the experimenter. 
Each member of a cogroup (triad) knew three of 
the numbers, each of which was presented on a 
3” X 4" card. Since each subject had different num- 
bers, the correct answer could be determined only 
by sharing information, that is, by passing cards 
from one player to another through the slots. 

Play was guided by a constrained schedule in 
which a subject called for information from either 
of his fellow group members in a regular order. 
To permit flexibility in play and to permit the 
subject to capitalize on whatever information he 
had about the numbers other players controlled, 
the subject was permitted the choice of selecting the 
group member who was to provide the information. 
A problem was considered “solved” by a group 
when all subjects within that group knew the answer. 


Procedure 


During each “experimental session, subjects were 
assigned to the two groups by reference to a table 
of random digits. After each subject was seated at 


1 Two experimenters were used, one for each group, 
to administer the remaining instructions. The second 
experimenter, trained by the investigators, interacted 
with groups during the period when the two practice 
g were conducted. 
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his or her compartment, both the general instructions 
and a practice trial for learning how to acquire and 
pass information according to the rules of the game 
were administered.! The treatments were then in- 
duced by instructions and by manipulations of 
rewards (number of points earned toward final 
course grade). The treatments were as follows: 

Unequal reward-independent. Each time a request- 
ing player received a card which was not previously 
seen, a poker chip was received from the experi- 
menter. When a player received a card which had 
been previously seen, the experimenter took a chip 
from the requesting player. The player in each co- 
group who accumulated the most chips at the end 
of the experiment received 12 points, the person with 
the second highest total in each group received 10 
points, and the player with the lowest total in each 
group received 8 points. These instructions were 
intended to induce individual competition among 
group members, Rewards were distributed differen- 
tially among the members of a given cogroup, 
Furthermore, rewards were assigned independently to 
each group. 

Unequal reward-dependent. This treatment was 
similar to the immediately preceding one except that 
each subject competed in two ways: (a) against 
members of his/her group and (b) as a member of 
a group competing against the other group. The 
performance of one group, relative to the other, 
was as much a consideration in the allocation of 
rewards as the performance of an individual within 
aà group if the performer was to maximize his gain, 

The player in the better of the two groups, that 
is, the cogroup which accumulated the most chips, 
received 15 points. The other two players in this 
group received 10 points. All players in the poorer 
of the two cogroups, that is, in the cogroup which 
accumulated a lower total number of chips, received 
8 points. 

Equal reward-independent. In this treatment, each 
subject competed solely as a member of a group 
against a criterion. All subjects within each group 
received the same reward, In addition, rewards were 
assigned independently to each group. Each time a 
requesting player received a card which he/she had 
not seen previously each member of the group re- 
ceived a poker chip from the experimenter. When 
anyone within a group received a card which he/she 
had already seen, then the experimenter took a chip 
from each of the other members of the group. All 
players in the cogroup which accumulated at least 
55 poker chips received 12 points. All players in 
the cogroup which failed to accumulate at least 55 
chips received 8 points, 

Equal reward-dependent. This treatment was in- 
tended to result in one group competing against 
the other cogroup. It differed from the immediately 
preceding one by the allocation of rewards. Thus, 
while all subjects within a given group received the 
same (equal) reward, the amount of reward differed 
for the “winning” and “losing” groups. All players 
in the cogroup which possessed the most chips at 
the end of all problems received 12 points, while 
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all players in the group which had the lower total 
number of chips received 8 points. 

Once the subjects in a given treatment understood 
the rules of play, another practice trial was admin- 
istered to make certain the rules were employed 
properly. Then, a total of four different problems 
were played to solution by all members of both 
groups within a given experimental session. The 
number of chips accumulated by each subject and/or 
group (depending on the experimental treatment) 
was posted on a blackboard upon completion of 
each problem. The nature of each interaction be- 
tween the initiator of a request and the contributor 
of information was recorded by the experimenter, 
as well as by each subject, during the course of 
problem solution. 

When the experiment was completed, the experi- 
menter announced the final results and awarded 
points to group members according to the rules 
under which the subjects in given treatments were 
participating. An induction questionnaire was admin- 
istered and a debriefing period followed. 


RESULTS 
Induction Measures 


An initial analysis was conducted to deter- 
mine whether subjects in groups with equal 
reward structures had perceived themselves as 
engaging in behaviors facilitating intragroup 
cooperation to a greater extent than subjects 
in groups with unequal reward structures. 
Responses to the following three items from 
the postexperiment questionnaire were used: 
(a) “To the best of my ability, I attempted 
to help the other players in my group get the 
missing number,” (where 1 = all the time and 
5 = never); (b) “I tried to pass cards which 
the other players had already seen,” (where 

= never and 5= all the time); and (c) 
“When I passed cards to the other players 
in my group, I passed cards which helped 
them” (where 1 = always and 5 = never). 
The ratings for the three items, appropriately 
reversed where necessary, were summed to 
obtain a measure of intragroup cooperation.? 

The analysis revealed that subjects in 
equal reward conditions (X = 4.02) perceived 
themselves to be more cooperative with re- 
spect to the other members of their group 
than did subjects in unequal reward structure 
groups (X = 9.27), #(94) = 15.91, p < .001. 


? Possible ratings ranged from a minimum of 3 to 
a maximum of 15. Low ratings represented a trend 
toward intragroup cooperation. 
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The results of this analysis imply that the 
manipulations were successfully induced. 


Group Efficiency 


The effects of reward structure, intergroup 
relationship, and problems (or trials) on 
group efficiency, as measured by the number 
of redundant interactions, were examined by 
a mixed analysis of variance (see Design 
section above). A redundant interaction was 
defined operationally as an exchange of in- 
formation in which the initiator (requestor) 
received a piece of information which he had 
seen previously. Since “dependently related” 
comprised one level of the intergroup rela- 
tionship factor, scores of individual groups 
could not be employed as independent obser- 
vations. Instead, in this analysis, as well as in 
the one which follows it, the unit of observa- 
tion was the pooled scores of the paired 
groups within a session. 

The analysis yielded (1,12) = 60.17, 
< .001 for the effect due to reward struc- 
ture; F(1, 12) = 26.11, p < .001 for the effect 
due to intergroup relationship; and F (3,36) 
= 5.66, p < .001 for the effect due to prob- 
lems. The mean redundancy score for the 
equal reward groups was lower (X = 4.17) 
than that for the unequal reward groups 
(X = 14.75). The mean redundancy score for 
the dependent treatments was lower (X = 
6.09) than that for the independent treat- 
ments (X = 12.97). For all groups there was 
a significant decrease in the number of 
redundant interactions over problems. 

While these main effects are related to and 
provide general support for the initial hy- 
pothesis, they must be qualified further by 
the significant (p < .05) interaction between 
reward structure and intergroup relationship 
which is graphically depicted in Figure 1, 
There it can be seen that the difference in 
efficiency between the equal and unequal re- 
ward conditions is substantially less when co- 
groups are dependently related than when 
they are independently related. Follow-up 
multiple comparison tests using Tukey’s 
(Games, 1971) wholly significant difference 
test revealed that the unequal rewards- 
independent cogroups were significantly 
($ < .05) less efficient than both the unequal 
rewards-dependent and equal rewards-indepen- 
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Ficure 1. Efficiency of cogroups as a function of 
dependency relations and reward structure. (High 
scores are related to low efficiency and low scores to 
high efficiency.) 


dent cogroups. This finding provides support 
for the initial hypothesis. 


Distribution of Information 


The distribution of information, that is, 
the extent to which information was shared 
by group members, was indexed by taking 
the absolute value of the proportion of inter- 
actions in which each subject was engaged 
and subtracting it from a constant (.67).* 
These scores were algebraically summed 
across all subjects within a session. The mag- 
nitude of the deviation of the score from zero 
indicated the degree of inequitable distribu- 
tion of information. 

The data based on the dispersion index 
were analyzed via a mixed analysis of vari- 


3 This procedure in no way alters the results. 
Scores were subtracted from the constant (.67) only 
to permit the index of dispersion to sum to zero if 
all group members participated equally. 
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ance (see the Design section). Only the first 
nine interactions, among group members, for 
each problem were used. Although each 
group member had been required to complete 
the task, some subjects were able to retain 
their clue cards initially because they partici- 
pated infrequently or because of the strategy 
they used. Eventually, however, the retained 
clue cards had to be circulated with the con- 
sequence of balancing the overall degree of 
participation, Accordingly, data for the first 
nine interactions were chosen as representa- 
tive of the dynamics involved in the distribu- 
tion of information. (Although the last half 
of the total number of interactions might 
also have been used, the data would have 
been less conveniently analyzed since different 
numbers of trials were taken to complete the 
problems by different subjects.) 

The analysis of the dispersion index scores 
yielded F(1,12) = 19.20, p< .001 for the 
effect due to reward structure. The mean dis- 
persion index for the equal reward condition 
was .28 and for the unequal reward condition 
it was .70. These results provide support for 
the second hypothesis. In addition, there 
was a significant decrease in dispersion of 
information over problems, F (3,36) = 3.00, 
p < .05. None of the other main or interactive 
effects were significant. 


Discussion 


The results of this study pointedly suggest 
an inverse relation between differential re- 
warding and group efficiency in cooperation— 
competition. Cooperation led to more efficient 
problem-solving activity than competition 
when participants are engaged in a high task- 
interdependent situation. 

The significant effect of the interaction be- 
tween reward structure and intergroup rela- 
tionship indicates that the interpretation of the 
inverse relation between differential rewarding 
and group efficiency must be qualified with 
coacting groups according to the nature of the 
intergroup relationship. When cogroups are 
independent, the relation between differential 
rewards and group efficiency is substantially 
the same as it is in single groups (see Miller 
& Hamblin, 1963). Inasmuch as these groups 
are independent as defined by the reward 
structure, this result is to be expected. On the 
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other hand, the difference in group efficiency 
between the unequal rewards-dependent 
and unequal rewards-independent conditions 
clearly indicates that the magnitude of the 
relation between differential rewarding and 
group efficiency is substantially less when 
groups stand in a dependent relation to one 
another. The mere presence of another group 
is insufficient to influence the performance of 
a given group. It is the relationship between 
them that determines their dynamics. 

When cogroups are dependently related, the 
allocation of rewards to members of one 
group is dependent on performance relative 
to the other group. Thus, the results of this 
study lead to the conclusion that partners of 
one group must develop sufficient commonal- 
ity of interest if they are to outperform the 
members of the other group. In order to at- 
tain a common goal, it is necessary that they 
engage in relatively efficient problem-solving 
processes. This is the state of affairs which 
mediates the conflicts of interest facing mem- 
bers of groups in the unequal reward-depen- 
dent conditions. Not only must they co- 
operate with their partners in order to out- 
perform the other group, they must also 
maximize their personal reward by striving 
to outperform their partners. The generaliza- 
bility of the relation between differential 
rewards and group efficiency in single groups 
is limited when we consider dependently re- 
lated cogroups. These conclusions also imply 
that the efficiency of groups in which out- 
comes (rewards) are distributed differentially 
can be increased by introducing elements of 
intergroup competition into task situations. 

Cogroups in equal reward conditions were 
found in the present study to distribute infor- 
mation in a more equal fashion than groups 
in unequal reward conditions. An examination 
of the records of play together with the data 
described in the Results section strongly indi- 
cated that partners in unequal reward condi- 
tions used the strategy of hoarding their ini- 
tial information (clues) by passing redundant 
information. This is a blocking strategy which 
prevents the other members of the group from 
achieving their goal. Since partners exposed to 
the blocking strategy were faced with the 
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threat of receiving redundant information, ii 
was necessary to decide who possessed the 
needed information. Then the player might 
have employed the strategy of engaging the 
partner with the fewest cards. Other times two 
members might consort to defeat the remain- 
ing player by reducing his supply of cards 
and increasing the likelihood that the initiator 
would receive novel information. These par- 
ticular strategies became sharpened over 
trials as the subjects learned not only the 
strategies more precisely but also learned how 
to implement the strategy to their advantage. 

In conclusion, it is of interest to note that 
most investigators in this area of research 
have attempted to determine the relation be- 
tween group cooperation and competition on 
productivity (outcomes). On the other hand, 
the present study has indicated that an ex- 
perimental approach is amenable to investi- 
gating the dynamics of the groups in terms of 
the strategies (processes) employed. Clearly, 
cooperation and competition are not all-or- 
none processes which apply equally to all 
situations. They are consequences of relation- 
ships among members and between or among 
groups. Which consequence or combination -of 
consequences is achieved depends, at least in 
part, on reward structures and dependency 
relationships. 


REFERENCES 


Deutsch, M. A theory of cooperation and competi- 
tion. Human Relations, 1949, 2, 129-152. 

Games, P. A. Multiple comparison of means. Ameri 
can Educational Research Journal, 1971, 8, 531 
565. 

Kelley, H. H., & Thibaut, J. W. Group problem 
solving. In G. Lindzey & E. Aronson (Eds.), The 
handbook of social psychology (2nd ed.) (Vol. 
4). Reading, Mass.: Addison-Wesley, 1969, 

Miller, L. K., & Hamblin, R. L. Interdependence, 
differential rewarding and productivity. American 
Sociological Review, 1963, 28, 768-778. 

Samuels, F. The intra- and inter-competitive group. 
Sociological Quarterly, 1970, 11, 390-396. 

Thomas, E. J. Effects of facilitative role interde- 
pendence on group functioning. Human Relations, 
1957, 10, 347-366. 

Vinacke, W. E., Wilson, W. R., & Meredith, G. M. 
Dimensions of social psychology. Chicago: Scott, 
Foresman, 1964, 


(Received September 12, 1973) 


Journal of Personality and Social Psychology 
1975, Vol. 31, No. 4, 621-633 


Development of Integrative Solutions in Bilateral Negotiation 


Dean G. Pruitt and 
State University of Ni 


Steven A. Lewis 
ew York at Buffalo 


Two studies of bilateral negotiation are reported. Both were designed to 
reveal the conditions under which bargainers reach agreement involving high 
joint profit. The results suggest the importance of three approaches to negoti- 
ation: (a) the distributive approach, which reduces joint profit; (b) the 
heuristic trial-and-error approach, which enhances joint profit; and (c) the 


information-exchange approach, which 


enhances joint profit when adopted 


by bargainers with high cognitive complexity. Bargainers who start with a 
problem-solving orientation tend to adopt one of the latter two approaches, 


The most integrative solutions appear 
members have this orientation together 
limits or levels of aspiration. 


The research reported here was concerned 
with the processes and conditions that lead to 
the development of more integrative agree- 
ments in bilateral negotiation. An agreement 
is said to be more “integrative” the greater 
its joint utility, that is, the more valuable it is 
to the two bargainers taken together. The 
term derives historically from Follett’s (1940) 
notion of “integration,” which she viewed as 
the creative process by which bargainers dis- 
cover new options that are better for both 
parties than those currently under considera- 
tion. Such options can be thought of as inte- 
grating the needs of the two bargainers. Fol- 
lett contrasted integration with “compromise” 
in which agreement is reached on an obvious 
option by a series of mutual concessions. 
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T These authors call the process or processes by 
which such solutions are adopted “integrative bar- 
gaining.” 
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to be developed by dyads whose 
with high and somewhat inflexible 


Our concept is broader than Follett’s in 
that it is not limited to new options. Often in 
an array of known options, some have greater 
joint utility than others. The process by which 
such options are adopted seems equally as 
important as that by which new options are 
forged. Furthermore, in practice, it is often 
difficult to tell whether bargainers were 
aware from the start of the solution they 
finally adopted and hence whether it is new. 
We also prefer to talk in terms of an overt 
event, agreement on a more or less integrative 
option, rather than a covert event, the sup- 
posed process of integration, because our re- 
sults suggest that more than one process is 
involved. 

There are several reasons for studying the 
development of highly integrative agreements 
in negotiation. Such agreements are tanta- 
mount to achieving the greatest good for the 
greatest number. Furthermore, where both 
parties have high aspirations or limits (mini- 
mum profit requirements), agreement can 
only be reached if highly integrative options 
are found. In addition, Follett points out that 
nonintegrative agreements often fail to resolve 
long-run conflict. A compromise that one or 
both parties finds barely satisfactory may set- 
tle an issue for the time being. But the result- 
ing truce is likely to be shaky, 

Walton and McKersie (1965) have postu- 
lated several conditions that lead to the adop- 
tion of more integrative options:+ (a) Several 
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issues are considered simultaneously. This 
permits the development of trade-offs in 
which one party’s concessions on one issue 
are exchanged for the other party’s conces- 
sions on another issue. (b) The bargainers 
view the issues as problems to be solved. 
Such perceptions ordinarily produce flexible, 
inventive behavior in which the bargainers 
seek out and discuss a variety of options. (c) 
There is a free exchange of accurate and be- 
lievable information about utility structures 
(preferences and needs), so that the bar- 
gainers have enough information to gain in- 
sight into how to integrate one another’s util- 
ity structures. Some degree of trust is likely 
to be a prerequisite for such an exchange of 
information. (d) The bargainers avoid “dis- 
tributive” behavior, which involves efforts to 
persuade each other to make concessions, by 
the use of threats and positional commitments 
or the withholding of information about one’s 
own preferences. 

Walton and McKersie also stress the im- 
portance of motivation, but the point has 
been most clearly made by Follett, who argues 
that integrative solutions are most likely to 
be adopted when (e) the bargainers “firmly 
uphold” their own viewpoints unless and un- 
til they are convinced that these are not vi- 
able. In her words, “Mushy people are no 
more good at this than stubborn people” 
(1940, p. 48). This makes sense because bar- 
gainers who maintain a high level of aspira- 
tion and are slow to make concessions avoid 
premature compromises and take the trouble 
that is often needed to seek an agreement 
that provides high joint benefit. 

Two sets of researchers (Froman & Cohen, 
1970; Walker & Thibaut, 1971) have con- 
firmed Walton and McKersie’s first postulate, 
but the other four have not been directly 
tested. 

Kelley and Schenitzki (1972) have ad- 
vanced and tested an interesting hypothesis 
about the special case in which a highly inte- 
grative option is adopted from a set of known 
options. It holds that integrative agreements 
develop when the bargainers make a system- 
atic series of concessions, in which each party 
reduces the profit level he is seeking only 
after mentioning every known option at that 
level. This hypothesis resembles Follett’s in 
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its emphasis on maintaining level of aspira- 
tion, but it adds the novel notion of varying 
one’s proposal within each level of aspiration. 
The beauty of this hypothesis is that it can 
be derived quite logically, without making 
psychological assumptions, from a simple geo- 
metrical analysis of what happens to joint 
utility when bargainers take such a systematic 
approach. These authors have demonstrated 
empirically that bargainers who take this ap- 
proach usually end up with highly integrative 
agreements. 

Two studies are reported in this article. One 
involved an effort to test the second, third, 
and fifth postulates from the writings of Fol- 
lett and Walton and McKersie and to repli- 
cate, with a different experimental task, Kel- 
ley and Schenitzki’s basic finding. The other 
study tested a proposition about the impact 
of bargainer personality derived from an as- 
sumption underlying Walton and McKersie’s 
third postulate. In both studies, extensive ob- 
servations were made of the actual bargaining 
events, which permitted development of an 
understanding of the processes underlying the 
adoption of integrative agreements. 

The task employed in both studies involved 
simultaneous consideration of three issues. As 
is so often true in multiissue negotiation, 
these issues had differing importance for the 
two negotiators. Hence in order to find the 
more integrative agreements, it was necessary 
to develop trade-offs, with each bargainer 
making substantial concessions on the issue or 
issues of lesser importance to him.” 


EXPERIMENT 1 


The hypotheses tested in this study were 
the following: 

(1) More integrative solutions are achieved 
under a problem-solving orientation than un- 
der an individualistic orientation. By “prob- 
lem-solving orientation” is meant a desire to 
solve the problem of how to satisfy both 
parties’ needs; and by “individualistic orien- 
tation,’ a desire to locate a solution that 
satisfies only one’s own needs. This hypothe- 
sis was derived from the second postulate 
presented above. 

(2) Provided that some of the available 


2 Walker and Thibaut (1971) call the process by 
which trade-offs are developed “logrolling.” 
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TABLE 1 
BUYER AND SELLER PROFIT SHEETS 
Buyer Seller 
Tron Sulfur Coal Tron Sulfur Coal 
Price | Profit | Price | Profit | Price | Profit | Price | Proft | Price | Proft | Price | Profit 
A $2,000 A $1,200 A $800 A $000 A $000 A $000 
B $1,750 B $1,050 B $700 B $100. B $150 B $250 
C $1,500 Ç $900 Cc $600 [e $200 Cc $300 C $500 
D $1,250 D $750 D $500 D $300 D $450 D $750 
E $1,000 E $600 E $400 E $400 E $600 E $1,000 
F $750 F $450 F $300 F $500 F; $750 F $1,250 
G $500 G $300 G $200 G $600 G $900 G $1,500 
H $250 H $150 H $100 H $700 H $1,050 A, $1,750 
I $000 I $000 E $000 I $800 I $1,200 I $2,000 


Note. Prices were referred to by letters. The numbers under each commodity represents profits to be made on that commodity at 


the particular price. 


options satisfy both parties’ limits, higher 
limits lead to more integrative solutions. A 
bargainer’s “limit”? is the level of utility be- 
low which he will not concede. This hypothe- 
sis was based on the fifth postulate above. 

(3) More integrative solutions are found 
to the extent that. the bargainers exchange 
truthful and trusted information about their 
utility structures. This hypothesis reflected 
the third postulate. 2 


Method 
Subjects 


Ninety-two male undergraduates from the State 
University of New York at Buffalo took part in 
this study. All participated in order to fulfill an 
imtroductory psychology course requirement and 
were volunteers in the sense of being permitted to 
write a short paper in lieu of this requirement. 


Task 


The task was adapted from one used by Kelley 
(1966). It required pairs of subjects, taking the roles 
of buyer and seller, to agree on prices for three 
commodities: iron, sulfur, and coal. The buyer could 
gain his highest profits on iron and the seller on 
coal, so that mutually profitable trade-offs could 
be negotiated if the buyer made heavy concessions 
on coal and the seller on iron. 

The profit schedules given to buyer and seller are 
shown in Table 1. They listed nine prices, repre- 
sented by letters from A to I. Next to each price on 
the seller’s sheet is his profit for selling each com- 
modity at that price. Next to each price on the 
buyer’s sheet is his profit for reselling each com- 
modity if purchased at that price. The reader can 
note that the buyer receives his highest profits on 
iron and the seller on coal. 


Although specifically told that any combination 
of letters is permissible, many dyads approach this 
task by proposing and agreeing on the same or nearly 
the same price for all three commodities. Such 
“horizontal” options yield a joint profit in the vi- 
cinity of $4,000 for the dyad as a whole. A joint 
profit in excess of this level can only be achieved by 
means of a “diagonal” option, involving a lower 
price (closer to A) for iron than for coal. The 
greater the difference in price between iron and 
coal, the larger is the joint profit. Thus D-F-F 
totals $4,400, B-C-H totals $4,900, and A-E-I totals 
$5,200 (as do A-B-I, A-F-I, and A-I-I, since the 
price of sulfur is immaterial to the joint profit). 
Hence the most integrative options lie on the 
diagonal. 


Design 


A 2X22 factorial design was employed, with 
two levels of bargaining orientation (individualistic 
ys. problem solving), two levels of limit (high vs. 
low), and two levels of communication freedom 
(free vs. truthful). Pairs of subjects were randomly 
assigned to the eight cells of this design, with five or 
six pairs per cell. The dyad (pair of interacting sub- 
jects) served as the unit of statistical analysis, 


Procedure 


Upon arrival at the laboratory, subjects were told 
that they were going to participate in a simulated 
negotiation task. When both members of the dyad 
had arrived, they were led to separate rooms and 
randomly assigned the role of either buyer or seller. 
Each was given the appropriate profit sheet and a 
position paper which provided him with general 
information about his company. All subjects were 
told to feel free to use part or all the information 
provided in the position paper in shaping their bar- 
gaining stance and to create additional arguments to 


~ bolster their position if they desired. After answering 


any questions that might have arisen by this time, 
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further instructions were presented that differed 
from condition to condition. 

In presenting the orientation instructions, all sub- 
jects were told that their company had “developed a 
bargaining technique which all of its negotiators have 
been instructed to use.” The instructions for the 
problem-solving orientation were, “This approach 
views the bargaining situation as a solvable problem. 
Attempt to play down the conflict nature of the 
task and view it as a problem situation. Naturally 
you want to make as much profit as you can, but 
you are interested in the needs of the other com- 
pany.” For the individualistic orientation (10), the 
instructions were, “Your job is to make as much 
profit for your company as possible. The needs and 
profits of the other negotiator are unimportant to 
you.” 

Limits were manipulated by telling the subjects 
that they were expected by their firms to make a 
certain minimum profit: $2,000 in the low-limit con- 
dition (LoL), a sum that could be achieved by both 
parties with the horizontal option E-E-E, and $2,300 
in the high-limit condition (HiL), a sum that could 
only be achieved on the diagonal. 

In the truthful communication condition (TC), 
subjects were told that any information they chose 
to give about their profit schedule had to be truth- 
ful and that the experimenter would enforce this 
restriction. This instruction was designed to ensure 
that communications about profit schedules would 
be valid and trusted. In the free communication con- 
dition (FC), subjects were told that they could say 
anything they desired and could, if they wished, 
bluff with respect to their profit sheet. All subjects 
were told that they could not show their profit 
schedules to one another. 

Both members of a dyad were given the same 
orientation, aspiration, and communication instruc- 
tions. After these instructions had been completed, 
both subjects were brought into a single room. They 
were told that they both had received the same 
orientation and communication instructions but noth- 
ing was said about the other bargainer’s limit Jevel. 
Then face-to-face negotiations began. All sessions 
were tape-recorded. The experimenter remained in 
the room to enforce the TC instruction and to 
observe the negotiation. No definite time limit was 
set for a session, although subjects were told they 
would have about 30 minutes to reach agreement. At 
the completion of a session, subjects were inter- 
viewed, given a postsession questionnaire, and fi- 
nally completely debriefed. 


Process Measures 


Process measures were developed by coding the 
discussions between the bargainers. Eight coding 
categories were employed: 

Asks for truthful information (ATI) is a request 
by one bargainer for information about the other's 
profit schedule or limit. 

Gives truthful information (GTI) is a statement 
of truthful information about one’s own profit 
schedule or limit. 
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Gives false information (GFI) is a lie about one’s 
profit schedule or limit. 

Calls for concession (CALL) is a demand for a 
unilateral concession from the other bargainer. 

Uses pressure tactic (PRESS) is an effort to 
persuade the other bargainer to concede. Included in 
this category were threats, positional commitments, 
and statements that brought in extraneous issues. 
This category was modeled after Walton and Mc- 
Kersie’s description of distributive behavior. 

Proposes general approach (GENAPP) is the sug- 
gestion of a general approach that might be used to 
produce agreement. 

Shows concern (CONCERN) is a statement indi- 
cating interest in the other bargainer’s welfare. 

Proposes coordination (COORD) is a proposal for # 
mutual concessions or a trade-off of one commodity 
for another. 

The coding unit was a “statement” from a bar- 
gainer, defined as anything said between the time 
when the other bargainer stops speaking and starts 
again. More than one category could be assigned to 
a statement (e.g, ATI and CONCERN) but the 
same category could not be assigned twice to a 
statement. All protocols were coded twice. The 
coders initially agreed on 76% to 94% of the state- 
ments, depending on the category. They conferred. 
about all disagreements and resolved them as ra- 
tionally as possible. 

Each dyad received a score for each of the coding 
categories consisting of the number of statements 
assigned to that category divided by the total time 
spent in negotiation. This was essentially a measure 
of the rate of emission for that category. 

Two other process measures were employed, both 
based on the proposals made by the bargainers dur- 
ing their discussion. One (NUMDIF) was the num- 
ber of different three-letter proposals made by each 
bargainer summed over the two bargainers and 
divided by total time spent in negotiation. The 
other (SYSCON), was an adaptation of Kelley and 
Schenitzki’s index of systematic concession making. 
A count was made of the number of offers that (a) 
provided a profit within $200 of the same bargain- 
er’s previous offer and (b) involved a demand for 
greater profit on one commodity and less on another, 
This count was divided by the total number of times 
either bargainer’s offer was changed during the ses- 
sion. The higher the resulting index, the more use 
was being made of the technique of exploring 


various proposals at one profit level before moving 
to another. 


ar 


Results 
Outcome Measures 


Of the 11 dyads assigned to a combination 
of high limits with an individualistic orienta- 
tion (IO-HiL), 5 broke off negotiations with- 
out agreement, whereas all other dyads 
reached agreement, Exact tests of significance 
between the results in this condition and each 
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of the other three formed by combinations of 
limit levels and orientations showed that such 
a difference could occur by chance less than 
5% of the time. Essentially this suggests the 
existence of an interaction between orientation 
and limit, such that dyads have more diffi- 
culty reaching agreement under high than 
under low aspiration unless their members 
adopt a problem-solving orientation. 

The obvious measure of the integrativeness 
of the final agreement, joint profit (buyer 
profit plus seller profit), presented problems 
because of the failure of so many dyads to 
reach agreement in the IO-HiL condition. The 
variance under this condition was much 
greater than under the others, which created 
statistical difficulties. Furthermore, it did not 
seem right to assign a score of 0 to dyads 
that failed to find a diagonal solution under 
high limits, when similar dyads would have 
reached agreement and received a score of 
$4,000 under low limits. These problems were 
resolved by awarding a corrected joint-profit 
score of $4,000 to dyads that failed to reach 
agreement. In interpreting the results, we 
recognized that this approach was somewhat 
arbitrary. Mean values for this variable are 
shown in Table 2, with the values in parenthe- 
ses indicating separately the means for the 
dyads that reached agreement in the IO-HiL 


~ condition. 


Hypothesis 1 (and hence the second postu- 
late attributed to Walton and McKersie) was 
clearly supported, in that dyads achieved 
higher joint profits under the problem-solving 
than under the individualistic orientation (F 
= 20.17, p < .01). This result would have 


TABLE 2 
- _ JOINT PROFIT ACHIEVED IN EXPERIMENT 1 


Problem-solving Individualistic 


orientation orientation 
Limit 

Truthful | Free | Truthful | Free 
communi- | communi- | communi- | communi- 

cation cation cation cation 

High 5,050 5,000 4,390 4,450 
(4,650) | (4,900) 

Low 4,725 4,450 4,450 4,175 


Nole. Scores can range from 4,000 to 5,200. Numbers in 
parentheses refer to the dyads that reached agreement in 
1O-HiL, There were three such dyads in each cell of this con- 
dition. 
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been even more marked had we assigned the 
raw score of O to those dyads that failed 
to reach agreement. 

A statistically significant main effect was 
also found for limit (F = 8.41, p < -01), but 
it is difficult to take this seriously in the 
case of the individualistic orientation, where 
the effect was small and would have been re- 
versed had we employed uncorrected joint- 
profit scores. A consistent limit effect can be 
seen if we take into account only dyads that 
reached agreement (and consider only the ma- 
terial in parentheses for the IO-HiL condi- 
tion). Hence Hypothesis 2 (Follett’s postu- 
late, labeled “e” above) was supported in the 
case of dyads that reached agreement, where 
higher limits consistently led to more inte- 
grative results, 

The interaction between orientation and 
limit was nearly significant (F = 2.52, p = 
-12). A comparison among the means based 
on those dyads that reached agreement sug- 
gests that it is entirely due to the previously 
mentioned failure of some of the dyads in the 
IO-HiL condition to reach agreement. 

The communication-freedom variable failed 
to yield any significant results, though the 
main effect was in the predicted direction. 
Hence Hypothesis 3 (and the third postulate 
attributed to Walton and McKersie, from 
which it was derived) received no support. 

An analysis of variance was also performed 
on a measure of the time taken to reach 
agreement or break off negotiation; but there 
were no significant results except for a diffi- 
cult to interpret, barely significant, second- 
order interaction. 


Process Measures 


Under the problem-solving as opposed to 
the individualistic orientation, significantly 
more truthful information was transmitted 
(GTI, p< .01), more general approaches 
were proposed (GENAPP, p= .09), more 
interest was expressed in the other party’s 
welfare (CONCERN, p= .06), more pro- 
posals for mutual coordination were made 
(COORD, p< .05), less false information 
was transmitted (GFI, p < .01), less use was 
made of pressure tactics (PRESS, p < 105); 
and there were fewer calls for concession 
(CALL, $ = .08). In addition, a greater vari- 
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ety of proposals was made per unit of time 
(NUMDIF, p < .01), and concessions were 
made more systematically, in the Kelley- 
Schenitzki sense (SYSCON, p<.01). In 
summary, the problem-solving instructions 
appear to have produced an open, communi- 
cative atmosphere in which the bargainers 
were concerned about one another’s outcomes, 
tried out in a systematic way a variety of 
general and specific approaches, and refrained 
from distributive behavior. 

Limit level only had an effect on SYSCON, 
with more systematic concession making un- 
der high than under low limits ($ < .01). 

In the truthful as opposed to the free 
communication condition, more requests were 
made for information about the other party’s 
profit structure (ATI, p < .01) and less false 
information was transmitted (GFI, p < .01). 
However only somewhat more truthful infor- 
mation about profit structure was transmitted, 
and this difference was not statistically sig- 
nificant (GTI, p = .13). Hence it is not clear 
that our TC instructions were successful in 
encouraging bargainers to exchange enough 
information about their profit structures for 
insight to develop into how to reconcile these 
structures. This may account for the failure 
of the communication-freedom variable to 
affect joint profit. 

Tn an effort to evaluate the mechanisms 
underlying the attainment of integrative s0- 
lutions, the process measures were correlated 
with corrected joint profit. Within-cell co- 
efficients were computed so as to avoid con- 
founding the effects of the independent varia- 
bles with the effects of these processes. No 
systematic differences were found among these 
coefficients as a function of the experimental 
variables, so average within-cell coefficients 
were computed.* These are shown in Table 3. 


3 The formula employed computes the correlation 
between two variables after the scores have been 
adjusted for their cell means. In this connection, 
the cells refer to the eight treatments produced by 
the 2 X 2 X 2 design. 

4It is possible that the positive correlation be- 
tween NUMDIF and SYSCON is simply a self- 
correlation, since both are measures of rates of pro- 
posal emission. But this possibility seems remote in 
light of the fact that SYSCON was corrected by 
dividing by the number of proposal changes—a mea- 
sure that turned out to be practically the same thing 
as the numerator of NUMDIF. 
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TABLE 3 


WITHIN-CELL CORRELATIONS OF Process MEASURES 
WITH JOINT PROFIT IN EXPERIMENT 1 


Measure Integrativeness* 

Asks for truthful information 04 
Gives truthful information 24 
Gives false information =.26 
Calls for concession —.09 
Uses a pressure tactic —.57** 
Proposes general approach E rhi 
Shows concern .02 
Proposes coordination 15 
Number of different proposals 40* 
Systematic concession making 28 
adf = 37. 

*p <.05. 

**p <01 


The results indicate that integrative solu- 
tions were more often achieved in dyads 
whose members proposed more general ap- 
proaches to reaching agreement (GENAPP), 
proposed a larger variety of concrete options 
(NUMDIF), were more systematic in their 
concessions (SYSCON), and made less use 
of pressure tactics (PRESS). The first three 
of these indices were significantly positively 
intercorrelated (7 = .40, p < .05).* The last 
index was inversely but not significantly re- 
lated to the other three (7 = —.18, ms). 

Correlations were also computed between 
the process indices for buyer and seller across 


all dyads, within cells. Significant positive, 


correlations were found for all of the indices 
that were predictive of joint profit: GENAPP 
(r= 44, p < .01), NUMDIF (r= .70, p< 
01), SYSCON (r=.42, p<.01), and 
PRESS (r = .62, p < .01). Also CONCERN 
(r= 35, p < .05) and COORD (r=.35, p 
<.05) showed positive correlations. These 
results indicate that the two members of a 
dyad exhibited somewhat similar levels of the 
index in question. 


Discussion 


Given the existence of some options that 
satisfy both parties’ limits, the results sug- 
gest that a combination of high limits and a 
problem-solving orientation is particularly 
likely to lead to highly integrative agree- 
ments. How can this be explained? 

For the purpose of explanation, this find- 
ing can be divided into three separate parts: 


(1) Agreements reached where both bar- 


gainers have higher limits are, on the average, 


ge? 
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more integrative than those reached where 
both have lower limits. This is easy to in- 
terpret, since it is self-evident that the mini- 
mal level of joint profit necessary for reaching 
agreement goes up as limits increase. 

(2) Given an individualistic orientation of 
the kind usually found in bargaining, higher 
limits mean that fewer dyads reach agree- 
ment. This can be explained in two ways: 
(a) Higher limits imply higher minimum 
joint-profit requirements and hence the avail- 
ability of fewer minimally acceptable options. 
Hence dyads whose members have higher 
limits have greater diffculty finding options 
upon which they can agree. (b) Where, as in 
the task used in the present study, joint profit 
is a function of the extent to which the bar- 
gainers trade concessions on different issues, 
higher limits mean that each bargainer must 
make deeper concessions on at least one issue 
in order to reach agreement. Deeper conces- 
sions are more difficult to make (Kelley, 
Beckman, & Fischer, 1967); hence, it is more 
difficult to reach agreement. 

(3) A problem-solving orientation over- 
comes the difficulty in reaching agreement 
produced by higher limits. Presumably this 
is because such an orientation makes it possi- 
ble to find more integrative options, if and 
when higher limits require the bargainers to 


~ adopt such options. But why should it have 


this effect? To answer this question, we must 
turn to the results for the process measures. 

Four of our measures were related to the 
level of joint profit in the final agreement. 
Hence they may provide clues to the process 
by which some dyads reached more integrative 
agreements than did others. 


Three of these measures were significantly 
intercorrelated, suggesting that they may re- 
flect different aspects of a single process, 
which appears to have something to do with 
the way bargainers change their proposals. 
One way of describing this process is to say 
that members of dyads that adopted more 
integrative agreements engaged in a sophisti- 
cated form of trial and error. They varied 
their proposals more frequently, both at the 
specific (NUMDIF) and general (GENAPP) 
levels; and they made concessions in a more 
systematic way, proposing more options at 
one profit level before moving to another 
(SYSCON). Because such behavior was pro- 
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ductive of the better agreements, we have 
tentatively labeled it heuristic trial and error. 

The success of the systematic-concession 
index (SYSCON) in predicting level of joint 
profit replicates Kelley and Schenitzki’s re- 
sult. It also adds generality to their result, 
since we used a different task and procedure: 
Those authors employed the Siegel and 
Fouraker (1960) task, in which all options 
are presented at the beginning of negotiation; 
while in our task, options must be constructed 
by combining the possible solutions to three 
issues. Furthermore their subjects communi- 
cated by passing notes rather than face-to- 
face conversation as in our study, Our results 
transcend those of Kelley and Schenitzki in 
suggesting that systematic concession mak- 
ing, when it arises spontaneously, may 
be part of a broader trial-and-error strategy, 
which also includes frequent varying of pro- 
posals at both a specific and a general level. 

The fourth index that was related to joint 
profit, PRESS, can be thought of as a measure 
of the amount of distributive behavior, be- 
cause it reflects the use of tactics aimed at 
persuading the other party to concede, such 
as positional commitments, threats, and argu- 
ments about why he should yield. Our finding 
supports Walton and McKersie’s contention 
(the fourth postulate presented above) that 
distributive behavior hinders the development 
of integrative agreements. Just why this 
should be so is not clear from our data, but 
it is certainly not because distributive be- 
havior rules out concessions. In the low-limit 
condition, dyads with high PRESS scores 
were perfectly able to reach agreements in- 
volving simple compromise, such as E-E-E or 
D-F-E. Rather, their problem seemed to be 
one of failing to make the sophisticated con- 
cessions required to reach more integrative 
agreements such as A-D-H, 

All four of the measures just discussed were 
affected by the problem-solving instructions. 
Hence it is possible that these measures re- 
flect the processes by which this orientation 
enabled bargainers to locate highly integrative 
options. In short, the problem-solving orien- 
tation may have had its positive effect by 
encouraging heuristic trial and error and in- 
hibiting distributive behavior. 

Our problem-solving instructions also pro- 
duced other effects: valid communication 
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about profit structures (GTI and GFI) and 
a warm (CONCERN) and undemanding 
(CALL) atmosphere. But these can be 
thought of as “excess baggage,” having no 
apparent relationship to the level of joint 
profit finally attained. 

The ideas just presented may help to ex- 
plain why approximately half of the dyads in 
the condition that involved high limits and an 
individualistic orientation (IO-HiL) suc- 
ceeded in reaching integrative agreements. As 
in all other cells of the design, these success- 
ful dyads exhibited relatively high levels of 
NUMDIF, GENAPP, and SYSCON and 
relatively low levels of PRESS. Hence it 
appears that the members of these dyads 
spontaneously adopted those elements of the 
problem-solving orientation that lead to inte- 
grative agreement in our task. 

We were surprised to find that our com- 
munication process measures (ATI, GTI, and 
GFI) were not significantly related to the 
level of joint profit in agreement.’ This chal- 
lenged the seemingly reasonable assumption 
(the third postulate attributed to Walton and 
McKersie) that integrative solutions emerge 
from an exchange of accurate and believable 
information about preferences and utility 
structures. It also seemed to challenge the 
results of some earlier pilot work which 
showed that highly integrative solutions were 
quite easily found when bargainers were per- 
mitted to communicate fully about their 
profit schedules by exchanging them. 

One possible way of resolving this anomaly 
is to assume that a certain minimum amount 
of information must be received before the 
typical bargainer can gain enough insight 
into his opponent’s profit structure to devise 


5 This finding is supported by the failure of our 
communication manipulation to have a significant 
effect on outcome. But it is possible to discount this 
latter result on the grounds that the TC manipula- 
tion was not successful in inducing subjects to trans- 
mit more truthful information about their payoff 
structures. 

6This variable has also been called “integrative 
capacity,” “conceptual complexity,” and the “ab- 
stract-concrete” dimension. 

TDespite these differences the variable has not 
been found to be significantly related to IQ or aca- 
demic achievement (Streufert & Driver, 1967). 
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an integrative option. Presumably most of 
our subjects were below this minimum, even 
at the highest levels of information exchange. 
This point can be dramatized by comparing 
the case of a bargainer who knows two of his 
opponent’s payoffs, randomly chosen, with a 
bargainer who knows four such payoffs. The 
chance of either gaining insight into his oppo- 
nent’s profit schedule on the basis of such 
information is close to zero. Hence the fact 
that one bargainer has more information than 
the other does not reflect a difference that 
makes a difference. But there must be some 
minimum number of payoffs which, if known, 
would permit almost any bargainer to gain 
such insight—that minimum being lower the 
more perceptive he is. 


EXPERIMENT 2 


A personality variable, cognitive complex- 
ity, was employed in this study. According 
to Schroeder, Driver, and Streufert (1967), 
more complex individuals entertain more 
alternative conceptions of a situation and 
gather and integrate more information when 
they have to make a decision.’ Reasoning 
from the thinking underlying the third postu- 
late presented above, we hypothesized that 
more complex bargainers would achieve more 
integrative solutions, because they would 
gather more information about one another’s 
utility structures and achieve more insights 
into how to integrate these utility structures, 

This study was also designed to follow up 
the process analysis that had proved so useful 
in the first experiment. We wanted to break 
the PRESS code down into finer categories 
so as to determine which components were 
most efficient for predicting behavior. We 
also felt a need to augment the coding scheme 
in the realm of trial-and-error problem solving 
so as to understand more fully the processes 
by which integrative solutions are found. 
Hence we devised some new “integrative” 
codes to measure phenomena that had been 
identified in a qualitative examination of the 
data from Experiment 1. 

We were particularly interested in the con- 
dition involving high limits and an individu- 
alistic orientation, because it represents a 
common real-life situation; so we assigned 
quite a few subjects to it. But we also ran a 
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,.&» few subjects with low limits and the same 


orientation, to see if we could replicate the 
limit effects that had emerged in Experiment 1. 


Method 
Subjects 


Eighty-two male undergraduates were used as 
subjects. All participated in order to satisfy a re- 
quirement of the introductory psychology course 
which could also be fulfilled by writing a paper. 
They were chosen on the basis of their scores on 
the Streufert and Driver (1967) impression-forma- 
tion test, which was administered in class several 
weeks before the beginning of the experiment. In 
this test, an individual is first asked to write a 
paragraph describing a person with the traits 
“intelligent, industrious, and impulsive.” He is then 
asked to describe a person with the traits “critical, 
stubborn, and envious.” Finally he is asked to de- 
scribe a person who has all six of the traits just 
listed. The last paragraph is the one scored. Respon- 
dents are classified into five categories that reflect 
the extent to which they have combined the traits 
into a unified and consistent picture. Those who 
integrate the traits more successfully are considered 
more complex. Our sample included 42 high-complex 
subjects, with scores of 4 or 5, and 40 low-complex 
subjects with scores of 1 or 2. 


Design 


A 2X2 design was employed, with the dyad as 
the basic unit. The members of a dyad were cither 
high complex (HiC) or low complex (LoC). Both 
members were given either a high limit (HiL) or a 
low limit (LoL). Assignment to HiL versus LoL 
was randomly determined. The number of dyads 
in each cell of the design was as follows: HiC-HiL, 
12; HiC-LoL, 9; LoC-HiL, 13; LoC-LoL, 7. 


Procedure 


Exactly the same procedure was used as in the 
conditions of Experiment 1 that involved an indi- 
vidualistic orientation and free communication. Also 
the same limits were used as before: $2,300 in the 
high-limit condition and $2,000 in the low-limit 
condition. 


Process Measures 


The discussions were coded on the basis of 18 
categories, some old and some new. Of these, only 
the following turned out to be useful: 

Asks for truthful information (ATI) is the same 
as in Experiment 1. 

Gives truthful information (GTI) is a truthful 
statement about the profit associated with a particu- 
lar option. 

In-role patter (PATTER) is an argument that the 
other should concede for reasons that relate to the 
commercial setting hypothetically surrounding the 
negotiation. This was one component of the former 
PRESS category. 
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Makes a positional commitment (COMMIT) is a 
strong statement that the speaker cannot or will not 
move from his current offer—another component 
of the former PRESS category. 

Threatens (THREAT) is an if-then statement in 
which the “then” portion refers to an action by the 
speaker that would harm the real or supposed inter- 
ests of the listener—a third component of the former 
PRESS category. 

Requests reaction to own proposal (REQREAC) 
is a request for the other bargainer to indicate how 
he feels about a proposal. 

Unfortunately, through an oversight, a useful cate- 
gory from the former study, GENAPP, was not 
included in the coding scheme. 

The same approach to coding was used as before, 
with all protocols coded twice, The two coders were 
in initial agreement on an average of 76% of the 
codes which each assigned. Of those on which they 
disagreed, an average of 15% were identified as 
lapses, that is, cases in which a coder indicated 
that he had simply not noticed the appropriateness 
of a category used by the other coder. Lapses are a 
problem with such a long schedule of categories 
because it is easy to lose sight of one or more cate- 
gories for a period of time. This problem makes 
double coding a necessity in order to keep the num- 
ber of omitted codes down to an acceptable mini- 
mum, Lapses do not reflect on the cogency of the 
coding system. Rather this is called into question 
when the coders disagree on the assignment of a 
category. An average of 9% of the codes assigned 
by each of our coders were challenged by the other 
coder, which would appear to be an acceptable 
percentage, especially since all disagreements were 
resolved by means of discussion. 

Unlike Experiment 1, a count was kept of the 
total number of statements made during each dis- 
cussion, Hence it was possible to develop a per- 
centage score for each dyad on each category, con- 
sisting of the number of statements fitting that cate- 
gory divided by the total number of statements. 
These scores were used in the analysis as they 
seemed more cogent than the rate-of-emission scores 
used in the first experiment. 

Two other process measures were employed, both 
based on the proposals made by the bargainers 
during their discussion. One (NUMDIF) was the 
number of different three-letter proposals made by 
each bargainer summed over the two bargainers and 
divided by the total number of statements made by 
the dyad. The other (SYSCON), an adaptation of 
the index of systematic concession making em- 
ployed by Kelley and Schenitzki, was defined as in 
Experiment 1. 


Results and Discussion 


As in the 10-HiL condition of the earlier 
study, about half (14 out of 25) of the dyads 
in the high-limit condition reached agreement, 


g the other half broke off negotiation. In 


contrast, all dyads reached agreement in the 
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TABLE 4 
Joint Prorrr ACHIEVED IN EXPERIMENT 2 


High-complex Low-complex 


Limit bargainers bargainers 
High 4,675 4,510 
(5,012) (5,100) 


Low 4,466 4,406 


Scores can range from 4,000 to 5,200. Numbers in 
ntheses refer to the dyads that reached agreement, There 
were eight such dyads under high complexity and six under 
low complexity, 


low-limit condition. The difference between 
these two conditions was significant beyond 
the .01 level (two-tailed exact test). 

The same corrected joint-profit score was 
employed as in the first study, with the results 
shown in Table 4. As before, dyads that 
reached agreement under high limits adopted 
more integrative options than those that 
reached agreement under low limits. But our 
hypothesis about the effect of complexity was 
not supported, 

More statements were made during the 
discussions under high as opposed to low 
limits (107 vs. 68; U= 87, z= 2.35, p< 
.05). This accords with the findings of Kelley 
et al. (1967) and makes sense because the 
greater number of viable solutions under 
low limits should make it possible to locate 
a mutually satisfactory solution with less 
communication, 

Of the 60 tests of significance relating the 
process measures to the variables in our 
2 X 2 design, only 5 were significant beyond 
the .05 level. Hence a hypothesis of chance 
cannot easily be eliminated and the results 
do not seem worth reporting, with one notable 
exception: The same relationship was found 
between limit and SYSCON as in Experi- 
ment 1, that is, the bargainers explored more 
options at one profit level before shifting to 
another level when they had high as opposed 
to low limits (p < .05). This may reflect the 
fact that substantive concessions are easier 
to make when limits are lower (Kelley et al., 
1967), so that bargainers are not so moti- 
vated to stay for a while at each profit level 
exploring the various possible combinations 
that add up to that level. 

Six of the process measures showed a sub- 
stantial, unidirectional relationship to joint 


profit in all four cells of the design. Hence x 
it is possible to report for these measures 
average within-cell correlations based on all 
four cells. 

Three of these correlations replicated re- 
sults found in Experiment 1. As before, sys- 
tematic concession making (SYSCON) was 
predictive of integrative solutions (r= .47, 
Pb < 01). In addition, lower joint profit was 
achieved by dyads that employed many ex- 
traneous arguments in an effort to achieve 
concessions (PATTER, r= —.56, p < 01) 
and those that made frequent use of posi- 
tional commitments (COMMIT, r= —.41, 
p < .01)—tesults which are compatible with 
earlier findings, in that patter and positional 
commitments were components of the earlier 
code PRESS, which was inversely related to 
joint profit. 

A positive correlation was found between 
REQREAC and joint profit (r = .41, p< 


OR 
-01), indicating that dyads whose members y “W” 


frequently seek one another’s reactions to the 
proposals they are advancing are more suc- 
cessful than those that do not. This result 
may be interpreted in either or both of two 
ways: (a) Feedback from the other bargainer 
helps a practitioner of heuristic trial and 
error to develop a suitable proposal, and/or 
(b) the fact that such feedback is sought 
reassures the other bargainer that a problem- * 
solving approach is being taken and thus 
encourages him also to take such an approach, 
In contrast to the first experiment, two of 
the communication measures were positively 
related to joint profit: asking for information 
about the other bargainer’s profit schedule 
(ATI: r= 38, p < .01) and giving truthful 
information about one’s own profit schedule 
(GTI: r= 39, p < .01). These correlations 
surprised us until we looked more closely and 
found that they were mainly due to the high- 
complex dyads. For ATI, r= .46 (p< .05) 
for HiC and .28 (ns) for LoC; for GTI, 
r= .54 for HiC (p < .05) and .22 for LoC 
(ns). Hence it is possible to conclude that 
communication is more of a road to inte- 
rative agreement for dyads composed of 
two cognitively complex bargainers (as in 
HiC) than for dyads composed of two non- 
complex bargainers (as in LoC) or of a mix- 
ture of complexity levels (as presumably in 
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most of the dyads of Experiment 1). Our 
results suggest, in addition, that the success of 
dyads composed of more complex bargainers 
is not because such bargainers exchange more 
information. Rather, in the terms used earlier, 
it is because their minimal information re- 
quirements are lower, so that they can gain 
insight into one another's profit structures on 
the basis of minimal amounts of information. 
However, all of this speculation must be 
tempered by a realization that the HiC-LoC 
correlation differences were not statistically 
significant. 

Two other process measures were related to 
joint profit for the dyads composed of high- 
complex subjects (HiC) but not for those 
composed of low-complex subjects (LoC): 
(a) Variation in offers (NUMDIF), with 
r= .49 for HiC (p < .05) and r = .06 (ns) 
for LoC (ns). The result for HiC is similar 
to that for all subjects in Experiment 1. (b) 
The use of threats (THREAT), with r= 
—.44 for HiC (p< .05) and —.11 for LoC 
(ns). Again the result for HiC is similar to 
that for all subjects in Experiment 1, since 
threats were part of the PRESS code in that 
study. 

It should be noted that THREAT, COM- 
MIT, and PATTER represent the three main 
categories of distributive behavior: the use of 
threats, the making of positional commit- 
ments, and the presentation of arguments de- 
signed to persuade the other bargainer to 
concede. Hence the results of this experiment 
suggest that all categories of distributive be- 
havior impede the development of integrative 
agreements. 

In Experiment 1, we found positive cor- 
relations among three process measures that 
were related to joint profit: NUMDIF, 
SYSCON, and GENAPP. In the present 
study, it was possible to check the reliability 
of the correlation between the first two of 
these measures. While positive, this correla- 
tion was unexceptional (r = .10, ns), casting 
some doubt on the integrity of the behavioral 
syndrome we had tentatively labeled heuristic 
trial and error. A significant correlation be- 
tween NUMDIF and a successful measure 
that was only used in this study, REQREAC 
(r = 35, p < .05), may indicate that seeking 
feedback from the other party is another ele- 
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ment of heuristic trial and error. But the ab- 
sence of a significant correlation between 
REQREAC and SYSCON (r = .14, ns) calls 
this interpretation into question. Clearly more 
cases need to be collected so as to have a 
sufficient data base for factor analysis of these 
and other measures that show a relationship 
to joint profit. 

Two new clusters of process measures that 
were predictive of joint profit emerged from 
the present study: (a) The three measures of 
distributive behavior were closely related 
for high-complex subjects (7 = .59, p < .01) 
though inexplicably not for low-complex sub- 
jects (F=.10, ns). This suggests that, at 
least for more complex bargainers, we can 
speak of a unitary “distributive approach” 
with several different manifestations. (b) The 
two communication measures (ATI and GTI) 
showed an intercorrelation of .72 (p < .01).* 
This suggests that we can also speak of a 
“communication approach” to bargaining. 


GENERAL CONCLUSIONS 


In summarizing the results of these two 
studies, it seems useful to distinguish between 
the terms “approach” and “orientation.” An 
approach is a behavioral syndrome, a set of 
interrelated actions or behavioral dimensions, 
An orientation is more like a world view, a 
set of interrelated motives and perceptions. 
Of course, an approach can constitute the 
expression of an orientation. 

In our process analysis, we have distin- 
guished three approaches to bargaining that 
seem to influence the level of joint profit in 
the final agreement. Our data are not univocal 
about the internal consistency of the first two 
of these approaches, so that this list should 
be seen as a preliminary statement pending 
further research. (a) The distributive ap- 
proach, which entails the use of threats, posi- 
tional commitments, and arguments about 
why the other party should concede. (b) Heu- 
ristic trial and error, in which the other 
party’s reactions are sought to a large variety 
of tentatively advanced options and general 
proposals. People who take this approach also 
tend to concede systematically, exploring vari- 


8 On looking back at the data from Experiment 1, 
we also found a high correlation between ATI and 
GTI (r= .67, p < 01). 
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ous options at one profit level before moving 
to a lower level. (c) Information exchange, 
in which bargainers request and provide valid 
information about their profit structures. 

In the kind of bargaining situation we 
have studied, the distributive approach re- 
duces the likelihood that integrative agree- 
ments will be adopted, while heuristic trial 
and error increases this likelihood. Our data 
are not altogether clear about the impact of 
information exchange but suggest that it may 
lead to more integrative solutions for bar- 
gainers with higher cognitive complexity. 

The approaches just described can be 
usefully compared with two “processes” de- 
scribed by Deutsch (1973). The distributive 
approach seems closely related to Deutsch’s 
“competitive process,” which he describes as 
involving an effort to impose one’s viewpoint 
on the other party. Heuristic trial and error 
and information exchange seem to involve dif- 
ferent elements of what Deutsch calls the 
“cooperative process.” While he treats this 
process as a unitary syndrome, we found it 
necessary to distinguish our two types be- 
cause: (a) They lead to improved joint 
profit in two different ways, heuristic trial and 
error by a convergence of the two bargainers’ 
proposals and information exchange by in- 
sight into the other bargainer’s profit struc- 
ture and how to combine it with one’s own. 
(b) They differ in the conditions under which 
they affect joint profit. 

The assumption just stated that insight 
into the other bargainer’s profit structure is 
not an element of solutions by heuristic trial 
and error has an important implication; 
namely, that this approach is mot useful in 
situations in which integrative solutions re- 
quire that totally new options be invented. 
The creative thinking that leads to new inven- 
tions presumably requires such insight, which 
necessitates some form of information ex- 
change. But heuristic trial and error should be 
quite effective whenever (a) highly integra- 
tive options are among those whose physical 
identities are initially known, as in the 
Siegel-Fouraker task used by Kelley and 
Schenitzki, or (b) highly integrative options 
can be created by combining known options 
in trade-offs, as in the task used in the 
present study. 
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We learned a good deal in this research 
about the problem-solving orientation, which 
involves the goal of satisfying both parties 
and a view of the situation as a problem to 
be solved. This orientation produces heuristic 
trial and error and information exchange and 
diminishes distributive behavior. 

Where limits and/or levels of aspiration are 
low, agreement can usually be reached with- 
out developing or locating the more integra 
tive options. Hence there is little motivation 
to look beyond obvious options, and integra- 
tive solutions are often not forthcoming. 
While a problem-solving orientation may 
make such solutions a little more likely, it is 
not very effective. On the other hand, where 
limits and/or levels of aspiration are high, 
agreement can only be reached if the more 
integrative options are located. This provides 
a challenge to the bargainers. If they take 
a distributive approach, they are not likely to 
discover such options and hence will fail to 
reach agreement. But if they engage in heu- 
ristic trial and error or (in the case of more 
complex individuals) exchange valid informa- 
tion about their needs and priorities, they 
may be able to rise to this challenge and 
locate or create options that are sufficiently 
integrative to satisfy both parties. Since the 
problem-solving orientation produces these 
forms of behavior, we can conclude that 
integrative solutions are most likely to be 
developed under a combination of high (and 
relatively inflexible) limits or aspirations and 
a problem-solving orientation. Of course, 
limits must not be so high as to obviate 
agreement. 

It should be noted finally that, in recom- 
mending the maintenance of high limits or 
aspirations, we are in a sense endorsing con- 
flict as a road to integrative agreements. A 
period of conflict is often necessary before 
people look beyond the easy, obvious options 
in search for those that provide more joint 
profit. But we are endorsing conflict within 
the framework of an integrative mode of 
thought, the problem-solving orientation. 
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Sex Role Adaptability: 
One Consequence of Psychological Androgyny 


Sandra L. Bem 
Stanford University 


This article advances the hypothesis that psychologically “androgynous” indi- 
viduals might be more likely than either masculine or feminine individuals to 
display sex role adaptability across situations, engaging in situationally effec- 
tive behavior without regard for its stereotype as more appropriate for one 
sex or the other. Two experiments with college students provide support for 
this hypothesis. Together, they demonstrate that androgynous subjects of both i 


sexes display “masculine” independence when under pressure to conform, and 
“feminine” playfulness when given the opportunity to interact with a tiny 


kitten. In contrast, all of the nonandro; 


gynous subjects were found to display 


behavioral deficits of one sort or another, with the feminine females showing 


perhaps the greatest deficit of all. 


For years, American society has considered 
masculinity to be the mark of the psychologi- 
cally healthy male and femininity to be the 
mark of the psychologically healthy female. 
Recently, however, the women’s liberation 
movement has been arguing that our current 
system of sex role differentiation has long 
since outlived its usefulness, and that it now 
serves only to prevent both men and women 
from developing as full and complete human 
beings. Supporters of the movement insist that 
people should no longer be socialized to con- 
form to outdated standards of masculinity and 
femininity, but that they should be encour- 
aged to be “androgynous.” That is, they 
should be encouraged to be both instrumental 
and expressive, both assertive and yielding, 
both masculine and feminine—depending 
upon the situational appropriateness of these 
various behaviors. 

The purpose of this article is to begin to 
evaluate the validity of this proposal. What 
are the behavioral consequences of sex role 
stereotyping? Is there any sense in which the 
androgynous person is “better off” than the 
person who is sex typed? 

Tt is our general hypothesis that a non- 
androgynous sex role can seriously restrict 
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the range of behaviors available to an indi 
vidual as he or she moves from situatio 
situation. Consider, first, the person who is 
sex typed. According to both Kagan (1964) 
and Kohlberg (1966), the highly sex typed 
person becomes motivated—during the course 
of sex role socialization—to keep his or her 
behavior consistent with an internalized sex 
role standard; that is, he becomes motivated 
to maintain a self-image as masculine or 
feminine, a goal which he presumably ac- 
complishes by suppressing any behavior that 
might be considered undesirable or inappro- 
priate for his sex. Similarly, although it is 
much less clear what developmental or moti- 
vational processes might be responsible for a 
person’s ever coming to hold a self-con 
which is “sex reversed,” the feminine male 
and masculine female can nonetheless be seen 
as differing from their more conventional 
counterparts primarily in their being “in- 
appropriately” rather than “appropriately” 
sex typed; their sex role concepts would noi 
necessarily appear to be any less narrow or 
restrictive, however, and so we would antici- 
pate that the behavior of the sex reversed 
person—like that of the sex typed person— 
would also be limited in accordance with his 
or her self-definition as masculine or feminine. 
In contrast, because his or her self-defini- 
tion excludes neither masculinity nor femi- 
ninity, the androgynous individual should be 
able to remain sensitive to the changing con- 


to 
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Sex ROLE ADAPTABILITY 


straints of the situation and engage in what- 
ever behavior seems most effective at the 
moment, regardless of its stereotype as ap- 
propriate for one sex or the other. What I 
am hypothesizing, then, is that whereas a 
narrowly masculine self-concept may inhibit 
so-called feminine behaviors, and a narrowly 
feminine self-concept may inhibit so-called 
masculine behaviors, a mixed or androgynous 
self-concept allows an individual to engage 
freely in both masculine and feminine behav- 
lors. 

Although there is no previous research 
which bears on this hypothesis directly, a 
review of the relevant literature nevertheless 
corroborates my underlying assumption that 
a high level of sex typing may not be desir- 
able. For example, high femininity in females 
has consistently been correlated with high 
anxiety, low self-esteem, and low social ac- 
ceptance (e.g., Cosentino & Heilbrun, 1964; 
Gall, 1969; Gray, 1957; Sears, 1970; Webb, 
1963); and, although high masculinity in 
males has been correlated during adolescence 
with better psychological adjustment (Mus- 
sen, 1961), it has been correlated during 
adulthood with high anxiety, high neuroti- 
cism, and low self-acceptance (Harford, Wil- 
lis, & Deabler, 1967; Mussen, 1962). In ad- 
dition, greater intellectual development has 
been correlated quite consistently with cross 
sex typing, that is, with masculinity in girls 
and with femininity in boys. Boys and girls 
who are more sex typed have been found to 
have lower overall intelligence, lower spatial 
ability, and lower creativity (Maccoby, 1966). 

The current investigations seek to demon- 
strate both the behavioral adaptability of the 
androgynous individual, as well as the behav- 
ioral restriction of the person who is not 
androgynous. The first study is designed to 
evoke a stereotypically masculine behavior. It 
uses a standard conformity paradigm to test 
the hypothesis that masculine and androgy- 
nous subjects would both remain more inde- 


: pendent from social pressure than feminine 


subjects. The second study is designed to 
evoke a stereotypically feminine behavior. By 
offering subjects the opportunity to interact 
with a tiny kitten, it tests the hypothesis that 
feminine and androgynous subjects would 
both be more nurturant or playful than mas- 
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culine subjects. Taken together, these two 
studies offer one test of the hypothesis that 
nonandrogynous subjects would “do well” 
only when the situation calls for behavior 
which is congruent with their self-definition as 
masculine or feminine, whereas androgynous 
subjects would do well regardless of the sex 
role stereotype of the particular behavior in 
question. That is, they would perform as well 
as masculine subjects on the masculine task, 
and they would perform as well as feminine 
subjects on the feminine task. 


Operational Definition of Androgyny 


In order to select subjects for this research, 
it was first necessary to develop a new type 
of sex role inventory, one which would not 
automatically build in an inverse relationship 
between masculinity and femininity. That 
inventory has been described in detail else- 
where (Bem, 1974), and so it is described 
here only briefly. 

Most importantly, the Bem Sex Role In- 
ventory (BSRI) contains both a Masculinity 
Scale and a Femininity Scale, each of which 
contains 20 personality characteristics se- 
lected on the basis of sex typed social desir- 
ability. That is, a characteristic qualified as 
masculine if it was judged by two indepen- 
dent samples of undergraduates to be more 
desirable in American society for a man than 
for a woman (e.g., ambitious, dominant, self- 
reliant), and it qualified as feminine if it 
was judged to be more desirable in American 
society for a woman than for a man (e.g. 
affectionate, gentle, understanding). 

When taking the BSRI, a person is asked 
to indicate on a 7-point scale how well each 
of these masculine and feminine personality 
characteristics describes himself. The scale 
ranges from 1 (“Never or almost never true”) 
to 7 (“Always or almost always true”) and is 
labeled at each point. On the basis of his re- 
sponses, each person receives an “Androgyny 
Score” defined as Student’s ¢ ratio for the 
difference between his or her endorsement of 
masculine and feminine personality character- 
istics. That is, the Androgyny Score is the 
difference between a person’s endorsement of 
masculinity and femininity standardized with 
respect to the standard deviations of his or 
her masculinity and femininity scores. The 
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use of a ¢ ratio as the index of androgyny— 
rather than a simple difference score—has at 
least one important conceptual advantage: 
namely, it allows us to ask whether a person’s 
endorsement of masculine attributes differs 
significantly from his or her endorsement of 
feminine attributes, and, if it does, lé] (38) 
2 2.025, p <.05, to classify that person as 
significantly sex typed or sex reversed. 

In general, then, the greater the absolute 
value of the Androgyny Score, the more the 
person is sex typed or sex reversed, with high 
positive scores indicating femininity and high 
negative scores indicating masculinity. A mas- 
culine sex role thus represents not only the 
endorsement of masculine attributes, but the 
simultaneous rejection of feminine attributes, 
just as a feminine sex role represents not 
only the endorsement of feminine attributes, 
but the simultaneous rejection of masculine 
attributes. In contrast, the smaller the abso- 
lute value of the Androgyny Score, the more 
the person is androgynous, An androgynous 
sex role thus represents the equal endorse- 
ment of both masculine and feminine attri- 
butes. 

Initial normative data for the BSRI were 
obtained from 444 male and 279 female stu- 
dents in Introductory Psychology at Stanford 
University during the winter and spring 
quarters of 1973. Table 1 presents the per- 
centage of subjects within that sample who 
were classified as masculine, androgynous, or 
feminine. Subjects were classified as mascu- 
line or feminine if the Androgyny Score 
reached statistical significance, and they were 
classified as androgynous if the absolute value 
of the Androgyny Score was less than or 
equal to 1. Table 1 also indicates the percent- 
age of subjects who fell between these various 
cutoff points. It should be noted that, as ex- 
pected, a much higher percentage of subjects 
scored as sex typed than as sex reversed, but 
also that a high percentage scored as an- 
drogynous. These results are virtually iden- 
tical to those obtained from both earlier and 
later samples of Introductory Psychology stu- 
dents at Stanford. 

Although the two experiments in this article 
represent the first attempt to validate the 
BSRI empirically, psychometric analyses 
(Bem, 1974) have already revealed that, as 
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TABLE 1 


PERCENTAGE OF SUBJECTS IN THE STANFORD 
NORMATIVE SAMPLE CLASSIFIED AS 
MASCULINE, ANDROGYNOUS, 


OR FEMININE 
Group Gay 

% feminine 6 34 
(t > 2.025) 

% near feminine > 20 

(1 < ¢ < 2.025) 

% androgynous 34 27 
C1sts+)) 

% near masculine 19 12 f 
(—2.025 <4 < — 1) 

% masculine 36 8 
(t< — 2.025) 


predicted, the Masculinity and Feminini 
Scales are empirically as well as logically i: 
dependent (average r= —.03), and ever 
more importantly, that the Androgyny Score 
itself is internally consistent (average « 
-86), reliable over a 4-week interval (averay: 
r= 93), and uncorrelated with the tendency 
to describe oneself in a socially desirable 
direction (average r = —.06), 


Validation of the Experimental Tasks as 
Stereotypically Sex Typed 


In order to validate my assumption that 
independence and kitten playing represent 
stereotypically masculine and feminine be- 
haviors, respectively, 23 male and 19 female 
undergraduates at Stanford University we: 
asked to judge a series of behaviors for thei 
sex role connotations. Specifically, these stu- 
dents—none of whom served as subjects in 
either of the two studies to be reported—were 
asked to indicate on a 7-point scale how 
masculine or feminine each of 12 activities 
would be considered “by American society 


generally.” The scale ranged from —3 (“Very ” $ 


Masculine”) through 0 (“Neutral”) to +3 
(“Very Feminine”) and was labeled at each 
point. Embedded in the list of activities were 
the following two behaviors of interest: “Play- 
ing with a six-week-old baby kitten” and 
“Saying what you believe, even when you 
know those around you disagree.” 

Table 2 presents the mean sex role ratings 
for each of these two behaviors, As can be 


Sex ROLE ADAPTABILITY 


seen in the table, both males and females 
rated independence as significantly masculine 
and kitten playing as significantly feminine.* 
In addition, only 1 out of 42 judges rated 
independence as even “slightly feminine,” and 
no judge rated kitten playing as at all mascu- 
line. 


EXPERIMENT 1° 


As we have just seen, both males and fe- 
males rate independence from social pressure 
as significantly masculine. Accordingly, Ex- 
periment 1 uses a standard conformity para- 
digm to test the hypothesis that masculine 
and androgynous subjects would both do 
better at a stereotypically masculine behavior 
than feminine subjects. That is, they would 
both be more likely to express their own 
opinions and less likely to conform. 


Method 
Subjects 


Nine masculine, nine androgynous, and nine femi- 
nine subjects of each sex participated in the experi- 
ment. All were students in Introductory Psychology 
at Stanford University during the spring of 1972, and 
all had taken an early version of the BSRI during 
the first week of the quarter. Their average An- 
drogyny Scores are shown in Table 3.4 


Procedure 


Subjects were solicited for an experiment on humor 
and were run in same-sex groups of four. When they 
arrived at the laboratory, they were immediately 
placed into individual soundproof booths equipped 
with microphones and earphones. They were told 
that the experiment would consist of rating a series 
of cartoons for funniness, that the order in which 


1 All probability levels in this article are based 
. on two-tailed tests of significance. 

2The author would like. to express her sincerest 
appreciation to Karen Rook and Robyn Stickney 
for their invaluable assistance during all phases of 
this study. Together, they prepared the stimuli, ran 

e subjects, and analyzed the data. Had this study 
been published by itself, they would most certainly 
have been listed as coauthors. 

3 Fifteen subjects were eliminated from the analy- 
sis because they indicated on a postexperimental 
questionnaire that they had become aware of the 
experimental manipulation, an awareness due in 
most cases to the fact that they participated in the 
study quite late in the quarter, after they had read 
about conformity experiments in the textbook. 
Analysis of the data including these subjects yielded 
results which were quite comparable to those pre- 


sented here. 
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TABLE 2 


Sex ROLE RATINGS or INDEPENDENCE 
AND KITTEN PLAYING 


Variable TAN pomala 
Independence 
M —1,13* —1,05* 
SD 87 91 
Kitten playing 
M 87* 1,05* 
SD 16 are 


Note. These ratings represent judgments made on the follow- 
ing scale: —3 = very masculine, ~2 = mov tely masculine, 
—1 = slightly masculi ) = neutral, 1 = slightly feminine, 
2 = moderately feminine, 3 = very feminine, 

p <.001, 


they would be called upon would vary from trial to 
trial, and that they would all be identified by the 
subject number printed in their respective booths. 
When called upon by the experimenter, each subject 
indicated his or her rating by stating it aloud into 
the microphone and by pressing the appropriate one 
of seven buttons situated on a panel in front of the 
booth. These buttons automatically triggered an 
Esterline-Angus recorder located in the control room, 
and they were each marked from 1 to 7, with the 
two extremes additionally labeled as “Very Unfunny” 
or “Very Funny.” From the subject’s point of view, 
a typical trial thus proceeded as follows: A new 
cartoon would appear on the screen, and the ex- 
perimenter would call upon each subject in turn, 
Each subject would listen, via earphone, to the 
responses of the other three subjects and would give 
his or her own response when called. 

Although the subjects believed that they were 
hearing each others’ voices, in fact, the booths were 
wired so that the only possible communication was 
between control room and subject; all subjects were 
identified as “Subject #3"; and what the subjects 
were hearing throughout the experiment was a 
preprogrammed tape which not only called upon 
each subject by number, but which also provided 
responses for all but Subject #3. In order to induce 


TABLE 3 


Anprooyny Scores FOR SUBJECTS IN THE 
Conrormity EXPERIMENT 


Group Female 
Masculine 
M —3.65 —2.13 
SD 2.11 1.92 
Androgynous 
y, —.09 63 
SD 10 60. 
Feminine 
M 1.79 3.90 
SD 1.40 2.08 
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conformity, the tape included a number of trials 
during which all three voices gave false responses. 
That is, they all agreed that a particular cartoon 
was funny when it was not, or that it was not funny 
when it was. Subjects were given the somewhat 
subjective task of judging cartoons for funniness— 
rather than the length of lines or the like—so that 
these false norms might impose pressure to conform 
without appearing to be bizarre. It should be noted 
that two separate tapes were used, one for males 
and one for females, so that all of the voices, in- 
cluding that of the experimenter, would be of the 
same sex as the subject. 


Stimuli 


Tn order to select cartoons which could be reliably 
classified as funny or unfunny, 195 cartoons were 
pretested for funniness by 11 males and 11 females, 
none of whom served as subjects in the conformity 
experiment itself. The scale ranged from 1 (“Very 
Unfunny”) to 9 (“Very Funny”) and was labeled 
at both ends. The goal was to select cartoons which 
evoked a reasonable amount of agreement on the 
part of the raters and which were either high or 
low in funniness. Accordingly, the 63 cartoons 
(33%) with the highest variance were eliminated 
and, of those which remained, the 46 with the 
highest ratings were selected as funny cartoons 
(mean rating = 5.27), and the 46 with the lowest 
ratings were selected as unfunny cartoons (mean 
rating = 3.85).4 


Experimental Series 


During the course of the experiment, subjects were 
asked to rate 92 different cartoons, half of which 
were funny and half of which were unfunny. Thirty- 
six of these represented “critical” trials, that is, 
trials on which a false consensus was presented to 
the subject in an attempt to induce conformity. On 
these trials, the subject was always the last to be 
called on, and the three voices on the tape all rated 
the unfunny cartoons as funny (“6” or “7”), and 
the funny cartoons as unfunny (“1” or “2”). False 


4%t should be noted that there were 14 trials 
during the course of the conformity experiment it- 
self when the subject rated the cartoon first, that is, 
before his response could be influenced in any spe- 
cific way by the voices on the tape. Although it 
might not be reasonable to expect subjects to re- 
spond in exactly the same way under the pressure 
of the conformity situation as they would under 
relaxed pretest conditions, these trials nevertheless 
serve as a double check on the pretest funniness 
ratings. Analysis of subjects’ responses on these 
trials revealed that the funny cartoons received an 
average rating of 4.11, whereas the unfunny cartoons 
received an average rating of 3.88. This difference 
is significant (t= 2.56, p < .05) and validates our 
classification of cartoons as funny or unfunny. 

5 An analysis of subjects’ actual ratings yielded 
quite comparable results. 
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norms were also presented for an additional 10 car- 
toons when the subject was not the last to be called. 
These trials served primarily to prevent the subjects 
from becoming too suspicious about the experimental 
manipulation, and they were not included in any 
measure of conformity. Finally, the remaining 46 
cartoons represented “noncritical” trials. The sub- 
ject could be the first, second, third, or fourth to be 
called on, and mo false norms were presented. In 
fact, the ratings which the subject heard represented 
the true judgments of the research assistants who 
prepared the tape. 


Results 


It can be recalled that there were 36 trials 
on which a false consensus was presented to 
the subject in an attempt to induce conform- 
ity. On these trials, the subject was always 
the last to be called upon, and the three 
voices on the tape always agreed that the 18 
funny cartoons were all unfunny CIDE 
“2”) and that the 18 unfunny cartoons were 
all funny (“6” or “7”). A subject was said to 
conform if he responded to a funny cartoon 
with a rating of 1, 2, or 3, or if he responded 
to an unfunny cartoon with a rating of 5, 6, 
or 7.° 

Because this experiment was designed to 
test a particular set of hypotheses, the method 
of planned comparisons was used to analyze 
the data. The two orthogonal comparisons to 
be tested were the following: (a) that mascu- 
line and androgynous subjects would both 
conform on fewer trials than feminine sub- 
jects; and (b) that masculine and androgy- 
nous subjects would not differ from one an- 
other in amount of conformity. 

Table 4 presents the mean number of trials 
conformed by the masculine, androgynous, 
and feminine subjects of each sex. As pre- 
dicted, masculine and androgynous subjects 
did conform on fewer trials than feminine 
subjects. This comparison was significant for 
males (f= 2.62, p < .02), for females (t = 


1.95, p = .06), and for the two sexes com- _ & 


bined (¢ = 3.27, p < 01). In addition, and 
again as predicted, masculine and androgy- 
nous subjects did not differ from one another 
in amount of conformity (¢ < 1 in all three 
comparisons). Finally, although it might ap- 
pear from the table that females conformed 
on more trials than males, it should be noted 
that an analysis of variance did not find a 
significant main effect for sex (F = 1.58, ns). 
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TABLE 4 


NUMBER OF TRIALS CONFORMED OUT OF 36 BY MASCULINE, ANDROGYNoUS, 
AND FEMININE MALES AND FEMALES 


Masculine Androgynous Feminine Planned £ 
Sex 
M SD M SD M SD 1s 2 
Male 15.3 6.5 16.8 7A 22.7 4.1 2:62** <1 
Female 18.2 6.7 19,3 6.9 23.7 5.0 1.95* <I 
Total 16.8 6.6 18.0 Fit 23.2 4.5 e BeN bna <1 
Note, Cell ns = 9, 
® Masculine/androgynous versus feminine, 
2 Masculine versus androgynous, 
> < 102, 
we) SOE 
EXPERIMENT 2 © Procedure 


The results of Experiment 1 demonstrate 
that masculine and androgynous subjects of 
both sexes remain independent from social 
pressure on significantly more trials than do 
feminine subjects. That is to say, they both 
do well at a stereotypically masculine behay- 
ior. Experiment 2 is designed to evoke a 
stereotypically feminine behavior. 

Folklore states that females are more likely 
than males to cuddle tiny babies, puppies, 
and kittens. In addition, as we saw earlier, 
both males and females rate kitten playing as 
significantly feminine. Accordingly, Experi- 
ment 2 offers subjects the opportunity to in- 
teract with a tiny kitten in an attempt to 
test the hypothesis that feminine and an- 
drogynous subjects would both do better at a 
stereotypically feminine task than masculine 
-subjects. That is, they would both be more 
j likely to be playful or nurturant. 


Method 
Subjects 


Sixty-six undergraduates from Stanford University 
served as subjects. Half were male and half were 
female. One-third of the subjects of each sex were 
masculine, one-third androgynous, and one-third 
feminine. All were students in Introductory Psychol- 
ogy during the winter of 1973, and all had taken 
the BSRI at the beginning of the quarter. Their 
average Androgyny Scores are shown in Table 5, 


6 The author would like to express her gratitude, 
first, to Jenny Jacobs, ‘Jody Lindell, Jan Feldman, 
and Susan Castillo, who not only coded the sub- 
jects’ behavior, but who also helped to analyze the 
data and, in addition, to Mykol Hamilton, Dorothy 
Ginsberg, Tove Jenson, and Carol Orshan, who 
served as experimenters. 


Subjects were solicited for an experiment on mood 
and were run in individual sessions by one of four 
female experimenters, all of whom were blind with 
respect to the subject’s sex role and all of whom 
ran an approximately equal number of masculine, 
androgynous, and feminine subjects of each sex. 
When the subjects arrived at the laboratory, they 
were told that the experiment concerned the effects 
of different activities on a person’s mood, and that 
they would therefore be asked to engage in four 
different activities over the course of the hour and to 
fill out a mood questionnaire after each, They were 
also told that they would be left alone in the room 
with enough time so that they could “really get in- 
volved with each activity,” and that the experi- 
menter would “keep an eye on things from behind 
the one-way mirror, just to make sure that nothing 
was going wrong.” Finally, as a precaution to pro- 
tect their clothing, the subjects were given a standard 
lab coat to wear. The four activities were as fol- 
lows: 

(1) Subjects were first. given a box of approxi- 
mately 60 plastic geometrical disks (“Crystal Climb- 
ers”) and were asked to build something with them. 
The disks varied in shape and color and could be 
attached to one another in a large variety of ways. 
Subjects were given 8 minutes for this activity and 
it served primarily to equalize mood across subjects, 


TABLE 5 


AnproGyny Scores ror Sup JECTS IN 
THE KITTEN EXPERIMENT 


Subjects Male Female 
Masculine 

M —4,58 —2.67 

SD AT 1.51 
Androgynous 

M —.03 V 

SD AZ -68 
Feminine 

M 4.78 

SD 1.19 
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(2) After the subjects filled out the first mood 
questionnaire, the kitten was brought into the room 
and placed into a playpen that was completely 
enclosed by chicken wire. At this point, the subjects 
were explicitly instructed to interact with the kit- 
ten; they were further told that they could do so in 
any way that they wished; and they were shown 
how to open the playpen so that they could take 
the kitten out if they wished. This “forced-play” 
period lasted 5 minutes, 

(3) For their third activity, subjects were given 5 
minutes to play with a somewhat challenging game 
of skill variously called “Hit the Spot” or “Shoot the 
Moon.” The game requires the player to place a 
metal ball between two metal rods and, by open- 
ing and closing the rods, to roll the ball as far away 
as possible without letting it drop between the two 
rods, 

(4) Finally, the kitten was once again placed in 
its playpen, and subjects were told that they could 
now do anything that they liked. The room was 
filled with a number of interesting things to do, 
including various magazines, puzzles, and games, 
perhaps the most interesting of which was a three- 
dimensional tilting maze. This “spontaneous play” 
period lasted 10 minutes. 

It should be noted that two kittens were used. in 
the experiment, both female, both black, both 8- 
weeks-old, and both from the same litter. In order to 
control for whatever differences in personality they 
may have had, the two kittens were alternated be- 
tween subjects in each of the six different sex roles. 
In addition, because the moods of the kittens seemed 
to vary over the course of the day, time of day 
(i.e. morning, afternoon, or evening) was also 
counterbalanced across the various sex roles. 


Coding the Subject’s Behavior 


During the periods of forced and spontaneous 
play, the subject’s interaction with the kitten was 
time sampled every 10 seconds by one of four 
female coders, all of whom were blind with respect 
to the subject’s sex role and all of whom observed 
an approximately equal number of masculine, an- 
drogynous, and feminine subjects of each sex. For 
each subject, the coder made 30 2-second observa- 
tions during forced play and 60 2-second observa- 
tions during spontaneous play. Ten behaviors were 
coded as present or absent: Was the subject looking 
at the kitten? Speaking to the kitten? Petting the 
kitten? Nuzzling the kitten? Was the subject playing 
with the kitten? If so, was he holding it in his 
hand, on his lap, on his chest, or face-to-face? 

In order to establish the reliability of these 
various measures, two of the four coders simultane- 
ously observed 12 subjects. These double coding ses- 
sions were scheduled so that all possible pairs of 
coders were together twice and so that 2 subjects 
from every sex role received double coding. The 
reliability of a given behavior was determined by 
combining forced and spontaneous play for a total 
of 90 observations per subject and then by calculat- 
ing the percentage of observations on which the two 


coders agreed perfectly. The results indicated very $ 
high reliability (over 95% perfect agreement) for” 
all 10 behaviors. 


Dependent Variables 


The coders’ observations were consolidated to 
yield three basic dependent variables, all of which 3 
were measured independently. The first was derived 4 
from spontaneous play and the second and third jj 
from forced play. 

(1) Given the opportunity either to interact with ) 
the kitten or to engage in any one of a number of , 
other interesting activities, how much did the ee | } 
spontaneously choose to interact with the kitten ?, 
That is, on how many of the 60 observations duri 
spontaneous play was the subject engaged in inter 
action with the kitten? 

(2) Given that the subject was forced to interatt \ 
with the kitten in some way, to what extent did he 
choose to touch it? That is, on how many intervals 
during forced play did the subject come into physical 
contact with the kitten, either by holding it, petting 
it, or touching it during play? 

(3) Given that the subject was forced to interact 
with the kitten, how much did he enjoy the interac- } 4 
tion? It can be recalled that subjects filled out a f5 
mood questionnaire at the end of each activity. One 4% | 
question asked subjects to indicate on a 5-point 
scale how much they had enjoyed their last activ- | 
ity. The third dependent variable was the subject’s { 
enjoyment rating at the end of forced play. f 

As a supplement to these individual measures, a i! 
summary score was also derived for each subject by» 
averaging together the standard score equivalents of 
each of his or her three separate scores. This sum- 
mary or composite score can be seen as best repre- d 
senting the subject’s overall degree of involvement: fe 
with the kitten. (es 


{ 


RESULTS 


Because this experiment was designed to 
test a particular set of hypotheses, the 
method of planned comparisons was again 
used to analyze the data. The two orthogonal 
comparisons to be tested across all four 
dependent variables were the following: 
(a) that feminine and androgynous subjects f 
would both demonstrate greater involvement g 
with the kitten than masculine subjects; and 
(b) that feminine and androgynous subjects 
would not differ from one another in amount 
of overall involvement. These predictions | 
were meant to apply equally to the two sexes, 
but because the pattern of their results was 
so different, their data are presented sepa- | 
rately. It should be noted, however, that there | 
were no overall sex differences in playfulness: ` 
which reached significance. 
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Table 6 presents the mean scores of the 
masculine, androgynous, and feminine males 
for all four dependent variables. Consider the 
“ summary measure first. As can be seen in the 
table, #feminine and androgynous males did 
demonstrate significantly greater overall in- 
volvement with the kitten than did masculine 
males (£ = 3.39, p < .002). This difference is 
as predicted. In addition, and again as pre- 
dicted, feminine and androgynous males did 
not differ from one another in amount of in- 
volvement (¢ <1, ns). Our hypotheses with 
respect to male kitten playing thus receive 
full eoipraaton: 

_ A look at the three separate measures pro- 
vides a close-up of the actual behaviors which 
differentiated the three groups of subjects. 
‘Interestingly, feminine and androgynous males 
did not touch the kitten significantly more 
during forced play than did masculine males. 
. That is, regardless of their sex role, all of the 
males seemed to interact with the kitten 
_ during forced play in more or less the same 
¿ way. Despite this one similarity in their be- 
havior, feminine and androgynous males 
nevertheless indicated at the end of forced 
_. play that they had enjoyed playing with the 
kitten significantly, more than did masculine 
males, and they proceeded during’ spontane- 
ous play to actually interact with the kitten 
on significantly more trials than did mascu- 
line males. These differences are, of course, 


_ as predicted. 


TABLE 6 
MEANS AND STANDARD DEVIATIONS FoR MALE KITTEN PLAYING 
Masculine Androgynous Feminine Planned ¢ 
Variable 
M SD M SD M SD 1a 2b 
Summary measure —.55 ays 06 50 07 44 3.39% <1 
Total touch—forced play 
(max = 30) 7.1 6.6 8.9 4.5 11.5 9.6 1.16 <1 
Total interaction— 
spontaneous play 
(max = 60) 9.6 14.4 22.9 19.5 23.8 23.4 1.91* <i 
` Enjoyment—forced play |. 
ise =) 3.0 1.0 4.0 1.0 3.6 92 PINA e aal <1 
Nole. Cell aS = 11. 
<  Feminine/androgynous versus masculine, 
b Feminine versus androgynous, 
*p œ .06. 
ED < 05, 
‘bp < 002. 
Males Females 


Table 7 presents the mean scores of the 
masculine, androgynous, and feminine females 
for all four dependent variables. Once again, 
consider the summary measure first. Contrary 
to our prediction, feminine and androgynous 
females did not show significantly greater 
overall involvement with the kitten than did 
masculine females. Indeed, and again con- 
trary to our prediction, feminine females were 
found to show significantly less overall in- 
volvement with the kitten than were androgy- 
nous females (¢ = 2.08, p < .05). 

This same pattern of results was found for 
two of the three individual dependent mea- 
sures as well. That is, feminine females were 
found to touch the kitten during forced play 
and to interact with it during spontaneous 
play on significantly fewer observations than 
were androgynous females, 


The ntem of results for the masculine 
females Was ambiguous. Although they did 


interact with the kitten on significantly more 
trials during spontaneous play than did the 
feminine females (¢ = 2.11, p < .05), they 
did not differ significantly from the feminine 
females on any of the other three dependent 
measures—including the summary measure. 
Hence, no firm conclusions can be drawn 
about the level of playfulness of this partic- 
ular group of subjects. 


GENERAL DISCUSSION 


In the introduction of this article, it 
was hypothesized that androgynous subjects 
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TABLE 7 
MEANS AND STANDARD DEVIATIONS FOR FEMALE KITTEN PLAYING 
Masculine Androgynous Feminine Planned ¢ 
Variable 
M SD M SD M SD 1s 2b 

Summary measure 16 12 49 1.16 —.24 AO <i 2.08* 
Total touch—forced play 

(max = 30) 9.9 da 15.5 11.3 73 $3 <1 2,30* 
Total interaction— 

spontaneous play 

(max = 60) 28.4 19.9 30.8 24.2 12.5 15:1 <1 2.15* 
Enjoyment—forced play 

(max = 5) 3.9 1.1 41 1.3 3.8 9 <i <1 


Note. Cell ns = 11. 

a Feminine/androgynous versus masculine. 
ò Feminine versus androgynous. 

*p <.05. 


would be more likely than nonandrogynous 
subjects to display behavioral adaptability 
across situations, engaging in whatever behav- 
ior seems most appropriate at the moment, 
regardless of its stereotype as appropriate for 
one sex or the other. The experiments re- 
ported here provide support for this hypothe- 
sis. Androgynous subjects of both sexes dis- 
played a high level of masculine independence 
when under pressure to conform, and they 
displayed a high level of feminine playfulness 
when given the opportunity to interact with 
a tiny kitten, 

The results for the nonandrogynous sub- 
jects show a nearly, but not completely, 
complementary pattern. Consider first the 
results for males. As predicted, the non- 
androgynous male did well only when the 
behavior was congruent with his particular 
self-ascribed sex role as measured by the 
BSRI. Thus, masculine males displayed 
masculine independence, but not feminine 
playfulness, and feminine males displayed 
feminine playfulness, but not masculine 
independence. 

The females displayed a very different 
pattern of results. For example, the hypothe- 
sis that masculine females would display only 
masculine behavior was only partially sup- 
ported. Like the androgynous females, and as 
predicted, the masculine females did display 
greater independence than the feminine fe- 
males, but they also displayed a moderate 
amount of playfulness, falling ambiguously 
between the androgynous and the feminine 
females on this dimension. Accordingly, it 


remains unclear as to whether the behavior 
of the masculine females conforms to our 
hypothesis. 

But it is the results of the feminine females 
which are the most surprising. It is true that, 
as predicted, the feminine females failed to 
display masculine independence in the face of 
pressure to conform, but it is also true that 
they failed to display feminine playfulness 
when given the opportunity to interact with 
a tiny kitten. Thus, across the two experi- 
mental situations, the feminine females can 
be said to have “flunked” both critical tasks, 
and consequently, it is they who seem to have 
the most serious behavioral deficit. This re- 
sult violates our initial hypothesis, but it is 
not inconsistent with previous findings noted fi 
in the introduction that femininity in females 
is generally associated with high anxiety and 


poor social adjustment. Of course, it is also" | 


possible that feminine females might simply 
be frightened of animals and that they could 
therefore be expected to display much greater 
nurturance if given the opportunity to inter- 
act with another human being rather than 
with a kitten. 

This complex pattern of results suggests 
that several different processes may be re- 
sponsible for the different behavioral deficits 
displayed by the various types of nonan- 
drogynous subjects. For example, their sex 
role socialization might have served to pre- 
vent the sex typed males and females from 
ever having learned those behaviors which are 
stereotypically classified as sex inappropriate, 
or it might even have led them to actively 


Poe Se 
beat, had 


- avoid those behaviors. Such a motivational 
process would seem particularly applicable to 
the masculine males who may well have been 
punished or shamed for displaying overtly 
feminine behavior. In contrast, a second set 
of processes would seem to be required to 
explain the failure of the feminine women to 
display even feminine behavior, processes in- 
volving such factors as low self-confidence, 
low self-esteem, and timidity. Finally, it 
would appear that still other processes are 
probably required to explain the behavior 
of the sex reversed masculine females and 
feminine males. 2 

But it is clear that whatever psychological 
barriers may turn out to be responsible for 
the behavioral rigidities óf the sex typed and 
sex reversed subjects, the current set of 
studies nevertheless provides the first empiri- 
cal demonstration that there exists a distinct 
class of people who can appropriately be 
termed androgy®us, whose sex role adapta- 
bility enables them to engage in situationally 
effective behavior without regard for its 
stereotype as masculine or feminine. Accord- 
ingly, it way well be—as the women’s libera- 
tion movement has urged—that the an- 
_drogynous individual will someday come to 
‘define a new. and more human standard of 
à pel health. r 
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Nonverbal Communication of Affect in Children 


Ross Buck 
University of Connecticut 


A paradigm was tested for measuring the tendency of children to send accu- 
rate nonverbal signals to others via spontaneous facial expressions and gestures. 
This paradigm was derived from studies on adults that suggest that women 
are more accurate nonverbal “senders” than men in certain situations, Eighteen 
male and 11 female preschoolers (aged 4 to 6 years) watched a series of 
emotionally loaded color slides while they were observed via a hidden tele- 
vision camera by their mothers. Videotapes of the child were later observed by 
groups of undergraduates. Results indicated that significant overall communi- 
cation occurred, with large individual differences in “sending ability” between 
children. There was no evidence of a large sex difference in sending ability in 


children, although on one measure girls were more accurate senders than boys 


when viewed by undergraduates. 


Sending ability was positively related to 


teacher’s ratings of activty level, aggressiveness, impulsiveness, bossiness, so- 
ciability, etc., and negatively related to shyness, cooperation, emotional inhi- 


bition and control, etc. 


There is evidence that nonverbal behavior 
may be a particularly important factor in the 
development of emotional expression and so- 
cial behavior. Nonverbal behaviors accom- 
pany emotional states in many species of ani- 
mals, and then often appear to play a role 
in the regulation of social behavior by aiding 
the process of communication between indi- 
viduals (cf. Vine, 1970). Abnormalities in 
nonverbal behavior may disrupt normal social 
behavior. For example, the aberrant social 
behavior of deprived rhesus monkeys may be 
related to their inability to accurately “send” 
and “receive” the nonverbal messages appro- 
priate to their species (Miller, Caul, & 
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Communication Division U85, University of Con- 
necticut, Storrs, Connecticut 06268. 

1This article uses the following terminology: 
“Nonverbal expressiveness” is a general concept 
defined in terms of “sending ability,” which is mea- 
sured primarily by “communication accuracy” and 
also “role-playing ability.” 


Mirsky, 1967). Among humans, nonverbal 
behaviors thought to signal “intimacy” have 
been shown to have considerable intraindi- 
vidual stability over time and have been 
related to low social anxiety (Patterson, 
1973). 

Measurement is a major problem in the 
study of nonverbal expression. Nonverbal 
behavior occurs in a complex flow, and the 
meaning of a particular response is often de- 
pendent upon the behaviors preceding and 
following it, as well as on the situational con- 
text. It is impossible to obtain an absolute 
measure of expressiveness: Any measure is 
necessarily a function of the particular mea- 
suring system, and there are many ap- 
proaches that one might take in developing a 
measuring system. 

This experiment involves an attempt to 
develop a paradigm for measuring nonverbal . 
expressiveness in young children. This para- 
digm measures nonverbal expressiveness in 
terms of “sending ability,” an individual's 
ability to send accurate and appropriate non- 
verbal messages to others. It was developed 
from studies in which an adult “sender” 
viewed and described his- subjective reaction 
to a series of emotionally loaded color slides 
(Buck, Savin, Miller, & Caul, 1972; Buck, 
Miller, & Caul, 1974). There are five distinct 
kinds of slides: sexual, scenic, pleasant peo- 
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ple, unpleasant, and unusual. Unknown to the 
sender, an observer watched via a hidden 
television camera the facial expressions and 
gestures the sender made while reacting to 
the slides. For each slide, the observer at- 
tempted to judge what kind of slide the 
sender had watched and how pleasant or 
unpleasant the sender’s emotional reaction 
was. These judgments were compared to the 
actual slide seen by the sender and the 
sender’s own rating of his emotional reaction. 

Results indicated that statistically signifi- 
cant nonverbal communication occurred: The 
observers could correctly judge what kind of 
slide was shown more often than chance, and 
there were significant positive relationships 
between the sender’s and observer’s ratings of 
the sender’s emotional responses to the slides.” 
There were also large individual differences 
between senders in the amount of accurate 
communication. Women were consistently 
better senders than men, and accurate senders 
gave a more personal verbal report of their 
subjective response to the slides (see Buck 
et al., 1974). Also, more accurate senders 
had smaller skin conductance and heart rate 
responses to the slides than did less accurate 
senders. Following Jones (1960), accurate 
senders with small physiological responses 
were termed “externalizers,” while individuals 
showing the opposite pattern of response 
were labeled “internalizers.” “Externalizers 


` scored higher in extraversion and self-esteem 


than did internalizers, and females tended to 
be externalizers and males, internalizers. 

.+ The study of the development of nonverbal 
expression in children may do much to illumi- 
nate the meaning of these findings. It may be, 
for example, that the sex difference observed 
in adults is in part a result of socialization 
pressures that bear down. more heavily on 
boys than on girls in our culture. Several 


2Tt should be noted that these two measures of 
communication. accuracy are complementary rather 
than independent measures, since the judgments of 
the observers are based upon the same nonverbal 
cues. Also, the precise nature of the cues involved 
in the communication are not analyzed in these 
experiments, and they might well differ from sender 
to sender. Thus, ‘one sender might be more expressive 
with his eyes than his mouth, another might be the 
reverse, etc. 
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investigators have suggested that nonverbal 
expression may become inhibited in the proc- 
ess of socialization (e.g., Izard, 1971), and 
both Jones (1960) and Lanzetta and Kleck 
(1970) have theorized that such inhibition is 
in some way associated with increased physio- 
logical responding. It may be that young boys 
are more likely to come to use an internalizing 
mode of emotional expression because they 
are generally discouraged more than girls 
from overtly expressing many emotions. 

If this is true, it is possible that young 
children will not show the sex difference in 
nonverbal expression and physiological re- 
sponding found in adults. Furthermore, it 
may be possible to study the development and 
determinants of this sex difference in progres- 
sively older samples of children. This experi- 
ment was designed as a first step in this 
direction. 


EXPERIMENT 1 


Method 
Subjects 


Ten males and 5 females constituting a class from 
the Carnegie-Mellon University nursery school served 
as senders. Their ages ranged from 4 to 6 years, 
with a mean of 67 months. Two other children in 
the class were not run because of scheduling prob- 
lems. The child’s mother was the primary observer. 
In addition, videotapes of the child were judged by 
five male and five female undergraduate observers. 


Procedure 


‘A series of 16 slides was created comprising 4 slides 
in each of four content categories. The first category 
was “familiar people,” which showed the child and 
his friends at the nursery school. Each child was 
photographed in the school a month before the ex- 
periment. During the experiment, the child saw two 
slides of himself and two general views of the school. 
The second category of slides was “unfamiliar 
people,” which depicted pleasant scenes of children 
and adults unknown to the child. The third slide 
category was “unpleasant,” which consisted of pic- 
tures judged to be mildly unpleasant to most chil- 
dren. These included the grotesque, clownlike face 
of a fashion model, a frightened character from a 
Dr. Suess book, a close-up of a grasshopper, and the 
painting “Echo of a Scream” by D. A. Siqueiros, 
which depicts a crying child. The fourth slide cate- 
gory was “unusual,” which included double expo- 
sures and strange photographic effects. It was hoped 
that the children would regard the familiar people 
slides as relatively pleasant, the unpleasant slides as 
unpleasant, and the other slides as in between. The 
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ordering of the slides was chosen by a randomly: 
selected 4 X 4 Latin square design. 

The child was brought to the experiment by his 
mother. They were greeted by the experimenter (the 
author) and an undergraduate assistant, Ms. Janet 
Worthington. The experimenter reminded the child 
of the time the child’s picture was taken at school 
and asked him if he wanted to see his pictures along 
with some others. All of the children brought to the 
laboratory complied, most with enthusiasm. 

The child was seated in a room with the experi- 
mental assistant facing a 12 X 12 inch (30.5 X 30.5 
cm) back-lighted screen on a small table. A slide 
projector focused on the screen was placed on a 
larger table against the wall opposite the child. A 
3 inch (7,6 cm) diameter pipe went through the 
wall under the table. A Sony 3200K camera in the 
next room was focused on the child’s face through 
this pipe. The camera was virtually undetectable, 
and none of the children noticed it. 

Initially, the assistant asked the child about his 
game preferences to provide data on sex role identi- 
fication. She asked him first if he liked, disliked, or 
didn’t care about a series of 20 activities. Six of 
these activities had been previously categorized as 
typical of the traditional female role (playing with 
dolls, cooking), and 5 had been judged as appropri- 
ate to the male role (playing soldier, airplanes). The 
assistant then asked the child to choose which he 
liked most of a pair of activities, one of which was 
“masculine” and the other “feminine.” There were 
10 such forced choices. 

While the child was responding to these questions, 
the experimenter escorted the mother to another 
room. The mother’s room was separated from the 
child’s room by a third room which contained the 
camera, videotape equipment, and controls for the 
slide projector. The two walls, the closed doors, and 
the distance involved eliminated most of the trans- 
mission of sound between the child’s room and the 
mother. 

The mother sat facing an 18-inch (45.8 cm) tele- 
vision receiver which was turned on, but the screen 
was blank. The experimenter explained that the 
camera was focused on the child but that he could 
be seen only when the videotape recorder was turned 
on. He gave the mother a prepared rating form 
which was divided into 16 rows, 1 for each slide. 
For each slide, there were places to mark defined 
by the slide categories familiar people, unfamiliar 
people, unpleasant, and unusual. To the right of 
these places to mark defined by the adjectives very 
pleasant, pleasant, neutral, unpleasant, and very 
unpleasant. The experimenter stated: 


[Name] will be looking at a series of 16 slides. 
There are four kinds of slides [pointing to the 
form]—Familiar people are pictures of [name] 
and his friends at school, unfamiliar people slides 
are pleasant enough pictures of people he doesn’t 
know, unpleasant slides are mildly unpleasant, and 
unusual slides show strange photographic effects, 
For each of these slides, I'd like you to watch 
[name’s] face over TV and try to guess what 
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‘Neutra Untwooy Very Unhappy 
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Ficure 1. Scale used by the child to rate his 
emotional experience. 


kind of slide he’s watching just by watching his 
facial expression. I’d also like you to try to rate 
how pleasant or unpleasant you think each slide 
made him feel by putting a mark in one of these 
columns [pointing]. Do you have any questions? 


The experimenter answered any questions and then 
proceeded to describe what would be seen on the 
television set: 


TIl turn on the TV just before the slide is 
turned on. You'll see [name] just before the slide 
comes on and see his initial reaction. Janet will 
ask him questions about the slide—you won’t be 
able to hear his answers because the audio is 
turned down, but keep watching his face for clues 
about what he’s watching. I'll turn off the TV just 
after the slide is turned off, so your screen will 
go blank. At that point, make your ratings about 
what kind of slide you think he saw and how 
pleasant or unpleasant you think his reaction to it 
was. Do you have any questions? 


The experimenter went to the control room and 
briefly turned on the Sony AV3650 videotape 
recorder to make sure that the camera was focused 
properly and to give the mother an example of what 
she would see. He then waited for the child to finish 
responding to the questions about game preferences. 

When the game preference test was over, the 
assistant turned on the slide projector, positioned 
her chair so that she sat beside he child facing the 
screen, and explained to the child about the rating 
scale he would use. The rating scale consisted of 
drawings of five faces that went from very happy to 
very unhappy (see Figure 1). She stated: 


After the picture goes off, you tell me what 
you thought of it by pointing to one of these 
faces. If you really liked the picture and it made 
you feel really good, point to the smiling face 
[demonstrating]. If you really didn’t like it at all, 
point to this unhappy face. If you liked or didn’t 
like it a little, point to one of these faces. If 
you didn’t feel anything one way or the other, 
point to this face. 


The experimenter then presented the first trial. 
A trial proceeded as follows: The experimenter 
turned on the videotape recorder and then presented 
a slide. The child looked at the slide for about 
5 sec and then the assistant asked, “What do you 
think of that?” or “Who's that?” if the child had 
not yet reacted. After 15-20 sec more, the experi- 
menter turned off the slide and then the videotape 
recorder. The assistant then showed the child the 
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TABLE 1 
Summary RESULTS FROM Four MEASURES OF COMMUNICATION ACCURACY 
Mother's rating Mean values from ratings by undergraduates 
Sender z 
Categorization Pleasantness Categorization Pleasantness 
measure measure measure measure 
Study 1 
Girls 
(n= 5) 34% +.57* 34% +.16 
Boys 
(n = 10) 36%* +.32** 29% +.27 
Study 2 
Girls P 
(n = 6) 41%" +.33** 40% +.28 
Boys 
(n= 8 37%" +.27"* 33% +.16 


Nole. Probability figures below refer to mother’s ratings only. 


*p <.05. 
D> < 01. 


series of faces, asked him how the slide made him 
feel, and recorded his answer. Throughout the trial, 
the assistant kept looking at the slide and not the 
child. This helped to keep the child’s attention 
centered on the slide. 

After the 16 trials, the child was brought to the 
mother’s room, the experiment was explained to 
both, and the child was shown a portion of the 
videotape of his facial expressions. 

The rating sessions with the undergraduate 
observers were run as follows: The observers were 
gathered in a classroom with the videotape recorder 
and 18-inch (45.8 cm) receiver in front of the 
room, They were given prepared rating forms and 
instructions similar to those given to the mother and 
were asked to remain silent while making their 
ratings. They were shown the videotapes with brief 
pauses between the trials so they could make their 
ratings. They were not given feedback about which 
judgments were correct. Two 50-minute sessions were 
required, each involving from 7 to 8 children. 


Dependent Variables 


On each trial, the observer judged what kind of 
slide the child had seen and rated on a 5-point scale 
how pleasant or unpleasant the slide had made him 
feel. This yielded two measures of communication 
accuracy that are similar to the measures used in 
studies with adults: (a) the number of slides that 
the observer could correctly categorize (categoriza- 
tion measure) and (b) the Pearson product-moment 
correlation coefficient between the observer's ratings 
of the child’s emotional reaction and the child’s own 
ratings of his emotional reaction along the pleasant- 
unpleasant scale (pleasantness measure). 


Results 


Ratings of the Slides 


The children rated each slide along a 
5-point scale by indicating which of the faces 


in Figure 1 corresponded with their reaction. 
With larger ratings indicating a negative re- 
action, the mean rating of the familiar people 
slides was 1.41, the rating of the unfamiliar 
people slides was 2.48, that of the unpleasant 
slides was 2.79, and that of the unusual slides 
was 2.46. Mann-Whitney U tests indicated 
that the familiar people slides were rated sig- 
nificantly more positive than any of the other 
three categories of slides ($ < .05). One of 
the unpleasant slides, that of the Dr. Suess 
character, was rated as being pleasant (1.93), 
and one of the unfamiliar people slides 
depicting two men was rated as rather un- 
pleasant (3.07). The rest of the slides were 
rated approximately as anticipated. One child, 
a boy, failed to give usable ratings: He rated 
all of the slides as neutral. The overall pat- 
tern of ratings suggested that most of the 
children could use the rating scale and that 
their reactions to most of the slides were 
similar to those intended. 


Communication Accuracy 


The communication accuracy data are sum- 
marized in the upper part of Table 1. By 
chance, an observer should correctly identify 
four slides (25%) if the child communicated 
nothing about them. Chi-squares were com- 
puted for each sender-observer pairing and 
combined to arrive at overall probability fig- 
ures. This procedure revealed that marginally 
significant communication occurred when the 
mother was observing, x°(15) = 24.74, p < 
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06. A Mann-Whitney U test computed 
on the raw data indicated that there was no 
significant difference between boys and girls 
in the number of slides correctly categorized 
when the mother was observing. 

The chi-squares for the pairings with 
undergraduates were combined for an overall 
probability figure for each child. This re- 
vealed that 3 of the 5 girls and 2 of the 10 
boys communicated a significant number of 
slides to the undergraduates (p < .05). This 
suggested that girls might be superior senders 
on this measure, but a Mann-Whitney U test 
indicated that this difference did not attain 
significance (U = 14). 

The Pearson product-moment correlation 
coefficients between the child’s ratings and 
each observer’s ratings were averaged to ob- 
tain overall probability figures. The boy who 
rated all of the slides as neutral was not 
included in this analysis. As Table 1 indi- 
cates, the average correlations between the 
child’s ratings and his mother’s ratings were 
significant. The correlation coefficients for the 
undergraduates, combined to give a probabil- 
ity figure for each child, revealed that 6 of 
the 14 children showed significant (p < .01) 
communication. There were no significant dif- 
ferences between male and female senders on 
this measure. 


Sex Role Identification 


No significant sex differences in communi- 
cation accuracy were found in this study. 
However, clear differences emerged between 
boys and girls in the game preference data. 
On the free-choice measure, where the chil- 
dren were asked whether or not they liked 
a series of activities, male activities were 
liked equally by boys and girls, while female 
activities were apparently rejected by boys. 
Of the 5 male activities, boys said they 
liked a mean of 3.89, while girls liked 3,40 
(U = 24, ns). Of the 6 female activities, 
boys liked 1.33 and girls liked 5.20 (U = 1, 
p<.001). On the forced-choice measure, 
both sexes were likely to choose sex-appropri- 
ate activities. Boys chose a mean of 8.89 male 
activities of the 10 choices, while the girls 
chose a mean of only 3.00 male activities 
(0 = 3, p< OL). 


Discussion 


Although the sample of subjects in this 
experiment is small, the data indicate that 
it is possible to measure significant nonverbal 
communication in young children. The data 
also indicated that there are large individual 
differences between children in noverbal ex- 
pressiveness. A second experiment was de- 
signed to replicate the results of the first 
study and to relate individual differences in 
expressiveness to other measures of emotional 
expression. 


EXPERIMENT 2 


Method 
Subjects 


Eight males and six females constituting the 
succeeding class at the nursery school served as 
senders. Their ages ranged from 5 to 6 years, with 
a mean age of 66 months. Two additional children 
could not be scheduled. Again, the child’s mother 
was the primary observer, and the videotapes of 
the child were judged by seven male and seven 
female undergraduates. 


Procedure 


The experimental procedure was identical to that 
in Experiment 1 with the following exceptions. A 
different person, Ms. Marguerite Zetwo, served as 
undergraduate assistant. Also, the unsuccessful un- 
pleasant slide of the Dr. Suess character was replaced 
with a slide showing a crying woman taken from 
the Thematic Apperception Test, and the unfamiliar 
people slide seen as unpleasant was replaced. Finally, 
an attempt was made to measure the child’s ability 
to role play the facial expressions involved in 
different emotions. 

The measure of role-playing ability was taken 
after the child had responded to the 16 slides. The 
assistant then moved the small table on which the 
back-lighted projection screen was placed and put 
her own chair in that position so that the child 
would face toward the camera when he looked at 
her. She then said: 


I want you to show me what your face looks 
like when you feel different ways. What does your 
face look like when you’re happy and glad about 
something? [Pause, with encouragement when 
necessary.] How about when you’re sad and un- 
happy? [Pause] How about when you're scared 
and afraid? [Pause] How about when you're sur- 
prised at something? [Pause] How about when 
you’re mad and angry about something? 


The child’s responses to these questions were 
videotaped and shown to the undergraduate ob- 
servers after they had drafted his responses to the 
slides. The audio was turn on so that the observers 
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could tell exactly when the questions were asked. 
They rated each response on a 7-point scale defined 
« by the bipolar adjectives “appropriate facial re- 


sponse”—“inappropriate response or no response.” 


They also rated the child’s overall role-playing abil- 
ity along a 7-point scale. 


Results 
Ratings of the Slides 


The children rated the modified series of 
slides as intended. The mean rating of the 
familiar people slides was 1.62, that of the 
unfamiliar people slides was 2.68, that of 
the unpleasant slides was 3.31, and that of 
the unusual slides was 2.34. There was no 
overlap in the distributions of mean ratings 
of the familiar people slides, the unpleasant 
slides, and the rest of the slides. The familiar 
people slides were rated as more pleasant 
than the unfamiliar people and unusual slides 
(U =0, p = .002), and the unpleasant slides 
were rated as more unpleasant than the un- 
familiar people and unsual slides (U = 0, 
p = .002). 


Communication Accuracy 


The communication accuracy data are sum- 
marized in the lower part of Table 1. For the 
- categorization measure, the chi-squares of the 
child-mother pairing were combined as be- 
fore, showing that significant overall com- 
muncation occurred when the mother was ob- 
serving, y2(14) = 34.73, p < .01. There was 
again no significant difference between boys 
and girls on this measure when the mother 
was observing. However, when the under- 
graduates were observing, girls were better 
senders on this measure than boys. As before, 
the chi-squares for each pairing were com- 
bined to give an overall probability figure 
for each child, which revealed that five of the 
six girls versus only three of the eight boys 
- showed significant (p < .05) communication. 

A Mann-Whitney U test indicated that this 

this sex difference was significant (U = 10, 

p = 041). 
-. The correlation coefficients between the 

child’s ratings and each observer's ratings 
*vere averaged as before. As Table 1 indicates, 
the -4iverige correlations between the child's 
‘ratings and his. mother’s ratings were again 
significant, Theaverage correlation coefficients 
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of the undergraduates for each child indicated 
that 8 of the 14 children showed significant 
(p < .01) communication. As before, there 
were no significant differences between the 
girls and boys on this measure of communi- 
cation. 


Sex Role Identification 


The pattern of results on the game pref- 
erence measure was similar to that in the first 
study. Girls liked the female activities more 
than did boys (U = 0, p < .001), boys liked 
male activities somewhat more than did girls 
(U = 8, p = .021), and boys chose more male 
games than did girls (U = 0, p < .001). 


Role Playing 


The mean ratings by the undergraduates of 
the appropriateness of the child’s facial re- 
sponses when asked to role play are presented 
in Table 2. Girls and boys showed a similar 
ordering in their related ability to enact the 
expressions appropriate to different emotions, 
They were best at showing happiness and 
progressively worse at enacting sadness, sur- 
prise, fear, and anger. Multiple sign tests, 
done with the criterion of significance set at 
the .01 level, indicated that the expressions 
of happiness were rated significantly more 
appropriate than those of fear and anger. 
There were no significant differences between 
boys and girls on rated role-playing ability. 


Individual Differences 


As in the previous experiment, marked dif- 
ferences between children in their ability to 
send accurate nonverbal messages were noted. 


TABLE 2 
MEAN RATINGS OF THE APPROPRIATENESS OF 
NS’ ENACTMENTS OF 
NT EMOTIONS 


= -= > —S — = 
Overall 
ja k ey 3 rating: 
Sari Happy | Sad Sa Afraid | Angry | role 
playing 
ability 
Girls 3.67 | 3.36 | 2.98 | 2.68 | 1.77 | 288 
Boys 3,99 3.49 2.92 2.69 2.44 3.12 
Average | 3.870 | 3.4445 | 2.940» | 2.69, | 2.185 
Note. A high rating indicates appropriate enactment. Differ- 


ences between emotions were tested by multiple sign tests. 
Cells with subscripts in common were not significantly different 
from one another at the .01 level of confidence, 
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TABLE 3 


TNTERRELATIONSHIPS BETWEEN Four MEASURES OF COMMUNICATION ACCURACY 
AND RATED ROLE-PLAYING ABILITY 


Mother's Mother's Undergraduate’s | Undergraduate's | Overall rating: 
Measure categorization pleasure categorization | pleasure role playing 
Mother’s categorization — +.49* +.55** +40) +32) 
Mother’s pleasantness x — +.24 +74" +54 
Undergraduates’ categorization — +.58 a ae 


Undergraduates’ pleasantness 


sp S08. 
wkp < 01. 

These differences were consistent across the 
different measures of sending ability used in 
this study. Table 3 shows that all of the 
intercorrelations between the different mea- 
sures were positive and most were significant, 
so that a child who was a poor sender when 
observed by undergraduates also tended to be 
a poor sender to his mother and tended to be 
rated as poor in role playing. 

These measures of sending ability were 
correlated with the child’s scores on the Min- 
nesota Child Development Inventory. Few 
significant relationships were found, perhaps 
because all of the children in the sample at- 
tained high scores on the inventory and the 
range on that variable was restricted. 

To further explore the meaning of the indi- 
vidual differences in sending ability, a 36-item 
scale was constructed from the results of pre- 
vious experiments that investigated internal- 
izing versus externalizing modes of response 
to emotion (Block, 1957; Buck et al., 1972, 
1974; Jones, 1935, 1960; Lanzetta & Kleck, 
1970; Learmonth, Ackerley, & Kaplan, 1959). 
The scale consisted of a series of statements 
which referred to attributes previously found 
to distinguish internalizers and externalizers. 
For example, different items asked whether 
the stimulus person “plays alone most of the 
time,” “often shows aggression,” “is coop- 
erative,” etc. The rater was asked to judge 
whether each attribute was always, often, 
sometimes, rarely, or never characteristic of 
the stimulus person. 

This scale was given to three teachers at 
the nursery school who were familiar with all 
of the children but who did not know the 
results of the communication experiment. 
Their ratings of the 14 children on each item 


were correlated with the childrens’ sending 
ability. These correlations were then averaged 
across the three teachers, and probability fig- 
ures were computed. This procedure indicated 
that, analyzing boys and girls separately and 
using all measures of expressiveness, sending 
ability was significantly related to a number 
of other kinds of emotional expression. In 
particular, sending ability was positively re- 
lated to ratings of a high activity level, a 
direct expression of hostility, and being an 
extravert. Sending ability was negatively re- 
lated to rated cooperation, emotional control, 
and solitary play. 

The relationships between the teachers’ 
ratings and three of the measures of sending 
ability are summarized in Table 4. One fea- 
ture stands out from this data. Sending abil- 
ity was related to a number of characteristics 
that might potentially disturb social relation- 
ships, such as bossiness, impulsivity, aggres- 
siveness, and low cooperation. However, send- 
ing ability was also related to having many 
friends at the school, and it was negatively 
related to shyness and solitary play. 


GENERAL DISCUSSION 


The experiments establish that it is possi- 
ble to measure nonverbal expressiveness or 
sending ability in young children in a brief 
testing session that was enjoyed by most 
children, and that it is possible to use special 
rating scales with children of this age in the 
process of this measurement. The experiments 
did not find evidence of a large sex difference 
in expressiveness in these children. They did 
find initial indications that the present mea- 
sures of nonverbal expressiveness are related 
to other kinds of emotional behavior. 


| 
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TABLE 4 
RELATIONSHIPS BETWEEN TEACHERS’ RATINGS AND MEASURES OF NONVERBAL SENDING ABILITY 
Girls Boys 
(n = 6) (n = 8) 
Teacher's rating 
Overall Mother's Rater's Overall Mother's Rater's 
rating: | pleasantness | pleasantness| rating: | pleasantness | pleasantness 
role playing measure measure role playing measure measure 
Items negatively correlated with sending ability 
Plays along much of the time. —47 —44 —34 —66*** GEL: N ag =50%* 
Is shy, fears strangers. —54 =64** —62** —54 —14 —09 
Keeps to himself. —A4l —43 —34 =62""* = —54** 
Is emotionally inhibited. —42 —52* —50* —44** —19 =23 
Controls his emotions. —42 =76* —84** mie —51* —34 
Is quiet and reserved —35 —56** —61** —57*** —12 =23 
Is cooperative. =27 —65** =i" —70*** —42* =25 
Ts an “introvert.” —46 =59%* —61** —65*** —33 —41* 
All negatively correlated items —43 —58** —56* —78*** —52** —41* 
Items positively correlated with sending ability 
Often shows aggression. +43 +68*** +81*** +66*** +31 +22 
Has a high activity level. +12 +78*** +82" +54*** +12 +01 
Has many friends at the school +80*** +78*** +73*** 35 44* +32 
‘Expresses his feelings openly. +62* +64" +68** +51""* +07 —01 
Is impulsive. +31 +47* +46 =09"* +23 +24 
Ts often difficult to get along with. +43 +71*** +78*** +53** +21 +27 
Expresses his hostilities directly. +35 +55 +54* +83*** +-68""" +48** 
Tends to be rebellious and non- 
conforming. +30 +52** +68*** +-72*** +11 —11 
Ts bossy. +15 +60** +07""* +09*t* +18 +16 
Often dominates other children. +24 +52* +65*** 54** 14 +19 
Ts an “extravert.” +41 59%" +63** +57"* +38 +11 
All positively correlated items +47 +71""* +76*** +-83""* +40 +27 
Total: positively correlated minus 
negatively correlated items +43 +67** +69** +8254" +46* +34 


Note. Correlations in the table are the averaged correlations of three teachers, 


* p <.05. 
** p <.025. 
p < .005. 


Sex Differences 


The game preference measure was included 
in these experiments to determine whether 
these children were in fact influenced by the 
traditional sex role in the face of possible 
current concern about sexual stereotyping 
among their parents. The data indicate that 
these children showed 4 very strong tendency 
to choose sex-appropriate games, which sug- 
gests that they have indeed been affected by 
the traditional sex roles, at least in this re- 
spect. Thus whatever the reason for the lack 
of a large sex difference in sending ability in 
these experiments, it was probably not because 
of a lack of traditional sex role identification 


among these children. 


It may be that the children in this experi- 
ment were simply too young to show a sex 
difference in expressiveness. Assuming that 
the learning of sex-appropriate patterns of 
expressive behavior is in fact the cause of the 
sex difference in sending ability among adults, 
it might be expected that such learning is 
more complex and takes more time than the 
learning of sex-appropriate games. Also, in 
Experiment 2, the undergraduate observers 
guessed the kind of slide being viewed by the 
girls more accurately than they did the boys. 
Perhaps this indicates the initial emergence 
of the eventually greater sending ability of 
women. This can be investigated in experi- 
ments using older groups of children. 
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Role-Playing Ability 


The fact that the undergraduate observers’ 
ratings of role-playing ability was related to 
the measures of communication accuracy is 
interesting, although it should be noted that 
the observers made these ratings after they 
had seen the child respond to the 16 slides, 
and it is possible that their impressions of the 
child’s communication accuracy may have 
influenced their rating of his role-playing 
ability. The lower ability of the children to 
role play the emotions of fear and anger was 
not predicted, but it is consistent with the 
general contention that socialization inhibits 
the expression of socially undesirable emo- 
tions. A similar finding has been reported by 
Odom and Lemond (1972), although these 
investigators found that children could pro- 
duce appropriate expressions of anger. 

One interesting aspect of the role-playing 
data might be noted. Figure 4 shows that 
among girls, the teacher’s ratings were related 
more closely to the measures of spontaneous 
facial expressiveness than they were to the 
ratings of role-playing ability, while among 
boys the opposite was the case. This pattern 
of relationships was not predicted, and any 
attempt at explanation must be tentative. 
However, one might speculate that the be- 
havior the teacher saw in the nursery school 
was based more upon natural expressiveness 
than role playing in girls and more upon role 
playing than natural expressiveness in boys. 
This would be consistent with the hypothesis 
that the socialization of emotional expression 
is more severe among males in our culture. 


Individual Differences 


The pattern of relationships between send- 
ing ability and the teachers’ ratings depicted 
expressive children as being sociable and hav- 
ing many friends, even though they are also 
often impulsive, aggressive, bossy, and hard to 
get along with. This picture of the outgoing 
but impulsive externalizer and the responsible 
but inhibited internalizer is consistent with 
the results of other studies using very dif- 
ferent procedures and populations of sub- 
jects. For example, Jones (1960) found 
among adolescents (ages 12-18) that “high 
galvanic skin response reactors” (internaliz- 
ers) were quiet, reserved, good-natured and 
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cooperative, while “low galvanic skin response 
reactors” (externalizers) were talkative, ani- 
mated, active, and impulsive. Block (1957) 
found among medical school applicants that 
high reactors were, among other things, man- 
nerly, submissive, and withdrawing; low re- 
actors were independent, ingenious, and re- 
bellious. Recently, Crider and Lunn (1971) 
found electrodermal lability to be negatively 
related to extraversion and impulsivity. 
The fact that these attributes taken from 
the literature were found to relate to the 
present measures of sending ability in chil- 
dren suggests that these measures can be 
meaningfully used to study the development 
of internalizing and externalizing modes of 
emotional expression. The present results 
also suggest a tentative hypothesis to guide 
such a study. We have seen that sending 
ability in adults has been related to person- 
ality differences involving intraversion—extra- 
version and self-esteem and to sex differences. 
In young children, a relationship with emo- 
tional behaviors which seem relevant to these 
personality differences is present, while a sex 
difference is not. If one assumes that the 
behavior of these young children is based 
more upon innate temperamental propensities 
and less upon socialization than is the be- 
havior of adults, the hypothesis is suggested 
that the personality difference in facial ex- 
pressiveness may be based upon innate fac- 
tors, while the sex difference is based upon 
socialization. This is, of course, offered as a 
tentative hypothesis pending more data. 


Measurement of Sending Ability 


As the present paradigm for measuring 
sending ability is, like all others, dependent 
upon the particular procedures of measure- 
ment, it might be well to discuss and give the 
rationale for these procedures. Frijda’s (1969, 
pp. 172-176) analysis of studies of emotional 
recognition accuracy can be applied to this 
problem. Frijda points out that studies of 
recognition accuracy differ according to sev- 
eral dimensions: (a) the nature of the stimuli 
employed, (b) the range of those stimuli, (c) 
the nature of the task confronting the ob- 
server, (d) the responses permitted to the 
observer, and (e) the computation of accu- 
racy scores. The following paragraphs de- 
scribe and give the rationale for the approach 
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that the present paradigm makes in each of 
these areas. 


1. Regarding the nature of the stimuli, the 
present approach shows observers the tele- 
vised sequences of the spontaneous expres- 
sions of senders to different kinds of emo- 
tionally loaded slides. The use of such stimuli 
avoids the criticisms leveled against studies 
using posed expressions and/or single photo- 
graphs. 

2. In regard to the range of stimuli em- 
ployed, this approach samples a number of 
different stimulus emotions by showing the 
sender different kinds of slides. Thus, there 
is at least some attempt to approach the 
problems of sampling different kinds of emo- 
tions that have been pointed out by Brunswik 
(1956) and others. 

3. Frijda has distinguished three kinds of 
tasks that have been used in recognition stud- 
ies. “Situational reference” consists of iden- 
tifying the kind of situation that evokes the 
stimulus emotion, “action anticipation” con- 
sists of anticipating the sender’s behavior, and 
“emotional experience” involves labeling or 
rating the stimulus emotion directly (Frijda, 

* 1969). The present approach provides both a 
-situational reference task, in that the observer 
: chooses the kind of slide that evoked the 
sender’s response, and an emotional experience 
~ task, in that the observer rates the sender’s 
emotional experience directly. 

4. The responses permitted to the observer 
include choosing which kind of slides were 
presented on each trial and rating the pleas- 
antness of the sender’s emotional response. 
Observers have been able to grasp readily the 
meaning of these tasks and to respond quickly, 
and these responses have been found to be 
useful in measuring communication accuracy 
in several studies. 

5. The scoring system allows for the com- 
putation of accuracy scores whose significance 
can be tested. The categorization measure of 
the number of correct categorizations of slides 
relative to chance can be evaluated for each 
subject by chi-square tests and then combined 
for group chi-squares (McNemar, 1955). The 
pleasantness measure consists of a Pearson 
product-moment correlation coefficient com- 
puted between the sender’s and observer’s 
ratings of the sender’s emotional experience. 


653 


The significance of the individual correlation 
coefficients may be tested, and they can be 
transformed to z scores and averaged to ob- 
tain group correlation indices (McNemar, 
1955). 
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Antipollution Measures and Psychological Reactance Theory: s 
A Field Experiment F 


Michael B. Mazis 
College of Business Administration, University of Florida 


A field experiment was conducted to determine if hypotheses derived from 
psychological reactance theory could explain response to the implementation 
of an antiphosphate ordinance. Deprived housewives expressed more positive 
attitudes toward the eliminated alternative than did control subjects thereby 
supporting reactance theory predictions. Within the experimental group, sub- 
jects were divided into two groups based on their degree of choice depriva- 
tion. As predicted, subjects forced to switch from their preferred detergent 
brand expressed less favorable attitudes about the effectiveness of no-phosphate 
versus phosphate detergent than subjects who could maintain brand continu- 
ity. Reduced attractiveness of forced alternative rather than enhancement of 
forbidden alternative was the principal mode of response resulting from psy- 


chological reactance. 


Psychological reactance theory asserts that 
when a person believes himself free to engage 
in a given behavior and his freedom is elimi- 
nated or threatened with elimination, the indi- 
vidual experiences psychological reactance, a 
motivational state directed toward reestablish- 
ment of the threatened or eliminated freedom 
(Brehm, 1966). One major determinant of 
reactance, external pressure applied through 
social influence or persuasive communications, 
has been studied extensively. Laboratory ex- 
periments have typically manipulated social 
influence by having confederates restrict sub- 
jects’ freedom of choice through freedom 
threatening statements (Brehm & Sensenig, 
1966; Worchel & Brehm, 1971). 

Recent research exploring the influence of 
persuasive communications has focused on 
factors attenuating reactance effects. Con- 
formity pressures (Grabitz-Gniech, 1971; 
Pallack & Heller, 1971), the involvement of 
peers in eliminating alternatives (Worchel & 
Brehm, 1971), and individual difference 
variables, including manipulated felt compe- 


This study was supported by a grant from the 
College of Business Administration, University of 
Florida. 

The author thanks Robert B. Settle for aiding 
conceptualization of the study and Dennis C. Leslie 
for assisting in data analysis. 

Requests for reprints should be addressed to 
Michael B. Mazis, College of Business Administra- 
tion, University of Florida, Gainesville, Florida 32611. 


tence (Wicklund & Brehm, 1968), feelings of 
inadequacy (Grabitz-Gniech, 1971), and locus 
of control (Biondo & MacDonald, 1971) have 
been found to influence reactivity. 

While the loss of options may be a major 
determinant of psychological reactance also, 
it has not been fully explored in reactance 
research. Grabitz-Gniech (1971) and Brehm, 
Stires, Sensenig, and Shaban (1966) examined 
the effect of reducing the choice of records 
and paintings, respectively, among college 
students in laboratory experiments. While 
statistically significant reactance effects were 
found, both experiments failed to present an 
ego-involving situation in which subjects lost 
an important freedom. The possibility of de- 
mand characteristics producing a reactive re- 
sponse must be considered. Perhaps a more 
appropriate laboratory for experimentation is 
the marketplace where consumers’ freedom 
of choice has been frequently restricted 
through governmental action such as requiring 
safety devices on automobiles; recalling 
soups, cranberries, tuna, drugs, and automo- 
biles; and restricting the use of energy. Un- 
fortunately, the only field study (Weiner & 
Brehm, 1966) examining reactance effects 
was debilitated by the failure to apply unob- 
trusive pressure to subjects, thereby negating 
a major advantage of field experimentation. 

The elimination of free choice by authority 
sources has been explored in reactance re- 
search, but the only authority source studied 
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has been the psychological experimenter 
(Hammock & Brehm, 1966). For research 
involving authority sources to have generaliz- 
ability, particularly for social issues, a wider 
sampling of authority sources is needed. For 
example, Davis and Eichhorn (1963) have 
studied the issue of compliance and noncom- 
pliance with physicians’ regimens. The cur- 
rent experiment is directed toward determin- 
ing if responses to the imposition of a “so- 
cially beneficial” governmental action (anti- 
phosphate ordinance) might be predicted 
from psychological reactance theory. 

Obviously, not all individuals react against 
all restrictions of free choice. It remains for 
research to specify which subjects and under 
which restrictions psychological reactance is 
manifested when options are eliminated or 
curtailed by authority sources. 

The principal consequence of reactivity 
needs to be specified more precisely also. The 
present study explores two major manifesta- 
tions of reactivity, enhanced attractiveness of 
restricted alternatives and reduced desirability 
of alternatives forced upon subjects. Most 
reactance studies have failed to separate these 
two effects. 

The present investigation concerns response 
to an event affecting many households 
throughout the United States—imposition of 

` an antiphosphate law. On Jaunary 1, 1972, 
Dade County (Miami), Florida, began pro- 
hibiting the sale, possession, or use of laun- 
dry detergents and other cleaning products 
containing phosphates. Since only a small 
number of popular brands were available in 
no-phosphate formulations, shoppers found 
their choice of laundry detergent drastically 
diminished. These dramatic changes in the 
number of choice alternatives provided an 
opportunity to examine psychological reac- 
į tance theory predictions in a field setting. 
2] According to reactance theory, Miami 
households would be motivationally aroused 
¿ de to the phosphate restrictions and feel an 
?Spcreased desire to have the forbidden de- 
4.» tergent as contrasted with households in a 
*:* gontrol city (Tampa) whose freedom of choice 
ws unrestricted. The reactance aroused in 
Méami subjects should result in higher effec- 
figengss attributed to phosphate detergents as 
aired with Tampa subjects and more 
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negative attitudes toward governmental regu- 
lation of environmental matters which should 
influence the product evaluations of Miami 
consumers. 

Since Miami households could be classified 
into three major subgroups, additional reac- 
tance theory predictions may be made. The 
first group consisted of housewives who were 
able to continue purchasing products labeled 
by their favorite brand names, but which 
were now being sold without phosphates; 
these households are referred to as “non- 
switchers.” Since one leading detergent manu- 
facturer quickly began distributing no-phos- 
phate reformulations of all its existing laundry 
detergent brands, users could continue to 
purchase their regular brands after the anti- 
phosphate law went into effect. 

Several other detergent manufacturers did 
not begin distribution of no-phosphate re- 
formulations for several months and there- 
fore users were forced to switch brands after 
the phosphate restrictions went into effect. 
These consumers, who were forced to switch 
from their regular brands, are referred to as 
“switchers.” 

Based on their dissimilar motivational 
states, it is anticipated that (a) switchers 
would express more negative attitudes about 
the effectiveness of no-phosphate products 
and/or more positive attitudes about the 
effectiveness of phosphate products than 
would nonswitchers and (b) switchers would 
have a more negative attitude toward the 
antiphosphate law than would nonswitchers. 

Switchers should feel a greater restriction 
of their freedom of choice than should non- 
switchers as a result of being unable to pur- 
chase a laundry detergent sold under their 
favorite brand label. It must be remembered 
that both switchers and nonswitchers were 
using a totally new product; however, in one 
case consumers were using a product with a 
“new” label or brand name, while in the other 
they were able to use a reformulated product 
being sold under their favorite brand name. 

The third group of Miami households are 
those who defied the antiphosphate ordinance 
by smuggling detergent into Miami from sur- 
rounding counties or who accummulated large 
amounts of phosphate detergent before the 
January enforcement date. Since these smug- 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR EFFECTIVENESS 
RATINGS cr PHOSPHATE DETERGENTS 


Miami Tampa 
(n = 76) (n = 45) 
Characteristic 

M SD M SD 
Whiteness 8.68 1.87 8.27 1.56 
Freshness 8.77 | 2.08 7.87 | 1.51 
Cleans in cold water | 8,52 | 2.15 7.47 | 1.83 
Brightness 8.31 | 2.09 7.84 | 1.80 
Stain removal 8.00 | 2.43 | 6.96 | 2.05 
Pours easily 9.45 2.05 9.07 1.77 
Gentleness 8.81 1.80 | 8.71 1.51 


Note, Based on an 11-point scale with 11 labeled ‘absolutely 
perfect” and 1 labeled “poor.” 


glers and hoarders may have had very favor- 
able attitudes toward phosphate detergents 
before the enactment of the no-phosphate 
restrictions, no predictions can be made con- 
cerning their attitudes about the effectiveness 
of phosphate brands based on reactance the- 
ory. Their predisposition toward phosphate 
brands may have been instrumental in their 
being categorized as “violators.” On the other 
hand, switchers and nonswitchers were placed 
into those categories as a result of detergent 
manufacturers’ decisions about whether to 
produce no-phosphate reformulations of de- 
tergent brands.* 


METHOD 


An instrument was designed to determine attitudes 
about laundry detergents and related products, 
opinions about laws regulating the use of phos- 
phates, and demographic characteristics. Seventy-six 
interviews were completed by four female interview- 
ers in Miami and 45 completed questionnaires were 
returned by the two female interviewers in Tampa. 
Nine questionnaires were discarded as a result of 
incomplete responses. Interviews were conducted 
from a period from 7 to 9 weeks after the anti- 
phosphate statute became effective. 

Since only limited funding was available to defer 
the cost of interviewing, sample selection in both 
Tampa and Miami emphasized subject homogeneity 
to restrict the impact of extraneous variables. To 
enhance the similarity between Tampa and Miami 
samples, all respondents interviewed were: (a) Eng- 
lish-speaking Caucasian women; (b) with family 
incomes of $7,500 to $15,000 per year; (c) who had 
at least one child under 16 years of age living at 


1 No subjects classified as nonswitchers switched to 
another manufacturer’s brand after enactment of the 
antiphosphate ordinance. 


home; (d) who resided in single-family dwelli 
and (e) who used primarily phosphate det 
brands during the preceeding 6 months. Comparisons 
between Tampa and Miami samples revealed no sig- 
nificant income, age, or educational differences. 

Two middle-income census tracts were chosen in 
both cities and blocks were randomly selected within 
tracts. Two sampling points were randomly desig- 
nated within each block and interviewing com- 
menced from each point in a clockwise direction 
until two respondents were found who met thi 
teria enumerated above. 


RESULTS 
Miami versus Tampa 


At the beginning of each interview, sub 
jects were asked to rate the effectiveness oj 
the phosphate laundry detergent they he 
used most during the previous 6 month 
Seven brand characteristics were evaluated an 
an 11-point scale labeled “absolutely p 
fect” and “poor” at the end points, Salier 
characteristics were determined through a 
free association procedure similar to that used 
by Fishbein (1967). 

Reactance theory predictions are supported 
by the data in Tables 1 and 2. On all seven 
characteristics, Miami subjects gave higher 
mean effectiveness ratings to phosphate 
brands than did Tampa subjects. 

The two way Groups X Trials analysis of 
variance (Veldman, 1967, pp. 247-257) in 
Table 2 provides statistical support for the 
Proposition that Miami housewives were in a 
reactive motivational state. Overall, Miami 
subjects rated phosphate detergents as bei: 
more efficacious than did Tampa subje 
(F = 5.42, p < .05). 

The significant interaction between cities 
and characteristics (F = 2.27, p < .05) show: 
that the reactivity of Miami housewives was 
not manifested uniformly across all character- 


TABLE 2 


ANALYsIS OF VARIANCE FOR EFFECTIVENESS RATINGS 
OF PHOSPHATE DETERGENTS 


Source MS F 
Groups: Miami vs. Tampa 77.74 5.42* 
Error (Groups) 14.34 
Trials: Characteristics 35.06 20.07** 
Groups X Trials 3.97 2.27* 
Error (Trials) 1.75 


*p <.05. 
** b < 001. 
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TABLE 3 


ATTITUDE TOWARD GOVERNMENT AND 
ANTIPHOSPHATE Laws 


Miami Tampa 
Wariable m + 
t M | SD | u |-SD 


“Fhe government 
should play an im- 

. portant role in pro- 
tecting our water 
from pollution. 

Legal restrictions 
should be imposed 
against the sale of 
detergents contain- 
ing phosphates. 


2.39 | .90 | 1.64 | 1.20 | 3.62* 


2.76 |- .93 | 2.20} 1.17 | 2.72* 


Note. Responses on Likert-type scale with ‘1’ indicating 
strong agreement and *“‘5” indicating strong disagreement. 
*p < 01, 


istics. Also, there was a significant main effect 
for characteristics indicating that subjects did 
discriminate among various detergent attri- 
butes by not rating detergent brands as being 
equally effective on all characteristics (F = 
20.07, p < .001). 

Miami subjects expressed a less optimistic 
view about the success of governmental action 
in solving water pollution problems and 
toward the usefulness of phosphate content 
regulation than did Tampa subjects (see 
Table 3). 

Tampa residents stated a stronger degree of 
agreement with the statement, “The govern- 
ment should play an important role in pro- 
tecting our water from pollution,” than did 
Miami subjects (¢ = 3.62, p < .01). In addi- 
tion, Tampa housewives had greater expecta- 
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tions about the usefulness of laws restricting 
the sale of detergents containing phosphates 
than did Miami respondents (¢ = 2.72, p < 
.01), thereby sustaining the view that Miami 
housewives were experiencing psychological 
reactance. 


Switchers versus Nonswitchers 

Brand ratings. According to psychological 
reactance theory, the amount of freedom 
eliminated directly influences psychological 
reactance. Since switchers were experiencing 
more psychological choice deprivation, they 
should provide lower mean effectiveness rat- 
ings to no-phosphate brands than should non- 
switchers. 

The data in Table 4 shows that of the 76 
Miami households responding (9 of which had 
incomplete data), the 44 switchers provided 
lower average effectiveness ratings across all 
seven detergent attributes than did the 23 
nonswitchers. The Groups X Trials analysis 
of variance in Table 5 reveals significantly 
lower attractiveness estimates for no-phos- 
phate products by switchers than by non- 
switchers (F = 4.48, p < .05). 

The greater degree of psychological reac- 
tance being experienced by switchers may 
take the form of increased attractiveness of 
the restricted alternative (phosphate deter- 
gent), as well as decreased attractiveness of 
the alternative forced upon subjects (no- 
phosphate detergent). Similar ratings of phos- 
phate detergent effectiveness between Tampa 
and Miami subjects are shown in the middle 
sections of Tables 4 and 5 (F = .00, p< 


TABLE 4 
MEANS AND STANDARD DEVIATIONS FOR EFFECTIVENESS RATINGS OF DETERGENTS 


No-phosphate detergents Phosphate detergents Difference scores 
Characteristic Switchers Nonswitchers Switchers Nonswitchers Switchers Nonswitchers 
M SD M SD M SD M SD M SD M SD 

Whiteness 7.07 | 1.60 | 7.87 | 1.72 | 8.59 | 1.63 | 8.61 | 1.51 79 | .74 | 1.70 
PERES 7.11 | 1.70 | 7.85 | 1.85 | 8.75 | 1.61 | 8.78 | 1.41 1.17 | .91 | 1.88 
Cleans in cold water | 6.36 | 2.12 | 7.61 | 2.37 | 8.72 | 1.93 | 8.43 | 1.42 1.43 | .83 | 2.43 
Brightness 6.30 | 2.00 | 7.43 | 2.13 | 8.41 | 1.78 | 8.17 | 1.61 1.51 | .74 | 2.26 
Stain removal 6.14 | 2.53 | 741 | 1.96 | 7.79 | 2.00 | 8.13 | 1.95 1.27 | .70 | 2.01 
Pours easily 8.45 | 2.01 | 9.17 | 2.37 | 9.30 | 1.41 | 9.78 | 1.89 1.05 | .61 | 1.62 
Gentleness 7.23 | 1.92 | 8.04 | 2.08 | 8.93 | 1.70 | 8.74 | 1.21 1.23 | .70 | 2.01 


a 


f: + Note, Based on an 11-point scale labeled “absolutely perfect” 


* and “‘poor” at end points. 
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TABLE 5 
ANALYSIS OF VARIANCE FOR EFFECTIVENESS RATINGS OF DETERGENTS 


No-phosphate detergents 


Phosphate detergents Difference scores 


Source 7 - 
df MS F df MS F df MS | F 
Groups: Switchers vs. P 
nonswitchers 1 99.06 4.48* 1 05 .00 1 99.27 Tial bzs 
Error 65 22.09 65 12.88 65 13.21 
Trials: Characteristics 6 BO. ml eat: 6 15.33 13.92*** 6 8.31 5 i 
Groups X Trials 6 94 67 6 1.32 1.20 6 3.22 , 
Error 390 1.40 390 1.10 390 1.60 
*p <05, 
wD <.01. 
wee bd <.001, 


-99). These nonsignificant findings are ex- 
plored in the final section of this article. 

In a within-subjects experimental design, 
the “total reactance effect” (Hammock & 
Brehm, 1966) can be examined by analyzing 
difference scores generated by subtracting 
each subject’s rating of no-phosphate deter- 
gent from her rating of phosphate detergent 
for each characteristic. In this way both hy- 
potheses derived from reactance theory (en- 
hancement of forbidden alternative and dero- 
gation of forced alternative) can be tested 
simultaneously. The last sections of Tables 4 
and 5 indicate that switchers felt there was a 
substantial difference in efficacy between phos- 
phate and no-phosphate formulations whereas 
nonswitchers felt that no-phosphate brands 
were only slightly less effective than phos- 
phate variations (F = 7.51, p < .01). 

Attitudes toward detergent use. In addition 
to rating phosphate and no-phosphate deter- 
gents on seven characteristics, subjects evalu- 
ated the potency and cost of alternative 


formulations. As a result of being in a reac- 
tive motivational state, subjects who were 
forced to switch from their favorite brands 
would be expected to express more negative 
attitudes about the amount of detergen 
extra ingredients required for each washloa 
and about the cost of washing clothes when 
using no-phosphate detergent than would non 
switchers. . À 
Table 6 indicates that while nearly a 
of switchers felt they had been using more 
no-phosphate detergent, x? (2) = 9.45, % 
.01, and additional ingredients (e.g., b 
or fabric softener), x? (2) = 4.15, p <. 
with each washload than they did with phos- 
phate detergent, less than 10% of nonswitch- 
ers felt this was the case. There is a similar 
disparity about perceptions of the relative 
cost of using phosphate and no-phospha‘ 
products. Over 71% of switchers beli: 
that no-phosphate brands were more expec: 
to use than products containing phosphates, 
but only 30% of nonswitchers expressed neg- 


TABLE 6 
ATTITUDE TOWARD DETERGENT USE > 


Amount of A t of ext si 
ant an ead E Taare 
Group — 
More Less Same More ores More ame 
Switchers 14 4 26 14 30 31 3 = 
7 (32%, (10%) (58%) (32%) (68%) (71%) (29%) 
Nonswitchers is 5 17 2 21 7 16 
(4%. (28%) (68%) (8%) (92%) (30%) (70%) 


a x2(2) = 9.45, p < 01. 
b X = 4.15, p < .05. 
e x?(1) = 9.58, p < .001. 


4 Categories combined to provide sufficient expected value per cell, 


= 
$ 


i et. 
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ative feelings about the cost of using their 
reformulated no-phosphate brands, x? (1) = 
9.58, p < .001. 


DISCUSSION 


While student subjects in laboratory ex- 
periments have often behaved as predicted 
by psychological reactance theory, the current 
study provides strong support for the theory’s 
predictions among housewives in a field set- 
ting. As predicted, Miami subjects rated 
phosphate detergents as being more effective 
than did Tampa subjects. Hypotheses derived 
from psychological reactance theory concern- 
ing Miami versus Tampa subjects’ attitudes 
toward government’s role in protecting the 
public from water pollution and in imposing 
legal restrictions against the sale of products 
containing phosphates were sustained as well. 

Since adaptation effects (Helson, 1964) 
may have contributed to the enhanced at- 
tractiveness of phosphate detergent as a re- 
sult of expanded use of the less effective no- 
phosphate brands, the most compelling evi- 
dence supporting a reactance theory interpre- 
tation is found in the attitudinal differences 


; of switchers and nonswitchers, Analysis of 


effectiveness ratings revealed that there were 
no significant mean differences in the evalu- 
ation of alternative brands of no-phosphate 
detergent. As a result, the differential atti- 
tudes expressed by switchers and nonswitch- 
ers can be attributed to psychological reac- 
tance rather than to physical differences in 
the brands used. 

This research does make several contribu- 
tions to the psychological reactance literature. 
First, it demonstrates that psychological re- 
actance is not solely a laboratory phenomena. 
Housewives did feel more psychological choice 
deprivation when they were forced to switch 
from their favorite phosphate brands to un- 
familiar brands of no-phosphate laundry de- 
tergent. 

Second, greater insight into the conse- 
quences of reactance were obtained. Negative 
attitudes about the effectiveness of the alter- 
native forced upon housewives rather than 
enhancement of the eliminated alternative 
was found. 

Since most subjects expressed satisfaction 
with their laundry detergents, psychological 
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reactance would be less likely to take the form 
of increased desirability of the eliminated al- 
ternative as compared with decreased attrac- 
tiveness of the forced alternative due to ceil- 
ing effects. The mean rating for control group 
(Tampa) subjects was 8.02 on an 11-point 
scale. While the circumstances surrounding 
this experiment may be unique, it is possible 
that reactance often takes the form of a re- 
duction in the attractiveness of the forced al- 
ternative since people are satisfied with the 
products they use as a result of dissonance 
reduction. More valuable than such specula- 
tion would be further research on this topic. 
Third, public policy implications of psycho- 
logical reactance research have been made 
apparent. From a societal perspective, the 
most appropriate laboratory for psychological 
reactance research might be the consumer 
market. While governmental agencies and leg- 
islators issue administrative decisions and en- 
act ordinances which restrict freedom of 
choice, consumer assessment after restriction 
is rarely undertaken. Too much restriction 
may result in extensive noncompliance. How- 
ever, additional research is needed to assess 
the degree of consumer reactivity as a result 
of: (a) the issue, (b) the proportion of 
choice alternatives remaining and number of 
substitutes available, (c) the authority sources 
(e.g., federal vs. local and state government; 
legislative vs. executive decisions) used to 
eliminate freedom of choice, and (d) the per- 
manence of a reactive motivational state. 
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Self-Fulfilling Prophecies: 
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Triangle Hypothesis 
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In an attempt to assess the generalizability of the perceptual “triangle hy- 
pothesis” proposed by Kelley and Stahelski, the expectations, perceptions, and 
behaviors of 70 subjects were measured in two conflict situations. Support 
for the triangle hypothesis came from data gathered in a situation similar to 
that employed by Kelley and Stahelski. In this situation, individuals who 
adopted a competitive orientation tended (a) to expect more homogeneity in 
the orientations of their opponents than did individuals with a cooperative 
orientation and (b) to dominate behaviorally individuals with a cooperative 
orientation. Neither of these effects were found in a situation which provided 
subjects with a more flexible behavioral repertoire. The importance of the 
impact of structural factors on conflict processes was further shown by an 
examination of the self-fulfilling prophecy effects found in the two situations, 
Preinteraction expectations had a pronounced effect on both the course of 
the interaction and opponent evaluations in the first situation, while only 


restrictive, situation. 


In a provocative theoretical article, Kelley 
and Stahelski (1970b) derived and elaborated 
the concept of the “triangle hypothesis.” 
Briefly stated, the triangle hypothesis postu- 
lates that people falling at different ends of 
a cooperative-competitive continuum hold 
diffferent views of what others are like with 
respect to cooperativeness versus competitive- 
ness. More specifically, the authors argued 
that competitors tend to believe others are 
uniformly competitive, whereas cooperators 
believe others are heterogeneous in this re- 
spect, some being competitive and some being 
cooperative. (The triangle hypothesis derives 
its name from the fact that a graph of the 
expected orientations of others for cooperators 
and competitors shows a triangular pattern.) 
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weak effects of preinteraction expectations were found in the second, less 


Tt is the contention of Kelley and Stahelski 
(1970b) that the differential perceptual 
schema of cooperators and competitors are 
due primarily to different types of interper- 
sonal experiences. The competitor’s belief 
that others are homogeneously competitive is 
thought to arise through the interplay of two 
factors: (a) The nature of the competitor’s 
interpersonal style typically induces those he 
interacts with to adopt similar behaviors, and 
(b) the competitor fails to recognize his 
power of influence, inferring not unreasonably 
that the pseudocompetitive behavior of those 
he observes reflects competitive dispositions. 
Cooperators, on the other hand, do not exert 
such interpersonal influence and consequently 
“accurately” identify the dispositions of 
others, whether cooperators or competitors. 

In addition to formulating the triangle 
hypothesis in their article, Kelley and Stahel- 
ski (1970b) pursued a number of intriguing 
implications derived from their hypothesis. 
Their arguments are most compelling, but 
their speculations, as well as their basic 
theoretical formulation, stem from tenuous 
empirical roots. Most of the data Kelley and 
Stahelski marshaled in support of their 
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hypothesis were collected from the Prisoner’s 
Dilemma (PD) game. Kelley and Stahelski 
(1970b) argued, nevertheless, that this does 
not represent a serious limitation to the gen- 
eralizability of their hypothesis and main- 
tained that 


there is a rough isomorphism between the PD game 
setting for social interaction and certain real-life 
settings rich in person information, which permits 
us to generalize from our laboratory results and 
draw the inference about the nature of the general 
relationship between the person’s orientation and his 
expectation about others’ orientation (pp. 77-18, 
italics added). 


From the viewpoint of the present authors, 
however, such an assumption warrants care- 
ful scrutiny. First, as has been frequently 
noted (e.g., Sermat, 1970; Terhune, 1968; 
Vinacke, 1969), the two responses available 
in the PD leave players with an impoverished 
behavioral and symbolic repertoire. One of 
the major effects of this structural restrictive- 
ness is to render a competitive orientation 
much more viable than a cooperative orienta- 
tion, A cooperative orientation has only a 
contingent viability in the PD; that is, it 
can only succeed if reciprocated (Kelley & 
Stahelski, 1970a; Rapoport, 1967; Braver & 
Barnett, Note 1). In the absence of reciproca- 
tion, a cooperatively oriented person is 
likely to react defensively which, in the PD, 
is usually indistinguishable from reacting 
competitively. 

Considering the nature of the game, there- 
fore, it seems inevitable that cooperators 
should be more influenced by competitors 
than vice versa. Moreover, with an aware- 
ness of the fact that a defensive move and a 
competitive move are confounded in the PD, 
it seems quite understandable that com- 
petitors should have difficulty in correctly 
identifying the intentions or orientation of an 
assimilated cooperative opponent. Accord- 
ingly, it is difficult for the present authors to 


1While PD data provide the foundation for 
Kelley and Stahelski’s argument, these authors also 
refer to a small number of bargaining studies which 
employed other game structures. With the possible 
exception of the game employed by Kelley et al. 
(1970), however, these games can all be characterized 
as having primarily PD-like properties (i.e., they 
favor competition over cooperation). 
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accept Kelley and Stahelski’s (1970b) asser- 
tion that the PD is analogous to those types 
of situations where: 


situational constraints on behavior are minimal and 
in which, therefore, the participants can feel they 
learn something about each other rather than merely 
about social roles and structure (p. 77). 


It would appear, on the contrary, that the 
PD represents an analog of a very specific 
type of social situation where situational con- 
straints are maximal, affording participating 
individuals little person information, with 
the consequence that interaction between 
participants generally evolves into spiraling 
competition. 

The question of differential goal viability 
has implications not only for game behavior 
but also for Kelley and Stahelski’s coopera- 
tive-competitive classification system. Kelley 
and Stahelski’s index of disposition was the 
strategy the subjects chose to adopt in the 
PD. Specifically, subjects were first taught 
the game and interaction procedure, then 
asked what they wanted to achieve in the 
relationship. This “goal choice” was made 
from two alternatives: (a) cooperation, para- 
phrased as, “I will try to cooperate with the 
other person and will be concerned with my 
own score and the other person’s score,” and 
(b) competition, paraphrased as, “I will work 
for myself against the other player and will 
be concerned only with my own score.” 

Although this classification schema is the 
only one employed by Kelley and Stahelski, 
the authors assumed that there is a close cor- 
respondence between individual dispositions 
toward cooperation-competition and the strat- 
egy adopted in the PD. By not securing a 
more direct dispositional measure, however, 
Kelley and Stahelski are left with a second- 
order approximation, or what Fishbein 
(1967) has termed a “behavioral intention.” 
In this connection, it has been demonstrated 
by previous research that dispositions or atti- 
tudes are only a subset of a variety of factors 
which enter into the determination of behav- 
ioral intent (cf. Cook & Selltiz, 1964; 
Triandis, 1967). Among the other factors 
believed to affect an individual’s behavioral 
intentions are the individual’s perceptions of 
relevant situational parameters as well as the 
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individual’s perceptions of the expectations 
of other people (Ajzen & Fishbein, 1969; 
Fishbein, 1967). It seems plausible, therefore, 
that Kelley and Stahelski’s index of person- 
ality (goal choice) may reflect the subjects’ 
perceptions of the structural bias of the PD, 
in addition to some underlying dispositional 
or personality trait.” 

To summarize briefly, there would seem to 
be substantive grounds for questioning the 
theoretical formulations of Kelley and Stahel- 
ski. First, the triangle hypothesis was devel- 
oped and validated primarily within a struc- 
ture (PD) where a competitive orientation 
is more viable than a cooperative orientation 
and where little person information can be 
inferred. The finding that competitors behav- 
iorally influence cooperators more than co- 
operators influence competitors in such a 
structure is less than surprising; it remains 
an empirical question, however, whether or 
not this pattern of interaction would occur 
in relationships where participants are less 
structurally and behaviorally constrained. 
Second, since Kelley and Stahelski based their 
cooperative-competitive categorization solely 
on a behavioral intention (goal choice), they 
employed a contaminated dispositional index. 
It is entirely possible that the direction of 
causality is from situational expectations to 
goal orientation rather than from goal orien- 
tation (i.e., disposition) to situational expecta- 
tions, as suggested by Kelley and Stahelski. 

To explore the possibility that the findings 
of Kelley and Stahelski may be, to some ex- 
tent, a function of the PD structure and their 
measurement indices, the present study at- 
tempted to determine the extent to which the 
triangle effect is generalizable to a game 


2Some data relevant to this point come from one 
of Kelley and Stahelski’s studies (1970a) where sub- 
jects were required to indicate whether their goal 
choice was characterized more by “rationality” or 
“principle.” Marked differences were observed be- 
tween cooperators’ and competitors’ responses to 
this question. Sixty-six percent of the competitors 
checked the rationality alternative, while only 40% 
of the cooperators checked this alternative. Assuming 
that checking the rationality alternative indicates a 
relative perceived absence of dispositional influence 
in the choice of a goal, this finding may be sug- 
gesting a differential correspondence between goal 
choice and real-life orientation for cooperators and 
competitors. 
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Ficure 1. The two game matrices. 


which shares considerable method variance 
with the PD but which possesses certain 
critical differences. The structure developed 
for this task is presented in Figure 1. 

Conceptually, there are many similiarities 
between the PD Game and the Expanded 
Prisoner’s Dilemma (EPD) Game. In both, 
one player can obtain maximum payoffs only 
at the expense of the other player. (In fact, 
as is evident from a comparison of the PD 
matrix and the corner cells of the EPD 
matrix, the payoff values for competitive-co- 
operative interchanges as for mutual competi- 
tion and cooperation are identical in the two 
games.) Thus, both games have similar 
mixed-motive characteristics, However, in the 
EPD, the players have one more response 
than players in the PD, a difference which 
renders the two games quite conceptually 
distinct. 

We have called this additional response a 
defensive or withdrawal response. The ratio- 
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nale for the introduction of this response was 
to provide a move which would enable players 
with cooperative orientations to avoid both 
behavioral assimilation and exploitation when 
confronted by a player with a competitive 
orientation.’ In other words, the dominance 
structure in the EPD does not require that a 
player choose a competitive response when he 
expects competition from his opponent. Of 
course, the competitive player can employ 
this additional response to his advantage as 
well, but its main function should be to pro- 
vide cooperators with a defensive move which, 
unlike in the PD, is distinguishable from an 
exploitative move. It is the present authors’ 
contention that a response with such a function 
is often available to people in everyday social 
relationships. 

The structural differences between the two 
games suggest a number of predictions con- 
cerning the expectations and behaviors of 
participants: 

Hypothesis 1. The preinteraction expecta- 
tions of EPD players will show less of a tri- 
angular pattern than the expectations of PD 
subjects. This prediction is based on the 
assumption that EPD subjects, because of 
their additional response alternative, will be 
less likely to choose the competitive goal for 
defensive (countercompetitive) reasons than 
will PD subjects. If Kelley and Stahelski’s 
index accurately reflects dispositional trends, 
then no differences would be expected between 
the two games. 

Hypothesis 2. EPD cooperators will be less 
behaviorally influenced by competitive oppo- 
nents than will PD cooperators. This predic- 
tion derives from the assumption that the 
additional response alternative in the EPD 
will enable players confronting competitors to 
avoid exploitation without becoming behay- 
jorally similar to the competitor. On the 
other hand, if competition “dominates” co- 
operation generally, as suggested by Kelley 
and Stahelski, the minor differences in tasks 


#In certain ways, the additional response is con- 
ceptually similar to Thibaut and Kelley’s (1959) 
comparison level for alternatives (CLalt). A CLalt 
Permits either person in a dyadic relationship to 
partially or completely “withdraw” if the relation- 
ship is not providing satisfactory outcome levels. 
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should not affect the interaction pattern of 
cooperator-competitor pairs. 

Hypothesis 3. Irrespective of goal orienta- 
tions, players in the EPD will be less behav- 
iorally influenced by their opponents than 
will PD players. The relevant assumption 
here is that the flexible behavioral repertoire 
of EPD players will enable them to pursue 
their initial goals in a less constrained manner 
than will the more impoverished repertoire 
of PD players. 


METHOD 
Overview 


The experimenter first explained and demonstrated 
the rules of either the PD or the EPD to the sub- 
jects. The subjects then independently indicated 
whether they would prefer to play the game 
“cooperatively” or “competitively.” After making 
their goal choice, the subjects answered a number 
of questions concerning their reasons for their 
decision as well as their expectations of the other 
player’s goal choice. 

On completion of these preliminary questions, the 
subjects, who were separated by a partition and not 
Permitted to communicate, began playing. The sub- 
jects were led to believe that they were playing 
against each other, but in fact, they were playing 
against programs controlled by the experimenter, The 
concealment of this manipulation was made possible 
by reporting the trial-by-trial outcomes to each 
subject privately. 

After each block of 10 trials, subjects were quizzed 
about their perceptions of the other player’s motives 
and orientation. Play was discontinued after the 30th 
trial. The subjects at this time answered questions 
devised to elicit information about the subject’s 
attributions of behavioral responsibility as well as 
information about the subject’s Perceptions of the 
other player. 

When the subjects completed the postgame ques- 
tionnaire, the experimenter described and demon- 
strated a second experimental game. Until this time, 
the subjects were unaware that a second game would 
be introduced. The ensuing procedural sequence and 
the questions asked of the subjects exactly paralleled 


where the game was 


instead of beginning play, the subjects were queried 
about the relationship 


games and “everyday social relationships.” 


Experimental Programs 


To ensure an adequate number of subjects in all 


therefore, did not play against each other but against 
a controlled program. 


Tue ROLE OF SITUATIONAL RESTRICTIVENESS 


For both games, the cooperative programs con- 
sisted of a tit-for-tat treatment for 7 out of every 
10 trials with the other 3 responses being randomly 
introduced cooperative responses. (A tit-for-tat 
treatment, see Komorita, 1965, is a strategy in which 
the other player, or program, consistently recipro- 
cates the subject's behavior with a one-trial lag.) 
The competitive program also consisted of a tit-for- 
tat treatment for 7 of every 10 trials, but instead of 
3 cooperative responses, 3 competitive responses were 
randomly introduced every 10 trials. 


Subjects and Design 


The subjects were 70 female undergraduates from 
York University. None of the subjects had ever 
been enrolled in a psychology course. They were 

“recruited on a volunteer basis in response to notices 
circulated throughout their classes. 

Eight groups were involved in the 2X%2X2 
design. The independent variables were the game 
played (PD. vs. EPD) and the program encountered 
(cooperative vs. competitive). Combined factorially 
with the two manipulated variables was the self- 
classification variable of goal choice. 

The game the subjects played was randomly 
determined, and cooperative and competitive sub- 
jects were randomly assigned to cooperative or 
competitive programs. After 10 cooperative subjects 
in each game had been assigned to the cooperative 
program, however, all subsequent cooperative players 
were paired with competitive programs. This pro- 
cedure was carried out because of our primary inter- 
est in the interaction of cooperators playing against 
competitors. 

An even distribution of subjects in each of the 
eight cells was not possible because of the greater 
proportion of subjects choosing the cooperative goal 
option over the competitive option. (The two goal 
options from the two games were chosen with 
the following frequency: for cooperators, PD = 28, 
EPD = 26; for competitors, PD =8, EPD = 8.) 


Procedure 


On their arrival at the laboratory, the two sub- 
jects were seated at tables separated by a partition. 
The experimenter’s opening remarks described the 
experiment as a bargaining study and were, for the 
most part, identical to those used by Kelley and 
Stahelski (1970a). When the experimenter had 
completed his introductory statement, the subjects 
were asked to classify themselves on a serics of 

. adjective scales. The subjects were told that 
this information would enable the experimenter to 
determine how different types of people bargain. 

Nine semantic differential scales, adapted from a 
previous bargaining study (Kelly et al., 1970) were 
employed for the present self-classification task. 
Included among these scales were the following 
dimensions: cooperative-competitive, flexible-inflexi- 
ble, dominant-submissive, passive-active, strong- 
weak, friendly-unfriendly, aggressive-defensive, de- 
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manding-conciliatory, and hostile-peaceful. The 
purpose of these scales was to provide an index of 
a general orientation which was measured indepen- 
dently of behavioral commitment. 

The experimenter then handed the subjects either 
the instructions for the PD or EPD. The instruc- 
tions for both games closely paralleled those used 
by Kelley and Stahelski. It was next explained that 
each of the two players would have a goal to work 
toward during the game and that the players were to 
choose their goal from the following two options: (a) 
Cooperation with the other player; “I will work to 
cooperate with the other player. I will be concerned 
with my score and the other player's score.” 
(b) Work for myself: “I will work independently, 
for myself, against the other player. I will be 
concerned only with my score,” These two goal 
orientations are identical to those employed by 
Kelley and Stahelski. The subjects alse indicated 
their confidence in adhering to their selected goal, 
as well as the rationale for their goal choice: 
(a) “A general approach I have to social relation- 
ships” and (b) “my idea of the most sensible way 
to play this particular game.” Expectations concern- 
ing the goal choice and rationale of the “typical 
person” were also elicited from the subjects at this 
time. 

At no point before or during the game were the 
subjects aware of how many trials would be played, 
After 10 trials, the experimenter asked each subject 
to judge the other player's goal choice, to indicate 
how confident she was in this judgment, and what 
she expected the other player’s next move to be. 
The game continued for another 10 trials, at which 
time the same questions were asked again. It was 
emphasized to all subjects that their answer could be, 
but did not necessarily have to be, the same as they 
had been the first time. After another 10 trials, a third 
set of judgments was made and a postgame question- 
naire was distributed. These questions concerned the 
degree to which the subject thought her and the 
other player's intentions were accurately reflected by 
their behavior, the subject’s sa ction with the 
game, and the subjects perceptions of the other 
player. 

On the completion of these questions, the experi- 
menter introduced the second game to the subjects. 
With the exception of the modifications necessary 
to accommodate the differences between the two 
games, the presentation of the second game followed 
the same pattern as the introduction of the first 
game, The questions asked of the subjects also 
paralleled those asked during the introduction of the 
first game. It was only with the completion of the 
subject’s judgments of the other’s goal choice that 
the procedure diverged from the procedural sequence 
which preceded the first game. At this point, the 
subjects were asked to respond to the following 
questions: “Which game situation do you think most 
closely parallels everyday social relationships?” 
“Why?” “What percentages of everyday social rela- 
tionships do you think are paralleled by the two 
games?” 
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TABLE 1 


RELATIONSHIP OF OWN GOAL T0 EXPECTATIONS OF THE 
TYPICAL Person’s GOAL CHOICE 


Typical other 


Goal choice: Self 


Cooperative Competitive 


Prisoner’s Dilemma Game* 


A: 60% 
C: 10% 


B: 40% 


Cooperative 
D: 90% 


Competitive 


Expanded Prisoner’s Dilemma Game” 


B: 35% 
D: 68% 


Cooperative 
Competitive 


Note. To determine the degree of ‘'triangularity" evidenced 
by these data, indices identical to those used by Kelley and 
Stahelski (1970b) were calculated. A triangularity index (TI) 
is computed by a single formula which combines the four 

rcentages in the following manner: TI =(D—C)—|A—B], 
The ee the index, the greater is the triangularity. 

0, 


ETES” 


After the responses to these questions had been 
collected, the subjects were informed that they 
would not have to play the second game, and the 
nature and purpose of the experiment were explained 
to them. 


RESULTS 
Preinteraction Expectations 


The subjects’ expectations of how the 
typical person would choose to play the two 
games are presented in Table 1.4 The triangu- 
larity index (TI) observed in the expectations 


4Due to the small number of subjects choosing 
the competitive goal option and because no effect 
of presentation order was observed, it was decided 
to include subjects’ expectations for the second game, 
as well as the first, in the analysis of expectations. 

5It should be emphasized at this time that it is 
not our contention that only those competitors who 
indicated that their choice reflected a general orienta- 
tion are, in fact, “real-life” competitors. First, the 
presence of a social desirability set is probably, at 
least partially, responsible for the relationship be- 
tween goal choice and rationale. Furthermore, it is 
plausible that for many competitors, their emphasis 
on the “rationality” of their orientation is the very 
means for which they are able to sustain and justify 
their general competitiveness. Some competitors may 
consistently adopt a competitive posture but avoid 
reflecting negatively upon themselves by maintaining 
that their competitiveness is situationally rather than 
dispositionally .determined. Neither the social desir- 
ability argument, nor the rationalization argument, 
however, can account for the differences between 
games reported here. 
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of PD players (TI = 60) was higher than 
that found in almost all of the studies re- 
ported by Kelley and Stahelski. Competitors 
showed a strong tendency to expect the 
typical person to choose the competitive goal 
(binomial test, p < .001), while the expecta- 
tions of PD cooperators did not differ signifi- 
cantly from a 50:50 split. In the EPD, how- 
ever, the degree of triangularity was greatly 
attenuated (TI = 6). Inspection of the EPD 
data reveals that the small TI value emerged 
as a consequence of a uniformly high tend- 
ency for both cooperators and competitors to 
expect the typical person to share their goal 
orientation (p< .05 and p< .08, respec- 
tively). The principal difference between the 
two games, as revealed by a test of propor- 
tions (Edwards, 1960), was the fact that 
more PD competitors than EPD competitors 
expected the TP to be competitive (p < .15). 

The present data, therefore, support Hy- 
pothesis 1. The triangle hypothesis was 
replicated with the expectations of PD play- 
ers but failed to hold for the expectations of 
EPD players. 

A possible explanation for the different 
relationships between goal choice and expec- 
tations of competition in the two games is 
suggested by an examination of the goal 
choice rationales competitors attribute to 
themselves. Ninety percent of the subjects 
who chose the competitive goal in the PD 
stated that they did so because “it is the 
most sensible thing to do in this particular 
game.” In the EPD, on the other hand, only 
53% of the competitors checked this alterna- 
tive. The finding that more PD than EPD 
competitors emphasized the rationality choice, 
x°(1) = 5.01, p < .025, can be construed as 
further evidence for the argument that the 
competitive goal choice in the PD, in many 
instances, reflects a situation-specific decision 
rather than a general orientation.’ 

Other data relevant to this question concern 
the rationales PD competitors attribute to 
the typical person. Eighty-three percent of 
the PD competitors who expected the typical 
person to be competitive indicated that the 
typical person would be rompendye for ra- 
tional reasons, with only 17% of the competi- 
tors stating that the peal person’s goal 
choice would reflect “a general approach 


Tue ROLE OF SITUATIONAL RESTRICTIVENESS 


taken to social relationships.” These data 
would seem to lend still further support to 
the contention that the expectations of PD 
competitors are mediated more by game struc- 
ture than by general beliefs about the orienta- 
tions of others, as suggested by Kelley and 
Stahelski. 


Game Behavior 


The analysis of game behavior was per- 
formed primarily to answer two questions: 
(a) Are players equally susceptible to oppo- 
nent (program) influence in the two games, 
and (b) does the interaction pattern of 
cooperator-competitor pairs differ in the two 
games; specifically, do cooperators evi- 
dence a differential degree of assimilation to 
competitive programs in the two games? 

Table 2 presents the percentages of com- 
petitive responses made by subjects in 
the different player-program combinations.° 
Separate 2 X 2 analyses of variance were per- 
formed upon the data of the two games. In 
both the PD and the EPD, a main effect 
of goal choice emerged, F(1,32) = 24.36, 
p < .001, and F(1, 30) = 13.82, p < .001, re- 
spectively. Subjects choosing the competitive 
goal made many more competitive moves 
than subjects choosing the cooperative goal. 
Experimental programs, however, did’ not 
have a consistent effect across games. By 
comparing the mean degree of program influ- 
ence in the PD (25%) with the mean degree 
of program influence in the EPD (4.5%), it 
is apparent that the program had a greater 
behavioral influence in the PD than in the 
EPD. (A 2X2X2 analysis of variance 
which included game, program, and goal 
choice as the independent variables provided 
further evidence for the greater behavioral 
influence of the opponent in the PD 
than in the EPD, F(1, 62) = 3.71, p < .06). 
Hypothesis 3, therefore, which predicted that 
players in the EPD would be able to maintain 
their orientations better than PD players, was 
also supported. 


6 Since competition and cooperation are not sym- 
metrically related in the EPD, separate analyses 
were carried out for competition and cooperation in 
this game. Similar patterns emerged for those dif- 
ferent response criteria, so that only the competitive 
index is discussed in this report. 
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TABLE 2 
MEAN PERCENTAGE OF COMPETITIVE RESPONSES 
EMITTED AS A FUNCTION OF GOAL 
CHOICE AND PROGRAM 


Competitive 
program 


Cooperative 


Goal choice program 


Prisoner’s Dilemma Game 


Cooperative 


17 41 
Competitive 58 


Expanded Prisoner’s Dilemma Game 


Cooperative 18 26 
Competitive 40 41 


Although the greater effect of program in 
the PD than in the EPD supports Hypothe- 
sis 3, the fact that the program did not have 
a significant effect on the behavior of EPD 
players (F(1,30) = 2.16, p < .16, compared 
with F(1,32) = 12.23, p < .001, in the PD) 
poses a potential problem of interpretation. 
The resolution of this problem requires an 
examination of the postgame opponent evalu- 
ations, the topic to which we now turn. 

As well as affecting the behavior of PD 
players, the programs also influenced the 
players’ perceptions of their opponent. PD 
players confronted with competitive programs 
judged their opponents to be more competi- 
tive, F(1,32) = 9.54, p < .004, more aggres- 
sive, F(1,32) = 8.95, p< .005, and less 
trustworthy, 7(1,32) = 11.18, p < .002, than 
players confronting cooperative programs. In 
the EPD, program did not have a significant 
effect on behavior but did have a pronounced 
effect on the perceptions of EPD players. Like 
the PD players, EPD players confronting 
competitive programs judged their opponents 
to be more competitive, F(1,30) = 7.87, 
p < .009, more aggressive, F(1,30) = 4.82, 
p< .04, and less trustworthy, F(1,30) = 
3.02, p < .09, than players confronting co- 
operative programs. In other words, while 
EPD players were better able to remain be- 
haviorally independent of their opponents than 
PD players, they drew similar conclusions 
about the nature of their partners. 

A second issue related to the effects of pro- 
gram concerns the failure to replicate Kelley 
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and Stahelski’s finding that PD cooperators 
are more influenced by PD competitors than 
vice versa. By comparing (see Table 2) the 
effect of the experimental programs on the 
behavior of individuals choosing the coopera- 
tive and competitive goal choice, it can be 
quickly grasped that there is no evidence for 
differential influence in the present PD re- 
sults, A comparison of the levels of competi- 
tion observed in the present study with those 
observed by Kelley and Stahelski, however, 
reveals a close correspondence between all 
cells but the cell in which competitors con- 
front cooperators. Kelley and Stahelski 
(1970b) reported a higher level of competi- 
tion in this condition than was found in the 
present study (70% vs. 58%). 

The discrepancy between the present re- 
sults and those reported by Kelley and 
Stahelski (1970b) becomes somewhat recon- 
cilable when one considers that the present 
design required players to interact with pro- 
grams rather than other players. By employ- 
ing a program that persists in a relatively 
cooperative orientation (30% cooperative re- 
sponses, regardless of how exploitative the 
behavior of the subject) and that maintains 
a relatively stable (error free) contingency 
pattern, the present study has created a rela- 
tionship where competitors receive clearer 
informational signals concerning the oppo- 
nent’s intentions than will normally occur. 
The results of the present experiment, there- 
fore, indicate that a competitor in the PD 
can be influenced under certain conditions. 
Apfelbaum (1967) has also reported data 
indicating that competitive players can be 
influenced in the PD. Of particular interest 
here is the fact that competitors in Apfel- 
baum’s experiment appeared to be most influ- 
enced by self-described “conditional” cooper- 
ators, who tended to pursue a strategy similar 
to that followed by our own tit-for-tat 
cooperative program. 

Of course, the results of Kelley and 
Stahelski (1970b) (and the results reported 
below for cooperative-competitive pairs) sug- 
gest that such influence will be less likely to 
occur between two typical players in the PD 
because the cooperative member of the co- 
operator—competitor pair will usually become 
assimilated to the competive member early in 
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the game and thus cease to exert any influ- 
ence on the competitor. In summary, it would 
seem that a competitor can generally be in- 
fluenced by a cooperator if the cooperator 
does not become assimilated to him first. 
Moreover, it appears that a cooperator need 
not pursue an unconditionally cooperative 
strategy to influence a competitor. In the 
present study it was possible for cooperative 
programs to emit as many as 70% competi- 
tive moves." 

Hypothesis 2 predicted that EPD cooper- 
ators would be less likely to become more 
competitive over trials when playing com- 
petitive programs than would their PD 
counterparts. As Figure 2 indicates, in the 
PD, cooperators do tend to become more 
competitive than when they began, though 
the increase occurs for the most part early in 
the game. In the EPD, however, no assimila- 
tion can be found. Comparisons between the 
percentage of competitive responses made 
from Trials 1-5 with the mean percentage of 
competitive responses made from Trials 6-30 
indicate a significant increase in the PD 
tare(17) = 4.63, p < .001, with no change in 
the EPD (¢ <1). Hypothesis 2 was, thus, 
also supported. 

But why do cooperators become assimilated 
to their competitive opponents in the PD and 


7 At this point, it may be worth disc in a 
more general way, the advantages and disadvantages 
of employing programs rather than real individuals 
as opponents. To begin with, there is little doubt 
that the interaction of real-life players ld be 
somewhat different than that observed in th sent 


study between players and programs and, 
of this, a degree of caution s d be exercised in 
interpreting the present results. The important issue, 
however, is whether or not the present situation 
was able to produce the proce that o in 
“real-life” interaction. From all avai tS, 
this objective appears to have been accomplished. 
First, there is a high degree of consistency between 
the present results and those obtained by Kelk 
Stahelski with real-life subjects. Moreover, th. 
consistencies between the present PD data and thi 
data of Kelley and Stahelski are easily, and interest 
ingly, explained by functional differences between 
players and programs. Finally, there is a high degree 
of internal consistency between the interaction 
patterns observed and preinteraction perceptual 
and expectational data. The latter, of course, are 


completely unaffected by the choice of opponent 
manipulation. 


eS. 
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not in the EPD? The likely explanation for 
this difference comes from a comparison of 
subject moves with expected moves on the 
part of the other player. After the 10th, 20th, 
and 30th trials, subjects were asked to make 
judgments concerning the other player’s goal 
choice, as well as to judge what the other 
player’s next move would be. It was possible, 
therefore, to examine the relationship be- 
tween ‘the response made by a subject and 
his expectation of the other player’s response 
oh the 11th and 21st trials (the game ended 
after the 30th trial). 

The results of this comparison for co- 
operative-competitive pairs are presented in 
Table 3. In the PD, 67% of the expectations 
were expectations of competition, and of these 
competitive expectations, 50% produced com- 
petitive moves on the part of the subject. In 
the EPD, on the other hand, only 11% of the 
expectations of competition produced com- 

` petive moves by the subject. Sixty-one per- 
cent of the expected competitive moves in 
the EPD were countered by defensive moves. 
The defensive move, therefore, seems to have 
permitted EPD cooperative players to avoid 
exploitation at the hands of a competitior. 


—- PD 


ee) 


40 


MEAN PERCENTAGE OF COMPETITIVE MOVES 


BLOCKS OF FIVE TRIALS 


Ficurr 2. The behavior of cooperators confronted 
with competitive programs in the Prisoner's Dilemma 
(PD) and Expanded Prisoner's Dilemma (EPD). 
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TABLE 3 
RELATIONSHIP BETWEEN A Supyect’s Move 
AND EXPECTED OPPONENT MOVE IN 
COOPERATOR-COMPETITOR PAIRS 
oN TRIALS 11 AND 21 


Subject move Expected move 


Prisoner’s Dilemma Game 


Cooperative Competitive 


Cooperative 12 
Competitive 0 


Re 


1 
1 


anded Prisoner’s Dilemma Game 


Cooperative | Competitive Defensive 
Cooperative 12 5 0 
Competitive 1 2 0 
Defensive 1 11 0 


Perhaps the most remarkable finding re- 
ported in Table 3 is the number of times 
cooperative players in the PD made coopera- 
tive moves when they expected the other 
player to make a competitive move (50% of 
the time). The channels for communication 
are so restrictive in the PD that a coopera- 
tive member in a cooperator-competitor pair 
is required to martyr himself in order to com- 
municate his desire for cooperative inter- 
action. Lastly, these data reveal that assimila- 
tion occurring in the PD is only “phenotypic,” 
an assimilation of behavior and not an assimi- 
lation of purpose. This is evident from the 
fact that there was not one instance of a 
PD cooperator making a competitive move 
when she expected her opponent to make a 
cooperative move. Further support for this 
interpretation comes from the finding that 
competitive programs, while affecting the 
behavior of PD cooperators, had no effect 
on the cooperators’ perceptions of their 
own “cooperativeness” during the interaction 
(F <1). 


Effect of Preinteraction Expectations on 
Behavior and Perceptions 


An argument was made earlier concerning 
the effect expectations may have on the selec- 
tion of preinteraction goal choices. It seems 
reasonable to expect a relationship between 
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preinteraction expectations and actual game 
behavior. To pursue this prospect as well as 
to determine if game structure had any effect 
on the role of self-fulfilling prophecies, analy- 
ses were performed on the basis of preinter- 
action expectations. 


Prisoners Dilemma 


Subjects who expected the typical person to 
be competitive emitted more competitive 
moves than subjects who expected the typical 
person to be cooperative, F (1,32) = 5.81, 
p< .025, The effect of expectations is notice- 
able on the first five trials, F (1,32) = 7.14, 
Ż < .01. The experimental programs, it should 
be mentioned, had no effect at this time (F < 
1). The effect of expectations in the PD could 
have been solely a function of the strength of 
the relationship between goal choice and ex- 
pectations. However, an analysis of the effects 
of expectations for subjects selecting the co- 
operative goal revealed similar effects. Sub- 
jects who expected the typical person to be 
competitive were also less satisfied with the 
game, F (1,32) = 5.56, É < .025, than sub- 
jects who expected the typical person to be 
cooperative, Interestingly, however, expecta- 
tions were not related to perceptions of self- 
cooperativeness, F (1,32) = 1.23, p< .28. 
Although preinteraction expectations clearly 
affected the behaviors and perceptions of PD 
subjects, they seemed unaware of the self- 
fulfilling nature of their expectations. 


Expanded Prisoners Dilemma 


In the EPD, preinteraction expectations 
had a less pronounced but somewhat more 
complex effect than they did in the PD. Sub- 
jects who expected the typical person to be 
competitive made more defensive moves in 
the first five trials than subjects with coop- 
erative expectancies, F (1,30) = 6.19, p< 
.02, but no significant difference occurred in 
the number of competitive moves made by 
players with different expectancies, F (1, 30) 
= 1.95, p< .18. It would appear that the 
caution or defensiveness resulting from com- 
petitive expectancies receives very different 
behavioral expression in the two games. In 
the PD, anticipatory defensiveness can only 
take the form of “pseudocompetition.” In 
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the EPD, however, caution need not be mani- 
fested in this form. Players who expect com- 
petition are able to retreat to the defensive 
move, a response which protects them against 
exploitation without the potential for trig- 
gering a sequence of spiraling competition. 
The long-range (30 trials) effect of expecta- 
tions in the EPD failed to reach significance 
for both the number of competitive and num- 
ber of defensive moves emitted, F < 1, and 
F (1,30) = 1.84, p< -20, respectively. Fur- 
thermore, the postinteraction perceptions of 
EPD players were not found to be strong] 
related to preinteraction expectancies. Sub 
jects who expected the typical person to be 
competitive indicated less satisfaction with 
the interaction than subjects with cooperative 
expectancies, F (1,30) = 4.09, p < .05, but 
none of the postinteraction questions concern- 
ing opponent assessment successfully | dis- 
criminated between players with different pre- 
interaction expectancies, 


Game Generality 


A great deal of evidence has been presented 
concerning the effects of game structure on 
behavior and perceptions, but these findings 
do not speak to the question of ecological va- 
lidity. Specifically, how representative of so- 
cial relationships are the two games? To esti- 
mate the relative representativeness of the 
two games, subjects, after they had played 
one game and had the second described to 
them, were asked: “Which game situation do 
you think most closely parallels everyday 
social relationships?” 

The most obvious inference to be drawn 
from the responses to this question is that 
subjects saw the EPD as more representative 
of “everyday social relationships” than the 
PD (71% vs. 29%). Interestingly, no rela- 
tionship emerged between goal choice and 
judgments of representativeness. A relation- 
ship was discovered, however, between pre- 
interaction self-ratings and perceived game 
representativeness, Subjects who considered 
the PD situation to be most representative of 
everyday social relationships classified them- 
selves as more competitive, F (1, 66) = 7.02, 
P< .025, more dominant, F (1, 66) = 6.15, 
$ <.025, and somewhat stronger, F (1, 66) 


E 


= 
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= 3.77, p < .10, than subjects who judged 
the EPD to be more representative." 


Discussion 


Generally, the results reported provide 
strong support for the three experimental 
hypotheses. First, the expectations of EPD 
players did not evidence the degree of triangu- 
larity evidenced by PD players (Hypothesis 
1). Second, EPD cooperators did not become 
as behaviorally assimilated to competitive pro- 
grams as did PD cooperators (Hypothesis 2). 
In the PD, cooperative players exhibited 
behavioral assimilation to a competitive op- 
ponent within the first 10 trials of interaction. 
In the EPD, on the other hand, cooperators 
managed to avoid assimilation by reacting to 
expected competition with a defensive move. 
Third, EPD players were generally less in- 
fluenced by programs than were PD players 
(Hypothesis 3). Finally, in addition to the 
empirical support for the hypotheses, the pres- 
ent data indicate that a self-fulfilling proph- 
ecy effect occurred more often in the PD than 
in the EPD. 

Several implications of the present results 
for a more general theory need to be discussed. 
A conflict model based on the expectational 
sets of the participants could incorporate the 
joint or common effects of dispositions and 
situational parameters. The evidence from 
the present study suggests that the triangle 
hypothesis, the behavioral assimilation of 


8 Preinteraction self-ratings were not found to be 
significantly related to preinteraction expectancies or 
goal choices. It is possible that the adjective pairs 
(from Kelley et al., 1970) are insensitive to the 
latent dimensions of importance here or, alterna- 
tively, that the goal choice measures do not reflect 
a general personality disposition. The patterns of 
behavior for self-rated cooperators and competitors 
paralleled those for the goal choice analyses, although 
the findings were less pronounced and only margin- 
ally significant. It may be of further interest to note 
that a separate experiment which examined the rela- 
tionship between the expectations of cooperative- 
ness-competitiveness in the PD and the authoritar- 
janism dimension revealed a pattern quite different 
from that posited by Kelley and Stahelski (1970b). 
Consistent with the speculations of Kelley and 
Stahelski, low-authoritarian subjects tended to expect 
heterogeneity in the goal choice of typical others; 
high-authoritarian subjects, however, tended to ex- 
pect uniform cooperativeness (not competitiveness) 
from others (TI =-— 68). 
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cooperators, and possibly the self-fulfilling 
prophecy are all more likely to occur in con- 
flict situations having structural similarities 
to the PD game. The particular properties of 
the PD facilitate these processes at several 
different points in the interaction. 

First, from a strategic (as pointed out in 
many game-theoretic solutions) as well as 
psychological perspective, subjects exposed to 
the PD at the preinteraction stage may form 
expectations of anticipated competition more 
frequently than in other conflict situations. 
Cooperation is less viable in the PD context 
than in other situations. The dispositions oi 
the participants may interact with these situ- 
ational perceptions to determine both the 
stated expectations and goal choices of the 
subject. (The present data fail to support 
Kelley and Stahelski’s (1970b) implication 
that both goal choices and expectations, and 
therefore the triangle hypothesis, largely re- 
flect the effects of the third variable, the dis- 
position of the players.) The relative weights 
of dispositions and situational perceptions 
may, therefore, be variable, contributing dif- 
ferentially across the range of conflict situa- 
tions. 

Second, the set of options or reactions that 
is available in the PD for the behavioral 
translation of the players’ expectations is 
extremely limited. As a consequence of this 
restrictiveness, the anticipatory responses se- 
lected by the players may be expressed in a 
way that masks their intentions. In the pres- 
ent study an attempt was made to demon- 
strate that when goal-choice cooperators either 
expect or experience a competitive opponent, 
their countercompetitive response tendency is 
generally defensive in the EPD but “competi- 
tive” in the PD. Since the latter players are 
unable to communicate their intentions 
through the “confounded” B response (see 
Figure 1), the spiral of conflict will tend to 
accelerate. The “cooperator’s” B response 
will help to confirm the expectations of the 
“competitive” opponent, as Kelley and Sta- 
helski (1970b) have shown, and will provide 
the (self-) justification for his actions. In 
addition, the PD does not provide a context 
in which the players can easily test their 
expectation and attributions. The costs and 
risks of varying one’s behavior to construct 
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an approximate F test (Kelley, 1971) are 
particularly high in the PD, and the complex 
patterns of feedback may often be uninter- 
pretable. However, there was some evidence 
that goal-choice cooperators were even willing 
to martyr themselves (see Table 3) in order 
to maintain their purposes or to solicit infor- 
mation, 

The obvious point to be made, then, is that 
the PD context is ripe for engendering self- 
fulfilling prophecies. The regulatory feedback 
mechanisms are confounded and ambiguous 
to the point where the expectations of the 
players have little chance of being discon- 
firmed. We might also expect that such con- 
texts would precipitate self-fulfilling prophe- 
cies in cases where the basis for defensive 
anxiety is more “psychological” than strategic 
(cf., Gamson & Modigliani, 1971). Obvious 
examples of such contexts are where power 
differences exist between the parties or where 
bilateral “mirror-image perceptions” exist. 
The argument is not that the PD is not eco- 
logically representative of many conflict situ- 
ations. (Is an Intercontinental Ballistic Mis- 
sile an offensive or defensive weapon?) The 
PD is, however, only one of a possible set of 
different conflict structures and one that may 
rank toward the more extreme end in its abil- 
ity to promote forced engagement and self- 
fulfilling prophecies, 

Finally, the competitive response in the 
PD seems to have the effect of dominating 
the cooperative response such that the latter 
orientation needs to be reciprocated in order 
to have any viability (Kelley & Stahelski, 
1970a). This conjunctive property may not 
be as pronounced in other conflict situations, 
especially those where the participant is not 
in an “involuntary” or forced interaction and 
can maintain a certain degree of indepen- 
dence from the other’s actions. The PD seems 
to be a relatively powerful agent for subvert- 
ing the purposes and intentions of its partici- 
pants. The dominant or forcing effect of the 
competitive response in the PD may, there- 
fore, put limitations on the situations or con- 
texts where the processes described by Kelley 
and Stahelski (1970b) will occur. In general, 
the theory of Kelley and Stahelski should act 
as a provocative stimulus for further develop- 
ment of a more general theory relating ex- 
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pectations and self-fulfilling prophecies to the 
properties of the conflict situation. 

From a different perspective, perhaps the 
most provocative finding in the current study 
is that people who classify themselyes as 
competitors on rating scales tend to perceive 
most social relationships as similar to the PD 
situation, whereas people who classify them- 
selves as cooperators tend to perceive most 
social relationships as similar to the EPD. It 
is unfortunate that the present data cannot 
shed light on the plausibility of the different 
possible explanations for this difference. 

One possible explanation, for instance, is 
that cooperators and competitors have differ- 
ential experience with the types of situations 
represented by the two games. Cooperators 
may have more experience with relatively 
unrestrictive situations, while competitors may 
have more experience with relatively restric- 
tive, forcing situations. Of course, even if such 
a relationship could be demonstrated, it would 
still be necessary to discover the direction of 
causation. That is, people may seek out situ- 
ations compatible with their orientations, or 
alternatively, people’s orientations may be 
shaped by the type of situation they generally 
find themselves in. 

Another possibility is that  self-classified 
cooperators and competitors are not drawn to 
different types of relationships in any sys- 
tematic manner, but, instead, tend to perceive 
situations in a direction consistent with their 
dispositions. This explanation would seem 
most applicable for competitors who, in situ- 
ations where the situational properties and 
rules for interaction are less explicit than in 
an experimental game like the EPD, may fail 
to perceive or ignore the withdrawal or the 
more neutral moves. That is, the self-classified 
competitor may perceive “interaction mat- 
tices” as being categorical and delimited in 
their options and emphasize the more polar- 
ized and extreme action alternatives. Coop- 
erators would seem less likely to be able to 
see defensive possibilities and opportunities 
that were not in fact there. 

Tt must be remembered, of course, that in 
many instances people do not have any con- 
straints or formal rules for interaction im- 
posed upon them. Often the participants them- 
selves define the structure of the relationship 
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and delimit the boundaries and values of the 
interaction. In these situations it does not 
seem at all unreasonable to expect people to 
create interpersonal matrices in their own 
images. 
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The Role of Expectations and Attributions in the Alleviation 


of Learned Helplessness 


Carol S. Dweck 
University of Illinois at Urbana-Champaign 


The purpose of the investigation was to determine whether altering attribu- 
tions for failure would enable learned helpless children to deal more effectively 
with failure in an experimental problem-solving situation. Twelve children 
with extreme reactions to failure were identified and were given intensive, 
relatively long-term experience with one of two training procedures. It was 
hypothesized that a procedure which taught the helpless children to take re- 
sponsibility for failure and to attribute it to lack of effort would result in 
unimpaired performance following failure in the criterion situation, but that a 
procedure which provided success experiences only (as in many programmed 
learning and behavior modification programs) would lead to changes of a 
lesser magnitude. The results revealed that following training, the subjects in 
the Success Only Treatment continued to evidence a severe deterioration in 
performance after failure, while subjects in the Attribution Retraining Treat- 
ment maintained or improved their performance. In addition, the subjects in 
the latter condition showed an increase in the degree to which they empha- 


sized insufficient motivation versus ability as a determinant of failure. 


The present study of the alleviation of 
learned helplessness in children has grown out 
of several relatively independent lines of in- 
vestigation: contingency learning in experi- 
mental animal psychology, attribution theory 
in personality-social psychology, and cogni- 
tive therapy in clinical psychology. 

The phenomenon of learned helplessness, 
conceptually related to the earlier notion of 
“hopelessness” advanced by Mowrer (1960), 
was first studied systematically by Seligman 
and Maier (1967). The latter found that 
animals pretreated with unavoidable and 
inescapable shock later failed to avoid and 
escape traumatic shock in another situation 
in which shock was avoidable and escapable 
by performing a simple response. In describ- 
ing the phenomenon, the investigators used 
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the term “learned helplessness” to refer to 
the learning or perception of independence 
between one’s behavior and the presentation 
and/or withdrawal of aversive events. 

In a study linking this phenomenon to re- 
inforcement responsibility (i.e., internal vs. 
external attributions for outcomes), Dweck 
and Reppucci (1973) demonstrated that fol- 
lowing failure, a certain group of children do 
not perform the response required to succeed, 
even though they are motivated to and are 
fully capable of doing so. An analysis of the 
reinforcement responsibility profiles of the 
subjects revealed that those children most 
likely to give up in the face of failure when 
compared to the more persevering subjects 
(a) took less personal responsibility for the 
successes and failures they met with and (b) 
to the extent that they did take responsibility, 
tended to attribute the outcomes of their be- 
havior to ability rather than to effort. Overall, 
the more helpless subjects placed significantly 
less emphasis on the amount of effort exerted 
as a determinant of success and failure than 
did the more persevering subjects. These 
results imply that since helpless children see 
themselves as less instrumental in determin- 
ing outcomes, they would be less likely to 
view adverse circumstances as surmountable; 
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since they tend to attribute failure to lack of 
ability, they would be less likely to respond to 
failure with increased effort or perseverance.* 
A number of studies investigating the relation- 
ship between reinforcement responsibility and 
achievement motivation (e.g., Rotter, 1966; 
Weiner & Kukla, 1970; Odell, Note 1) or 
facilitating anxiety (Butterfield, 1964) have 
yielded results consistent with these findings. 

In learned helplessness the important varia- 
ble is not the occurrence of the aversive event, 
but the perception of the relationship between 
cne’s behavior and the occurrence of that 
event. As shown in the Dweck and Reppucci 
study, two children may receive exactly the 
same number and sequence of success and 
failure trials yet react quite differently as a 
function of whether they interpret the failure 
to mean that the situation is beyond or within 
their control. Many recent studies have shown 
that an individual’s reaction to events is 
largely determined by his attribution of the 
causes of that event. Nisbett and Schachter 
(1966), Ross, Rodin, and Zimbardo (1969), 
Storms and Nisbett (1970), and Valins and 
Nisbett (1971) have discussed the possibility 
of “attribution therapy,” a procedure whereby 
the teaching of a new attribution for certain 
symptoms might lead to the lessening of the 
debilitating or undesirable effects of those 
symptoms. Moreover, several clinicians (Da- 
vison, 1966; Ellis, 1962; Meichenbaum, Note 
1) have reported favorable results with pa- 
tients treated by “cognitive restructuring,” 
“rational-emotive therapy,” or by modifica- 
tion of “what they say to themselves about 
events.” 


1Jt is interesting to note that these patterns of 
reinforcement responsibility need not correspond to 
differences in actual ability. In the Dweck and Rep- 
pucci (1973) study those children who gave up in 
the face of failure performed just as well on the 
task presented as those who persisted. In fact, for 
the female subjects, the helpless children showed 
superior initial performance. It should also be noted 
that an attribution of failure to lack of ability has 
different implications depending on whether that 
lack is perceived as a temporary deficit in obtainable 
skills or as a more general intellectual deficit (al- 
though both might indicate that persistence in the 
immediate situation would be fruitless). Helpless 
children tend to key on the latter as an explanation 
for failure, but it might prove beneficial to encour- 
age a distinction between the two. 
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.Given the fact that the performance of the 
helpless child tends to deteriorate in the face 
of failure, the purpose of the present study 
was to determine whether altering the help- 
less child’s perception of the relationship 
between his behavior and the occurrence of 
failure (i.e. his attributions for failure) 
would result in a change in his maladaptive 
response to failure in the experimental situ- 
ation. If the helpless child tends to perceive 
independence between his responses and fail- 
ure, then one step would involve demonstrat- 
ing the dependence of failure upon his re- 
sponses and another would involve explaining 
to which aspects of the situation failure 
should be attributed. 

It was therefore hypothesized that a long- 
term training procedure which taught the 
child (a) to take responsibility for failure and 
(b) to attribute it to insufficient effort would 
lead to increased persistence on the experi- 
mental task in the face of failure. For those 
children receiving this retraining of attribu- 
tions, it was expected that failure on the task 
would no longer serve as a cue for continued 
failure (but as a cue to do something different 
or something additional) and that it would 
therefore no longer have deleterious effects on 
the child’s performance. 

Since the literature suggests that higher 
expectations of success lead to greater persis- 
tence in the face of difficulty (Battle, 1965; 
Feather, 1966; Tombaugh & Tombaugh, 
1963; Tyler, 1958), it was hypothesized that 
a procedure which provided only success ex- 
periences would lead to some positive changes 
in the helpless child’s reactions to failure. 
However, to the extent that failure retained 
its meaning as a cue for continued failure, 
the subjects in this group were not expected 
to show as much improvement as those sub- 
jects who were given attribution retraining. 
A success only procedure is one recommended 
by many behavior modifiers (e.g., Bigelow, 
1972; Hart & Risley, 1968; Meacham & 
Wiesen, 1969) and advocates of programmed 
learning (eg., Holland, 1960; Skinner, 
1968), especially for children who have diffi- 
culty dealing with failure. These investigators 
recommend the elimination of errors in the 
belief that errors serve only to elicit negative 
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emotions on the part of the child and to make 
the materials and/or the situation aversive, 
while they do little or nothing to enhance 
learning. Although there is ample evidence 
hat errors per se do indeed have adverse 
ffects on the performance of these children 
nd that success works to motivate them, the 
uestion is whether the most effective way of 
vercoming such reactions to failure is to 
iminate it from the situation or to teach the 
hild how to deal with it. 

In summary, the purpose of the study was 
to determine whether a treatment that altered 
attributions for failure would alter responses 
to failure to a greater degree than a com- 
monly advocated procedure that did not alter 
attributions. The Success Only Treatment 
was expected to raise the child’s expectation 
of success, thereby enabling him to sustain 
his performance despite failure. It was ex- 
pected, however, that the Attribution Re- 
training Treatment would prove superior since 
it provided a new interpretation for failure by 
teaching the child to attribute it to insufficient 
effort. 
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METHOD 
Overview 


Twelve extremely helpless children were identified 
and compared to persistent children, of the same 
age and sex and in the same classroom, on several 
measures, Baselines for speed and accuracy of per- 
formance on math problems were obtained for each 
of the helpless subjects and the effects of failure on 
performance following failure were assessed. Follow- 
ing this initial baseline and interpolation of failure, 
the helpless subjects were given one of two treat- 
ments in a different situation with different problems 


2 While the simultaneous manipulation of these 
two variables is entirely compatible with the purpose 
of the study (i.e., to assess the effects of a pro- 
cedure that altered attributions vs. a commonly advo- 
cated procedure that did not), the differences be- 
tween the two treatments should be kept in mind in 
comparing their effects. 

3 Three other children, originally selected as help- 
less, were not used. Two of them did not perform 
consistently enough to establish stable baselines 
against which the effect of failure on their perform- 
ance could be assessed. One was eliminated because 
he did not meet the criteria for helplessness on the 
premeasures and interpolated failure; it appeared 
that he did not perform well in school because the 
rewards of the classroom meant little to him and 

not because he found the threat of failure, or failure 
itself, debilitating. 
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for 25 sessions. In the Success Only Treatment, trials 
were programmed such that the subject could suc- 
cessfully complete the number of problems required 
well within the time limit on every trial. In this 
treatment, while success was attributed to subject’s 
responses, failure was avoided or glossed over when 
it occurred. In the Attribution Retraining Treat- 
ment, the procedure differed in two respects: the 
programmed occurrence of failure and the attribution 
of failure.? The success trials were identical to those 
of the Success Only Treatment. However, on ap- 
proximately 20% of the trials, the requirement 
slightly exceeded the number the subject was cur- 
rently able to solve within the time limit. On these 
trials, failure was clearly attributed to insufficient 
effort (i.e., experimenter verbalized the attribution 
to subject). The effects of failure on performance 
in the original situation were again assessed at the 
middle and end of training. 


Subjects 


There were 12 subjects, 5 females and 7 males, 
between the ages of 8 and 13 years. The females 
ranged in age from 10 to 13 years (10, 11, 11, 11, 
and 13); 3 were white and 2 were black. The males 
ranged in age from 8 to 13 years (8, 10, 10, 11, 12, 
12, and 13); 4 were white and 3 were black. Two 
of the females and 6 of the males were in special 
classes for children performing below grade level, 
while the remaining children were in regular classes. 
The subjects attended two public elementary schools 
in a lower-middle-class town outside of New Haven, 
Connecticut. 

The subjects were identified as “helpless” (i.e., 
characterized by expectation of failure and deteri- 
oration of performance in the face of failure) by 
their school psychologist, their principal, and their 
classroom teacher, independently. They were chosen 
as the most extreme cases out of the 750 children in 
the two schools. To ensure that the teachers’ criteria 
matched those of the investigator, teachers were 
given a helplessness rating scale for each of the 
children designated. The scale described various 
reactions to a stressful academic situation in which 
failure was possible or present. The teacher was 
asked to rate, on a scale of 1 to 5, the degree to 
which each child evidenced these reactions in similar 
situations. 

r In an attempt to avoid any possible stigma asso- 
ciated with being singled out for participation in 
the study, subjects were told that the investigators 
were studying how children do math problems and 
that they had been specially selected to help out on 
this project. 

To determine how and how much the helpless 
children differed from persistent children, 10 per- 
sistent comparison subjects of the same age and 
sex, and from the same classrooms as the helpless 
subjects were selected. Teachers were asked to choose 
children they believed to be of equal ability to the 
helpless children but who were characterized by 
their persistence in the face of failure or difficulty. 
Teacher selection, rather than IQ tests, was used as 
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a means ot matching subjects on intelligence, since 
for various reasons (e.g., the fact that IQ tests 
inevitably involve failure at some point, the fact 
that the groups may differ in test anxiety, etc.) IQ 
tests do not accurately reflect the abilities of the 
helpless child, These comparison subjects were given 
three of the same measures that were administered 
to the helpless children (but not the training or 
testing procedures). Because of the small size of the 
classes from which the subjects were drawn, one of 
these persistent subjects served as the comparison 
for three helpless subjects. 


Experimenters 


The two experimenters were women, 24 years of 
age, who had no knowledge of the experimental 
hypothesis. Lest they infer that the subjects receiv- 
ing one treatment were “supposed” to perform 
better, or that one treatment was thought to be 
superior to the other, they were told that it was not 
a matter of “better—poorer” but that a different pat- 
tern of results might follow from each treatment. 
The two experimenters administered all the measures 
that were given prior to training as well as all sub- 
sequent training and testing, with each experimenter 
working with the same subjects every day. The ex- 
perimenters were given extensive training in the pro- 
cedure by the author and their performances were 
monitored daily. 

Questionnaire measures given after the completion 
of training were administered by two teacher's aides, 
employed by the schools, who were trained in the 
administration of these measures by the author. 
This was done to ensure that possible changes in 
these measures from before to after training were 
not due to familiarity with the experimenter. 


Comparison Measures 


‘Measures on which the helpless subjects were com- 
pared to their persistent classmates consisted of the 
Intellectual Achievement Responsibility Scale, two 
subscales of the Test Anxiety Scale for Children, 
and a Repetition Choice task. They were all indi- 
vidually and orally administered. 

The Intellectual Achievement Responsibility (JAR) 
Scale (Crandall, Katkoysky, & Crandall, 1965) was 
given to all the helpless and persistent subjects. The 
TAR is designed to determine the degree to which a 
child believes that the intellectual failures and suc- 
cesses he encounters are a result of his own be- 
havior versus the behavior of important ‘others in 
his environment (e.g., teachers, parents, friends). It 
consists of 34 forced-choice items, each depicting a 
positive or negative achievement situation and pre- 
senting two alternative attributions: (a) an internal 
attribution, in which responsibility for the outcome 
is assumed by subject, and (b) an external attribu- 


4The author wishes to thank Karen Gould and 
Suella Wallace for their extremely competent work 
as experimenters. 


tion, in which responsibility for the outcome is rele- 
gated to some property of the situation or the other 
person. The I+ score represents the number of 
positive items for which subject takes responsibility ; 
the I— score represents the number of negative items 
for which he takes responsibility; and the total I 
score represents the sum of the subscores. The in- 
ternal alternatives can be further categorized into 
those which attribute the outcome to the ability of 
the subject versus those which attribute the outcome 
to his motivation. Thus the I+ score can be sub- 
divided into I+s (effort) and I+, (ability); the I— 
score can be subdivided into I—» (effort) and I—a 
(ability). 

Two subscales of the Test Anxiety Scale for Chil- 
dren (TASC) (Sarason, Davidson, Lighthall, Waite, 
& Ruebush, 1960) were administered. The scale has 
been factor analyzed and divided into four subscales 
by Feld and Lewis (1969), of which the test anxiety 
subscale and the poor self-evaluation subscale were 
used in the present study. Since the TASC has been 
used as an indirect measure of the motive to avoid 
failure (e.g., Atkinson & Feather, 1966), it was 
thought to be a measure which might distinguish 
between the helpless and persistent subjects. 

Finally, a repetition-choice task, similar to that of 
Bialer and Cromwell (1960) was administered to all 
subjects to test for the relative strength of the 
tendency to avoid failure versus the tendency to 
strive for success. Two 24-piece jigsaw puzzles, 114 
in. X 144 in. (.29 X .37 m), were used. They were 
of approximately the same difficulty and interest 
value and were cut from the same die. All subjects 
were allowed to complete the first puzzle, but were 
stopped 9 pieces short of completion on the second. 
They were then given the choice of reconstructing 
either puzzle at their leisure. The decision to recon- 
struct the already successfully completed one was 
taken as an indication of the tendency to avoid 
failure, while the decision to reconstruct the failed 
one was taken as an indication of the tendency, to 
strive for success. 

In addition to these measures, an Effort versus 
Ability Failure Attribution Scale was developed and 
administered to the helpless subjects only. Since the 
IAR fails to give the child a choice between two 
internal alternatives, that is, one attributing failure to 
insufficient effort and the other attributing failure to 
insufficient ability, it is difficult to assess the relative 
emphasis which an individual child places upon these 
two factors. Therefore, the present investigator de- 
vised a simple 5-question scale which presents 5 
failure situations involving arithmetic and asks sub- 
ject to choose between the ability and effort al- 
ternatives. 

All of the measures above were readministered to 
the helpless subjects following the completion of 
training, largely for exploratory purposes. The Effort 
versus Ability Failure Attribution Scale was ex- 
pected to yield the greatest changes as a result of 
attribution retraining, since it was most closely 
related to what this procedure endeavored to teach 
these children. 
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Establishment of Baselines and Inter polation 
of Failure 


After the administration of the premeasures and 
prior to the experimental treatment, the experimental 
subjects were given sheets of arithmetic problems to 
do. They were given three or four sheets per session 
each containing 25 or 30 problems. The sheets were 
adapted from the children’s workbooks and were 
chosen for each subject to reflect the range of his 
arithmetic abilities from moderately easy to mod- 
erately difficult. The problems were done in groups 
of five and a star was placed after every fifth prob- 
lem so that the subject would know he had reached 
the end of the group. After every group of 5 prob- 
lems, the experimenter recorded the time and graded 
them. If the subject got at least 4 of the 5 correct 
he was given a token. At the end of the session he 
was permitted to trade in his tokens for a small 
prize or save them for a larger one. 

The subject was given the same sheets every day, 
but in a different order for 10 days. Then to test 
for the effects of failure on rate and accuracy of 
performance, special sheets were made. They were 
identical to the moderately difficult arithmetic sheets 
on which the subjects had been working, except that 
in the second and fourth group of five problems, 
two problems were replaced with ones that were 
beyond subject's abilities, Thus, he would fail on 
that group of problems and could not earn a token. 
The interpolated failure sheet was placed second or 
third among the other sheets. When subject came 
to a star on a failure set of five problems, experi- 
menter corrected them, marking the two (or more) 
wrong, and said, “You got too many wrong that 
time. You don’t get a token.” If subject simply 
sat there and stared at the problems without at- 
tempting to do them and without indicating that he 
was finished with that set, the experimenter waited 
3 minutes and said, “Time's up.” She then cor- 
rected them, marking the incorrect and unattempted 
problems wrong and said, “You got too many wrong 
that time. You don’t get a token.” If, during a 
failure set, subject asked how to do the problems, 
experimenter said, “Just do the best you can.” 


Division into Treatment Groups 


The subjects were matched as closely as possible 
on the degree to which the interpolation of failure 
disrupted their performance. This was done by cal- 
culating the Percentage of decrease in the number of 
correct problems per minute from the day before 
interpolated failure to the day of interpolated failure 
on the same problems (i.e, those which followed 
failure). The subjects were randomly assigned to the 
Attribution Retraining (AR) Treatment or the Suc- 
cess Only (SO) Treatment, the second member of 
the pair being automatically assigned to the other 
group, There were three females and three males in 
the AR Treatment and two females and four males 
in the SO Treatment. Exch experimenter worked 
with three subjects in each treatment group, with 
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approximately the same range of subjects from the 
most disrupted by failure to the least. 


Experimental Training 


Two pieces of equipment especially designed for 
the training were used, The casing was made of 
wood and measured 23 X 64 X 74 in. (.58 X 17 X 
18 m). The front face contained two horizontal 
parallel rows of 20 small lights, the top row consist- 
ing of green lights, the bottom row of red lights, 
There were numbers from 1 to 20 across the face 
to correspond to each of the lights. The back of the 
apparatus contained an on-off switch and a dial 
which could be set to any number between 1 and 20 
(according to the criterian number of problems on 
a given trial). The lights in the top row could be 
illuminated one at a time via a handswitch Operated 
by the experimenter, When the number of lights 
illuminated in the top row matched the criterion 
number, the same number of lights in the bottom 
row was illuminated automatically. However, the 
experimenter could illuminate the criterion number 
of lights in the bottom row by means of a hand- 
switch before that number was reached in the top 
row. 

Training was carried out for 25 daily sessions. 
During these sessions, the subject did problems one 
at a time from a stack of 3 X 2 in, (.08 X .05 m) 
Papers each containing one problem. For each sub- 
ject, 48 problems were taken from sheets of prob- 
lems that had been easy for him and were presented 
in random order. On each trial, the experimenter 
preset the criterion number of problems for that trial 
by means of the dial on the back of the apparatus. 
When experimenter told subject to begin, experi- 
menter started the stopwatch and the subject took 
one problem from the top of the stack and com- 
Pleted it. If he did it correctly, experimenter said, 
“Correct,” and illuminated the first light in the top 


experimenter said, “No, 80 on to the next one,” and 
did not illuminate a light. The subject continued 


whereupon the bottom row of lights illuminated to 
match the number of lights in the top row, or until 
experimenter stopped him and told him his time 


Each subject was given 15 such trials in every 
Session. For the Success Only Treatment, the cri- 
terion number of problems on each trial was set at 
or below the number he was able to complete within 
the time limit; thus the members of this treatment 
group could succeed on every trial. The procedure 
was the same for the Attribution Retraining Treat- 
ment on 12 or 13 of the 15 trials. However, on the 
remaining 2 or 3 trials the criterion number was set 
one above the number he was generally able to 


was stopped one or two problems short of criterion, 
was compared to the criterion 
number required, and experimenter verbally attrib- 
uted the failure to insufficient effort. Thus the 
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members of this treatment group received 2 or 3 
failure trials in the course of each session. 

On the first day of training, subjects were given 
the following instructions, Instructions in parentheses 
were given to subjects in the AR Treatment only 
and were omitted for subjects in the SO Treatment. 


Today, and from now on, we're going to do some- 
thing a little different. See the problems on these 
papers? You will do one at a time and you will 
work for 1 minute. [The experimenter showed 
subject how long it takes to time 1 minute on the 
stopwatch.] 

Every time you finish a problem and it is correct, 
one of these lights will light up [the experimenter 
indicated lights in the top row]. If you work hard, 
get enough problems right, and light up enough 
lights, you will get a token. 

Here’s how you'll know if you did enough for a 
token, If you finish in time, these lights will light 
up and tell you [the experimenter indicated lights 
in the bottom row]. If you do not finish in time, 
I will stop you (and these lights will tell you 
how hard you should have tried to earn a token 
that time [the experimenter indicated lights in 
the bottom row]). 

OK, let’s try one and see how it works. [The 
experimenter set criterion to 2.] Begin. [The sub- 
ject took the first paper and did the problem. ] 
Correct. [The experimenter illuminated the first 
light. The subject took the second paper and did 
the problem.] Correct. [The experimenter illumi- 
nated the second light and the two lights in the 
bottom row lit up.] That was very good. You 
finished in time. See, those lights mean you had 
to get 2 right [Experimenter pointed to the bot- 
tom row] and you did [the experimenter pointed 
to the top row]. Here’s your token. 

Now let’s see what happens when you don’t finish 
in time. Suppose you did 1 problem and it took 
the whole minute. You got it right so the light 
lit up [the experimenter illuminated the first 
light]. But then I said, “Stop, time is up! You 
didn’t finish in time.” (Then these lights will tell 
you [experimenter lit two lights of the bottom 
row] you had to do 2 problems to carn a token 
that time. You did only one. That means you 
should have tried harder.) 


The experimenter then began the actual trials. It 
was determined in advance that if any subject in 
either group took longer than 14 minutes to 
complete the criterion number of problems, he was 
to fail on that trial. In summary, experimenter re- 
sponded to the success or failure trials of the two 
groups in the following manner: 

1. On success trials (Treatments AR and SO): 
“You finished in time. You needed , and you 
got ____.. You get a token.” 

2, On programmed failure trials (Treatment AR 
only): “Stop, time’s up. You didn’t finish in time. 
Let’s see how many you needed to earn a token. 
[The experimenter illuminated the criterion number 
of lights in the bottom row by means of the hand- 


switch.] You needed ____, and you got only 
. That means you should have tried harder.” 

3. On unscheduled failure trials (Treatments AR 
and SO): “Stop, time’s up. You didn’t finish in 
time. Let’s go on to the next.” 

The criterion numbers for the success trials were 
specified in advance and consisted of the three num- 
bers at the upper limit of what a given subject 
was generally able to complete within 1 minute. 
The experimenter recorded the time and errors on 
each trial; every day the mean number of problems 
per minute was calculated for each subject and the 
criterion numbers for the following day were ad- 
justed accordingly. Thus, the subjects were always 
working to capacity. The criterion number on the 
programmed failure trials for Treatment AR was one 
number above the highest criterion number on the 
success trials. (The two or three failure trials for 
Treatment AR in every session were randomly dis- 
persed among the trials with the constraint that there 
be no more than two consecutive failures and that 
neither the first nor the last trial be a failure.) On 
the last trial of every session, the subjects in both 
treatments were allowed to reach that number (i.e., 
one above the highest criterion number on the other 
success trials). 


Midtraining and Posttraining Interpolated 
Failure Tests 


For both groups, training was interrupted after 
the thirteenth day for the midtraining interpolated 
failure test. Baselines on the sheets of math prob- 
lems were reestablished for 3 days, an interpolated 
failure sheet corresponding to the difficult problems 
was inserted on the fourth day and baselines were 
reestablished for the following 3 days. Training was 
then resumed for 12 additional sessions after which 
the interpolated failure procedure repeated. 


RESULTS 
Helpless versus Persistent Subjects 


The helpless subjects differed from the 
persistent subjects on all of the comparison 
measures, thus verifying the selection proce- 
dure. The results from the TAR are in agree- 
ment with those of Dweck and Reppucci 
(1973) in that the helpless children took less 
personal responsibility for the outcomes of 
their behavior and tended to place less em- 
phasis on the role of effort in determining 
success and failure than did the persistent 
children. The Total I scores of the helpless 
subjects were significantly lower than those 
of the persistent subjects, £ = 19.5 and 24.5, 
respectively, #(20) = 3.11, p < .005.° While 
they did not differ reliably in the degree to 
which they attributed outcomes to ability, 


5 All probability values are for one-tailed tests. 
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TABLE 1 
PRE- AND POSTFAILURE PERFORMANCE ON THE PRETRAINING, MIDTRAINING, AND POSTTRAINING 


TESTS FOR SUBJECTS IN 


THE ATTRIBUTION RETRAINING AND SUCCESS ONLY TREATMEN 


Attribution Retraining Treatment Success Only Treatment 
Test 
AN | HA | ID | LA | TE | TO & AL | CI | OD | OS | JO | RO z 
Pretraining | 
Prefailure | 
Total time (sec.) 180 | 285 | 110 | 125 | 170 | 208 | 179.67 | 273 | 117 | 231 | 225 | 210 | 161 
Number correct 10) 12) 15} 15) 12} 15) 13.17] 12] 10} 15] 10] 15] 15] 
Postfailure 
Total time (sec.) 255 | 515 | 136 | 172 | 346 | 363 | 297.83 | 477 | 281 | 356 | 357 | 283 | 227 | 330.1 
_ Number correct 9} 3) 15) 14] 3] 14] 9.67 9) O} 14} 10) 15] 12} 10.0 
Midtraining 
Prefailure | 
Total time (sec.) 73| 155| 74] 148) 95] 145] 115.00 | 248] 88 | 136] 249 | 150 | 124 
Number correct 10] 15) 15) 14] 13] 15] 13.67) 9] 10] 15] 10] 15] 15 
Postfailure 
Total time (sec.) 96 | 212) 81 |172| 111| 159| 138.50 | 340) 179 | 197 | 279 | 166 | 127 | 214.67 
Number correct 9} 14) 15] 15] 12] 15) 1333] 3} 5] 14] 10] 12] 9] ~ gigs 
Posttraining 
Prefailure 
Total time (sec.) 50} 148} 85] 107) 99) 178} 111.17 | 149] 77] 95} 151 | 142 | 128 
Number correct 10} 14; 15} 14] 13] 15] 13.50 | 14] 10] 15] 10| 15] 15 
Postfailure 
Total time (sec.) 48| 155| 81] 108] 91] 131] 102.33 | 254] 105 | 154 | 136 | 202 | 193 
Number correct 10] 14) 15) 15] 13} 15] 13.67] 9] oO] 15] 10] 11] 14 


Note. The number of 


(X = 14.17); AL = 15, CI = 10, OD = 15, OS = 10, JO = 15, RO = 15 ( = S different levels o 
difficulty received sheets containing different numbers of problems. Prefailure and postfailure problems were identical and ad- 


ministered on consecutive days. 


that is, (I+4) + (I-,), £ = 8.6 and 10.3, 
respectively, £(20) = 1.3, p < .10, the differ- 
ence in the degree to which they attributed 
outcomes to effort, that is, (I+) + (I—n), 
was highly reliable, X = 10.9 and 14.1, re- 
spectively, £(20) = 3.20, p < .005. 

On both of the TASC subscales adminis- 
tered (test anxiety and poor self-evaluation) ,* 
the helpless subjects proved to be more anx- 
ious than the comparison subjects. For the 
test anxiety subscale, the means for the help- 
less and persistent subjects were 5.3 and 4.2, 
respectively, #(18) = 2.80, p < .01, and for 
the poor self-evaluation subscale, the means 
were 3.9 and 3.1, respectively, ¢(18) = 2.47, 
p < .025. 

On the Repetition Choice task, 9 of the 
12 helpless subjects, when given the choice, 
chose to reconstruct the puzzle they had al- 
ready successfully completed rather than the 
one on which they had been interrupted, 
while only 1 of the persistent subjects chose 
to do so, x*(1) = 6.86, p < .01. Thus, on 


®These data were missing for two of the per- 
sistent comparison subjects. 


problems for each subject were as follows: AN = 10, HA = 15, ID = 15, LA = 15, 


13.33). Subjects working at d 


this measure, the helpless subjects evidenced 
a clear tendency to avoid failure, while the 
persistent children showed a tendency to 
strive for success. 


Effects of Training 


Table 1 contains the absolute levels of pre 
failure and postfailure performance (ti 
completion and number correct) for each 
subject on the pretraining, midtraining, and 
posttraining tests. To obtain a measure that 
would take account of the effects of failure 
on both speed and accuracy of performance, 
the number of correct problems per minute 
was calculated. The percentage decrease i 
this measure was used to compare prefailure 
(baseline) to postfailure performance on the 
identical problems. 

Figures 1 and 2 show the percentage de- 
crease in correct problems per minute follow- 
ing failure on the pretraining, midtraining, 
and posttraining interpolated failure tests for 
the individual subjects in the AR and SO 
treatments, respectively. Inspection of the 
individual graphs reveals that the perform- 
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ances of all subjects underwent marked 
deterioration on the interpolated failure day 
in the pretreatment period. That this decrease 
in rate is not a result of being more careful 
is reflected by the increase in the number of 
errors on the part of the subjects and by the 
observation of other behavior in the period 
following failure, such as deterioration of 
handwriting, staring into space, etc. By mid- 
training, all the subjects in the AR Treat- 
ment showed some improvement, although 
all still decreased their score over the previ- 
ous day. By the end of training, however, all 
of the subjects in this treatment showed either 
negligible impairment following failure or 
improvement in the number of correct prob- 
lems per minute compared with their per- 
formance on the day prior to interpolated 
failure. In the SO treatment, on the other 
hand, all but one of the subjects (OS) con- 
tinued to show substantial decrements in per- 
formance at both the midtraining and post- 
training tests, with two of the subjects 
showing increased impairment from pretrain- 
ing to posttraining. Thus, even though most 
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Ficure 1. Percentage decrease in correct problems 
per minute at pretraining, midtraining, and post- 
training for the subjects in the Attribution Retrain- 
ing Treatment. 
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Ficure 2. Percentage decrease in correct problems 
per minute at pretraining, midtraining, and post- 
training for the subjects in the Success Only Treat- 
ment. 


of the subjects in both treatments improved 
in their general performance on these prob- 
lems on nonfailure days, only the subjects in 
the AR Treatment showed consistent and 
substantial decreases in their maladaptive 
reaction to failure. 

Figure 3 shows the mean percentage de- 
crease in correct problems per minute for the 
two treatment groups at pretraining, mid- 
training, and posttraining. The ¢ tests on the 
changes from pretraining to midtraining and 
from midtraining to posttraining for the sub- 
jects in the AR Treatment showed a signifi- 
cant difference in percentage decrease in 
correct problems per minute from pretraining 
to midtraining, X = 51.4 and 18.2, ¢(5) = 
3.32, p < .025, and from midtraining to post- 
training, X= 18.2 and —9.2, ¢(5) = 5.51, 
p<.005. A ż test comparing the change 
scores from pretraining to posttraining of the 
subjects in the AR Treatment to those of the 
subjects in the SO Treatment revealed a sig- 
nificantly greater improvement for the AR 
Treatment, Xar = 51.4 and —9.2, Xgo = 50.4 
and 46.0, (10) = 3.64, p < .005. Thus, it is 
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Ficure 3. Mean percentage decrease in correct 
problems per minute at pretraining, midtraining, 
and posttraining for the Attribution Retraining 
Treatment and the Success Only Treatment. 


clear that while the subjects in the Attribu- 
tion Retraining Treatment were able to handle 
failure more adaptively in the test situation 
following treatment, some of the subjects in 
the Success Only Treatment seem to have 
become more sensitive to failure in the test 
situation following exclusive success during 
training. 


Readministered Measures 


On the Effort versus Ability Failure Attri- 
bution scale, all of the subjects in the AR 
Treatment showed an increase in the choice 
of the effort alternatives from pretraining to 
posttraining, X = 1.3 and 3.0, #(5) = 3.39, 
p < .01. The subjects in the SO Treatment 
showed no such increase (X = 1.3 and 1.5). 
A comparison of the posttraining scores of 
the subjects in the two treatments shows a 
reliably greater tendency for the subjects in 
the AR treatment to emphasize the role of 
effort in determining failure, #(10) = 3.0, 
p < .01. The change in this measure provides 
evidence that the subjects in the AR Treat- 
ment, in addition to changing their reactions 
to failure in the experimental situation, 
altered their attributions for failure in situa- 

tions involving mathematics in general. 
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There were no consistent changes in the 
scores of either treatment group on the TAR 
scale as of the time this scale was readmin- 
istered. For the subjects in the AR Treatment 
the means before and after training were 
19.0 and 19.5; for those in the SO Treatment 
the means were 20.0 and 20.8. 

On the Test Anxiety Subscale of the 
TASC, the subjects in the AR Treatment 
showed a decrease in test anxiety (prn = 
5.7 and prost = 3.7), although the change 
did not reach significance, (5) = 1.89, p < 
-10, while the subjects in the SO Treatment 
showed a slight increase (prg = 5.0 and 
Xposr = 5.5). The difference between the 
changes in test anxiety for the two treat- 
ments, however, was not significant, ¢(10) = 
1.58, p< .10. On the Poor Self-Evaluation 
Subscale of the TASC, the subjects in both 
treatments showed decreases, but these de- 
creases were not reliable. For the AR Treat- 
ment, Xpry = 4.5 and Xpogr = 2.2, #(5) = 
1.71, p< .10; for the SO Treatment, Xpry 
= 4.0 and X post = 3.0, (5) = 1.59, p < .10. 

Tnitially, on the Repetition Choice task, 
one subject in the AR Treatment and two 
subjects in the SO Treatment had chosen to 
reconstruct the failed puzzle rather than the 
one which they had successfully completed. 
After training, two additional subjects in the 
AR Treatment chose to do the puzzle on 
which they had previously failed, while none 
of the subjects in the SO Treatment altered 
their choices. However, this change was not 
statistically reliable. 

In summary, the subjects in the AR Treat- 
ment showed large changes in their recogni- 
tion of effort as a determinant of failure as 
reflected in their change scores on the Effort 
versus Ability Failure Attribution scale, the 
measure most closely related to the attribu- 
tion retraining treatment, but failed to show 
reliable changes on the other, more global 
measures. The subjects in the Success Only 
Treatment, on the other hand, did not show 
significant changes on any of the measures 
which were readministered after training. 


DISCUSSION 


The investigation began with the assump- . 


tion that the manner in which a child views 
an aversive event, such as failure, determines, 
in large part, the way in which he reacts to 
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that event. Specifically, if a child believes 
failure to be a result of his lack of ability or 
a result of external factors beyond his con- 
trol, he is unlikely to persist in his efforts. 
On the other hand, if a child believes failure 
to be a result of his lack of motivation, he 
is likely to escalate his effort in an attempt 
to obtain the goal. It was further assumed 
(see Dweck & Reppucci, 1973) that those 
children who are characterized as learned 
helpless tend to regard failure in the former 
way, while those who are characterized as 
persistent tend toward the latter view. 

It was then hypothesized that by changing 
the helpless child’s attribution of failure in 
the training situation (from fixed to variable 
factors; see Weiner, 1972) one could thereby 


change his reaction to failure in the test, 


situation (from surrender to persistence). 
Training and testing differed in a number 
of respects. During training, relatively easy 
problems were completed by the subject and 
corrected by the experimenter one at a time; 
during testing, more difficult problems were 
completed by the subject and corrected by 
the experimenter in groups of five. In train- 
ing, responses were monitored on the appa- 
ratus; in testing, no apparatus was used. In 
training, failure meant that the child did not 
complete enough problems within the allotted 
time: in testing, failure meant that the child 
got the problems wrong. Despite these dif- 
ferences, it was expected that the new attribu- 
tion would generalize to the test situation and 
enable the child to persist following failure. 
The results of the investigation lend strong 
support to the hypothesis. 

Prior to training, all subjects showed 
severely deteriorated performance following 
failure; problems they had been solving daily 
became sources of great difficulty as evidenced 
by a decrease in rate of problem solving, an 
increase in errors, and behavioral indicants of 
withdrawal. The children who were taught to 
attribute failure during training to insufficient 
effort were able to persist after failure in the 
test situation. That failure became a cue to 
escalate effort is supported by the finding 
that five of the six subjects receiving the 
Attribution Retraining Treatment, in fact, 
showed superior performance following fail- 
ure (as compared to the day preceding 
failure). Interestingly, three of the subjects 
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in the AR Treatment actually verbalized the 
attribution during testing. 

Contrary to initial expectation, however, 
the subjects in the Success Only Treatment 
did not show any consistent improvement in 
their response to failure, but rather continued 
to display a marked impairment of perform- 
ance following failure. This occurred despite 
the fact that their performance during train- 
ing and on nonfailure days during testing 
steadily improved. While the successes might 
well have resulted in higher expectations of 
success and might have kept the situation 
from becoming aversive, this was not suffi- 
cient to prevent the adverse reaction to fail- 
ure. Two of the subjects in this group even 
appeared to be slightly more sensitive to 
failure after prolonged experience with con- 
tinued success. If their successes led them to 
form a new image of themselves, that of a 
succeeder, failure would indeed have a serious 
impact. 

Investigators who advocate errorless learn- 
ing for children often cite Terrace’s (1963a) 
work on errorless discrimination in the 
pigeon. Terrace demonstrated that pigeons 
making unreinforced responses to the nega- 
tive stimulus (i.e., errors) during discrimina- 
tion training showed gross emotional re- 
sponses to that stimulus. In other words, it 
acquired aversive properties. When discrimi- 
nations were trained without responding to 
the negative stimulus (i.e., were errorless), it 
remained a neutral stimulus. Skinner (1968) 
says of the pigeon, “It is, so to speak, ‘not 
afraid of making a mistake.’” It is rarely 
pointed out, although it is explicitly stated 
by Terrace (1969), that in some cases “error- 
less discrimination is not an unmixed bless- 
ing.” For example, Terrace (1963b) found 
that when an errorless discrimination was 
followed by a different discrimination trained 
with errors, the original perfect discrimina- 
tion was disrupted: Similarly, Terrace (1969) 
reported that when a continuously reinforced 
errorless discrimination was followed by ex- 
tinction, the previously errorless performance 
was interfered with. It would seem that al- 
though errorless learning led to superior per- 
formance, it rendered the subjects less able 
to deal with subsequent errors. 

While a success only procedure for children 
is an effective approach for teaching a given 
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body of material, the present findings suggest 
that it might be a short-sighted approach. The 
implications for strategies of behavior change 
or behavior building are rather straight- 
forward. An instructional program for chil- 
dren who have difficulty dealing with failure 
would do well not to skirt the issue by trying 
to ensure success or by glossing over failure. 
Instead it should include procedures for deal- 
ing with this problem directly. This is not to 
suggest that failure should be included in 
great amounts or that failure per se is desir- 
able, but rather, that errors should be capi- 
talized upon as vehicles for teaching the child 
how to handle failure. 

The training procedure in the present study 
addressed itself to a relatively narrow aspect 
of the student’s achievement behavior—his 
performance in the experimental situation. 
The results for the children receiving attribu- 
tion retraining provide evidence for a change 
in behavior in that situation and for a greater 
emphasis on the role of motivation in deter- 
mining failure in arithmetic. It is not sur- 
prising that the less related, more global cog- 
nitive measures failed to show significant 
changes, since (a) so little time elapsed be- 
tween the completion of training and the 
readministration of these measures and (b) 
training was conducted in a limited context. 
It is possible that additional time would have 
allowed the child to “test out” the new at- 
tribution in other settings and to begin to use 
it more generally. Unfortunately, the present 
study was conducted at the end of the school 
year (March through June) and there was 
therefore no opportunity to test this possibil- 
ity. A safer assumption, however, is that 
explicit generalization training is required to 
assure changes in the global cognitive mea- 
sures. Such training would involve guided 
experience in applying the new attribution to 
a variety of situations. 

Although direct measures of transfer of the 
effects of training to the classroom were not 
taken, verbal reports from the teachers (who 
did not know which subject received each 
treatment) indicated that the subjects in the 

Attribution Retraining Treatment had begun 
to work harder and to develop a different atti- 
tude toward failure. This was evidenced by 
their increased persistence with new material 
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and their seeking help rather than withdraw- 
ing when unable to complete a task. 

Cognitive-personality variables, such as the 
manner in which a person perceives the rela- 
tionship between his behavior and the occur- 
rence of certain events, indeed appear to be 
important determinants of the way in which 
people react to events. Many clinicians (e.g., 
Rogers, 1951) have hoped for changes in 
these variables to come about as byproducts 
of therapy, for example, increased responsibil- 
ity for one’s behavior or increased self-esteem. 
This may be akin to the Success Only Treat- 
ment in the present experiment. Others (me 
Ellis, 1962; Meichenbaum, Note 1) have 
begun to try to modify directly “what clients 
say to themselves” about events. The present 
investigation emphasizes the usefulness of con- 
sidering such cognitive-personality variables 
in planning strategies of behavior change. 
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The Influence of a Communicators Body Position on Opinion fi 
Change in Others 


Hugh McGinley, Richard LeFevre, and Pat McGinley 
University of Wyoming 


The question was asked, “While discussing his or her views, does a communi- 
cator who exhibits limb-outward or open body positions effect greater opinion 
change in an addressee than a communicator who exhibits limb-inward or 
closed body positions?” In an attempt to answer this question, female sub- 
jects, whose attitudes were premeasured, perused an attitude questionnaire of a 
female student and then viewed pictures that were taken of her while she dis- 
cussed her beliefs. Some subjects viewed open body position pictures of the 
communicator, while others viewed closed body position pictures. Retesting 
of the subjects’ opinions showed a change toward the communicator’s view- 
point for subjects who had viewed “open” pictures compared with subjects 
who had viewed “closed” pictures of the communicator (p <.01). 


The common observation that communica- 
tors vary in the openness of their body posi- 
tions and movements while orally presenting 
their views led to the question, “While dis- 
cussing his other views, does a communicator 
who exhibits limb-outward or open body po- 
sitions effect greater opinion change in an 
adressee than a communicator who exhibits 
limb-inward or closed body positions?” Infor- 
mation that is pertinent to this question is 
reported by Mehrabian and Williams (1969). 
Mehrabian and Williams found that the degree 
of liking a communicator nonverbally com- 
municates to an address is a positive correlate 
of both the communicator’s intention to per- 
suade and the degree to which others judge 
the communicator to be persuasive. They 
found the following nonverbal behaviors to 
be directly associated with both the com- 
municator’s intent to persuade and the judges’ 
tendency to rate a communicator as persua- 
sive: facial activity, rate of gesticulation, and 
eye contact. These behaviors all loaded a non- 
verbal communication factor that Mehrabian 
has labeled “relaxation.” Other nonverbal be- 
haviors loading this factor are: degree of 
sideways lean, leg-position asymmetry, arm- 
position asymmetry, openness of arm posi- 
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tion, hand relaxation, and neck relaxation 
(Mehrabian, 1968a, 1968b, 1972; Mehrabian 
& Friar, 1969; Mehrabian & Williams, 1969). 

Mehrabian (1972) has delineated three 
referential dimensions of nonverbal behavior 
that seem to be important in noverbal com- 
munication. These dimensions are positive- 
ness, potency, and responsiveness. The posi- 
tiveness dimension relates to evaluation, 
liking, and the expression of affect. The po- 
tency dimension deals with social status and 
social control; more or less, it reflects inter- 
personal effectiveness. The responsiveness 
dimension reflects general voice and body 
activity as well as the communicator’s orien- 
tation toward his addressee. Mehrabian’s 
nonverbal communication factor of relaxa- 
tion is involved with each of three referential 
dimensions. 

In accord with Mehrabian’s work, the com- 
municator who uses open body positions as 
compared to the communicator who uses 
closed body positions should (a) be more 
persuasive, (b) be seen as more active 
(responsive), (c) be better liked or positively 
evaluated, and (d) be seen as more potent 
(effective). 

The question we posed concerns opinion 
change in a person who has knowledge of the 
attitude stance of another person. The previ- 
ously cited works generally pertain to in- 
tended and perceived persuasiveness and not 
to opinion change. However, intuitively, one 
expects a direct association between the 
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persuasiveness of a communicator and his 
opinion-change effect on an addressee. Follow- 
ing this line of thought, the main hypothesis 
of the present study is that addressees who 
are exposed to the opinions of a communi- 
cator who displays open body positioning of 
the arms and legs will demonstate more opin- 
ion change than addressees who are exposed 
to the opinions of a communicator who dis- 
plays closed body positioning of the arms 
and legs. 

The secondary hypotheses of the experi- 
ment are, essentially, tests of the internal va- 
lidity of the experimental manipulation, the 
open and closed body positions of the com- 
municator. A communicator who exhibits 
open body positions should be rated higher 
in activity, positive evaluation, and potency 
by addressees than a communicator who ex- 
hibits closed body positions. For subjects 
from whom opinion change data were ob- 
tained there was an attempt to control for 
variations in positive evaluation and potency. 
For positive evaluation, this was done by 
presenting the communicator as being highly 
similar in attitudes to the addressees which, 
according to Byrne (1971), results in attrac- 
tion toward the communicator (positive evalu- 
ation); for potency, this was achieved by 
introducing the communicator as a peer of 
the subjects—Mehrabian (1972) reported 
that his potency dimension was highly loaded 
by status. 


METHOD 


Subjects and Materials 


Subjects were 96 female students from introduc- 
tory psychology and introductory education classes 
at the University of Wyoming who volunteered to 
participate in the experiment. The materials used in 
the experiment were a 24-item attitude questionnaire, 
a modified version of the Interpersonal Judgment 
Scale, a 12-scale semantic differential, 30 color slides 
of a female model, and a Kodak Carousel 850 
projector timed by a Hunter timer. s 

The 24 items selected for the attitude question- 
naire were taken from Byrne (1971). Each topic 
item was followed by 10 alternative choices that 
ranged from an absolutely supporting to an abso- 
lutely nonsupporting position. The items included 
topics such as the legalization of marijuana, the 
custom of tipping, and the use of nuclear weapons. 

The Interpersonal Judgment Scale (Byrne, 1971) 
was modified so that each item had nine alternatives. 
The items required the subjects to make judgments 
about the intelligence, knowledge of current events, 
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likability, and creativity of the communicator. We 
were only interested in the subjects’ responses to 
the likability item; the other items were fillers. 

The semantic differential (Osgood, Suci, & Tannen- 
baum, 1957) consisted of 12 seven-interval bipolar 
scales. The scales were evenly grouped into scales 
loading the semantic factors activity, evaluation, and 
potency. 

Thirty color slides were selected from 144 taken 
of a seated, slack-suit clad, female student (com- 
municator) while she conversed with a stationary 
photographer. Two judges rated the closed-open 
quality of the 144 slides on a 5-point scale, The 
judges considered such things as the angle between 
torso and legs (reclining angle), suggestion of out- 
ward movement of arms and legs, and head posi- 
tions. Ten slides that were twice judged by both 
raters as demonstrating closed pody position were 
selected. These slides showed the communicator in 
some sort of combination of the following: elbows 
next to her body, arms crossed, hands folded in 
her lap, knees pressed together, feet together, and 
legs crossed at either knees or the ankles. Ten 
other slides were judged as representing open posi- 
tion. These slides showed the communicator in vari- 
ous combinations of the following: leaning back- 
ward, legs stretched out, knees apart, one ankle 
crossed over the other knee, elbows away from her 
body, hands held outward, and arms held outward 
from her body either directly at her side or elevated. 
The judges also selected 10 slides that were con- 
sistently rated in the middle of the open-closed body 
position scale. In these slides the communicator was 
typically leaning back, her legs were slightly stretched 
out and siightly apart, her hands were either resting 
on her lap or her thighs, and her arms were held 
relatively close to her sides. The neutral slides were, 
depending on the experimental condition, either 
mixed with the open or closed slides or were shown 
by themselves. During the experiment, the slides of 
the communicator were projected to a size of ap- 
proximately 120 X 180 cm onto a screen in a dark- 
ened room. Each slide was shown for 9,2 sec; the 
interslide interval was .75 sec. 


Procedure 


Approximately 500, 24-item attitude questionnaires 
were administered to students. Students were assured 
of confidentiality and were asked to indicate their 
name, sex, and telephone number on the question- 
naire. Some of the students were later invited to 
participate in an experiment having to do with 
making judgments about people. Two to three weeks 
after they had filled out the initial questionnaire, 
the students were assigned to one of six experimental 
conditions, each of which, in the final analysis, con- 
tained 16 subjects. The subjects met in small groups 
of from 2 to 8 persons. 


Experimental Conditions 


Four of the six experimental conditions allowed for 
a direct test of the main hypothesis concerning 
opinion change (similar-open, similar-neutral, similar- 
closed, and retest-neutral conditions). Also, two of 
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these conditions and two other conditions allowed 
for the testing of the secondary hypotheses (similar- 
open, similar-closed, open, and closed conditions). 
Similar-open condition. Subjects (hereafter referred 
to as addressees) perused a “copy” of the com- 
municator’s attitude questionnaire, a bogus question- 
naire that agreed with the addressee’s attitude ques- 
tionnaire on 22 of 24 items. The bogus questionnaire 
was prepared so that 8 items were marked the same 
as the addressee had marked them, 7 were marked 
one interval more centrally, and 7 were marked an 
interval more extreme. Furthermore, 2 items that the 
addressee had marked in the “strongly” interval, 
were marked in the opposite strongly category. The 
addressees were given the following instructions: 


I am sure you remember when you filled out an 
attitude questionnaire in your class. I am going to 
pass out copies that were made from another 
girl’s questionnaire. After you have had 5 minutes 
to study the copy of her questionnaire, I am going 
to show you some pictures that were taken of 
her as she talked about her responses to the ques- 
tionnaire. I would like you to get as clear a notion 
as you can of what this girl is like. Once I hand 
out the questionnaires please don’t talk or ask 
questions. 


Following the initial instructions, roll was called, 
and the appropriate bogus questionnaire was handed 
to each addressee. After the addressees had perused 
the questionnaires for 5 minutes, the instructor said, 
“Now I am going to turn off the lights and show 
you a series of pictures that were taken while the 
girl talked about her responses to the questionnaire 
items. Again, I’d like you to figure out what kind 
of a girl she is.” 


The addressees then viewed 20 slides of the com- 
municator (10 neutral and 10 open slides). After the 
slides were shown, the experimenter said, “Pd like 
you to spend another couple of minutes looking 
through the questionnaire trying to clear up your 
concept of this person now that you have seen the 
pictures,” 

After 3 minutes the experimenter gathered the 
bogus questionnaires and said, “Now I would like 
to get a measure of your impression of the girl in 
the pictures whose questionnaire you have been look- 
ing over. Please fill out these scales in the next 3 
minutes.” The Interpersonal Judgment Scale and the 
semantic differential scales were handed out. After 
about 3 minutes the experimenter suggested that the 
addressees finish completing the forms and shortly 
thereafter proceeded to collect the scales. After 
gathering the forms, he handed out blank attitude 
questionnaires and said, “I’m interested in how the 
attitude questionnaire that you filled out in class 
remains stable or changes over time. What I'm 


1For simplicity we used the terms addressee for 
the subject and communicator for the model. There 
was no direct, person-to-person communication. 
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wondering is how reliable it is. Will you please fill 
out the questionnaire again?” The experimenter 
picked up the questionnaires as they were completed. 

Similar-closed condtition. The Procedure for this 
group was the same as that for the similar-open 
group with the exception that the addressees viewed 
10 closed body position slides interspersed with the 
10 neutral body position slides. 

Similar-neutral condition. The procedure for this 
group was exactly the same as that for the similar- 
open and similar-closed groups except that the 
addressees viewed only the 10 neutral slides. 

Retest-neutral condition. The procedure for this 
group was the same as that for the similar-neutral 
group, only the addressees filled out their second 
attitude questionnaires before they were exposed to 
the bogus questionnaire and the neutral slides of the 
communicator, 

The two remaining conditions were labeled open 
and closed. In these conditions there was no attempt 
to control for the addressee’s initial attraction 
toward the communicator by making the communi- 
cator appear similar to her in attitude. Addressees in 
these two groups were simply asked to make judg- 
ments about the girl whose pictures they viewed, 
The addressees in the open condition viewed the 
open and neutral slides, while addressees in the closed 
condition viewed the closed and neutral slides, 
There was no option change measure for the open 
and closed groups, 

Attitude questionnaire retest data were obtained 
from 16 female students 24 weeks after the initial 
administration of the attitude questionnaire. The 
retesting was done in the classroom. The product- 
moment coefficient for the test-retest total scores 
was .79 (df=14, p < .001). 


Scoring the Dependent Measures 


The opinion change score was obtained by com- 
paring an addressee’s markings on the two items of 
the attitude questionnaire with which the com- 
municator radically disagreed. If an addressee’s re- 
sponse on one of the items was 2 points less extreme 
(toward the communicator’s viewpoint) than her 
response on the first completion of the attitude 
questionnaire, she was assigned a score of +2; if her 
second response was two points more extreme, her 
Score was —2. The total opinion change score was 
the algebraic total of the scale movement for the 
two critical items. 

The only item scored from the Interpersonal Judg- 
ment Scale was one concerning how much the ad- 
dressee thought she liked the communicator. A score 
of 1 indicated that she strongly disliked her, while 
a score of 9 indicated that she strongly liked her. 

The semantic differential was made up of 12 bi- 
polar scales, 4 scales each for the evaluative, po- 
tency, and activity dimensions, High scores, a 
maximum of 28 for each dimension, indicated posi- 
tive evaluation as opposed to negative evaluation, 
potent as opposed to impotent, and active as op- 
posed to passive. 


INFLUENCE oF A CommunicaTor’s Bopy POSITION 


Data Analyses 


The opinion change scores were analyzed by a 
one-way analysis of variance. The data for this 
analysis were from subjects in the similar-open, 
similar-neutral, similar-closed, and neutral-retest con- 
ditions, The ratings were analyzed by four, separate 
2X2 analyses of variance. The two factors were an 
attitude similarity—no attitude similarity manipula- 
tion and an open-closed body position. Data for 
these analyses were obtained only from subjects 
who had viewed 20 slides, subjects from the similar- 
open, similar-closed, open, and closed conditions. 


RESULTS 
Opinion Change 

The overall analysis showed a significant 
difference between the similar-open, similar- 
neutral, similar-closed, and retest-neutral 
groups, F (3,60) = 3.15, p < .05. The mean 
opinion change scores were 3.50, 1.19, 1.81, 
and 1.69, respectively. The similar-open 
group’s mean opinion change score was sig- 
nificantly greater than the mean opinion 
change scores of all other groups (p < .01 for 
the smallest difference between paired means). 
The mean opinion change scores for the three 
other groups did not significantly differ from 
each other. 

These data fully support the main hypothe- 
sis: Addressees who viewed pictures of a com- 
municator exhibiting open body position 
showed greater opinion change toward the 
communicator’s viewpoint than addressees 
who viewed pictures of the communicator ex- 
hibiting closed body position. The latter ad- 
dressees, however, did not differ in opinion 
change from addressees who saw neutral body 
position pictures of the communicator or from 
persons who simply rereported their opinions 
about various topics 24 weeks after they had 
originally reported them. 


Secondary Measures 


The activity ratings showed only a body 
position effect, F (1,60) = 5.80, p< .025. 
This result supports our prediction that ad- 
dressees who viewed the open body position 
slides would rate the communicator as more 
active than addressees who viewed closed 
body position slides of her. As seen from the 
means listed in Table 1, this was the case both 
when the communicator was presented as 
having attitudes similar to those of the ad- 
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TABLE 1 


Errects or Mopet’s Bopy Position AND ATTITUDE 
SIMILARITY MANIPULATION 


Attitude similarity | Noageitude 
Measure Sz y 
Open | Closed | Open | Closed 
body body body body 
position | position | position | position 
Opinion 
change 54] 3.50a}| 1.815 — — 
Activity «79 | 21.88, | 19.81b0 | 20.94ap | 19.19. 
Liking 32 | 7.31a | 7.0605} 7.06 | 6,316 
Evaluation | .67 | 22.63, | 21.25, | 21.00, | 19.50, 
Potency 94 | 16.81a | 13.88, | 15.06.» | 16.444 


Note. Values within measures 
differ at p < .05 (direct comparisons, two-tailed tests). 

+ The mean opinion change score for the retest-neutral group 
was 1.69; for the similar-neutral group it was 1.19. 


ith different su 


dressees and when no attitude similarity ma- 
nipulation was attempted. This result offers 
validity for the body position manipulation. 

The analysis of the evaluation data showed 
both an attitude similarity effect, F (1,60) = 
6.32, p < .025, and a body position effect, 
F (1,60) = 4.58, p < .05. Addressees whose 
attitudes were similar to the communicator 
rated her more positively than addressees who 
had no knowledge of the communicator’s at- 
titudes, and addressees who viewed open 
slides of the communicator rated her more 
positively than addressees who viewed closed 
slides of her. We wanted some control over 
the effect of the communicator’s body position 
upon the addressees’ evaluation of her. To do 
this, wẹ tried to create an addressees’ atti- 
tude toward the communicator that would not 
be differentially positive toward her because 
of the slides the addressees viewed. The pres- 
ent analysis shows that this attempt was not 
successful. Regardless of the attitude similar- 
ity condition, the communicator was seen as 
evaluatively more positive when addressees 
viewed open as compared to closed slides of 
her. 

The liking data showed no overall effects, 
although, as would be expected, the attitude 
similarity and body position manipulations 
led to trends that are consistent with the 
evaluation ratings; for both manipulations, 
F (1,60) = 2.38. 

The analysis of the potency data revealed 
a significant Attitude X Body Position inter- 
action, F (1,60) = 5.31, p < .025. Simple 
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effects analyses of the data showed that when 
there was no attitude similarity manipulation, 
there was no difference in the potency ratings 
given to the communicator that were related 
to her body position. When the communicator 
was presented as having attitudes similar to 
the addressees, she was rated as being less 
potent in the closed slide condition. 

As a post hoc analysis, the addressees’ lik- 
ing, evaluation, activity, and potency ratings 
of the communicator were regressed upon 
their opinion change scores in order to assess 
further the relationships involyed. None of 
the regression F values were significant (all 
were less than unity). The multiple correla- 
tion based on all four measures accounted for 
less than 5% of the opinion change variance. 


Discusston 


The results are perplexing. As hypothesized, 
the analyses of variance data showed that as 
opposed to the communicator with closed 
body position, the communicator with open 
body position: (a) effected more addressee 
opinion change and (b) was rated by the 
addressees as being more active, evaluatively 
positive, and, given attitude similarity be- 
tween the communicator and the addressees, 
more potent. The perplexing part of these 
data is that, as shown by the stepwise multi- 
ple regression analysis, addressees’ opinion 
change is seemingly independent of their ac- 
tivity, evaluative, and potency ratings of the 
communicator. In concert with Mehrabian’s 
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work, we had expected that these ratings 
would help to explain what it was about the 
communicator’s open body position that led 
the addressees to change their opinions 
toward the viewpoint of the communicator. 
Inasmuch as the ratings were not signifi- 
cantly related to the addressees’ opinion 
change scores, an explanation needs yet to 
be offered for the opinion change phenomenon 
observed in the study. The absence of this 
explanation bespeaks the need for further 
research. 
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Helping a Motorist in Distress: 
The Effects of Sex, Race, and Neighborhood 


Stephen G. West, Glayde Whitney, and Robert Schnedler 
Florida State University 


Two experiments investigated the effects of the sex and race of the victim and 
the racial composition of the neighborhood in which the incident was staged 
on helping a motorist whose car was apparently disabled. In Experiment 1, 
a significant sex of victim effect was obtained: The female victims were helped 
faster than male victims. A significant Race X Neighborhood interaction was 
also obtained, with the black victims being helped faster in the black neigh- 
borhoods, while the white victims were helped faster in the white neighbor- 
hoods. The helpers were predominantly male and of the same race as the 
victim. In Experiment 2, the effects of a fourth variable, proximity to a col- 
lege campus, were also investigated. While the effect of sex of victim and the 
helper-sex bias were replicated, the Race X Neighborhood interaction was 
modified by the proximity to college variable. In locations adjacent to a 
predominantly black and a predominantly white college campus, the victims 
of the opposite race were helped faster than victims of the same race. How- 
ever, in the noncollege neighborhoods, the victims of the same race were 
helped faster. Whites predominantly helped victims of the same race, while 
black helpers did not show a racial bias. The results were discussed in terms 


of the Piliavins’ (1972) two-stage model of the helping process. 


Since the brutal murder of Kitty Genovese 
in Queens in 1964, a growing body of research 
in social psychology has focused on helping 
behavior. Much of this research has focused 
on such determinants of helping behavior as 
modeling (Bryan, 1972; Hornstein, 1970), 


norms (Berkowitz, 1972; Schwartz, 1970), 


temporary states (Berkowitz, 1972; Freed- 
man, 1970; Isen, 1970), and the actual or 
implied presence of other onlookers (Clark & 
Word, 1972; Latané & Darley, 1969, 1970; 
Ross, 1971). Recently, researchers have begun 
to examine the effects of victim character- 
istics, such as race and sex, on the helping 
behavior of adults in naturalistic settings. 
The effects of race of the solicitor on dona- 
tions to the Salvation Army were investigated 
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in two experiments by Bryan and Test 
(1967). In their first experiment, a male 
model gave a donation which was observed 
by the subject, and no differences were found 
in the donations to black or white solicitors. 
In Experiment 2, no model was present and 
the white solicitor received signfiicantly more 
contributions than the black solicitor. In both 
experiments, the solicitors were female and 
the majority of the subjects were white fe- 
males. 

Piliavin, Rodin, and Piliavin (1969) inves- 
tigated the effects of race of the victim and 
the nature of the emergency on helping. The 
victim, who appeared to be either drunk or 
ill, collapsed on a New York subway train. 
One black and three white undergraduate 
males played the role of the victim. Males 
were more likely to help than females and 
there was a tendency for same-race helping to 
occur, but only when the victim was appar- 
ently drunk. 

Wispé and Freshley (1971) investigated 
the effects of the race of the person needing 
assistance and the race and sex of the observer 
on helping behavior. In this experiment, a 
black or white confederate broke a bag of 
groceries in front of a grocery store. The inci- 


691 


692 


dent was staged as a black male, a black 
female, a white male, or a white female sub- 
ject was approaching. With white subjects, 
the men and women did not differ signifi- 
cantly in the amount of help given, while with 
black subjects, men were more likely to help 
than women. A second result was that while 
no differences were obtained with male help- 
ers, female helpers were more likely to ignore 
a confederate of their own race. However, 
white female subjects were more likely to 
offer positive help (picking up the groceries) 
rather than perfunctory help or reacting with- 
out helping to the white confederate than to 
the black confederate. No difference was 
found in the positive help category for the 
black female subjects. 

A model of the helping process which may 
aid in clarifying some of the reasons for the 
often conflicting results cited above has re- 
cently been developed by the Piliavins and 
their associates (Piliavin & Piliavin, 1972; 
Piliavin et al., 1969; Walster & Piliavin, 
1972). Briefly, they have proposed that the 
sight of a victim elicits a state of physio- 
logical arousal or distress in the observer. The 
amount of distress experienced by the observer 
is a function of such variables as the severity 
of the emergency, the distance of the observer 
from the emergency, the personality of the 
observer, and the similarity and emotional 
attachment of the observer to the victim. As 
the amount of distress increases, the proba- 
bility of the observer making a response to 
the situation increases. The nature of the re- 
sponse is determined by the costs and rewards 
for helping and not helping. If the costs for 
helping are high, the observer is likely to 
leave the scene or give the victim indirect 
help such as calling the police rather than 
intervening directly. If the costs for not help- 
ing are high, the observer is likely to inter- 
vene directly in the situation. 

In applying this model to the race and 
helping area, one factor at each stage of the 
process appears to be particularly relevant. 
First, the similarity of the observer and the 
victim should be greater if both are of the 
same race. Thus, the observer should experi- 
ence greater distress if the victim is of the 
same race. However, the influence of this 
factor would probably be minimized if the 
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emergency was severe and the observer—vii 
distance was minimal. Relative to the 
level of distress which should occur u: 
these conditions, the increment in distr 
which would occur as a result of observer- 
victim similarity should be negligible. Second 
the race of the victim should particularly i 
fluence the costs of helping and not helpin: 
attributed to the situation by the observe: 
The observer probably feels that he is les 
likely to be hurt or embarrassed by a victim 
of the same race and thus should be more 
likely to help such a victim. However, if the 
costs of helping in the situation are very high 
or the costs of not helping are high this effect 
would be obscured. Under these conditions, 
direct helping should occur close to 0% and 
100% of the time, respectively, so that the 
influence of other attributed costs should be 
minimal. Thus, according to the model, same- 
race helping should occur in a situation where 
the emergency is not severe and the costs of 
both helping and not helping are low or mod- 
erate. 

The Piliavin model can also be used to 
analyze the relation between sex and helping. 
Consistent with the model and other ap- 
proaches (e.g., cultural norms), ‘females 
should help less often than males. It should 
be more costly in effort and potential danger 
for females to help than for males. Also, the 
costs of not helping are probably less for fe- 
males since they are unlikely to receive nega- 
tive sanctions for not helping. Conversely, 
female victims should generally elicit more 
helping than male victims. An emergency 
situation should be perceived by the observer 
as being more severe for a female victim than 
a male victim and the potential costs of help- 
ing a female victim are less. Thus, female 
victims should be more likely to receive help 
than male victims and female bystanders 
should be less likely to help than male by- 
standers. This prediction is consistent with 
much of the research literature (Bryan & 
Test, 1967; Clark, 1974; Darley & Latané, 
1970; Piliavin et al., 1969), although some 
exceptions have been reported (Emswiller, 
Deaux, & Willits, 1971; Wispé & Freshley, 
1971). As with race, sex effects should be 
most easily demonstrated when the severity 
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HELPING A MOTORIST IN DISTRESS 


of the emergency and the costs are not ex- 
treme. 


EXPERIMENT 1 


The present experiment attempted to test 
these predictions in a naturalistic situation. 
A situation similar to the flat-tire studies con- 
ducted by Bryan and Test (1967) was used 
since it adequately met the requirements 
above: a nonsevere emergency in which the 
cost (effort, potential danger) of helping was 
moderate and the cost of not helping was low. 
In addition, the situation easily allowed ma- 
nipulation of the race and sex of the victim 
as well as the racial composition of the pool 
of potential helpers. 


Method 
Overview 


Sixty-four incidents of a motorist needing as- 
sistance were staged at various locations in Talla- 
hassee, Florida. The independent variables in a 
2X2X2 factorial design were sex of the victim, 
the race of the victim (black or white), and the 
racial composition of the neighborhood in which the 
incident was staged. The primary dependent varia- 
bles were the elapsed time before another motorist 
helped and the race and sex of the helper. 


Procedure 


Three-person teams consisting of one victim-driver 
and two observers were used for each incident. After 
dropping off the two observers, the driver would 
circle a city block and coast to a stop on the 
shoulder of the road. The victim would then raise 
the hood of the car and stand behind the car, facing 
the approaching traffic. 

The session began when the hood of the car was 
raised and was terminated when a motorist stopped 
and offered assistance or after 15 minutes had 
elapsed. The two observers kept independent records 
of the elapsed time of the session, the race and sex 
of the motorists who passed by going in the same 
direction without offering help, and the race and 
sex of the person offering assistance. 

Four different victims were used in the experi- 
ment: a black male, a black female, a white male, 
and a white female. The victims were young adult 
college students chosen so as to avoid extremes in 
physical appearance. In each case, the victim was 
attired in slacks and a shirt or blouse. 

The sessions were conducted at four different loca- 
tions in Tallahassee, Florida. Two locations were in 
predominantly black residential areas and two loca- 
tions were in predominantly white residential areas. 
In all cases, the streets chosen were major two-lane 
arteries with a relatively heavy traffic volume of 
300-500 cars per hour. All sessions were conducted 
between 1:00 PM and 4:00 PM. The order in which 
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TABLE 1 


Mean Latency or HELPING 
(in seconds) 


Neighborhood 


Victim 
White Black 
White male 371 580, 
White female 101 217 
Black male 470 311 
Black female 281 176 
Note. n = 8 per cell. 


the sessions were conducted was counterbalanced to 
eliminate differences due to variations in traffic 
flow during the afternoon. The stimulus car for one- 
half of the sessions was a 1971 Ford, while for the 
other half of the sessions it was a 1968 Chevrolet. 


Results 
Checks on Neighborhood Manipulation 


While the locations chosen for the experi- 
ment were clearly in black or white residential 
areas, it is possible that the racial composi- 
tion of drivers would not differ between the 
black and white neighborhoods. In the two 
white neighborhoods 22% of the drivers were 
black, while in the two black neighborhoods, 
42% of the drivers were black, x? (1) = 
56.20, p < .001. These figures reflect the ra- 
cial composition of Tallahassee (23% black) 
and the fact that only high volume, through 
streets were used in the present experiment. 


Latency of Helping Scores 


The latency of helping was computed as 
the number of seconds from the time the vic- 
tim raised the hood of the car until a passing 
motorist stopped alongside the victim’s car. 
If no helping occurred, the session was termi- 
nated after 15 minutes and assigned a time of 
15 minutes. The latencies were subjected to a 
2 (Race of Victim) X 2 (Sex of Victim) x 
2 (White vs. Black Neighborhood) analysis 
of variance. The cell means are presented in 
Table 1. 

The analysis showed two significant effects. 
Female victims were offered help faster than 
male victims, F (1,56) = 17.21, p< .001, 
and a Race X Neighborhood interaction was 
obtained, F (1,56) = 6.50, p< .025, with 
black victims being helped faster in black 
neighborhoods, while the white victims were 
helped faster in the white neighborhoods. 
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Since a significant correlation, r (62) = 
—.34, p<.01, was obtained between the 
total number of cars passing per minute and 
the latency of helping scores, an analysis of 
covariance was also performed on the latency 
Scores using the total number of cars passing 
per minute as the covariate, Supporting the 
results of the analysis of variance, statistical 
significance was once again obtained for both 
sex, F (1,54) = 13.12, p< .001, and the 
Race X Neighborhood interaction, F (1, 54) 
= 7.00, $ < .025. 


Characteristics of H elpers 


In 60 of the 64 sessions, the victim was 
helped before the 15 minutes allotted for the 
session had elapsed. The data from these 
sessions were analyzed to examine the rela- 
tionship between the race and sex of the vic- 
tim and the race and sex of the helper. 

Two significant effects emerged. Males of- 
fered help more often than females. While 
59% of the drivers who passed the victim 
were male, 92% of the people who stopped to 
offer aid were male, x° (1) = 26.47, p < .001. 
Victims were also helped primarily by people 
of the same race. Combining across all condi- 
tions, 34% of the drivers who passed the vic- 
tim were black. For white victims, 94% of 
the people who offered assistance were white, 
while for black victims, 97% of the people 
who offered assistance were black, x? (1) = 
59.86, p < .001. 


EXPERIMENT 2 


Experiment 2 was undertaken to investigate 
the same variables in college and noncollege 
neighborhoods. It was predicted that the 
results in the noncollege areas would replicate 
those of Experiment 1. It was also predicted 
that in the Presumably more liberal white 
college area, significant cross-racial helping 
should occur and that the black victims should 
be helped at least as fast as the white victims 
(cf. Gaertner, 1973). While no firm predic- 
tion was made concerning helping in the pre- 
dominantly black college area, recent research 
by Braddock (1973, Note 1) indicates that 
these students might be expected to have 
traditional liberal, prointegration attitudes 
rather than pro-black nationalism, militant 
attitudes. Thus, a tentative prediction may 
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be made that significant cross-racial helping 
should occur in the black college area with 
the white victims being helped at least as fast 
as the black victims. 


Method 


Overview 


Using the same procedure as Experiment 1, 128 
incidents of a motorist needing assistance were staged 
at four locations in Tallahassee, Florida. The factors 
in the 2X2X2X2 design were sex of the victim, 
race of the victim (black or white), the racial 
composition of the neighborhood in which the inci- 
dent was staged, and the proximity to a college 
campus (near vs, far). Once again, the primary 
dependent variables were the elapsed time before 
another motorist offered assistance and the race and 
sex of the helper. 

As in Experiment 1, four different victims were 
used in the present experiment: a black male, a 
black female, a white male, and a white female. The 
victims for Experiment 2 did not participate in Ex- 
periment 1 in any manner, The victims again were 
young adult college students chosen to avoid ex- 
tremes in physical appearance and were attired in 
slacks and a shirt or blouse. 


Changes in Procedure 


The sessions were conducted at four new locations, 
each of which was at least 1.0 mile (1.61 km) from 
any location of Experiment 1, One location was 
adjacent to Florida State University which has a 
predominantly white student population (95% 
white). A second was adjacent to Florida Agricul- 
tural and Mechanical University which has a pre- 
dominantly black student population (96% black), 
The two other locations were in a white residential 
area and a black residential area, at least 1.5 miles 
(2.41 km) from the nearest university. In each case 
the street was a two-lane, local carrier so that the 
traffic flow was lighter (80-200 cars per hour), and 
the racial composition of the drivers varied from that 
of Experiment 1. were conducted be- 
tween 1:00 PM and 4:00 PM, and the order in which 


eliminate differences due to variations in traffic flow, 
The stimulus car for one-half of the sessions was a 
1972 Datsun; for the other half it was a 1971 Dat- 
sun. 


Results 
Checks on Reliability of Observation 


During 45 of the experimental sessions, two 
observers independently recorded the latency 
of helping and the race and sex of each pass- 
ing motorist. The records of the two observers 
were then correlated to get an estimate of the 
reliability of the observational record. For 
computing reliability only, scores of 15 min- 
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utes (no help) were discarded to avoid a 
spuriously high estimate of the reliability of 
the latency of helping scores. The mean inter- 
observer reliability for the number of cars 
passing with a given category of driver (e.g., 
black male) was r= .99. The interobserver 
reliability for the latency scores was also r = 
99. 


Checks on Neighborhood Manipulation 


A check was again made on the racial com- 
position of the passing motorists who formed 
the pool of potential helpers. In the white, 
noncollege, residential area, 10% of the driv- 
ers were black; in the white college residen- 
tial area, 18% of the drivers were black; in 
the black noncollege residential area, 74% of 
the drivers were black; and in the black, col- 
lege area, 83% of the drivers were black. The 
difference in the racial composition of the 
drivers between the black and white neigh- 
borhoods, collapsed across proximity to col- 
lege, was significant, x? (1) = 1,415, p< 
001. 


Latency of Helping Scores 


The latency of helping scores were sub- 
jected to a 2 (Race of Victim) X 2 (Sex of 
Victim) X 2 (White vs. Black Neighborhood) 
X 2 (Proximity to College) analysis of vari- 
ance. The cell means are presented in Table 2. 

As in Experiment 1, female victims were 
offered help faster than male victims, F (1, 
112) = 16.11, p < .001. However, the Race 
of Victim X Neighborhood interaction found 
in Experiment 1 was not obtained (F < 1). 
A significant three-way interaction of the 
Race of Victim X Neighborhood X Proximity 
to College occurred, F (1,112) = 4.12, p< 
.05. The noncollege location results were simi- 
lar to Experiment 1 in that black victims were 
helped faster in the black neighborhood and 
white victims were helped faster in the white 
neighborhoods. However, in the locations ad- 
jacent to the college campuses, black victims 
were helped faster near the white university, 
while white victims were helped faster near 
the black university. 

As a significant correlation, 7 (126) = 
—.26, p<.01, was obtained between the 
total number of cars passing per minute and 
the latency of helping scores, an analysis of 


TABLE 2 


MEAN LATENCY OF HELPING 
(in seconds) 


White neighborhood | Black neighborhood 


Victim 


College | Nor, | College | Nige 
White male 728 432 621 711 
White female 548 313 362 401 
Black male 717 563 742 675 


Black female 376 505 564 392 


Nole. n = 8 per cell. 


covariance was again performed on the latency 
scores using the total number of cars passing 
per minute as the covariate. Supporting the 
results of the analysis of variance, significance 
was obtained for sex of victim, F(1, 111) = 
14.09, p < .001, and for the Race of Victim 
X Neighborhood (black vs. white) X Proxim- 
ity to College interaction, F (1,111) = 4.89, 
p < .05. In addition, when the differences in 
traffic flow were removed, the victims received 
help faster in a residential neighborhood 
than when adjacent to a college campus, F (1, 
111) = 5.50, p < .025. No other effects ap- 
proached significance (p > .10). 


Characteristics of Helpers 


In 85 of the 128 sessions, the victim was 
helped before the 15 minutes allotted for the 
session had elapsed. The data from these ses- 
sions were analyzed to examine the relation 
between the race and sex of the victim and 
the race and sex of the helper. 

Two significant effects emerged. As in 
Experiment 1, males offered help more often 
than females. While 68% of the drivers who 
passed the victim were male, 82% of the 
people who stopped to offer assistance were 
male, x* (1) = 8.05, p < .01. Contrary to the 
results of Experiment 1, only black victims 
were helped more often by people of the same 
race. Combining across conditions, 46% of 
the drivers who passed the victim were black. 
For the white victims, 51% of the helpers 
were black, x? (1) = .49, ms, while for the 
black victims 86% of the helpers were black, 
x? (1) = 26.67, p < .001. Finally, the prox- 
imity to a college campus did not significantly 
affect cross-racial helping. In the black col- 
lege location, 63% of the helpers were of the 
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same race as the victim; while in the black 
noncollege location, 45% of the helpers were 
of the same race as the victim, x? (1) = .81, 
ns. In the white college location, 90% of the 
helpers were of the same race as the victim; 
while in the white noncollege location, 70% 
of the helpers were of the same race as the 
victim, x? (1) = 1.61, ns. This latter result, 
while not significant, is in a direction oppo- 
site that which was predicted. 


GENERAL DISCUSSION 


While the effects of the sex of the victim 
found in Experiment 1 were replicated in 
Experiment 2, the effects of the race of the 
victim were modifed by the addition of the 
proximity to college variable. Since no inter- 
actions were obtained between the race and 
sex of the victim, the effects of each of the 
variables are discussed separately. 

The effect of the sex of the victim on help- 
ing was clear: Female victims were consis- 
tently helped faster than male victims across 
a variety of types of neighborhoods. The 
source of this effect is the presumably greater 
distress which the female victim elicits in 
the observer. Females are less likely to be 
perceived as being able to cope with the 
emergency and the potential danger (rape, 
robbery, etc.) is higher. Also, the potential 
danger (cost) of helping the female victim 
should be perceived as being lower since the 
female victim is unlikely to injure the helper 
(Clark, 1974). By the same reasoning, the 
costs to a potential female helper should also 
be perceived as being higher. Consistent with 
this analysis, in both experiments females 
helped less than males. However, the present 
situation was one in which most females are 
not well-trained and therefore less competent 
to act than males. At the same time, both the 
effect of sex of victim and the helper-sex bias 
are consistent with sociobiological considera- 
tions of the evolutionary development of so- 
cial behavior. For a wide variety of situations 
a similar behavioral sex effect would be pre- 
dicted on purely evolutionary grounds for 
virtually any species that displayed morpho- 
logical sexual dimorphisms similar to those of 
humans (Campbell, 1972; Trivers & Willard, 
1973). The present experiment demonstrates 
only that males help more often than females 


under the present set of conditions. In other 
situations where the costs are lower for fe- 
males than males, or the norm dictates that 
females should help (e.g., a child in distress) , 
females may be as likely to help as males. 

The effects of the race of the victim and the 
neighborhood are complex.’ In Experiment 1, 
the predictions that predominantly same-race 
helping would occur and that the black vic- 
tims would be helped faster in the black 
neighborhoods were supported. However, in 
Experiment 2, the predictions were only 
partially supported. The obtained Race of 
Victim X Neighborhood X Proximity to Col- 
lege interaction in the latency of helping was 
consistent with the predictions. However, the 
subsequent analysis of the characteristics of 
the helpers showed that the source of this 
effect was quite different from that which 
was expected. Contrary to prediction, signifi- 
cant (51%) cross-racial helping occurred in 
both the college and the noncollege black 
neighborhoods, while little (14%) cross- 
racial helping occurred in either of the white 
neighborhood locations. Somewhat surpris- 
ingly, it was not the white, presumably liberal 
students who were responsible for the black 
victims being helped faster than the white 
victims in the white college area; rather it 
was primarily black students who were help- 
ing the black victims. This result is consistent 
with Braddock’s (1973, Note 1) finding that 
black students at white, Southern universities 
showed a high level of racial identification, 
pride, and solidarity. When these black stu- 
dents refer to other blacks as brothers and 
sisters, it is apparently more than mere 
rhetoric. 

However, in the black college area, unlike 
the white college area, it was both black 


1 Every attempt was made in the present investi- 
gations to minimize the possibility of a confounding 
of the race and neighborhood variables with social 
status. In both experiments, the black and white 
victims were dressed in a similar manner and were 
associated with the same stimulus cars. Additionally, 
in Experiment 1, sessions were conducted in lower- 
middle-class and upper-middle-class white neighbor- 
hoods, but no differences in helping were obtained 
in the two neighborhoods as a function of the experi- 
mental manipulations. The absence of a comparable 
higher income black neighborhood in Tallahassee, 
however, precluded the examination of the effects of 
social status on helping in black neighborhoods. 
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students and white passersby who were help- 
ing the white victims faster than the black 
victims. A significant amount of cross-racial 
helping also occurred in the black noncollege 
areas in Experiment 2, while this did not 
occur in the black residential areas used in 
Experiment 1 or in the white residential areas 
used in either experiment. The primary dif- 
ferences between the black locations used in 
the two experiments were the volume of traf- 
fic flow, the type of traffic (local or through), 
and the percentage of white drivers. In the 
present investigations, when traffic flow was 
light so that diffusion of responsibility (Dar- 
ley & Latané, 1968) may have been low and 
few same-race helpers were available, blacks 
frequently helped whites, while whites did 
not often help blacks. This may reflect a 
persistence of the traditional white and Ne- 
gro roles in the South and/or more favorable 
attitudes of blacks toward whites than the 
reverse. 

The finding that helping tended to be 
slower in the college than the noncollege areas 
was surprising. The college student observers 
should attribute greater similarity to the col- 
lege student victims than should the generally 
older observers in the residential areas. Sev- 
eral studies (Emswiller et al., 1971; Regan, 
1971; Suedfeld, Bochner, & Matas, 1971) 
have shown that attributed similarity in- 
creases helping. However, in the present ex- 
periment, help may have been more readily 
available in the college than the noncollege 
areas since student housing was nearby. Also, 
many of the student potential helpers were 
probably on their way to classes and Darley 
and Batson (1973) have shown that people 
in a hurry are less likely to help than people 
in less of a hurry. 

Finally, contrary to the conclusion some 
other researchers have reached (e.g., Darley 
& Batson, 1973), dispositional variables are 
useful in predicting helping behavior. As Ger- 
gen, Gergen, and Meter (1972) and Walster 
and Piliavin (1972) have pointed out, using 
dispositional variables to predict helping be- 
havior requires a careful analysis of the pay- 
off matrix of the costs of helping and not 
helping to the potential helpers in the situ- 
ation in question. There is no reason to expect 
that dispositional variables alone will predict 
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across situations having a wide variety of 
different payoff matrices. The present experi- 
ment, unlike other studies which have investi- 
gated race of the victim and helping (Piliavin 
et al., 1969; Wispé & Freshley, 1971) used 
a situation where there is no necessary con- 
frontation or contact with the victim or other 
observers. Each driver is relatively anony- 
mous. It is likely that, under these conditions, 
norms which facilitate helping would be weak 
and the costs of not helping would be quite 
low. In addition, the costs of helping in the 
present experiment would be best described 
as moderate since there is some potential 
danger as well as time and effort involved in 
stopping to help a stranded motorist. It is 
under this specified payoff matrix that the 
effects of the race and sex of the victim 
obtained in the present experiment should 
emerge. 
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This report contains three (conceptual) replications of an experiment designed 
to test the relation between the magnitude of task intrinsic rewards and the 
degree of intrinsic and extrinsic motivation toward the task. Based on an 
attributional analysis of task motivation, it was hypothesized that (a) the 
degree of intrinsic motivation will vary as a positive function and (b) the 
degree of extrinsic motivation, as a negative function of task-intrinsic rewards. 
Both predictions were supported within each of the separate replications. The 
discussion considered the theoretical and the practical significance of the 


findings. 


The distinction between intrinsic and ex- 
trinsic motivation has been of long-standing 
interest within psychology, However, recent 
attributional analyses (e.g., Bem, 1967; Kel- 
ley, 1967, 1971; Jones, Kanause, Kelley, Nis- 
bett, Valins, & Weiner, 1971) have brought 
about a recrudescence in the research activity 
on this topic (e.g., Deci, 1971, 1972a, 1972b; 
Kruglanski, Alon, & Lewis, 1972; Kruglanski, 
Friedman, & Zeevi, 1971; Lepper, Greene, & 
Nisbett, 1973). 

Within the attribution theory, motivation 
toward a task or a behavior is assumed to 
follow from an inference (attribution) made 
by the actor regarding the locus of causality 
for his own behavior. Cast in these terms, in- 
trinsic motivation is said to obtain whenever 
the actor’s self-attributed cause for his activ- 
ity inheres in that activity, and extrinsic 
motivation obtains whenever the cause is 
exogenous to the activity. Important implica- 
tions for a theory of task motivation follow 
from the attributional principle of discount- 
ing (Kelley, 1971), whereby the person’s 
magnitude of an attribution to any given 
cause varies inversely with the number of 
apparent causes. Thus, in an instance where 


‘This research was supported by Grant 7 from the 
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both an extrinsic circumstance and an in- 
trinsic aspect of the task appear as plausible 
causes for one’s behavior, attribution will be 
weaker than in the case where no extrinsic 
cause was evident. By now there has accumu- 
lated in the literature substantial evidence in 
support of the foregoing proposition. The 
present reference is to research which demon- 
strates that the absence (vs, presence) of ex- 
trinsic rewards enhances the actor's intrinsic 
motivation for an activity. For example, 
workers in the area of forced compliance (e.g., 
Carlsmith, Collins, & Helmreich, 1966; Fes- 
tinger & Carlsmith, 1959; Linder, Cooper, & 
Jones, 1967; Nel, Helmreich, & Aronson, 
1969) reported repeatedly a negative relation 
between the magnitude of task-extrinsic in- 
ducement offered for the performance of a 
(counterattitudinal) task and subsequent lik- 
ing for the task (assumed to constitute one 
index of intrinsic motivation). That the nega- 
tive relation between extrinsic rewards and 
intrinsic motivation extends to tasks which 
are not counterattitudinal is suggested by sev- 
eral further experiments, For example, Weick 
(1964) found that the cancellation of credits 
for participation in an attractive concept- 
attainment experiment increased the rated 
interest in the task. Kruglanski et al. (1971) 
found that under extrinsic incentives (a prom- 
ised visit to the university), the performance 
of high school students on (attractive) tasks 
of verbal skill was characterized by weaker 
intrinsic motivation (indexed by the Zeigarnik 
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ratios) as compared with a no-extrinsic-in- 
centive condition. Similarly, Kruglanski et 
al. (1972) found that subjects led to believe 
that their participation in various team com- 
petitions had been prompted by attractive 
rewards reported a lower interest in the 
games as compared to subjects who had not 
been given or made aware of such rewards. 
Deci (1971, 1972a, 1972b) found that sub- 
jects who had been paid to solve geometric 
puzzles spent less time on these activities 
during a subsequent free-choice period as 
compared to subjects who had not been pre- 
viously paid. Also, Lepper (Greene & Lepper, 
in press; Lepper, Greene, & Nisbett, 1973) 
showed that nursery school children who en- 
gaged in a novel activity for a prize (vs. no 
prize) spent less time on this activity during 
a test session conducted 1 to 2 weeks follow- 
ing the experimental treatments. 

The research just reviewed demonstrates 
the applicability of attribution theory to the 
phenomena of task motivation with respect to 
a single relation, notably the negative one 
between the magnitude (or the presence) of 
extrinsic rewards and intrinsic motivation, As 
noted earlier, this tendency has been ac- 
counted for by the attributional principle of 
discounting, which Suggests in this instance 
that the presence of an extrinsic cause for an 
activity lowers the actor’s magnitude of at- 
tribution to an intrinsic cause. But the dis- 
counting principle has a symmetrical implica- 
tion, namely, that the Presence (vs. absence) 
of a plausible intrinsic cause for an activity 
should lower one’s attribution to an extrinsic 
cause. In other words, this suggests the first 
hypothesis, a negative relation between the 
magnitude of intrinsic rewards and the degree 
of extrinsic motivation, 

The lowering of attribution to an extrinsic 
cause under conditions in which a plausible 
intrinsic one is apparent should be accompa- 
nied by a concomitant enhancement in in- 
trinsic attributions, Therefore, a second hy- 
pothesis is suggested: Intrinsic motivation 
should vary as a Positive function of the in- 
trinsic rewards contained in an activity. The 
foregoing hypotheses were tested in three sep- 
arate experiments described below. 
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GENERAL Parapicm 


The three studies to be reported were based ` 


on the same conceptual paradigm and dif- 
fered in theoretically irrelevant detail, Spe- 
cifically, in all cases the subjects were solic- 
ited to perform an unspecified task for mone- 
tary pay. In one experimental condition (high 
intrinsic reward) the task turned out to be 
interesting and challenging. In the other con- 
dition (low intrinsic reward), it was dull. 
The dependent variables included (a) mea- 
sures of intrinsic motivation (e.g., the degree 
of willingness to volunteer for similar tasks in 
the future) and (b) measures of extrinsic 
motivation (e.g., the importance attached to 
the monetary pay received). The description 
of the specific experiments follows. 


EXPERIMENT 1 


Method 
Subjects 


Forty boys and girls from Grades 6 and 7 of 
elementary school1 served as subjects. They had 
volunteered for an experiment alleged to study the 
Psychology of language in return for a 1 IE fee.? 


Procedure 


The experiment took place during regular class 
time at the subjects’ school. Each subject in his 
(her) turn departed from his (her) class and was 
referred to a separate room which served as the 
experimental laboratory. Upon arrival, the subject 
was greeted by the experimenter (the sixth author), 
who reminded him (her) that the experimental task 
had to do with the psychology of language. The 
experimenter also stated that he had no interest in 
testing each subject’s intelligence or personality and 
that only group averages would be considered. In 
addition, the experimenter reminded the subject that 
he (she) would be paid a fee of 1 IŁ upon the 
completion of the experimental task, 

The high intrinsic reward condition. The task in 
this condition consisted of a verbal puzzle. The sub- 
ject was presented with a list of five Hebrew words 
and was asked to construct the maximal number of 
meaningful words out of the letters of each word 
on the list. A pretest had indicated that children of 
the subjects? age and background found this task 
highly attractive and challenging. 

The low intrinsic reward condition. Subjects in 
this condition were required to erase two Hebrew 
letters whenever encountered in a typed text. A 
pretest had indicated that the subjects’ counterparts 
found this task quite dull. 


1 Appreciation is expressed to the staff of the 
Mevooth Eiron school for their gracious cooperation. 
24.2 Israeli IŁ ~$1 U.S. 
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In both experimental conditions a clock was 
placed in front of the subject and he (she) was 
instructed to continue with the task for 10 minutes. 
The experimenter measured the time from the start- 
ing signal and until the subject stopped working of 
his (her) own accord. Subjects who continued past 
14 minutes from their starting time were stopped 
by the experimenter. The amount of time the sub- 
ject spent on the task beyond the designated 10 
minutes was taken as an index of intrinsic motiva- 
tion. 

At this time the subject was asked to fill out a 
questionnaire ostensibly about his (her) linguistic 
background and habits. Included in this question- 
naire was an item about the task’s perceived degree 
of interest. A second questionnaire, allegedly about 
administrative details, contained a question about 
the perceived adequacy of the 1 IL monetary fee. 
After the subject had completed both question- 
naires, he was given the promised 1 IŁ fee in four 
_ coins of .25 IŁ each. Just before leaving the room, 
a second experimenter (the seventh author) ap- 
proached the subject and informed him (her) that 
a charity organization had asked for assistance in 
collecting donations for children in need. If the 
subject was willing, he (she) could place a donation 
in a box outside the door. This concluded the 
experiment. The subject was admonished not to 
discuss the research with the other students and 
was asked to return to his (her) class. After all the 
subjects had been tested, the experiment was fully 
explained to them and their donations were returned. 


Results 
A Check on the Experimental Manipulation 


Subjects answered the questions: “Differ- 
ent people show different degrees of interest 
in language tasks. To what extent did you 
find the task you have just performed inter- 
esting?” The answers were registered on a 
5-point Likert-type scale ranging from very 
interesting to not at all interesting. The 
relevant means are displayed in Table 1. As 


TABLE 1 


Task INTEREST, INTRINSIC AND EXTRINSIC 
MOTIVATION, EXPERIMENT 1 


High Low 
Measure intrinsic | intrinsic 
reward | reward 
Mean interest ratings | 4.20an | 2.04 
Mean extra time (in minutes) 
spent on the task 4 3.758 1.58 
Mean pay (in Israeli IL judged 
adequate f 1.450 4.00 
Mean donation to charity 3.708 2.954 


Note, Mi with the same uppercase subscript differ at the 
Loot level earn nificance, and those with the same lowercase 
subscript, at the .05 level, Higher figures denote a greater 


interest. 
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expected, subjects in the high intrinsic reward 
condition rated their task as more interesting 
than did those in the low intrinsic reward 
condition, #(38) = 6.41, p < .001.° 


Intrinsic Motivation 


The subjects’ degree of intrinsic motivation 
was inferred from the amount of time he 
(she) spent on the task beyond the required 
10 minutes. Accordingly, the index of intrin- 
sic motivation was constructed as a 5-step 
scale in which the lowest score was 0 addi- 
tional minutes and the highest one 4 addi- 
tional minutes* (beyond which the subject 
was stopped). The mean measures of intrinsic 
motivation for the two experimental groups 
are included in Table 1. Consistent with the 
previously stated hypothesis subjects in the 
high intrinsic reward condition spent more 
time on the activity, thus exhibiting a higher 
degree of intrinsic motivation than their 
counterparts in the low intrinsic reward. 
condition, (38) = 5.89, p < .001. 


Extrinsic Motivation 


Subjects’ degree of extrinsic motivation was 
inferred from two separate indices. One was 
the amount of pay the subject deemed ade- 
quate for the task. Specifically, subjects an- 
swered the following question: “The manage- 
ment of the present research should like to 
remunerate the subjects for their participa- 
tion. However, there exist differences of 
opinion as to what might be equitable pay. 
In your opinion, what would constitute ade- 
quate payment for the task you have just 
completed?” The answers were recorded on a 
5-point forced-choice scale ranging in .5 Ii 
steps from 0 Ih to 2.00 IÙ. The means for 
these results are also included in Table 1. 
Consistent with the hypothesis under test, 
subjects in the high intrinsic reward condi- 
tion recommended lower pay than did sub- 
jects in the low intrinsic reward condition, 
t(38) = 7.25, p < .001. 

A second operational definition of extrinsic 
motivation has been the magnitude of the 
subjects’ tendency to keep their fee to them- 


8 Unless indicated otheywise, all p values reported 
are two-tailed. 

4 Intermediate times were rounded off to the near- 
est minute. 
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selves versus to donate it to charity. A 5-point 
scale was constructed ranging from 0 donated 
quarters to 4 donated quarters (or 1.00 IL). 
The pertinent means are included in Table 
1. In line with the hypothesis, subjects 
in the high intrinsic reward condition donated 
more (thus displaying a lower degree of ex- 
trinsic motivation) than did subjects in the 
low intrinsic reward condition, ¢(38) = 2.13, 
É < 05. 


EXPERIMENT 2 
Method 


Subjects in this experiment were 40 female under- 
graduates who had offered to assist in a research 
on teenage girls in return for a fee of 3 IE per 
hour. Initially, the experimenter (the third autho) 
had ascertained the subjects’ reasons for participating 
in the project. It turned out that only three subjects 
gave an intrinsic reason (interest in the issue investi- 
gated) and the rest an extrinsic one (the promised 
fee). To keep constant the subjects’ initial motiva- 
tion, the three intrinsic subjects were a priori ex- 
cluded frem the analyses, 

Subjects in the high intrinsic reward condit’on 
sorted authentic letters sent by teenage girls to a 
psychologist of a women’s magazine into various 
problem categories (conflicts with the parents, love 
difficulties, etc.). Subjects in the low intrinsic reward 
condition checked IBM cards, allegedly containing 
data about teenage girls for (actually nonexistent) 
punching errors, 

At the time of recruitment, it had been ascer- 
tained that each subject. had at least 3 free hours 
at her disposal. However, before the commencement 
of work, the experimenter explained that because 
of budgetary problems and the need to employ as 
many girls as Possible, any one subject could be 
paid for only 2 hours of work but could contribute 
an additional hour without pay. After the two 
designated hours had elapsed, the experimenter 
recorded whether the subject did or did not wish 
to volunteer extra time and asked her to fill out a 
set of rating scales regarding various aspects of the 
subject was asked to state 
whether the pay was adequate for the job and, in 
case it was not, what might be an equitable amount 
of increase. This concluded the experiment. The 
purpose of the deception involved was fully ex- 
plained to the subject and she was enjoined to 
Secrecy. 


Results 
Checks on the Manipulation 


The several job ratings made by subjects 
in the two experimental conditions are sum- 
marized in Table 2, As expected, subjects in 
the high (vs. the low) intrinsic reward condi- 
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TABLE 2 


Mean RATINGS OF THE TASKS ON VARIOUS 


Dime: NS, EXPERIMENT 1 

High Low 
Rating dimension intrinsic intrinsic 

reward reward 
Enjoyable 1.83334 
Challenging 1.5556p 
Stimulating 1.83330 
Tiresome 3.6667p 


Requiring intelligence 
Requiring concentration 
Requiring responsibility 
Requiring personal judgment 


1 7-point bipolar. scales 
for example, from 1 
for the enjoyability 
rating on the 
subscript differ at 


Note. All ratings were 
anchored appropriately 
{not enjoyable) to 7 (v y 
measure, etc, Higher figures denote h 
specific dimensions, Means with the sar 
the .002 level of significance, 


tion rated their job as more enjoyable, stimu- 
lating, and challenging (p < .002 in all in- 
stances). In addition, subjects in the high 
(vs. the low) intrinsic reward condition rated 
their job as requiring more intelligence and 
personal judgment (p< 001) but did not, 
differ in the rated degrees of responsibility or 
concentration required for the job. Finally, 
subjects in the high intrinsic reward condi- 
tion rated their job as more tiresome 
(P< .001) than did subjects in the low 
intrinsic reward condition, 


Intrinsic Motivation 


Table 3 presents the proportions of sub- 
jects who volunteered for additional work at 
the experimental task without pay. Consistent 
with the hypothesis, subjects in the high in- 
trinsic reward condition tended to volunteer 
more than did subjects in the low intrinsic 
reward condition (z = 3.48, p < .001). 


TABLE 3 


DEGREES oF INTRINSIC AND EXTRINSIC 
MOTIVATION, EXPERIMENT 2 


High | Low 
Measure intrinsic | intrinsic 
reward | reward 


Proportion of Volunteering without 
pay for additional work# -736 -166 
Mean pay increase (in Israeli IL) 


recommended? -6842 | 1.7778 
Proportion of subjects recommend- 
ing any pay increase® -2105 


-6111 


å Proportions differ at the -001 level of significance, 
b Means differ at the .05 level of significance, 
* Proportions differ at the .02 level of significance, 


EFFECTS OF TAsk-INTRINSIC 


Extrinsic Motivation 


The pay increases that the subjects recom- 
mended as appropriate for their jobs are sum- 
marized in Table 3. Subjects recommend- 
ing no increase were assigned the score of 
0, and the remaining subjects, the amount 
of increase (in IŁ) that they had recom- 
mended. As predicted, subjects in the high 
intrinsic reward condition recommended on 
the average a lower pay increase than did 
subjects in the low intrinsic reward condition, 
t(35) = 1.82, p < .05, one-tailed. 

The extent of extrinsic motivation present 
in each experimental condition may be in- 
ferred alternatively from the proportions of 
subjects who recommended any (vs. no) in- 
crease in pay. The relevant figures are in- 
cluded in Table 3. Consistent with the previ- 
ously stated hypothesis, the proportions of 
subjects recommending an increase in pay 
was higher in the low versus the high intrinsic 
reward condition (z = 2.48, p < .02). 


EXPERIMENT 3 
Method 


Subjects in this experiment were 40 male and fe- 
male undergraduates from the social science division 
at the Tel-Aviv University. They had volunteered 
to participate in an experiment in return for a 
4 IL fee. 

In the present study the magnitude of task- 
intrinsic rewards was operationally defined as the 
subject’s degree of involvement (ie, a sense of 
participation and a stake) in the research. Accord- 
ingly, subjects in the high intrinsic reward condi- 
tion were given a meticulous explanation of the 
scientific background and the significance of the 
(fictitious) research on memory processes in which 
they were about to participate. By contrast, subjects 
in the low intrinsic reward condition were given no 
introductory explanation regarding the nature of this 
research. Subjects in both conditions then read a 
prief passage from a history text and answered 10 
questions about dates and events contained in the 
passage. Then subjects in the high intrinsic reward 
condition were given feedback regarding the number 
of questions they had answered correctly, while no 
such feedback was provided their low intrinsic 
reward counterparts. z 

In the adjoining room an experimenter (the 
fourth author), unaware of the subject’s experi- 
mental condition, administered to him (her) a ques- 
tionnaire which inquired, among other things, about 
the task’s perceived degree of interest and whether 


the subjects would be willing to volunteer without 
h in the future. At this point, 


pay for similar researc é is po 
the subject was paid the promised 4 TŁ fee in eight 
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TABLE 4 


Task INTEREST, ĪNTRINSIC AND EXTRINSIC 
MOTIVATION, EXPERIMENT 3 


High Low 
Condition intrinsic | intrinsic 
reward | reward 
Mean interest ratings* 4.97 4.76 


Proportions of volunteering without 
pay for future research” 85 50 
Mean amounts of money (in 
Israeli IL) donated to charity? 1.50 19 


a Higher figures denote a greater interest. 
ò Proportions differ at the .02 level of significance, 
e Means differ at the .001 level of significance. 


.5 IŁ coins and was seemingly discharged from the 
experiment. Outside the laboratory he (she) was met 
by an accomplice posing as a student allegedly un- 
connected with the memory research, This person 
(also unaware of the subject’s experimental condi- 
tion) introduced himself as a representative of a 
student group which collected donations allegedly 
needed, “in order to send a lecturer’s child for a 
medical operation abroad,” After the subject made 
(or did not make) the requested donation, he (she) 
was reinvited into the laboratory and was given a 
full explanation regarding the purpose of the experi- 
ment. The subject’s donation (if any) was returned, 
and he (she) was enjoined to secrecy. 


Results 
A Check on the Manipulation 


The subjects’ ratings of the task’s perceived 
interest are summarized in Table 4, As 
shown, subjects in the high intrinsic reward 
condition rated the task as slightly more in- 
teresting than did subjects in the low intrin- 
sic reward condition. However, this difference 
was not statistically significant. The foregoing 
result raises the possibility that the manipula- 
tion of task rewards was not fully successful 
in this experiment. Note, however, that 
inasfar as the treatments were designed to 
produce differential levels of involvement in 
the research, a question about an interest in 
the specific task may not have been appropri- 
ate as a check on the efficacy of these 
treatments. 


Intrinsic Motivation 


The proportions of subjects volunteering 
without pay for future research are included 
in Table 4. Consistent with expectation, sub- 
jects in the high intrinsic reward condition 
yolunteered more, thus exhibiting a higher 
degree of intrinsic motivation as compared 
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with subjects in the low intrinsic reward 
condition (z = 2.33, p < .02). 


Extrinsic Motivation 


The subjects’? donations of their pay are 
also summarized in Table 4. Consistent with 
the hypothesis, subjects in the high intrinsic 
reward condition donated more on the aver- 
age, thus manifesting a lower degree of ex- 
trinsic motivation, than did subjects in the 
low intrinsic reward condition, (38) = 5.03, 
p < 001. 


GENERAL DISCUSSION 


The results above provide strong support 
for the hypothesis that (a) the extent of in- 
trinsic motivation varies positively and (b) 
the extent of extrinsic motivation varies nega- 
tively with the magnitude of task-intrinsic 
rewards.* Especially the second proposition 
is not a priori obvious, as it is possible to 
advance a plausible opposite prediction, that 
is, that the degree of extrinsic motivation will 
vary positively (rather than negatively) with 
task rewards. The latter proposition follows, 
for example, from the prevalent cultural norm 
whereby the more challenging and interesting 
jobs are accorded higher pay as well as 
prestige than the less exciting ones. 

The confidence in the validity of the pres- 
ent findings is abetted by the fact that they 
held for three separate experiments varying 
in a multitude of procedural details and in 
the operational definitions of the theoretical 
constructs investigated. In addition, the pro- 
cedural variations of this research provide a 
basis for deciding among alternative interpre- 
tations of the results which could be advanced 
with respect to some (but not other) experi- 
ments in the present series. For instance, one 
could argue that the observed negative rela- 
tion between task intrinsic rewards and ex- 
trinsic motivation may be derived from the 
equity principle (Adams, 1965) insofar as the 
investments of labor, the tedium endured, 
etc., are greater in the case of a dull versus an 
interesting task. Such an argument is weak- 


5 Following the completion of this research, we 
became aware of a recently completed survey with 
kibbutz workers in which the negative relation be- 
tween intrinsic and extrinsic motivations toward 
work has been consistently replicated. 


ened by the results of Experiment 2, where 
the intrinsically rewarding task was rated as 
requiring the greater investment of effort (it 
was perceived as more tiresome) as well as 
intellectual acumen and personal judgment. 

Alternatively, from the standpoint of the 
dissonance theory (Festinger, 1957), it could 
be reasoned that a person who expects an 
exciting experience suffers dissonance wher 
required to perform a dull, unrewarding task 
(as in the low intrinsic reward condition) 
Such dissonance could possibly be abate 
by attaching a greater importance to the 
monetary pay. But the dissonance argument 
seems less cogent when viewed against the 
circumstances of Experiment 2, where all 
subjects reported to have volunteered for the 
task because of the promised pay rather than 
the expectation of a pleasurable experience. 
Thus, while no direct measurement of causal 
attributions was attempted in this research, 
the attributional analysis advanced earlier 
still seems to fit the data better than do pos- 
sible alternative explanations. To reiterate, 
according to this analysis, the attribution that 
the extrinsic pay caused one’s performance 
of the task was lower when a plausible alter- 
native cause, notably task intrinsic rewards, 
was present versus absent. In other words, 
task intrinsic rewards may have pulled causal 
attributions away from the monetary reward 
and, hence, the observed lower level of extrin- 
sic motivation and a higher one of intrinsic 
motivation in the high versus the low task- 
reward condition. 

But besides offering theoretical support for 
an attributional analysis of task motivation, 
the present results seem to have a significant 
practical implication: Thus far, arguments 
in favor of intrinsic (vs, extrinsic) modes of 
inducing work motivation have rested oi 
tions regarding (a) the qualitative support of 
performance under intrinsic (vs. extrins! 
motivation (e.g., see Kruglanski et al., 1 
Lawler, 1969) and (b) the economic inefii- 
ciency of extrinsic motivation, based on the 
idea (e.g., discussed by Herzberg, 1966) that 
since workers quickly adapt to extrinsic re- 
wards, their constant elevation is required to 
sustain work motivation, Insofar as it demon- 
strates that a rise in intrinsic motivation is 
accompanied by a decline in an extrinsic one, 
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the present research adds an economic reason 
for preferring the former mode of motivating 
people in their jobs. 
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Gain and Loss in Attraction: 
Attributions from Nonverbal Behavior 


Gerald L. Clore, Nancy Hirschberg Wiggins, and Stuart Itkin 
University of Illinois at Urbana-Champaign 


In a series of videotaped conversations with a male, an actress performed 
nonverbal behaviors previously scaled for warmth. Taped segments were 
spliced together to form cold-warm, warm-warm, warm-cold, and cold-cold 
scenarios. Subjects made ratings of (a) the male recipient’s attraction to the 
female, (b) her attraction to the male, and (c) the male’s attraction to her. 
Clear gain (cold~warm > warm-warm) and loss (warm-cold < cold—cold) 
effects were found in ratings of the male recipient’s attraction to the female 
and not in the other two sets of ratings. These gain-loss effects, based on 
attributions from nonverbal behavior, did not depend upon a stimulus contrast 
or on perceptions of the female as discerning. Rather they depended on beliefs 
that the male recipient would respond affectively to the gain and loss of 


warmth from the female. 


There’s a language in her eye, her cheek, her lip; 
Nay, her foot speaks. Her wanton spirits look out 
At every joint and motive of her body. (Shakes- 
peare, Troilus and Cressida, Act IV, Scene 5, Line 55) 


Attributions from nonverbal behavior were 
explored in a recent study by Itkin, Wiggins, 
Clore, and Vercelloti (Note 1). In that 
experiment, observers saw a videotaped con- 
versation between a boy and a girl. Al- 
though seeming to meet for the first time 
as computer dates, the girl was in fact an 
actress whose behavior varied systematically 
in warmth. The experiment showed that de- 
scriptions of nonverbal behavior could be 
scaled along an evaluative dimension, en- 
acted, and then reliably decoded by naive 
observers for the appropriate affect portrayed. 

Attributions in the experiment were influ- 
enced, however, not only by warmth and 
coldness but also by the order in which the 
girl displayed the behaviors. When she be- 
gan with cold behaviors and then became 
warm (cold-warm), subjects thought the 
male would be more attracted to her than 
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when she was consistently warm (warm-— 
warm). These results were similar to the gain 
portion of what Aronson and Linder (1965) 
called gain and loss effects in attraction. They 
differ, however, in that they involved infer- 
ences about others rather than responses to 
direct experience. 

By an ingenious subterfuge, Aronson and 
Linder’s (1965) subjects overheard what 
appeared to be candid remarks about them 
selves from another person. The overheard 
remarks were either consistently positive, 
negative changing to positive (gain), con- 
sistently negative, or positive changing to 
negative (loss). The experimenters predicted ` 
more attraction to the speaker in the gain 
than in the consistently positive condition 
and less attraction in the loss than in the 
consistently negative condition. These predic- 
tions were counterintuitive, since everyone 
likes warm, loving people and dislikes cold, 
hateful people; nevertheless, the gain predic- 
tion was confirmed, even though the loss 
effect did not reach significance (p < 07). 

The gain—loss idea subsequently generated 
other studies using an amazing variety of 
procedures, many of which found no evidence 
for either effect when attraction was the 
dependent variable (Byrne & London, 1966; 
Chaikin, 1971; Mettee, 1971la; Sigall & 
Aronson, 1967; Taylor, Altman, & Sorrentino, 
1969; Taylor & Altman, Note 2). While a 
complete replication attempt has never been 
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published, Tognoli and Kiesner (1972) looked 
for the effects without success in a study very 
similar to the original Aronson-Linder study. 
In fact, only one investigator has ever re- 
ported both gain and loss effects. Griffitt 
(1971) found both effects in a series of 
studies, two using attitudinal stimuli and one 
using trait adjectives. His was a unique proce- 
dure in which items of information were al- 
ternately presented about first one and then 
the other of two stimulus persons. Although 
loss effects have been hard to find, gain effects 
have emerged more consistently (Aronson & 
Linder, 1965; Gerard & Greenbaum, 1962; 
Mettee, 1971b; Mettee, Taylor, & Friedman, 
1973; Stapert & Clore, 1969; Worchel & 
Schuster, 1966; Itkin et al., Note 1). 

The present study extends Itkin et al.’s 
(Note 1) experimental paradigm in order to 
study the illusive gain and loss effects in an 
attribution context. It was predicted that gain 
and loss would occur in attributions of the 
male’s attraction to the female, effects that 
would parallel those originally sought by 
Aronson and Linder (1965). In addition, a 
second goal of the study was to assess the 
generalizability of gain and loss explanations 
from the usual paradigm to the present 
attribution context. 

In the usual study, three major elements 
are involved: (a) the manipulated feedback 
(e.g, kind and cruel remarks, agreeing and 
disagreeing attitudes, positive and negative 
adjectives); (b) the person giving the feed- 
back; and (c) the person receiving the feed- 
back. Explanations for gain—loss effects can 
be grouped into these three major classes 
depending on the locus of emphasis. The 
present attribution paradigm used the video- 
tapes made by Itkin et al. (Note 1) showing 
a conversation between a male and a female. 
Hence the manipulated feedback consisted of 
nonverbal behaviors precalibrated for warmth, 
the person giving the feedback was the female 
actress, and the person receiving the feedback 
was a partially visible male actor. J 

The unique feature of the present attribu- 
tion paradigm is that the subject and the 
recipient of the feedback are not the same 
person. The virtue of this departure from the 
usual approach is that it allows a distinction 
to be made among the three classes of expla- 
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nations. Explanations that focus on the ma- 
nipulated feedback include expectancy (Berko- 
witz, 1960) and contrast (Sherif & Hovland, 
1961). They stress the role of expectations or 
levels of adaptation set by stimuli early in 
the sequence. Thus, for example, subjects may 
attribute greater attraction to the male recip- 
ient because the female seems warmer at the 
end of the cold-warm than the warm—warm 
sequence (and colder at the end of the warm- 
cold than the cold-cold sequence). 

Explanations concerned with the person 
giving the feedback involve that person’s 
attributes. For example, the gain and loss 
conditions might be more potent than the con- 
sistent conditions because a person giving 
both negative and positive feedback might 
be seen as more discerning (Landy & Aron- 
son, 1968). This explanation predicts that 
subjects would rate her as more discerning in 
the gain and loss conditions, and since dis- 
cerningness is a positive characteristic, sub- 
jects should like her more in both conditions. 

The third class of explanations reside with 
the person receiving the feedback. If the re- 
cipient feels that he changed the other's feel- 
ings toward him, he feels competent in the 
gain condition and should therefore like the 
other or incompetent in the loss condition 
and should dislike the other. A related expla- 
nation is that the recipient feels initially 
rebuked in the gain condition so that later 
positive feedback is especially rewarding, and 
conversely, the rejection after initial rein- 
forcement in the loss condition is especially 
punishing. Recipient-oriented (affective) ex- 
planations would predict gain and loss effects 
only for ratings of the recipient’s attraction 
because only he is affectively involved. 

The present study added three manipula- 
tions to the design used by Itkin et al. 
(Note 1). Since it is conceivable that the 
previous pattern of results was due to the 
order in which ratings were made (whether 
from the male or female perspective), the 
order was counterbalanced. Also, subjects in 
the previous study made judgments not only 
at the end but also after seeing the first half 
of the conversation. In some studies such 
intermediate judgments appear to be an essen- 
tial precondition for the gain effect (e.g., 
Byrne, Lamberth, Palmer, & London, 1969) ; 
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therefore, the present experiment provided a 
test of the effects of intermediate judgments. 
Finally, from the pool of tape segments avail- 
able, additional tapes were made, and the 
design was expanded to test for loss effects. 


METHOD 
Subjects 


The subjects were 338 students (180 females and 
158 males) at the University of Illinois who were 
randomly drawn by computer from an introductory 
psychology pool. They were scheduled in mixed-sex 
groups of 8 and randomly assigned to one cell of a 
4X2X2X2 design that varied conditions, sex, 
question order, and rating in the middle versus no 
rating in the middle. 


Stimuli 


Tn a previous study (Itkin et al., Note 1), a pool 
of nonverbal behaviors were selected based on scale 
values and factor loadings of their capacity to convey 
attraction. A series of videotapes were then made in 
which an actress performed selected warm and cold 
behaviors in an interaction with a male. Some 
sample warm behaviors were the following: sitting 
directly facing him, smiling, and nodding her head 
affirmatively ; sample cold behaviors were as follows: 
looking around the room, frowning, and playing 
with the ends of her hair, Two warm and two cold 
versions were made, each 24 minutes in length and 
each consisting of 18 warm or cold behaviors and 
3 neutral ones. The two versions of each type of 
Scenario differed only in the order in which warm or 
cold behaviors were presented. Pretest subjects’ 
ratings confirmed that the warm and cold segments 
conveyed significantly different amounts of attrac- 


In the earlier study, two 53-minute tapes were 
made from these Segments to form cold-warm and 
warm—warm scenarios. These Were used again in the 
Present study along with additional warm-cold and 


Procedure 


Subjects were told that they would see a video- 
tape showing the beginning and end of a 30-minute 
get-acquainted Conversation of a couple from a pre- 
vious computer dating study, A one-way mirror 
with a television camera behind it was demonstrated 


actress appeared seated facing a male who was only 
partially visible from the back. The tape was silent, 


but to control for apparent activity level, the same 
words were said in each tape segment, 

For half of the subjects the experimenter stopped 
the tape after the first warm or cold segment and 
asked them to answer some questions about what 
they had seen. The questions consisted of 10 seven- 
place rating scales, 4 of whicłĦ asked about the male’s 
reaction to the female, 4 of which asked about the 
female’s reaction to the male, and 2 of which asked 
about the subjects’ own reaction to the female. 
Thus, attraction ratings were made from three dif- 
ferent perspecitves: male’s, female’s and subject's. 
All subjects, whether they had made ratings in the 
middle or not, then rated the entire 5-minute con- 
versation on the same 10 scales, The order of the 
questions was counterbalanced so that half of the 
subjects made their male ratings first and half made 
their female ratings first. 

The rating scales asked for judgments of the male’s 
and female’s liking of their Partner, interest in the 
conversation, warmth, and probability of wanting to 
see the partner again. In each case, the sum of the 
“liking” and “see again” items was used as the 
definition of “attraction” (the possible range was 
from 2 to 14 points, the interitem correlation was 
-68). For their own attraction to the actress, subjects 
were asked about their liking for and desire to meet 
her. Finally, they rated the male and female actors 
on 10 seven-place trait attribution items that in- 
cluded the following: attractiveness, sociability, 
nervousness, warmth, dominance, confidence, social 
competence, sincerity, discernment (defined for the 
subject as “perceptive”), and efficacy (defined as 
“power to produce effects or intended results”), 


RESULTS 


The data analysis consisted primarily of a 
series of 4X 2X 2x2 analyses of variance 
supplemented by a series of orthogonal com- 


parisons. The variables in the analyses were ` 


treatment condition, sex, rating in the middle 
versus no rating in the middle, and question 
order. The treatment effect was significant in 
each of the three principal attraction analy- 
ses, male’s attraction to female, female’s at- 
traction to male, and subjects’ attraction to 
female (see Table 1). In each analysis or- 
thogonal comparisons among means showed 
that attraction ratings were significantly 
higher for the scenarios that ended in warm 
behaviors than for those that ended in cold 
behaviors (Ż < .001). The means and or- 
thogonal comparisons are shown in Table 2. 
The ratings of the male’s attraction to the 
female confirmed the primary predictions, 
showing both gain and loss effects. Consistent 
with expectations from the previously re- 


ported experiment, no gain or loss effects 


p 


£ 
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emerged in either ratings of the female’s at- 
traction to the male or in the subjects’? own 
attraction to her. In fact, the subjects’ own 
attraction showed a significant trend in the 
opposite` direction. 

No interaction emerged between treatment 
and whether or not ratings were made in the 
middle in any of the primary analyses; there- 
fore such ratings are not necessary to produce 
gain-loss effects. The question order variable 
was included to make sure that the tendency 
found in the previous study for the gain effect 
to occur only when subjects took the male’s 
perspective was not merely due to the order 
in which the attraction questions appeared. 
The proper test of that possibility is the Treat- 
ment X Question Order interaction, which 
was significant here for ratings of the male’s 
attraction, F (3,311) = 7.84, p < .001, and of 
the female’s attraction, F(3,311) < 14.05, 
p < .001. Internal analyses indicated that the 
interaction was due to a tendency to give 
more tentative ratings from the first perspec- 
tive and more extreme ratings from the second 
perspective, male F(1,311) = 20.55, p< 
.001; female F(1,311) = 42.71, p< .001. 
The other orthogonal comparisons were be- 
tween gain and question order and loss and 
question order. They were all nonsignificant 
except for one. Apparently because of a floor 
effect on the attraction measure, a significant 
interaction between loss and order within the 
male perspective, F(1,311) = 3.99, p < .05, 
showed no loss effect when questions about 
the male were asked second. 

At the conclusion of the experiment sub- 
jects rated the actor and actress on 10 trait 
items. These were analyzed in the same 


TABLE 1 


ANALYSIS OF VARIANCE RESULTS FOR SUBJECTS” 
JUDGMENTS OF ATTRACTION 


Dependent variable df MSerror F 
gery iste: x 3,311 4.11 58.33* 
Bs mi cee 3,311 4.26 41.77* 
Ra Scale ae 3,311 5.57 24,74* 


*p <.001. 
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TABLE 2 


MEANS AND ORTHOGONAL COMPARISONS FOR SUBJECTS’ 
JUDGMENTS OF ATTRACTION 


Treatment M 
Dependent 
Variable Warm- | Cold- | Cold- | Warm- 
warm | warm | cold cold 
(n = 102) | (n = 89) | (a = 75) | @ = 72) 
Male attraction 
to female 9.36 9.91 7,05 6,28 
D <.04 p <.02 
Female attrac- 
tion to male 9.10 9.58 6,80 6,74 
ns ns 
Subjects’ attrac- 
tion to female 8.21 6.61 5.18 5.77] 
p <.001 ns 


manner as the earlier data. The partially visi- 
ble male was believed to be significantly (all 
ps < 03) less nervous, more discerning, and 
more efficacious in the gain condition com- 
pared with the consistently warm condition. 
The stimulus female in the gain condition was 
seen as less discerning, sociable, warm, and 
socially competent and more nervous and 
efficacious compared with the consistently 
warm condition. When attributions in the loss 
condition were compared with those in the 
consistently cold condition, there were no sig- 
nificant effects for the male. The female in 
the loss condition was seen as more discern- 
ing, attractive, sociable, warm, and socially 
competent and as less neryous compared with 
the consistently cold condition. 


Discussion 


Observers saw videotaped conversations be- 
tween a male and a female in which the fe- 
male performed 42 nonverbal behaviors pre- 
scaled for warmth. The judgment scales asked 
about three different attitudes: (a) the male’s 
attraction to the female, (b) the female’s 
attraction to the male, and (c) the subject’s 
own attraction to the female. As predicted by 
the recipient-oriented explanations, significant 
gain and loss effects were shown only in at- 
tributions of the male’s attraction to the fe- 
male. Neither gain nor loss was present in 
judgments of the female’s attraction to the 
male nor in the subject’s report of his or her 
own feelings. 

The results of the trait attributions (made 
after the attraction ratings) hint that the 
gain effect may have depended on (a) sub- 
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jects reinterpreting the cold-warm woman’s 
coldness as nervousness and lack of social 
competence and (b) the assumption that after 
she warmed up, the male felt efficacious and 
no longer nervous. Similarly, the loss effect 
might have been based on subjects’ inferences 
that the male would feel punished after a 
woman, who was seen as attractive, discern- 
ing, sociable, warm, and socially competent, 
changed from warm to cold. 

Other results showed that gain-loss effects 
were found (a) regardless of the order in 
which questions were asked and (b) whether 
or not subjects made intermediate judgments 
after the first conversational segment. With 
respect to the latter finding, it seems likely 
that intermediate judgments will produce 
gain-loss effects when they serve to break up 
an otherwise continuous sequence of stimuli 
into two parts (Byrne et al., 1969). However, 
in the present study, the knowledge that 25 
minutes of conversation intervened between 
the first and last segments apparently served 
the same purpose, so that the effects occurred 
whether or not intermdeiate judgments were 
made, 

It should be pointed out that the gain and 
loss effects found here are not directly compa- 
rable to those sought by Aronson and Linder 
(1965) because they were attributions to 
others rather than personal reactions to direct 
experience, 


Recipient-Oriented Explanations 


Anxiety hypothesis. The same pattern of 
results were found in this and a previous 
study using the same methodology (Itkin et 
al., Note 1). Gain-loss effects occurred only 
when subjects made judgments about the in- 
volved male’s attraction to the stimulus fe- 
male. No such effects occurred when subjects 
made judgments about her feelings or when 
they reported their own reactions. The effects 
therefore depend on something believed to be 
distinctive about the male’s perspective such 
as his affective involvement. An affective 
interpretation of the present attribution re- 
sults is consistent with Aronson and Linder’s 
(1965) original logic as well as that of subse- 


* Personal communication from Norman Anderson, 
December 7, 1971. 
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quent gain-loss investigators (e.g., Griffitt, 
1971), Anxiety and its reduction and lack of 
anxiety and its onset are the main affects 
involved in this explanation. 

Competence hypothesis. As initially sug- 
gested by Aronson and Linder, the recipient’s 
feelings of involvement and competence may 
be a necessary condition for the gain effect. 
The present results indicated that in fact the 
male (recipient) was judged as more effica- 
cious in the gain condition. However, he 
was not judged as less effective in the loss 
condition, as might be predicted. 

Discounting. Discounting is a necessary, 
though not sufficient, condition for gain—loss 
effects (Mettee et al., 1973). In this study, 
subjects in the gain condition must ignore the 
initial segment of the cold-warm sequence. 
Otherwise, they would average the two seg- 
ments, and no amount of extra rewards from 
gaining warmth would make the combination 
more potent than the continuously warm 
condition. The belief that 25 minutes of con- 
versation separated the two segments may 
have encouraged subjects to discount the 
initial segments when inconsistent with later 
ones, 

In combination with the affective involve- 
ment hypothesis, the discounting notion ex- 
plains why loss effects have been harder for 
investigators to find than gain effects. In this 
regard, Anderson* suggests that gain effects 
occur readily because the person involved is 
anxious to discount the cold message in a 
cold-warm (gain) sequence, while loss effects 
are difficult to find because a person is loath 
to discount the initial warmth received in 
the warm-cold (loss) sequence. This explana- 
tion may also account for why strong loss 
effects occurred in the present study. Presum- 
ably, relatively uninvolved observers (in this 
attribution paradigm) were more willing to 
discount the initial warmth in the warm-cold 
condition than are more involved subjects 
(in studies of direct experience, e.g., Aronson 
& Linder, 1965). 


Manipulated-Feedback Explanations 


Stimulus contrast. According to this hy- 
Pgtiesis, early information creates anchoring 
expectancies in terms of which later experi- 
ences are judged. As applied to the present 
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research, the initially cold or warm behav- 
iors could make the opposite behaviors ap- 
pear more extreme by contrast. Being a con- 
sequence of the stimulus pattern itself (rather 
than of one’s affective involvement), the 
process should affect the perception of ob- 
servers as well as of participants. However, 
observers did not see the female’s behavior 
in the gain and loss conditions as more ex- 
treme in warmth and coldness. They did not 
believe that the female actually liked or dis- 
liked the male more in the gain or loss condi- 
tions than in the consistent ones, nor did they 
show gain or loss effects in their own atti- 
tudes toward the female. A related possibility 
is that subjects did not experience contrast 
effects but nevertheless attributed such effects 
to the male recipient. This attribution was 
not measured in the present study. 


Feedback Sender Explanations 


Discernment. Another hypothesis is that 
the stimulus person appears more discerning 
(Mettee, 1971b) in the gain and loss condi- 
tions than when she is consistently warm or 
consistently cold. It has been demonstrated 
that persons seen as discerning are more 
attractive than less discerning persons (Landy 
& Aronson, 1968). However, the present re- 
sults indicated that subjects liked the female 
significantly less in the purportedly discern- 
ing persons (Landy & Aronson, 1968). How- 
ever, the present results indicated that sub- 
jects liked the female significantly less in the 
purportedly discerning conditions and most 
when she was indiscriminately warm. More- 
over, the trait attribution data showed that 
subjects did not see the actress as more dis- 
cerning in the gain condition. These results 
do not vitiate the discernment hypothesis in 
the direct experience paradigm because they 
are based on the subject’s perspective; sub- 
jects did see the female as more discerning in 
the loss condition than in the consistently 
cold condition, so that discernment may have 
influenced the loss effect. 


Effort Hypothesis 

The present design also allows another hy- 
pothesis to be considered, which is very dif- 
ferent from those already discussed. Since the 
male is only partially visible and subjects are 


told that they are seeing only the first and 
last few minutes of a 30-minute conversation, 
what the male did in the middle is open to the 
subject’s imagination. It is thus conceivable 
that subjects turned everything around and 
viewed changes in the female’s behavior not 
as a cause of the male’s feelings but as a clue 
to how hard the male was trying to win or 
lose her. Thus, changes indicating he was try- 
ing hard (gain condition) would lead to 
inferences that he liked her, while changes 
indicating he was not trying at all (loss 
condition) would lead to conclusions that he 
found her uninteresting. This reverse infer- 
ence hypothesis is very different from the 
other hypotheses considered. It locates a new 
class of explanation focusing on the subject’s 
causal attributions. Nothing in the present 
data rules out this provocative explanation.* 


CONCLUSIONS 


Three questions were posed in this experi- 
ment: (a) Can gain-loss effects be found in 
an attribution context using precalibrated 
nonverbal stimuli? (b) How vulnerable are 
the effects to certain procedural variations? 
(c) What hypotheses about directly experi- 
enced gain-loss effects can explain results 
from this attribution data? As predicted, 
subjects’ judgments showed strong gain and 
loss effects that did not depend either on vari- 
ations in question order or on whether ratings 
were made in the middle of the sequences. 
Since gain and loss were found for ratings of 
the involved male’s attraction to the female 
but not for other ratings, they appear to be 
based on the attribution of affective responses 
to the male. In view of this pattern of re- 
sults, certain hypotheses were not helpful in 
accounting for this attribution data including 
contrast and discernment. 

Almost every gain-loss study done in the 
last 10 years has used a different procedure, 
and the present study is no exception. The 
fact that Aronson and Linder (1965) ob- 
tained only weak support for a loss effect 
and that other investigators have reported 
similar difficulties hints at the potency of the 


2The effort hypothesis was a possibility pointed 
out by Bibb Latané, personal communication, March 
1973. 


712 


present paradigm. This approach produced 
both gain and loss effects, used well- 
calibrated feedback, and was replicable (Itkin 


et al., Note 1). 
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Heider’s Structural Balance Principle as a Conceptual Rule 


Nickolas B. Cottrell 
University of Iowa 


Two experiments examined the hypothesis that Heider’s balance principle is a 
prominent conceptual rule when individuals classify social situations. Indi- 
viduals classified descriptions of triadic social situations in the standard 
reception paradigm for studying conceptual behavior. Supporting the hypothe- 
sis, the results showed that individuals required to use the balance rule per- 
formed better (p < .01) than a rote-learning control group. The balance rule 
was more available than other rules that involve three relevant dimensions 
(p < 00001). The balance rule was less available than rules based on friend- 
ship (p< .001) or agreement in the situation (p< .0001). These findings were 
discussed in terms of number of relevant dimensions as a determinant of 
problem difficulty, positive intraproblem transfer resulting from previous rule 
use, and variability of the mediating response as a determinant of rule avail- 


ability. 


The present studies examined two hypothe- 
ses concerning conceptual behavior that stem 
from Heider’s (1958) theory of structural 
balance. The hypotheses seem to be basic to 
Heider’s thinking, but to date they have not 
been subjected to empirical test. The theory 
concerns the perception of interpersonal rela- 
tions and of relations between persons and 
inanimate objects. The theory assumes that 
these perceptions change and interact accord- 
ing to the balance principle. 

Tn discussing the phenomenal properties of 
balance and imbalance, Heider notes that 
many different social situations meet the 
condition of being in an imbalanced state. 


That naive psychology has little difficulty in detect- 
ing situations of imbalance can be seen from the 
following examples [the letters stand for persons per- 
ceived in the individual’s life space]: 
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P hates O because he is so similar to O. 

He always imitates people he dislikes. 

He always hates people with whom he has to work, 
He hates Q because Q is similar to his friend O. 


He avoids people he likes... in some way we 
sense that the factors in the situation “do not add 
up”; they seem to pull in different directions. They 
leave us with a feeling of disturbance [Heider, 1958, 
p. 180]. 


Heider also states that many discriminably 
different social situations qualify as balanced 
states. 


By a balanced state (or situation) is meant a harmo- 
nious state, one in which the entities comprising the 
situation and the feelings about them fit together 
without stress . . . (for example) a person likes the 
things he makes, is uneasy if he has to live with 
people he dislikes, likes the child of his best friend 
. . . [Heider, 1958, pp. 180-181]. 


Typical investigations to verify these as- 
pects of Heider’s theory (see Zajonc, 1968, 
for a review of representative research) have 
required subjects to rate social situations in 
terms of perceived tension or perceived pleas- 
antness. However, the quotations given above 
seem to refer to conceptual behavior on the 
part of the perceiving individual. A variety 
of discriminably different social situations 
meet Heider’s definition of the imbalanced 
state. All of these situations are presumed to 
have the effect upon thesindividual of making 
him sense that the facto s in the situation do 
not add up and create a feeling of disturb- 
ance and tension, Similarly, Heider has stated 
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that a variety of discriminably different social 
situations meet his definition of a balanced 
state. All of these situations have the effects 
upon the individual of not producing stress 
and of producing a feeling of harmony and 
obviousness. 

The process of grouping distinctive stimuli 
together and responding to them in the same 
way has been termed “conceptual behavior” 
(e.g., Bourne, 1966). Certain attributes of the 
stimuli are relevant to define the concept or 
class. A classification principle or conceptual 
rule specifies how the relevant attributes are 
combined for use in determining the class 
membership of particular stimuli. Heider’s 
thinking, then, suggests that the balance 
principle is a conceptual rule which individu- 
als frequently use to classify social situations 
as either balanced or imbalanced. 

The experiments reported below used stan- 
dardized methods of studying concept behay- 
ior to test two hypotheses concerning the use 
of the balanced-imbalanced distinction as a 
conceptual rule. Briefly stated, the hypotheses 
are the following: 

1. Individuals are capable of classifying 
perceived social situations as either balanced 
or imbalanced. 

2. Heider’s theory implies that individuals 
frequently classify perceived social situations 
as either balanced or imbalanced. Studies of 
intraproblem transfer in the solution of clas- 
sification problems (e.g., Bourne, 1966) have 
shown that repeated use makes a conceptual 
rule increasingly available for later use. Thus, 
the hypotheses follow that the balance prin- 
ciple is a prominent conceptual rule for 
classifying social situations. 


EXPERIMENT 1 


This experiment tested Hypothesis 1. If 
this hypothesis is correct, then individuals 
required to classify social situations accord- 
ing to the structural balance rule should per- 
form better in a classification task than indi- 
viduals who must place the stimuli into one 
of two groups arbitrarily, that is, with no 
rule or principle to guide them in their 
classification. 

A second purpose of this experiment was 
to compare the relative availability of the 
structural balance rule to the rule based on 
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friendship and also to the rule based on agres- 
ment. Recently, Zajonc (1968) and Re: 
rigues (1968) have suggested that the per 
ceived friendship in a social situation and also 
the perceived agreement are more impor- 
tant determinants of the individual’s reaction 
to social situations than whether the situation 
is balanced or imbalanced. 

This study employed the reception para- 
digm for studying conceptual behavior, see 
Bourne (1966) for a description. In the 
present study, the stimuli were short descrip- 
tions of triadic social structures which Heider 
(1958) has classified as either balanced or 
imbalanced. 


Method 
Subjects 


The subjects were 40 female students at the Uni- 
versity of Iowa who chose to participate to fulfill 
a requirement in the introductory psychology course. 
Their ages ranged from 18 to 23 years. 


Apparatus and Materials 


The stimuli consisted of 16 cards. Each card de- 
scribed a triadic social structure involving two 
persons and the attitude object “integration.” There 
were four sentences on each card. The first sentence 
described a person’s liking for another person, for 
example, “Ellen dislikes Marcie,” or “Rita likes 
Kathy.” The second sentence stated the attitude of 
the person who was the subject of the first sen- 
tence, for example, “Ellen is in favor of integration,” 
“Rita does not favor integration.” The third sentence 
stated the attitude of the person who was the object 
of the first sentence. The fourth sentence was “DOK 
or ROZ?” 

The 16 cards consisted of two instances each of 
the four triadic systems that Heider has classified 
as balanced: +++, +——, —+—, ——+, and two 
instances each of the four triadic systems that Hei- 
der has classified as imbalanced: +—+, ++—, —++, 
———. In this notation scheme, the first sign desig- 
nates the sign of the sentiment between the two 
people, O (the first-named person) and Q (the 
second-named person). The second and third signs 
designate the respective attitudes of O and Q toward 
the impersonal attitude object. 

Each of the 16 cards had a different pair of first 
names, The attitude object on each card was always 
“integration.” The names of the persons in the social 
situations were female names three to six letters 
long. Chosen from a standard dictionary, they are 
relatively common, with their initial letters quite 
evenly distributed throughout the alphabet. The 32 
names were combined into pairs at random, with the 
restrictions that in no pair did the names begin with 
a same letter or with alphabetically adjacent let- 
ers. 


zt 
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The name pairs were assigned to situations at ran- 
dom (Version 1). For generality, a second set of 
stimuli was prepared with a different random as- 
signment of name pairs to situations (Version 2). 


Procedure 


Within each version there were four independent 
random orderings of the 16 stimuli. These orders 
had two restrictions: The first 8 cards exhausted the 
eight basic Heiderian situations; there were never 
more than 2 stimuli of the same class in succession. 
For each subject there were 64 cards contained in a 
ring binder. The experimenter timed the various 
stages of the experiment with a GRA Lab Micro- 
timer (Meylan Stopwatch Company). 

The subject was instructed that her task, requiring 
social sensitivity and insight, was to classify a set 
of social situations involving two people who know 
each other well. She was shown a typical stimulus 
card and the notebook containing the stimuli and 
told that the word “integration” denoted the issue 
of racial integration. 

Then the subject received the following instruc- 
tions: 


Your task. is to classify these social situations. 
The situations can be sorted into two equal-sized 
groups according to some principle or rule. I want 
you to discover the rule which will enable you to 
classify all of the cards in the notebook. 

Here is how we will proceed. When I tell you to 
open the notebook, study the first card. Then say 
either “DOK” or “ROZ” . . . to show which 
cards belong together in the same group. You are 
to say “DOK” to all those cards which belong in 
one group and “ROZ” to all those cards which be- 
long together in the other group. 

After you have said “DOK” or “ROZ,” I will tell 
you whether or not your response was correct to 
that card, After I have given the correct answer, 
you may study the card a bit longer. When I 
start the clock, get ready to turn to the next card. 
When I stop this clock (demonstrate), turn to 
the next card, and again study it and classify it 
as a “DOK” or “ROZ” situation. Then I will tell 
you the correct answer for that card, allow you to 
study it, and signal you to turn to the next card. 


Additional instructions stressed the need for saying 
DOK or ROZ for each item. The subjects were also 
instructed that the same cards were repeated four 
times in the notebook, so that every time through 
the notebook counted as four times through the list. 

On each item the experimenter would urge the 
subject to respond, if she did not make a classifica- 
tion response within 20 seconds. When the subject 
responded, the experimenter said the correct response 
aloud, that is, gave informative feedback. There was 
a 5-second postfeedback interval during which the 
subject studied the card she had just responded to. 
Then the experimenter turned to the next card and 
the sequence began again. This procedure continued 
until the subject achieved the learning criterion of 
16 consecutive correct responses, OF until she had 
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been through the list of 16 cards eight times, a total 
of 128 items. A starting place was picked at random 
from the four list beginnings for each subject. 

Ten subjects were assigned at random to each of 
the four experimental conditions. These pertained to 
the principle that the subject had to follow to 
classify correctly the 16 stimuli into two groups of 
8. 
Friendship. In this condition, all cards on which 
O liked Q were DOK and all cards on which O dis- 
liked Q were called ROZ. 

Agreement. In this condition, the subject had to 
say DOK to all cards on which O and Q agreed in 
their sentiment toward integration and ROZ to all 
cards on which O and Q did not agree in their opin- 
ion. 

Balance. In this condition, the subject had to say 
DOK to all cards which described balanced situa- 
tions and ROZ to all cards which described imbal- 
anced situations. 

These three rules were mutually orthogonal to each 
other. For instance, if a subject in the balance condi- 
tion had the hypothesis that friendship was the 
correct classification, she would be correct only half 
the time, that is, chance level. 

Random. The purpose of the random condition 
was to provide a rote-learning control group. If 
individuals are not capable of using the balance 
rule to classify social situations, then it should be 
just as easy to learn an arbitrary division of the 
cards into two groups of 8. To give some generality 
to the random condition, for each version there 
were two random divisions of the 16 cards into 8 
that were DOK and 8 that were ROZ. Two or 3 
subjects were assigned to each of the four sub-ver- 
sions of the random condition, a total of 10 sub- 
jects. The three experimenters who ran the subjects 
ran an approximately equal number of subjects in 
each condition. 


Results 


The first measure of performance was the 
number of persons who achieved the learning 
criterion. Table 1 shows that by this criterion, 
the balance rule appears to be more difficult 
to learn than both the rules of friendship and 
agreement and less difficult than the rote 
learning required of the random group, How- 
ever, none of the Fisher exact tests comparing 
balance and each of the other three groups 
was significant. 

The second measure of performance was 
the number of errors each individual made in 
reaching the solution criterion, or if they failed 
to reach it, in 128 items. These data violated 
all of the assumptions that are the basis of 
hypothesis testing by the F statistic. Exami- 
nation of the frequency distributions revealed 
that the parametric assumption of normality 
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TABLE 1 


NuMBER OF SUBJECTS ACHIEVING CRITERION AND 
MEDIAN AND MFAN CLASSIFICATION ERRORS 


Condition 
Measure 
Friendship} Agreement|} Balance | Random 

Number 

achieving 

criterion® 9 10 6 2- 
Median 5.50 4.50 27.50 47.50 
M 12.70 7.00 27.20 43.40 


an = 10 in each condition. 


was severely violated. The distributions were 
not symmetrical, nor even of the same non- 
normal shape. A significant Fmax test, Fmax 
(4,9) = 7.67, p < .05, revealed that the data 
failed to meet the parametric assumption of 
homogeneity of variance. 

Table 1 lists the median as the primary 
descriptive statistic in each condition, since 
its location is less affected by a skew than is 
the mean’s. The mean for each group is listed 
in parentheses; comparison of the two gives 
an indication of the type and degree of skew- 
ness in the several distributions. 

Randomization tests (e.g., Edginton, 1969, 
p. 95) were chosen as techniques for testing 
statistical hypotheses because their applica- 
tion requires only that subjects be randomly 
assigned to experimental treatments. 

For computational convenience of deter- 
mining probability values, the raw error scores 
were converted to ranks appropriate for the 
Kruskal-Wallis test and the Mann-Whitney 
U test. The Kruskal-Wallis test revealed that 
one can accept with high confidence the hy- 
pothesis that the treatments produced error 
scores of different ranks, H (3) = 18.43, p< 
-001. The balance rule produced in general 
higher ranked error scores than the agreement 
tule, U = 22, p < .05, two-tailed. The other 
two comparisons approached, but did not at- 
tain the conventional significance levels, The 
balance rule in general produced error scores 
of lower rank than the random condition, U 
= 24, p < .07, two-tailed, and of higher rank 
than the friendship condition U = 28, p < 

.15, two-tailed. 


Discussion 


The interpretation of the results must re- 
main tentative since two of the important 
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comparisons only approached the conven- 
tional significance level. Perhaps more con- 
clusive results would have been obtained if 
the individuals in the present study had been 
allowed a larger number of trials to work on 
the problem. The subjects in the random con- 
dition then might have persisted in their 
failure to learn the two stimulus groups by 
rote, thereby accumulating more errors and 
thus show a clear inferiority to the individu- 
als in the balance condition. 

That the balance condition was significantly 
inferior to the agreement condition and nearly 
significantly inferior to the friendship condi- 
tion may be interpreted as indicating that in 
their social experience, individuals seldom 
classify social situations according to the bal- 
ance principle. Another way of interpreting 
these findings is in terms of the number of 
relevant dimensions involved. For the balance 
rule, all three of the dimensions or sentences 
are relevant. To classify a stimulus as either 
balanced or imbalanced, an individual must 
take account of the sentiment that O holds 
for Q, and O’s attitude toward X, and also 
Q’s attitude toward X. For the classification 
into agreement and disagreement only two 
sentences are relevant: those stating O’s and 
Q’s respective attitudes toward X. Correct 
use of the friendship rule requires attention to 
only one dimension, the sentence describing 
the sentiment O holds for Q. Numerous stud- 
ies (see Bourne’s, 1966, review) have shown 
that the difficulty of classification problems 
increases directly as a function of the number 
of relevant dimensions. To compare the bal- 
ance principle with other rules that also in- 
volve three relevant dimensions would be a 
good method of determining whether balance 
has a special status as a principle for classi- 
fying social situations, 


EXPERIMENT 2 


Several procedural changes were introduced 
to increase the sensitivity of the experimental 
technique and thereby stabilize and clarify 
the findings of Experiment 1. 

The second purpose of this experiment was 
to examine the ease of learning the balance 
tule relative to other rules that also have 
three relevant dimensions. Shepard, Hovland, 
and Jenkins (1961) have analyzed the types 
of conceptual rules that divide into two 
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groups of four, a set of eight stimuli, Their 
analysis applies to a set of stimuli in a uni- 
verse of three orthogonal binary dimensions. 
These conditions are met by the stimuli of the 
present study. There are two values for each 
dimension (sentence); they are the affirma- 
tion and negation of the verb. The three 
dimensions are mutually orthogonal over the 
set of stimuli, since the eight social situations 
exhaust all possible combinations of verb 
values in sentences, Shepard et al. (1961) 
have shown that there are four types of con- 
ceptual rules that involve three relevant di- 
mensions. They have labeled them Types i, 
IV, V, and VI. It can be shown that the bal- 
ance rule as used in the present study cor- 
responds to Shepard et al.’s (1961) Type VI 
rule. 

The present study compared the balance 
rule to the other types of rules that also 
involve three relevant dimensions, Types TII, 
IV, and V. There are conflicting expectations 
concerning the ease of using the balance rule 
relative to the other three-dimensional rules. 
Heider’s (1958) balance theory implicitly as- 
sumes that individuals frequently classify 
social situations as either balanced or unbal- 
anced. Research (e.g., Haygood & Bourne, 
1965) has shown large positive transfer for 
individuals performing successive classifica- 
tion problems involving the same rule. This 
line of reasoning indicates that the balance 
rule would be easier to use in classifying social 
situations than rules of Types III, IV, and V. 
On the other hand, research on the classifica- 
tion of nonsocial stimuli (Davis & Bourne, 
1965; Shepard et al., 1961) leads to the op- 
posite prediction. In these studies, the Type 
VI rule was harder to use than rules of Types 
III, IV, or V. 


Method 
Subjects 


The subjects, mostly university students, received 
$2 for participating jn the hour-long session. Their 
ages ranged from 17 to 25 years. 


ere two instances of Heider’s 
eae The attitude object in this study was “draft 
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protests.” To avoid uncertainty concerning recipro- 
cation, the interpersonal sentiment stated in the first 
sentence was reciprocated, for example, “Doris and 
Jenny like each other.” 

One type of deck showed a different pair of com-. 
mon female names on each card and a second type 
of deck showed a different pair of common male 
names on each card. The name pairs had been ran- 
domly composed and assigned to situations, 

A second version of each deck was prepared by a 
second random assignment of the 16 name pairs to 
the social situations. Thus there were four decks in 
all, two with female names and two with male 
names. For each deck, five different random orders 
of the cards were entered in a ring binder. 


Procedure 


Each subject received classification instructions 
essentially similar to those used in Experiment 1. 
Male subjects learned about social situations re ti 
ing boys and females, about situations invoiv- 
ing girls. The postfeedback interval was 5 seconds, 
and 15 seconds were allowed before the experimenter 
prompted the subject for a response. The classifica- 
tion responses were “one” and “two.” The learning 
criterion was 16 consecutive correct responses. Each 
subject was allowed a maximum of 240 trials, three 
times through the five random orders assigned to 
them, to reach the solution criterion. 

There were 22 subjects, 14 females and 8 males, 
randomly assigned to each of seven experimental 
conditions, Within each sex, half of the subjects were 
given Version 1 of their stimulus materials and half 
were given Version 2. The two experimenters each 
ran 7 females and 4 males in each condition. 

As in Experiment 1, the experimental conditions 
pertained to the principle the subject had to follow 
in order to classify correctly the cards into two 
groups of eight. The Friendship, Agreement, and 
Balance conditions were retained from Experiment 1. 

Random condition. Each of the four sets of 16 
cards (two versions for each of two sexes) was 
divided into two groups of 8 by a different random 
partition, For each set one group was arbitrarily 
labeled “one,” the other “two.” 

The other three conditions were exemplars of 
Shepard ct al.’s (1961) rules of Types III, IV, and V. 

Type II. In this condition, class one included the 
4+—+, +m, —+—, and the ——— situations, and 
class two included the +++, ++—, —++, and the 
——-+ situations. This classification is orthogonal to 
the friendship, agreement, and balance classifications. 

Type IV. In this condition, class one included the 
4+——, —+—, ——+, and the ——— situations and 
class two included the +++, ++—, +—+, and the 
—+-+- situations. This classification overlaps 75% 
with both the classification produced by the friend- 
ship rule and that produced by the balance rule, but 
is orthogonal to the agreement rule, 

Type V. In this condition, class one included the 
+++, +, —+—, and the ——— situations, and 
class two included the +—+, ++—, —++, and 
——+ situations. This classification is orthogonal to 
both the classifications produced by use of the bal- 
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TABLE 2 
NUMBER OF SUBJECTS ACHIEVING CRITERION AND MEDIAN AND MEAN CLASSIFICATION ERRORS 


Condition 
Measure H 
Friendship Agreement Type III Type IV Type V (te anes 5 
X 
b hievii 

aes poche 22 22 17 21 20 19 
Median 4.00 3.50 45.50 29.00 42.00 16.00 
M 6.64 6.68 54.59 35.63 42.32 30.22 


Nolte, In the Random condition 4 of 22 individuals achieved criterion. 


an = 22 in each condition. 


ance rule and the friendship rule, but overlaps 75% 
with the classification produced by the agreement 
rule. 


Results 


Table 2 shows for each of the six rule con- 
ditions the number of individuals who achieved 
the criterion of 16 consecutive correct classi- 
fications. The number of individuals solving 
the classification was significantly greater in 
the balance condition (19) than in the random 
condition (4) xè (1) = 17.85, p < .001. This 
finding clearly indicates that individuals are 
capable of using the balance rule to classify 
social situations. For this reason the results 
from the random condition were excluded 
from further analyses. 

As in Experiment 1, the distributions of 
total errors per person in each condition se- 
verely violated the assumptions required for 
parametric techniques of analysis. There was 
severe heterogeneity of variance, Fmax (6, 21) 
= 15.92, p < .01. The distributions of errors 
were not of the same shape from condition to 
condition. Table 2 presents the median and 
mean number of errors for each of the six rule 
conditions. 

The balance rule produced in general more 
error scores of higher rank than the friendship 
rule, U= 82.5, z= 3.75, p<.001 (one- 
tailed), and the agreement rule, U = 80, z = 
3.81, p < .0001 (one-tailed). The results of 
this experiment confirm that the balance rule 
is harder to use than either the friendship or 
agreement rule. 

The comparisons of balance with Types III, 
IV, and V provide a control on the number 
of relevant dimensions, since each rule in- 
volves three relevant dimensions. The Krus- 
kal-Wallis test showed that these four treat- 


ments produced error scores of different ranks, 
H (3) = 13.61, p< .01. A U test revealed 
that the balance condition produced lower 
ranked error scores than all of the other three 
conditions (Types III, IV, & V) combined, 
U = 327, z = —3.85, p < .00001 (two- 
tailed). Additional U tests also revealed that 
the balance condition produced lower ranked 
error scores than each of the other three con- 
ditions considered separately. The difference 
between balance and Types III and V are 
highly significant, U = 105.5, 2 = —3.22, p 
<.002 (two-tailed), and U=138, z= 
—2.44, p < .02 (two-tailed), respectively. The 
difference between balance and Type IV just 
failed to reach the conventional significance 
level U=162, z= —1.88, p< .06 (two- 
tailed). 


GENERAL Discussion 


The results confirm the hypothesis that 
individuals are capable of classifying social 
situations according to the balance principle. 
Both studies showed that the balance rule was 
superior to classifying situations according to 
the rote learning required in the random con- 
dition. 

The results of Experiment 2 show that the 
balance rule is an easier rule for individuals 
to use than other rules that also involve three 
relevant dimensions, Types III, IV, and V2 
These results contrast sharply with results 
from studies of the classification of nonsocial 
stimuli. These studies (Davis & Bourne, 1965; 


1 The statistically borderline difference observed in 
the present study between balance and Type IV may 
be due to the fact that the Type IV classification 
has a 75% overlap with both the classifications in 
terms of balance and in terms of friendship. 
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Shepard et al., 1961) found that the Type VI 
classification (formally similar to the balance 
rule) was more difficult to learn than classi- 
fications of Types III, IV, or V. One inter- 
pretation of this pattern of results is that as 
far as nonsocial stimuli are concerned, indi- 
viduals seldom, if ever, classify them by the 
Type VI rule. In the case of social stimuli, 
however, individuals often classify them ac- 
cording to the balance Type VI principle. 
Therefore, this principle gains in availability 
due to positive intraproblem transfer. This 
interpretation is consistent with Heider’s 
contention that balance of situations is a sali- 
ent aspect of social situations. 

Two other interpretations are possible. One 
is that for social situations of the sort used 
in the present study, the balance rule is effec- 
tively a two-dimensional rule. The two dimen- 
sions are friendship and agreement. Individu- 
als may encode the two dimensions that give 
the opinion sentiments of the two persons into 
one dimension on which the values are “agree” 
and “disagree.” In support of this interpreta- 
tion, the results of both experiments showed 
that individuals were nearly equal in their 
proficiency at learning the friendship and 
agreement rules (see Tables 1 & 2), However, 
this finding may itself be the result of intra- 
problem transfer. Individuals may have fre- 
quently in their social experience classified 
situations in terms of whether the persons 
agree or disagree and thus the agreement rule 
has reached a point of ceiling availability due 
to positive intraproblem transfer. 

The materials of the present study and 
those of the Davis and Bourne ( 1965) and 
Shepard et al. (1961) studies differed in what 
may be an important respect. The materials of 
the present study presented stimuli in the 
form of three-sentence descriptions of social 
situations. In the earlier studies, the stimuli 
were patterns of figures and geometric forms. 
Perhaps the sentence format for classification 
stimuli gives the Type VI balance rule an 
advantage. With such stimuli the Type VI 
balance rule can be reduced to an odd-even 
rule: Put all stimuli with an odd number of 
positive verbs in one class, and all stimuli 
with an even number (including zero) of posi- 
tive verbs in another class. Perhaps if non- 


social stimuli were described in the sentence 
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format of the present study, Type VI would 
be easier than Types III, IV, and V. How- 
ever, there is some evidence that indicates 
that this is not the case. In their second ex- 
periment, Shepard et al. (1961) used stimuli 
that are formally similar to the sentence for- 
mat stimuli of the present study. In their 
experiment, the stimuli consisted of cards on 
each of which there was (from left to right) 
a triangle, a circle, and a square. These three 
forms and positions represented dimensions on 
which binary variation was achieved by mak- 
ing each form either large or small. The odd- 
even counting rule is possible with these stim- 
uli, for example, put the stimulus in one class 
if there is an odd number of large figures, and 
in the other class if there is an even number 
of large figures. However, the results also 
showed that Type VI was a more difficult 
classification to learn than either Types III, 
IV, or V. 

Ts the balance rule a highly salient rule for 
classifying social situations? The results indi- 
cate that it is not as available as the friend- 
ship and agreement rules. These findings may 
be a reflection of the greater dimensional 
complexity of the balance rule. Balance in- 
volves three relevant dimensions, while friend- 
ship involves only one and agreement just two 
relevant dimensions. 

However, there is reason to believe that the 
number of relevant dimensions is not the sole 
determinant of rule availability in the classifi- 
cation of social situations, Subjects in both 
the present studies used the friendship rule 
and the agreement rule with equal ease, de- 
spite the fact that friendship involves only 
one, but agreement involves two, relevant di- 
mensions. 

Some recent findings by Gutman and Knox 
(1972) suggest another interpretation of the 
inferiority of balance to friendship and agree- 
ment. They had subjects rate the eight triadic 
P O X situations considered by Heider (1958) 
for pleasantness, tension, and consistency. 
They found that balance in the situation is a 
good predictor of rated consistency, but a 
poor predictor of rated tension and rated 
pleasantness. Both friendship and agreement 
are good predictors of rated tension and rated 
pleasantness, but are poor predictors of rated 
consistency. Especially pertinent to the pres- 
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ent study they found (Gutman & Knox, 1972, 
Table 4, p. 355) interesting differences in the 
dispersions of the three kinds of ratings. The 
variance of the mean ratings of the eight tri- 
adic social situations was 4.12 for rated pleas- 
antness, 4.25 for rated tension, and only 1.85 
for rated consistency. These data suggest that 
the eight triadic social situations are much 
more readily discriminable and differentiable 
in terms of perceived pleasantness and tension 
than in terms of perceived consistency. Ac- 
cordingly, the conceptual rules of agreement 
and friendship should be more available than 
the balance rule since these rules are strongly 
associated with salient properties of the situ- 
ations, that is, perceived pleasantness and ten- 
sion, while the balance rule is not. 

The findings of these experiments support 
the hypotheses of this investigator: (a) Indi- 
viduals are capable of using the balance rule 
to classify triadic social structures; (b) The 
balance rule is more available than all other 
rules of comparable dimensional complexity. 
These findings corroborate Heider’s belief 
that individuals frequently classify social 
situations as either balanced or imbalanced. 

Perhaps the methodology of conceptual be- 
havior will be as useful in the further study 
of social cognition as paired-associates learn- 
ing methods, for example, De Soto (1960), 
have been. In support of their use, we may 
note that the present study has suggested two 
reasons why balance, though available, is not 
the most salient aspect of perceived social 
situations. Other rules, that is, friendship and 
agreement, have fewer relevant dimensions 


and may have more positive transfer due to 
more frequent prior use than balance. These 
possibilities were not uncovered by years of 
research on the phenomenal properties of bal- 
ance and imbalance using rating methods, 
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The Facilitation of Aggression by Aggression: 
Evidence Against the Catharsis Hypothesis 


Russell G. Geen, David Stonner, and Gary L. Shope 
University of Missouri—Columbia 


Ninety male subjects were either attacked or treated in a more neutral manner 
by a male confederate. On a subsequent maze-learning task, one third of the 
subjects shocked the confederate, one third observed as the experimenter 
shocked the confederate, and one third waited for a period of time during 
which the confederate was not shocked, Finally, all subjects shocked the 
confederate as part of a code-learning task. Subjects who had been attacked 
and had shocked the confederate during the maze task delivered shocks of 
greater intensity on the code task than did subjects in the other two condi- 
tions, and the former subjects also experienced a greater reduction in diastolic 
blood pressure than did the latter. The results contradict the hypothesis of 
aggression catharsis and are discussed in terms of feelings of restraint against 


aggressing that a subject experiences after committing an aggressive act. 


The idea of aggression catharsis is popular 
both in conventional wisdom and in psycho- 
logical writing. Certainly it seems to make 
sense that attacking an enemy allows the 
attacker to purge himself of hostility and 
anger toward his victim. Furthermore, in both 
psychoanalytic theory (e.g., Schafer, 1970) 
and the frustration-aggression hypothesis 
(Dollard, Doob, Miller, Mowrer, & Sears, 
1939), the notion is expressed that aggression 
leads to a release of aggressive drive or insti- 
gation, so that further acts of violence are 
rendered less probable. This point of view 
stipulates a fairly simple cause-effect model: 
instigation produces aggressive arousal, which 
motivates an aggressive response, which in 
turn reduces arousal. The end result is a de- 
clining level of aggression. 

Largely because of this cause-effect assump- 
tion, however, the catharsis hypothesis has 
encountered troubles on empirical, methodo- 
logical, and theoretical levels. Although some 
investigators have reported results that sup- 
port the notion of catharsis (e.g., Doob, 1970; 
Feshbach, 1955; Thibaut & Coules, 1952), 
many others have made findings that contra- 
dict it (e.g, deCharms & Wilkins, 1963; 
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Freeman, 1962; Kahn, 1966; Mallick & Mc- 
Candless, 1966). Still other studies not: di- 
rectly concerned with catharsis have reported 
a tendency for subjects to increase progres- 
sively the intensity of electric shocks admin- 
istered to an antagonist over a series of trials 
(Buss, 1966b; Geen, 1968). One reason for 
the lack of conclusiveness that characterizes 
data on aggression catharsis is that reductions 
in aggression may be caused by more than 
just lowered aggressive instigation. If the 
experimental situation is structured in such a 
way, for example, that aggressive behavior 
produces feelings of guilt or anxiety in the 
subject, intensity of aggression may decline 
not because of reduced instigation but be- 
cause of increased restraints (e.g, Baron, 
1971; Geen, 1970). Perhaps subtle differences 
in experimental conditions in some of the 
studies cited above led to varying degrees of 
inhibitions against aggressing, thus producing 
the conflicting results. Finally, the catharsis 
hypothesis can be questioned on theoretical 
grounds: Because reduction in aggressive 
arousal may actually represent the termina- 
tion of an aversive state, it is not at all obvi- 
ous why such a reduction should reduce the 
probability of occurrence of the aggressive 
response that precedes it. On the contrary, it 
can be argued that lowered aggressive arousal 
may actually reinforce aggression (Berko- 
witz, 1970). 
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Some of the matters discussed here may 
have been involved in a study by Doob and 
Wood (1972), which was reported while the 
present experiment was in progress. In their 
experiment, Doob and Wood found apparent 
evidence for aggression catharsis: Angry sub- 
jects who aggressed against an antagonist 
twice were less punitive in their second at- 
tack than similarly angered subjects who 
aggressed only once. A slight tendency toward 
reduced aggression was also observed among 
subjects who, prior to their own aggressing, 
had witnessed the experimenter attacking their 
adversary. Although Doob and Wood did not 
present their findings as clear-cut evidence 
for catharsis and suggested that the results 
could have been due to a process of restora- 
tion of equity, they did not mention as a 
possible explanation increased restraints 
against aggressing in the groups showing 
reduced aggression. Their procedures con- 
tained two features that could have led to 
strong inhibitions against aggressing following 
aggressive responses, however. First, the con- 
federate who sustained the subject’s attacks 
was always a female, and the subjects were 
both male and female. Buss (1966a) has 
shown that both male and female subjects, 
and especially females, tend to reduce the 
intensity of attacks on a female victim when 
they suspect that they have hurt her. Doob 
and Wood’s subjects may have believed that 
they were causing some pain to the confed- 
erate and thus held back in later attacks on 
her. Second, the confederate aroused the sub- 
ject to attack through verbal badgering and 
insult, whereas the subject’s mode of attack- 
ing the confederate was at all times electric 
shock. It is not difficult to imagine the subject 
concluding that the confederate had more than 
paid for her insults by being shocked one 
time and thus holding back on hurting her 
further. The subject may have still harbored 
considerable resentment toward the antag- 
onist but may have been restrained from 
aggressing by a fear of guilt over further 
unwarranted aggression. The experimental 
situation was therefore structured in such a 
way that aggression could have produced 
stimuli that inhibited subsequent aggression 
by the subject. Finally, the Doob and Wood 
study did not include any measure of arousal, 
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which could have shed some light on the state 
of aggressive drive characteristic of the sub- 
ject before and after aggression. 

The present experiment, which was de 
signed and begun before the appearance of 
the Doob and Wood (1972) article, adventi- 
tiously avoided some of the potential source 
of inhibition implicit in that study. Subjeci: 
were all males, as was the confederate. Bus 
(1966a) has shown that males do not show 
marked decreases in aggression against other 
males over a series of shocks, even when they 
have evidence that the shocks are causing 
suffering. In addition, subjects were aroused 
to aggression by electric shocks and later 
retaliated with shocks. Finally, systolic and 
diastolic blood pressure were measured at 
several points during the session. Otherwise 
the study was similar to the Doob-Wood 
experiment. The subject was either attacked 
or not attacked by a confederate; he then 
either attacked the confederate himself, ob- 
served the confederate being attacked by the 
experimenter, or waited for a period of time 
during which the confederate was not at- 
tacked. Finally subjects in every condition 
attacked the confederate. 


METHOD 
Subjects 


Subjects were 90 males recruited from an intro- 
ductory psychology class, with each student receiving 
a point toward his course grade for participating. 
Subjects were randomly assigned to the six treatment 
conditions, f 


Procedure 


The experimenter described the experiment to the 
subject, and to a confederate who posed as a sub- 
ject, as one on the effects of punishment on learn- 
ing. It was explained that one of the men, to be 
selected by chance, would perform on two types of 
learning problem, one involving maze learning and 
the other deciphering of a code, with threat of pun- 
ishment as a motive in each case. By means of a 
rigged lottery, the confederate was designated as the 
one to work on the learning problem under threat 
of punishment. The two men were thereupon iso- 
lated from each other in separate booths which 
allowed auditory, but not visual, communication. 
The subject’s booth contained the panel of an aggres- 
sion machine, with 10 shock buttons. The experi- 
menter then said that before the learning problems 
could begin, it was necessary to discover the degree 
of similarity that existed between the “learner” and 
the other person (the subject), who would at some 
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soints in the experiment play the role of a teacher; 
this was justified by the assertion that similarity 
between teacher and learner is an important vari- 
able in the learning process. The first measure- 
ment of blood pressure was made just after these 
instructions had been given. 

The subject was then given a set of 12 cards, each 
bearing an attitude statement on some controversial 
issue. The subject was instructed to read the state- 
ment aloud and state whether he agreed or disagreed 
with it. The confederate was instructed to express 
agreement with the subject by saying “I agree” and 
to register disagreement by giving the subject an 
electric shock. The rationale for this treatment was 
that the effects of disagreement between people are 
most acute when the situation is a highly stressful 
one, such as threat of shock would produce. The 
confederate then proceeded to give the subject either 
(a) two mild shocks following 2 of the statements 
and no shocks after the other 10 statements (no 
attack condition) or (b) fairly strong shocks follow- 
ing 8 statements, two mild ones after 2 other state- 
ments, and no shocks after the remaining 2 state- 
ments (attack condition). Previous studies (e.g., 
Geen & Stonner, 1973) have shown that these treat- 
ments arouse reliably different levels of anger in 
subjects. Blood pressure was again measured just 
after the shocks had been given. 

The experimenter then announced that the first 
task, maze learning, would begin and called the 
attention of the subject and the confederate to a 
console in the room visible from both cubicles. 
Among a panel of lights on this console were a red 
one, a green one, and a white one. The experimenter 
explained that the onset of the white light signaled 
the beginning of a trial. Each trial was said to 
consist of a run through an electronically simulated 
maze by the learner designate (the confederate), 
who would express his decision at each of the choice 
points by pressing one of two switches. Jf the 
learner made a correct choice, the green light on 
the console would go on, but if he made an error 
the red one would go on. Two thirds of the subjects 
were told that the red light signaled that the learner 
was to receive a shock, after which he would go on 
to the next choice point. The remaining third were 
told nothing about shocks (and thus comprised the 
no shocks condition). Of those subjects who were 
informed that a shock was to be given for each 
choice-point error, half were told that the experi- 
menter would give the shock (experimenter shocks 
condition), whereas the other half were instructed 
to deliver @ constant-intensity shock themselves 
(subject shocks condition). The subject was told to 
give shocks by pressing the number 5 button on 
his panel. Ten trials were run, in which the consed. 
ate made 20 errors according to a prearrange 
schedule. A third measurement of blood pressure 


followed. 


The experimenter then explained that the second 


and final learning task would begin. Again, the con- 


federate was given @ learning task, this one consist- 


ing of translating an alphabetical code into numerals 


123 


according to an abstract principle which the learner 
was required to discover. The subject was instructed 
to present trigrams from a list by pressing combina- 
tions of switches from among five on his panel 
labeled A through E. The subject was informed that 
he would learn only whether the learner had cor- 
rectly encoded the information or whether he had 
made an error. A correct response was said to govern 
the onset of a green light on the subject’s panel 
and an error to control a red light on the same 
panel. The subject was instructed to give the con- 
federate a shock of any intensity from 1 to 10 by 
pressing the appropriate button on his panel; he 
was told that the intensity of the shock increased 
across the series of buttons. The confederate then 
proceeded to make 10 errors over a series of 16 trials, 
following a standard schedule. The intensity and 
duration of each shock supposedly given to the con- 
federate were measured electronically. The fourth 
and final measurement of blood pressure was made 
just after the shock series had ended. 

After completing the fourth measurement of blood 
pressure, the experimenter asked the subject and the 
confederate to fill out a short questionnaire on which 
they could express their overall impressions of the 
experiment. The questionnaire consisted of several 
ratings scales on which were described (a) the sub- 
ject’s mood—how happy, angry, excited, and anxious 
he felt; (b) how much he liked the confederate; 
(c) how painful he thought the shocks were that he 
had received; and (d) how much he had held back 
and restrained himself in shocking the confederate on 
the code-learning task. Each scale consisted of a 
continuous 100-mm line which the subject checked. 
Finally the experimenter interviewed each subject 
in order to discover whether the deceptions had 
been successful. A total of 11 subjects were able 
to at least approximately verbalize the true nature 
of the study. These subjects were eliminated from 
analysis and replaced by other subjects randomly 
drawn from the same population, The true nature 
of the experiment was fully discussed by the 
experimenter and the subject at the end of each 
session. 


RESULTS 
Effectiveness of Attack 


Ratings of anger made by the subject just 
after being shocked by the confederate were 
used as the basis for inferring the effective- 
ness of the attack manipulation. Subjects who 
had received 10 shocks reported a mean anger 
rating of 55.1 (on a 100-mm scale on which 
O represented maximum anger and 100 repre- 
sented minimum anger), whereas subjects 
who had received 2 shocks reported a mean 
rating of 78.3. This difference was significant, 
F(1, 88) = 13.88, p< 01. The treatment 
used to induce different levels of instigation 
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TABLE 1 


MEAN INTENSITIES OF SHOCKS GIVEN BY SUBJECTS 
ON THE CopE-LEARNING TASK 


Treatment of subject by 


Treatment of confederate on confederate 
code-learning task 
Attack No attack 
Subject shocks 6.655 3.92be 
Experimenter shocks 4.136. 3.62. 
No shocks 5.20, 3.20, 


Note. Cells having common subscripts are not significantly 
different at the .05 level by a Duncan multiple-range test. 


to aggression was therefore judged to be 
successful. 


Shock Intensity 


' The mean intensities of shocks given by 
subjects to the confederate during the code- 
learning task are reported in Table 1. An 
analysis of variance showed main effects for 
the attack variable, F(1,84) = 26.10, p< 
,001, and the variable regarding treatment of 
the confederate during the maze-learning task, 
F (2,84) = 5.93, p< .01. The latter of the 
two main effects was due primarily to an in- 
teraction between the two variables, F(2, 84) 
= 3.56, p < .05. Comparisons among means 
further reveal that subjects in the attack—sub- 
ject shocks condition gave stronger shocks 
than did those in both the attack-no shocks 
and the attack-experimenter shocks conditions. 


Blood Pressure 


Data regarding systolic and diastolic blood 
pressure were analyzed separately. Although 
systolic pressure failed to manifest any dif- 
ferences across conditions, treatment effects 
on diastolic pressure were found. The data 
were analyzed as a set of three change scores 
per subject: (a) from baseline to just after 
the attack-no attack treatment, (b) from 
that treatment to just after the first attack 
on the confederate by the subject or experi- 
menter (or the absence of such an attack) , 
and (c) from that treatment to just after the 
subject himself attacked. These are labeled, 
respectively, diz, dəs, and dg, in Table 2, 
which shows the relevant change scores. 

No differences were found across the five 
conditions in baseline diastolic pressure, 
F(5, 84) < 1.00. Analysis of the die scores 
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indicated that subjects in the attack condition 
experienced a greater elevation in diastolic 
pressure (in millimeters of mercury, Hg) than 
those in the no attack condition (5.92 mm 
Hg vs. —1.67 mm Hg; F(1, 84) = 29.12, 
< .01. The analysis of the dəs scores re- 
vealed that subjects in the attack~subject 
shocks condition experienced a decrease in 
diastolic pressure to slightly below baseline 
level, while those in the attack—experimenter 
shocks condition manifested a slight increase, 
and those in the attack-no shocks condition 
revealed a slight decrease. Subjects in the no 
attack condition showed a moderate decrease 
in blood pressure. These data produced a sig- 


nificant interaction between the two variables, 
F(2,84) = 6.82, p < .01. Analysis of the 
dg4 scores showed main effects for | the 


attack, F(1, 84) = 10.33, p < .01, an its 
ment of confederate, F (2, 84) = 3. 05; 
variables. Both effects are due primarily to 
the fact that subjects in the attack experi- 
menter shocks condition experienced a greater 
drop in diastolic pressure than subject ny 
other condition. 


Questionnaire Data 

Two of the items on the final question- 
naire yielded significant differences across 
TABLE 2 


Mean DIASTOLIC BLOOD PRESSURE CHANGES FOR THE 
Six TREATMENT CONDITIONS 


Mean diastolic press 
Treatment condition 


dia dab | 
af 
Attack-subject shocks | +6.66, | —8.80, | 204, 
Attack-experimenter | 
shocks | 
Attack-no shocks —1.30 | =í 
No attack-subject | 
shocks —3.27, | —1.87b. | OF» 
No attack- | 
experimenter shocks | — 1.83 | — 1.80s | —3.53 
No attack-no shocks +.10s | —6.60a» —.07, 
Note, Cells having common subscripts within time periods 


are not significantly different at the .05 level by a Duncan 
multiple-range test. 

Refers to the change in diastolic pressure during the time 
period from baseline to just after the attack-no attack 
treatment, 

» Refers to the change in diastolic pressure during the time 
period from the attack-no attack treatment to just after the 

rst attack on the confederate by the subject or experimenter 
(or the absence of such an attack). 

¢ Refers to the change in diastolic pressure during the time 
period from just after the first attack on the confederate by 
the subject or experimenter (or the absence of such an attack) 
to just after the subject himself attacked. 
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conditions.t One asked the subject the follow- 
ing question: “On the basis of what you 
know about the other subject in the experi- 
ment, what are your general feelings about 
him?” The subject replied to this item by 
marking a 100-mm «ne anchored at the low 
end by the statement “I liked him very 
much” and at the high end by the statement 
“T didn’t like him at all.” The results showed 
that subjects in the attack-subject shocks 
condition expressed significantly greater dis- 
like for the confederate than did those in the 
attack-experimenter shocks condition, 45.8 vs. 
32.3; #(18) = 2.73, p< .05, and slightly 
more dislike than those in the attack—no 
shocks condition, 45.8 vs. 36.1; #(18) = 1.98, 
p < 10. Thus, in verbally expressed hostility, 
as in physical aggression, subjects were more 
intense after aggressing than after not ag- 
gressing. In response to the questionnaire item 
“How much did you hold back in shocking 
the other subject during the code-learning 
task,” subjects again marked a 100-mm line 


“extending from the statement “I held back a 


lot” (at the low end) to “I didn’t hold back 
at all” (at the high end). The results indi- 
cated that subjects in the attack-subject 
shocks condition reported feeling less re- 
strained than those in both the attack-experi- 
menter shocks, 70.13 vs. 29.20; #(18) = 4.93, 
p<.01, and attack-no shocks, 70.13 vs. 
48.21; £(18) = 3.22, p<.01, conditions. 
This indicates that the attack-subject shocks 


“condition, relative to the other two attack 


conditions, lowered inhibitions against ag- 
gression in the subjects. 


Discussion 


The findings of the present experiment 
provide no support for the hypothesis of ag- 
gression catharsis and suggest instead that 
when an experimental situation is arranged to 
minimize restraints against aggression, the 
opposite of catharsis occurs. The study also 
shows that aggressive behavior was followed 
by a reduction in blood pressure for all three 
groups of previously attacked subjects: Men 
in the subject shocks condition experienced 


1 None of the other scales from the final question- 
naire yielded significant differences across conditions. 
These scales are not discussed further. 


725 


decreased pressure after the maze task in 
which they aggressed, whereas men in the 
experimenter shocks and no shocks conditions 
experienced decreased pressure following the 
learning task, in which they aggressed for the 
first time. The finding that reduced arousal in 
the subjects of the attack-subject shocks 
condition was in turn followed by a high level 
of aggression rather than a low level casts 
doubt on the simple cause-effect assumption 
mentioned earlier in this article. Activity that 
we may call “physiological catharsis” does 
follow aggression, as previous reports have 
shown (e.g., Hokanson & Burgess, 1962), but 
this cardiovascular change is associated with 
relatively strong aggressiveness. The results 
reported here are consistent with those of 
Kahn (1966), who showed that verbal ag- 
gression was followed by both reduced arterial 
pressure and a high degree of verbal hostility. 
Our findings, like Kahn’s, suggest that future 
studies of aggression catharsis should include 
measures of both behavioral aggression and 
physiological arousal, since we cannot simply 
assume that decreased arousal brings about a 
lowered instigation to aggress. 

Two explanations of the findings of the 
present study seem plausible. One is that 
aggression, by lowering arousal, causes a per- 
son to feel relatively at ease. The person may 
interpret his relaxed condition as being due 
to the aggression he has just committed and 
may therefore experience a weakening of 
socialized restraints against further violence. 
Our data support this interpretation by show- 
ing that aggression is followed by a lowering 
of motivation to hold back in shocking. The 
other plausible explanation of our data is that 
subjects in the attack-subject shocks condi- 
tion, having aggressed once against the con- 
federate on the maze-learning task, felt com- 
pelled to behave consistently by continuing to 
attack him on the subsequent code-learning 
phase of the experiment. Doob and Wood 
(1972) have proposed that such consistency- 
seeking behavior is enhanced by a high degree 
of similarity between the first occasion for 
aggression and the second, such as similarity 
in the means of aggression used by the sub- 
ject. Thus, the fact that the subject used 
electric shock to attack the confederate on 
both occasions in the present study could 
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have facilitated the subject’s need to be 
equally violent in both. The data from this 
experiment also showed that men in the 
attack-subject shocks condition tended to 
express greater verbal hostility on the final 
questionnaire than did those in the other two 
attack groups. This, too, may indicate that 
subjects acted out of a need to maintain con- 
sistency. Perhaps committing violence against 
another person creates a feeling of need to 
Justify the violent act, and one means of 
justification is derogation of the victim (cf. 
Glass, 1964). The data from this experiment 
do not allow us to test directly whether the 
subject’s need for consistency underlies the 
failure to find evidence of catharsis, however, 
and this possibility remains an intriguing 
subject for future research, 
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Information Properties of Descriptive Words 


Robert K. Gifford 
Yale University 


Two experiments examined some of the dimensions of information conveyed 
by descriptive words. In the first experiment, subjects rated descriptive terms 
on each of 15 scales. The terms rated were either personality traits or life 
conditions. For each of these types of descriptive terms, the intercorrelations 
among scale ratings were factor analyzed. In each case, a five-factor solution 
yielded an interpretable set of dimensions which represented information 
properties of the rated terms. In the second experiment, subjects made judg- 


ments about stimulus persons describ: 


ed by terms that had been rated in 


Experiment 1, These terms’ factor scores varied along one of the trait dimen- 
sions from Experiment 1 or (for life conditions) along two of the life condi- 
tion dimensions. The results demonstrated that the kinds of judgments made 
about a person were related to the factor scores of the terms used to describe 


him. 


Verbal descriptions of a person are often an 
important source of information for people 
making judgments about that person. For 
example, people form impressions of public 
figures entirely on the basis of descriptions 
found in the mass media, and selection agen- 
cies such as college admissions committees 
frequently make important judgments using 
only verbal descriptions. Even when a person 
is available for observation, descriptive lan- 
guage can influence analysis of that observa- 
tion. Both Kelley (1950) and Bond (1970) 
have found that prior labeling influenced 
subjects’ reactions to a stimulus person. 

The words used to describe a person have 
a variety of information properties in addition 
to (though not independent of) their denota- 
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tive meanings. Schneider (Note 1) has used 
a factor analytic approach to the study of 
information properties of English trait words. 
Schneider’s subjects rated 44 trait words on 
several bipolar scales, such as “deep-seated— 
peripheral,” and “ability-not ability.” The 
ratings of the trait words were averaged over 
subjects, and the intercorrelations among 
scales were factor analyzed. Schneider re- 
ported four interpretable factors: a disposi- 
tional factor, a nonvolitional factor, a behav- 
ioral quality factor, and an ability factor. It 
seems possible that these dimensions represent 
a pattern of information organization that is 
relevant to how people use and interpret trait 
information. The present study examined this 
question in two ways: first, by attempting to 
replicate Schneider’s results using a larger 
selection of trait words, and second, by show- 
ing that a word’s position on an information 
dimension can be used to predict judgments 
made about a person described by the word, 

In addition, the present study examined 
information properties of words that might be 
used to describe a person, but that are 
not what would generally be thought of as 
personality traits. For example, the words 
“unemployed,” “wealthy,” and “overweight” 
all convey information about a person, but 
are not personality traits in the classical 
sense. Such terms describe a person’s life con- 
ditions rather than his personality. It seems 
certain that such nonpersonality aspects ofa 
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person are important in determining how 
others see him (Rodin, 1972; Schneider, 
1973). The present study compared the in- 
formation properties of life conditions to 
those of traits. 

The present research was conducted in two 
stages. In the first stage, subjects rated de- 
scriptive terms—either traits or life condi- 
tions—on a variety of information properties. 
The interscale correlation matrices for both 
traits and life conditions were then factor 
analyzed. The purpose of this analysis was to 
determine whether Schneider’s (Note 1) re- 
sults could be replicated and also whether the 
factor structure for life conditions resembled 
that for personality traits. The second stage 
of the research was an attempt to demon- 
strate the meaningfulness of a limited subset 
of the dimensions derived from the factor 
analytic study by showing that information 
properties of the words are relevant in an 
interpersonal judgment situation, The dimen- 
sions used in the judgment task were selected 
from those found in the first study on the 
basis of their interpretability and probable 
relevance to interpersonal judgment. 


EXPERIMENT 1 


Method 
Subjects 


Fifty-five female students at a state college in 
the New Haven, Connecticut, area were paid for 
participation in a 2-hour experiment. 


Materials and Procedure 


Each subject received one of two lists of descrip- 
tive terms (both are in Gifford, 1973). Thirty sub- 
jects were given a list which contained 100 words 
denoting personal traits, The trait words used in- 
cluded the 44 words used by Schneider (Note 1) and 
56 used in studies by Anderson (1968) and 
Rosenberg, Nelson, and Vivekananthan (1968). The 
other 25 subjects were given a list of 89 words or 
phrases that describe life conditions, including the 
examples given above and a variety of other terms 
such as “unmarried,” “has high medical bills,” 
“happy in his job” and “new in town.” 

Subjects were asked to rate each of the words or 
phrases on each of 15 rating scales, including the 10 
scales used by Schneider (Note 1). The rating scales 
included how ability-like the characteristic is, how 
much it is affected by trying (motivation), how 
pervasive it is, how much it tells one about the 
person, how stable it is over time, how stable over 

situations, how deep-seated it is, how behavioral, 
objective, socially desirable, personally desirable to 
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the subject, how much it influences one’s actio: 
toward someone who has the trait, how common 
it is among people in general, how much it de- 
scribes the subject’s friends, and how much it de- 
scribes the subject herself. Ratings were on a scale 
from 1 to 9. 

The data from the ratings of trait words and 
from ratings of life conditions were analyzed sepa- 
rately, but in an identical manner. The rat o 
each word or phrase on each scale were a ed 
over subjects. These mean ratings were th 1 
to calculate the matrix of intercorrelations 
the 15 rating scales. The correlation matrix 
factor analyzed and rotated orthogonally accord 
to the varimax method. The procedure of {actor 


analyzing intercorrelations among rating ales 
treated each rating scale as a variable in the or 
analysis and the mean rating of each word an 
observation on the variable. Consequently, the ilt- 
ing factor structure represented the average s 


of the groups rather than the ratings of an) 
individual. 


Results 


Given an unrotated factor matrix wil: 15 
factors, it is possible to rotate any nuniber 
of factors from 1 to 15. The choice of h 
solution to accept must be somewha‘ ji- 
trary. In the present study, the 5-facto u- 
tions for both traits and life conditions wire 
chosen because these solutions accounted 
large portions of the variance, with 
being relatively little increment in t 
amount of variance accounted for when more 
than 5 factors were rotated. The 5-factor 
solution for trait ratings accounted for 89.4% 
of the variance, and the 5-factor solut 
life condition ratings accounted for 86. 
the variance. The factor loadings of the 
rating scales on the rotated factors : 


Table 1. The rotated factor loadings fo ihe 
life conditions are in Table 2. 
The first factor for trait ratings, accou“ ting 


for 29.1% of the variance, was an eval. itive 
factor. Both the “social desirability” and 
“personal desirability” scales loaded highly 
on this factor, as did the scales for how much 
the term described the subjects’ friends 
and the subject herself. In addition, two 
scales describing socially desirable properties, 
“ability-like” and “affected by effort,’ both 
loaded highly on this factor. This evaluative 
factor appears to be equivalent to Schneider’s 
ability factor. The second factor, which ac- 
counted for 27.4% of the variance, was one 
of how stable or dispositional the informa- 
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tion conveyed by the word was. The scales TABLE 2 

loading highly on this factor were “stability Lire CONDITION RATINGS 
over time,” “stability over situations,” ‘“in- 
formative about a person,” “deep-seated,” and 


Factor loadings of scales 
Scale sè a 


“pervasive.” This factor is equivalent to ; 
Schneider’s dispositionality factor. The third pS a TS X 
factor, accounting for 13.1% of the variance, Pervasive 91) 11] .20 09 
can be called a familiarity factor, since it Behavioral 90| 16) 10 16 
Pe Influences 
appeared to be one of how familiar the char- behavior 87| .19| -06 220. 
acteristic described was. The scales with the omava g AS g S 
. . . $ eep-seater fe te ` le 
highest loadings on this factor were “describes ‘Abikty-like "341 '86| 00 20 
people in general,” “describes the subject,” Social 
and “describes subjects’ friends.” The fourth aaa 09) 84) 24 — 34 
factor, accounting for 10.5% of the variance, desirability 42) 84) 29| —.09 | —.32 
can be called a behavior influence factor be- Influenced by oe os) orl 2a 
x) . We PEN £9 


cause it had the “influences actions” scale Hens e S 48 


loading ‘most highly on it, with the “behav- Describessubject) -12 
ioral” scale also loading fairly stronaly Lhe Sete people | .00 
Stable over time 241 


fifth factor, accounting for 9.4% of the vari- Stable over 


ance, had the scales for how subjective or i tenan 
i H H s £ ubjective— 
objective the term 1s and how much it is objective Larl 05} —.10| —.11 | —.89 


manifested in behavior loading highly on it. Percentage 
This objectivity factor appeared to have to of yanen e \235 lisa |1225 90 
do with how one gets information about the TE EEEa Aea R ive tli Lee 
presence or absence of the given trait, and 


is similar to Schneider’s behavioral quality 


The factor structure for the ratings of in- 


factor. 
T formation properties of life conditions had a 
TABLE 1 somewhat different pattern of high loadings. 
‘Trait RATINGS The first factor, accounting for 26.5% of the 


et RE! actor loadings of écales variance, appeared to be a centrality factor, 
with the scales for “informativeness,” “deep- 


Scale ES 
I u m | Iv v seatedness,” “influencing actions toward an- 
other,” «pehaviorality,” and “pervasiveness” 
LOU ai Beha (ane een ou toate highly on it. This factor was called 
Tafuenced by ; ` a centrality factor rather than a dispositional- 
effort _24| 3| 13/16 ity factor because the scales for “stability 


= 49 | —103 | —.08 | —.07 ; sy trent ; 
a over time” and “stability over situations” did 


Apility-like 
not have high loadings on this factor, unlike 


oclal 
desirability —AL 

eS man = ay the case for the dispositionality factor in the 
esariben an on —'87 trait-ratings structure. The second factor, 


Stable over time 
Stable over 
situations 
Æ Informative 
Deep-seated 
Pervasive 


accounting for 23.57 of the variance, was an 
evaluative factor. The scales with high load- 
ings on this factor were “social desirability,” 
“personal desirability,” “ability-like” and 


Deserts Per mee at «influenced by effort.” The third factor ac- 
behavior 09|—07| .02| .94|—15 counted for 15.49% of the variance and was 
Subjective —y9|—a1| 06) 12) —-79 similar to the familiarity factor (IIT) of the 
E EA S 15) 38 T68 trait-rating structure. The scales with the 
Percentage highest loadings were “describes the subject’s 


94 friends,” “describes the subject,’ and “de- 
, 


of variance 29.1 |274 13.1 | 10.5 : 
scribes people in genera ” The fourth factor 


accounted for 
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was a stability factor, with both “stable over 
time” and “stable over situations” having 
high loadings. This factor accounted for 
12.2% of the variance. Only one scale, 
“subjective-objective,” had a particularly 
high loading on the fifth factor, which 
accounted for 9.0% of the variance. 

In order to examine the degree of corre- 
spondence between the factor structures for 
trait ratings and life condition ratings, the 
two sets of factor loadings were converted to 
Fisher’s Z scores and correlated. Each trait- 
rating factor correlated significantly with one 
life condition factor. The two evaluative 
factors, trait Factor I and life condition Fac- 
tor II, had a correlation of —.50 (p < .05). 
The trait dispositionality factor (Factor II) 
and the life condition stability factor (Factor 
IV), both of which had high loadings for the 
stability scales, had a correlation of 73 
(Ż < .01). The correlation between the two 
familiarity factors was .63 (p < 025). The 
behavior influence factor for traits (Factor 
IIT) had a correlation of .66 ($ < .01) with 
the life condition centrality factor (Factor I). 
The two fifth factors were both objectivity 
factors and had a correlation of .58 (p < .05). 


Discussion 


The factor structure for ratings of informa- 
tion properties of traits resembled the one 
obtained by Schneider (1971). Since there 
were more rating scales in the present study, 
and a five-factor solution was used rather 
than a four-factor solution, the two structures 
were not identical. However, the pattern of 
loadings of the scales used in both studies 
was the same. 

The structures for ratings of traits and life 
conditions in the present study had much in 
common, but there was one major difference. 
In the trait Structure, the “stability over 
time” and “stability over situations” scales 
loaded highly on the dispositionality factor 
(Factor II) along with the “informative,” 
“deep-seated,” and “pervasive” scales. In the 
life condition Structure, on the other hand, 
the latter three scales loaded on the centrality 
factor (Factor 1) along with “influences 
actions” and “behavioral,” while the two 
stability scales loaded most highly on a sta- 
bility factor (Factor IV). Apparently the 
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concept of stability was related to the concept 
of dispositionality for traits, while stability 
and dispositionality were distinct concepts for 
life conditions. 

Some examples of terms in the high or low 
ranges of factor scores on the trait disposi- 
tionality, life condition centrality, and life 
condition stability factors may make this 
distinction more clear.’ Traits with high 
dispositionality factor scores were properties 
that subjects saw as relatively unvarying over 
time and situations, and also as being major 
aspects of his personality, for example, “in- 
telligent,” “artistic,” “dishonest,” and “domi- 
nating.” Traits with low factor scores were 
more transient, situationally determined, 
properties such as “rash,” “dissatisfied,” or 
“happy.” The life condition stability factor 
score of a term seemed to be a straightforward 
function of the relative permanence of the 
characteristic: terms like “employed,” “mid- 
dle-class,” and “parent” had high scores 
while the terms “new in town” and “unde: 
pressure” had low scores. 

It appears that life conditions having high 
centrality factor scores provide information 
about someone’s personal characteristics o: 
behaviors in which he is likely to engage, fo 
example, “educated,” “experienced,” “unpop 
ular,” and “well-read.” In contrast, life con 
ditions with low centrality scores, given with 
out a context, offer little basis for deducin 
how the personality of a person so described 
differs from anyone else’s, “Lucky,” “ren 
an apartment,” “owns a new car,” and “live 
near a park” all had low scores on this factor 

The high correlation (.73) between load 
ings on the trait dispositionality and life con- 
dition stability factors shows that these fac- 
tors had similar overall patterns of loadings 
Although there was no such correspondence 
between the trait dispositionality and life con- 
dition centrality factors (r = —,.07), the fact 
that both had high loadings for the informa- 
tiveness, deep-seatedness, and pervasiveness 
scales suggests that these two information 


*In the discussion that follows, the signs of the 
factor Scores have been reversed for those factors for 
which the high loadings were negative. This was 
done for Convenience, so that the terms with high 
factor scores are those that are highest in the 
Property for which the factor was named. 
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dimensions may have some similarity of 
_ Meaning. 

Tf the factors found in Experiment 1 repre- 
sent significant aspects of the connotative 
meanings of descriptive words, then words 
having high factor scores on a given factor 
should provide information that differs from 
the information provided by words with low 
scores on that factor. Thus, different judg- 
ments might be made about a person depend- 
ing on whether he has been described by 
words that are low or high on a given dimen- 
sion. Experiment 2 tested this notion, using 


~ information varying along the trait disposi- 


e 


» 


. 
fe 


= 


tionality dimension and the life condition 
stability and centrality dimensions. The trait 
dispositionality dimension was selected for 
Experiment 2 because of the importance of 
the concept of dispositionality in person per- 
ception (Heider, 1958; Jones & Davis, 1965), 
while the life condition dimensions were 
chosen to allow investigation of their apparent 
similarity to the trait dispositionality dimen- 
= sion. 

Differences in judgment due to information 
type were measured in terms of the strength 
of implication of a description. The strength 
of implication measure was defined as follows: 
When the subjects’ task is to rate the likeli- 
hood that a stimulus person does or does not 
have a certain characteristic, the strength of 
implication of the stimulus description is the 
absolute difference between the rating actually 

-*made and a neutral rating. This measure does 
not take account of the direction of the in- 
ference (i.e., whether the subject thinks the 
characteristic is likely or unlikely): Only 
magnitude is considered. 

Schneider’s (Note 1) suggestion that high 
dispositional traits provide more information 
about a target person than do low disposi- 
tional traits leads to one prediction concern- 

«ing the effect of trait dispositionality on 
judgment: (Hypothesis 1) In a judgment 
task, there is a general tendency for high 
dispositional traits to have a greater strength 
of implication than low dispositional traits 
have. This hypothesis suggests a general 
trend, however, there may be types of judg- 
ments for which this greater informativeness 
of high dispositional trait information would 
not apply. For instance, when the task is 
prediction of whether a person will engage in 
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a specific behavior, knowledge of variable 
characteristics, such as the person’s mood, 
may be as important for the prediction as 
knowledge of stable properties. Thus (Hy- 
pothesis 2), there would be no tendency for 
stable, dispositional trait information to have 
greater strength of implication than variable 
trait information when the judgment task is 
predicting whether the stimulus person would 
engage in certain behaviors. Similarly, when 
the task involves inference of traits which 
themselves were low on the dispositionality 
factor, the relative usefulness of dispositional 
trait information may be reduced. If this is 
the case, then (Hypothesis 3) even though 
dispositional trait information leads to 
stronger inferences when the task is inferring 
a stable trait, there is no difference between 
the two when the task involves inference of 
a variable trait. 

The predictions above all apply to the 
effects of variation of the dispositionality of 
trait information. The use of life condition 
information varying along the stability and 
centrality dimensions allowed exploration of 
the extent to which these two dimensions have 
similar effects on judgment. 


EXPERIMENT 2 


Method 
Subjects 


Forty female students at a state college in the 
New Haven, Connecticut, area were paid for 1 
hour’s participation in the experiment, 


Materials and Procedure 


Each subject received a booklet describing eight 
stimulus persons, Each stimulus person was described 
by four of the terms used in Experiment 1: two 
traits and two life conditions. The two traits were 
both either stable or variable (ie., high or low on 
the dispositionality factor) and the two life condi- 
tions were both either stable or variable and high 
or low on the centrality dimension. Thus, these cight 
stimulus persons formed a 2 X 2 X 2 factorial design, 
the factors being trait stability, life condition sta- 
bility, and life condition centrality, For example, 
the stimulus person including the terms “intelli- 
gent,” “warm,” “financially secure,” and “has many 
friends” was in the stable traits, stable high central 
life conditions cell, while the one using the terms 
“restless,” “bold,” “new in town,” and “lives near 
a park” was in the variable traits, variable low cen- 
tral life conditions cell. 

A word or phrase’s classification on an informa- 
tion dimension was based on its factor score in 
Experiment 1. A trait or life condition was classified 
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as stable or variable if its factor score was in the 
upper or lower third of the scores on the relevant 
factor. A life condition was classified as high or low 
central if its factor score was in the upper or lower 
half of the scores on the centrality factor. The less 
stringent criterion for centrality was needed to 
provide enough words for the various combinations 
of stimulus information required by the design. 

Selection of the words to describe stimulus persons 
was random within categories, except for these 
constraints: (a) no negatively evaluated terms were 
used, to reduce the chance of confounding with the 
evaluative dimension; ? (b) no contradictory terms 
were paired; (c) no two terms were paired in more 
than one stimulus person. 

Two different sets of stimulus persons were con- 
structed. Each subject received only one set, pro- 
viding a replication using the same information 
dimensions but different specific content. Control 
for order effects was provided by reversal of the 
order of the terms in each stimulus person in half 
of the sets. 

Subjects rated each stimulus person on 26 char- 
acteristics (see display). They rated the likelihood 
that the stimulus person would be described by the 
characteristic. The rating scale was from —5 (very 
unlikely) to 5 (very likely). In scoring, only the 
absolute value of a rating was considered. The rated 
characteristics included, in order, 4 stable traits, 4 
variable traits, 4 stable life conditions, 4 variable 
life conditions, and 10 hypothetical behaviors in 
which the stimulus person might engage. The traits 
and life conditions were chosen randomly from the 
lists used in Experiment 1. When a term describing 
the stimulus person appeared as a response item, 
another word in the same category was substituted 
on the response sheet. The rating scale included the 
following items: 

1, artistic 

2. intelligent 

3. dominating 
4, dishonest 

5. dissatisfied 
6, relaxed 

7. happy 

8. rash 

9. employed 

10, middle-class 

11. parent 

12. city-dweller 

13. new in town 


14. plans to move to a different neighborhood 
15. new in his job 


2 Although the factors found in Experiment 1 were 
orthogonal, it is possible that a randomly selected 
set of terms varying on a chosen dimension might 
also differ on some other dimension. The evaluative 
dimension poses a particular danger in this regard, 
since evaluation is likely to affect strength of impli- 
cation (cf. Jones & Davis, 1965), Therefore, the 
evaluative factor scores of the terms selected were 
compared, and no confounding was detected. 

3 Individual cell means and complete tables of the 
analyses of variance can be found in Gifford (1973). 
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16. under pressure 

17. would watch the evening news regularly to 
keep up on current events ‘ 

18. would shoplift an inexpensive item from a 
store 

19. would know how to solve problems his co- 
workers have trouble with 

20. would smoke on a crowded bus 

21. would volunteer for a service organization 
such as VISTA 

22. would litter by throwing a candy bar wrapper 
on the sidewalk 

23. would almost always be on time for work 

24. would dent the fender of a parked car and 
not leave his name 

25. would risk his own life to rescue someone else J. 

26. would canvass his neighborhood soliciting for *\ T + 
charity. 


Results 


Preliminary analyses showed no significant 
effects due to either the stimulus person con- 
tent replication or order of presentation of 4 
phrases. Consequently, the analyses presented 
below dealt with only the within-subjects 
factors, which were the main focus of the 
present study.* pe ee 

The data supported the main hypothesis of 
the present study: Stable, dispositional trait 
information had a greater strength of impli- 
cation than variable trait information for all 
three categories of dependent measures. In 
ratings of traits, the mean strengths of im- 
plication for high and low dispositional trait 
information were 2.35 and 2.18, respectively, 

F (1,39) = 4.45, p < .05. In ratings of life { 
conditions, the high and low dispositional ~ j 
means were 2.49 and 2.24 (F = 10.89, p < 
.01), while in ratings of behaviors the means 
were 2.34 and 2.05 (F = 17.73, p< .01). 7 
Thus, Schneider’s proposal that high disposi- 
tional information is more informative was 
supported by these results. 

For life condition stimulus information, 

the pattern was different. The strength of 3 l 
l 


implication of high centrality life conditions » 

(X= 2.41) was significantly greater than 

that of low centrality information (X = 2.12) 

in the implication of traits (F = 7.85, p < 

01), but not in the implication of either life l 

conditions or behaviors. Stable life condition 

information had greater strength of implica- 

tion than variable life condition information 

in the judgment of likelihood of behaviors, i 

the means being 2.33 and 2.06 (F = 11.67, 

$ < .01) but not in judgments of traits or life + a 
“= 
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conditions. Thus, while both of these life con- 
dition dimensions had effects on judgment, 
their effects were not the same as those of the 
trait dispositionality factor. 

The significant effect for trait stability in 
prediction of behaviors was contrary to Hy- 
pothesis 2. However, a post hoc analysis 
yielded some support for this hypothesis. The 
10 behavior rating items can be broken down 


` into three subcategories based on three types 


of described behavior: regular patterns of 
behavior (numbers 17, 19, and 23 in display), 
all positively evaluated; single acts that are 
positively ‘evaluated (numbers 21, 25, 26); 


and single acts that are negatively evaluated 


(numbers 18, 20, 22, 24). Examination of 
these subcategories showed that the difference 
between stable and variable trait information 
was due mainly to differences in the predic- 
tion of regular patterns of behavior, with the 
mean strengths of implication being 2.65 and 
1.95, F (1,39) = 60.05, p < .01. For single 
behaviors, there was no effect when the be- 
haviors judged were positively evaluated (the 
mean ‘strengths of implication were 2.11 and 
2.21, F = 1.12). When the ratings were of 
negatively evaluated behaviors, the effect, 
although significant, was small in comparison 
to that obtained for regular behaviors. The 
mean strengths of implication were 2.28 and 
2.01 (F = 6.45, p < 05). 

It appears that the hypothesis of reduced 
difference in implicative strength might be 
valid when the behaviors being predicted refer 
to a single performance of an act, but defi- 
nitely is not valid when the prediction is of a 
regular, repeated behavior. It is not surprising 
that judgment of regular behaviors was simi- 
lar to the judgment of traits (with respect to 
this hypothesis) since repeated behaviors are 
a key source of information for attributing 
traits (Kelley, 1967). However, the finding 
of an effect for trait dispositionality when the 
predicted behavior is negatively evaluated 
limits the extent of support for the hypothesis, 
since there was no a priori basis for expecting 
the hypothesis to hold only for positive 
behaviors. 

The third hypothesis was strongly sup- 
ported. When stable traits were being judged, 
stable trait information had a mean strength 
of implication of 2.35, while the mean for 
variable trait information was only 1.92 (F = 
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24,31, p< .01). When variable traits were 
being judged, there was no such tendency. 
The mean strengths of implication were 2.35 
for stable trait information and 2.45 for varia- 
ble (F = 1.0). This pattern suggests that 
stable trait information was equally useful 
for drawing inferences about dispositional and 
nondispositional aspects of a person, while, in 
contrast, variable trait information was much 
less useful for drawing inferences about dis- 
positional traits than for inferring variable 
traits. 


GENERAL Discussion 


Experiment 2 was designed to demonstrate 
that the trait dispositionality dimension and 
the life condition centrality and stability di- 
mensions are relevant to interpersonal judg- 
ment. Since significant differences in strength 
of implication were found for all three fac- 
tors, it is reasonable to conclude that a word’s 
score on a given factor is in fact related to 
the type of information supplied by that 
word. This suggests that these three factors 
represent a significant aspect of people’s or- 
ganization of descriptive information, as op- 
posed to being merely a summary of subjects’ 
responses to scales that have no particular 
relevance to the actual use and understanding 
of the words. These results also provide a 
start toward understanding the specific rela- 
tionship between the information conveyed 
by a word and the word’s being high or low 
on a given dimension. 

Certain cautions are needed in interpreting 
the present results. First, the factor structures 
found in Experiment 1 do not provide an ex- 
haustive view of the dimensions of meaning of 
either traits or life conditions. The possibilities 
for dimensions to be found were limited by 
the choice of scales. Other researchers have 
found different dimensions of connotative 
meaning using different scales (e.g., Osgood, 
Suci, & Tannenbaum, 1957), or different 
methods, such as factor analyzing trait rat- 
ings (e.g., Norman, 1963; Passini & Norman, 
1966). Also, the number of words used as 
examples of each dimension in Experiment 2 
was very small in comparison to the number 
of words or phrases that might be members 
of each class. This means that differences 
might have been obtained due to the idio- 
syncratic meanings of some of the words used 
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rather than to information properties of the 
dimensions. However, the latter problem was 
reduced by the use of two different sets of 
stimulus content and by random selection of 
terms within each category for the stimulus 
and response materials. Furthermore, results 
consistent with the present ones have been 
found using two other sets of stimulus per- 
sons, similar to the ones in Experiment 2, but 
not mixing traits and life conditions in the 
same description (Gifford, 1973). 
` Another reason for caution in generalizing 
from the present results is that the effects 
reported in Experiment 2 were all weak rela- 
tive to individual differences. In all the analy- 
ses of variance of the present data (Gifford, 
1973), the subjects factor accounted for by 
far the greatest portion of the variance. This 
means that, while many of the effects were 
highly significant, individual differences in 
judgment style accounted for more variance 
than did variations in stimulus information. 

The results of the present study can be 
considered in terms of the concept of implicit 
personality theories. Implicit personality 
theories are usually operationally defined or 
measured as trait implications (e.g., Secord & 
Berscheid, 1963) or trait-rating intercorre- 
lations (e.g., Koltuv, 1962). The present find- 
ings provide some insight into the structure of 
such trait-trait relationships by showing that 
how dispositional a trait is seen as being is 
related to the sorts of traits that are seen as 
co-occurring with it. However, since the pres- 
ent analyses were of group averages, specific 
predictions about any given individual’s im- 
plicit personality theory could be made only 
to the extent that the group findings are rep- 
resentative of that person’s beliefs about in- 
formation properties of traits. 

One recent review of the literature on im- 
plicit personality theories has noted the need 
for research on their nontrait aspects 
(Schneider, 1973). While the research on the 
subject has been concerned primarily with 
co-occurrence of traits, the concept of implicit 
personality theories can include a much 
broader selection of types of information, for 
example, facial features, or behaviors. The 
present study has shown that life conditions 
can be reasonably strong impliers of traits 
and that life condition information properties 
differ from those of traits. Thus, the relation- 
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ship between life conditions and traits ap- 
pears to be an aspect of implicit personality 
theories worthy of further study. 
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Effect of Cognitive Appraisal on d’ and 
Response Bias to Experimental Stress 


Richard W. J. Neufeld 
University of Western Ontario 
London, Canada 


The effects of an intellectualization-denial passage designed to modify cogni- 
tive appraisal of stressor stimuli (gory scenes) were evaluated in terms of 
sensory-decision theory analysis. The passage was found to reduce “felt 
stress” (d’) for certain pairs of the stimuli addressed by the passage content 
but to be less effective in reducing d’ for stimuli not included in the passage. 
The criterion for reporting stress (Lz) was reduced rather than increased with 
presentation of the passage for the included stimuli only. Differences in d' 
between subjects classified within treatment groups as repressors versus sensi- 
tizers were obtained for certain stimulus pairs generally in the direction of 
lower sensitivity for repressors. Results are discussed in terms of the effects of 


Pa 


denial.” 


The concept of cognitive appraisal (cf. 
Lazarus, 1966) has been postulated as a 
critical intervening variable in accounting for 
variation in stress response. Its potential as 
an agent for increasing or decreasing reac- 


altered cognitive appraisal on the “prevention of stress” and on “defensive 


environment has been associated with reduced 
verbal acknowledgement of subjective stress 
without other indications, usually physio- 
logical, of an absence of stress (cf. Lazarus & 
Alfert, 1964; Pagano, 1973; Weinstein, 


i tions to ongoing stress has been experiment- Averill, Opton, & Lazarus, 1968). Lazarus 
) ally demonstrated by Speisman, Lazarus, (1967), for one, has noted that the coping 
Mordkoff, and Davison (1964) and later strategy of denial may be an important source 
extended to the “short circuiting’ of stress of disagreement among stress indexes, 
(e.g., Lazarus & Alfert, 1964) through pro- The presence of physiological responsivity 
gramming cognitive appraisal in advance of to threat in contradiction to subjective report 
p> stressor exposure. has raised the question of whether the indi- 


More recently, the construct of “cognitive 
appraisal” has assumed an important theo- 
retical role in accounting for variation in 
anxiety-related behavior (Spielberger, 1972) 
and also in distinguishing among anxiety and 
other fear-related emotions (cf. Lazarus & 


$ Averill, 1972). 


One type of cognitive appraisal, the ten- 
dency to avoid aversive aspects of emotional 
stimuli, has been the focus of considerable 
research on decrement in stress response (e.g., 
Byrne, 1961; Houston, 1973; Lazarus & 
Alfert, 1964; Pagano, 1973). An individual’s 
inclination to deny threatening aspects of the 


This research was supported by Canada Council 
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W. J. Neufeld, Department of Psychology, Univer- 
sity of Western Ontario, London 72, Canada. 


vidual is unaware of the bodily reactions or 
is reporting less stress despite awareness of 
such factors as disturbed cognitive or motor 
performance or an aroused physiological state 
(cf. Maher, 1966, pp. 192-199). This prob- 
lem seems to relate to two components of the 
subjective report of stress. One is the “felt 
threat” of the stimulus or the “sensory” as- 
pect of the stimulus with respect to stress. 
The other is the propensity of the subject to 
verbally identify a given stimulus occurrence 
as a member of the class of occurrences he is 
inclined to designate as stressful. 

One approach to the analysis of verbal re- 
ports separating sensory aspects of stimuli 
from “response bias” has been the applica- 
tion of the theory of signal detection (TSD) 
to subjective judgments (cf. Price, 1966). A 
full description of the method is available 
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elsewhere (Green & Swets, 1966; Egan & 
Clarke, 1966). However, the importance of 
TSD with respect to denial of threat would 
appear to be the possibility of assessing sub- 
jective responses to aversive stimuli in terms 
of sensitivity to disturbing properties (d’) as 
distinct from the tendency to verbally label a 
given stimulus event as stressful (Lz). 

In a previous study (Van Egeren, 1968), 
responses to anxiety and neutral stimuli of 
subjects separated on the basis of tendency 
toward denial in ‘the form of “repression ver- 
sus sensitization” (cf. Byrne, 1961) were 
assessed in terms of sensitivity and recogni- 
tion criteria. Word stimuli were categorized as 
“anxiety” or “neutral” for each subject ac- 
cording to prior responses in a word associ- 
ation task. Surprisingly, repressors were not 
found to differ from sensitizers with respect 
to either d’ or L,. The present study, rather 
than focusing primarily on psychometrically 
defined denial tendency, was designed to 
evaluate the effects of an experimentally pro- 
grammed denial orientation, on d’ and L+. In 
addition, generalization of treatment effects 
to stimuli, other than those directly addressed 
by the programmed denial orientation, was 
assessed in terms of these two parameters. 
Recent evidence (Averill, Malmstrom, Koriat, 
& Lazarus, 1972) has indicated generalization 
across stressors of experimentally reduced 
physiological and self-report ratings of dis- 
turbance may be somewhat limited. The pres- 
ent procedures sought to appraise whether 
such limits in generalization might extend to 
treatment effects on the separate d' and L, 
components of subjective responses. 

The agreement between physiological 
arousal and d’ and L, was also explored. It 
may be suspected that past findings of equivo- 
cal covariation between physiological and 
self-report measures of disturbance (cf. Math- 
ews, 1971; McGrath, 1970) might be partly 
attributable to the necessary confounding of 
response criterion with subjective effects of 
stress in most self-report measures. 

Finally, subjects, divided on the Repres- 
sion-Sensitization Scale, were compared in 
terms of d’ and L, in each experimental con- 
dition to assess whether previous absence of 
relationships would persist with the present 
sample and experimental context. 
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In summary, then, an attempt was made to _ 
relate the “sensory” and response-criterion ¢ 
components of subjective stress to experi- 
mentally programmed denial and “trait” ` 
denial and to probe the agreement in varia- \ 
tion between the sensory and criterion com-, | 
ponents and physiological reactivity. 


METHOD 
Subjects 


Subjects were 60 students taking undergraduate | 
psychology and sociology courses at the University } 
of Western Ontario. Equal numbers were randomly! Jis 
assigned to each of four experimental treatment TE Ti 
groups (n=15 per group). As participation re- i ~*~ 
quired two experimental sessions, subjects were paid” 
for their participation at the end of the second 
session. WW 

3 $ 
Stressor Stimuli j: 


Stimuli consisted of eight slides of homicide vif 
tims, eight slides of live skin-diseased patients, and‘ 
a neutral slide—a bare-chested male undergraduate. | 
The slides of homicide victims, obtained from the 
police department of a large Canadian city, were 
untouched color photographs of the victims at Lode 
the scene of the crime or “on the slab” at the w 
morgue. Blood and other evidence of trauma typi- | 9 
fied the scenes. Such slides of mutilated bodies have ‘ 
previously been allocated by judges toward the“very 
unpleasant” end of a “very unpleasant” to “very 
pleasant” rating continuum (Campos & Johnson, 
1967; Hare, Wood, Britain, & Shadman, 1971) and 
have been unidimensionally and multidimensionally 
scaled as “very stressful” (Neufeld & Davidson, Note 
1). The slides of skin-diseased patients, obtained 
from a medical textbook, were unretouched color 
photographs of patients in advanced stages of the >i 
diseases. These slides were similar to those of the 
homicide victims in that bodily disfiguration was 
extensive. 


Procedure 


Each subject, run individually, participated in two 
experimental sessions separated by 1 day. The experi- 
ment was introduced as “a study of people’s im- 
pressions of provocative photographs.” 

The first session consisted of three phases: (a) € 
completing the Repression-Sensitization Scale (Byrne, | 
1961); (b) ranking two sets of nine stimuli each 
from most to least stressful; and (c) listening to one 
of two experimental tapes. Because of the provoca- 
tive nature of the slides, subjects were informed as 
to their content and permission was given to with- 
draw prior to the ranking task. All subjects re- 
mained to participate. 

One set of slides to be ranked was made up of 
the eight homicide victims plus the male under- 
graduate neutral slide and the second set consisted 
of the eight skin-diseased patients and the neutral 
slide. Order of presentation of each set for ranking 


was randomized independently for each subject. 
Although both sets of slides were initially rank 
ordered in an effort to avoid differential orientation 
toward the taped passages to follow, only one set 
was retained for subsequent use in the second session. 
The subjects were randomly assigned to the homicide 
victim slides (HV) group or the skin-diseased patient 
slides (SD) group according to the set of slides 
presented in the second session. 

As the purpose of the study was not to relate 
variation in subjective experience to variation in a 
physical stimulus, precise measurement of the stimu- 
lus was not a requisite (cf. Chapman & Feather, 
1971). Rather, the stimuli had only to be ranked 
along a dimension of meaning designated by the 
experimenter and understood by the subject (“stress- 
fulness”), Hence, each subject separately ranked each 
}, of the two sets of stimuli from 9 to 1 corresponding 

to from most to least stressful. To facilitate this pro- 

cedure, a Pana-Vue 2 X 2 view finder was provided 
long with the sets of slides. Every other slide 

(ranks 1, 3, 5, 7, and 9) was selected from that set 

assigned for presentation in the second session (cf. 

Chapman & Feather, 1971). 

Generally, the rank allocated to a slide varied 
according to the extent of mutilation evident in the 
photograph. However, for the SD set, facial dis- 
figuration around the eyes also tended to produce a 
higher rank for some subjects. As expected, in both 
sets, the male undergraduate neutral slide was 

- “invariably ranked as least stressful (1). 

After completing the ranking task, the subject 
E. iistenéd to a taped passage. Half of the subjects in 
the HV group (n= 15) and half of the subjects in 
J the SD group (m= 15) were randomly assigned to 

“an “intellectualization-denial” tape and the other half 

in each group to a control tape on study habits. 

The intellectualization-denial (ID) tape was intro- 
duced as a description of some of the slides previ- 
ously ranked. Content was directed toward the 
homicide victim slides and was thus considered more 
relevant for subjects subsequently viewing the homi- 
cide victims in Session 2, than for subjects assigned 
the skin-diseased patients. The tape* was intro- 
duced as a passage on the informational value of 
photographs. It was fashioned according to similar 
passages relating to the stressor film “Subincision” 

4 by Speisman et al. (1964), de-emphasizing the aver- 
sive aspects of the stimuli, No mention was made 
of the severity of trauma or of feelings associated 
with bodily damage. Rather, a detached orientation 
was encouraged by focusing on the usefulness of 
photographs in crime detection, in educating the 
public about violence, and in assessing the “make-up” 
of people breaking society’s laws. Emphasis was 
also placed on the usefulness of clues in such photo- 
graphs for preventing similar crimes and for the 
“present state of control over these problems by 
local police forces.” 

The study habits tape (SH) was presented as a 
tape routinely given by the investigator as a service 


1 Readers interested in these passages can obtain 
the scripts from the author. 
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to subjects participating in his research projects. 
Content centered on the question-read-recite-review 
method of studying. 

In the second session, the subject was told more 
slides would be presented, and skin-conductance 
electrodes were then attached to the palm and upper 
surface of the nondominant hand and their use 
explained. The subject was informed that the elec- 
trodes were not capable of delivering shock and 
was then instructed in the use of a 7-point rating 
scale for recording subjective reactions to the slides. 

The rating scale was anchored at one end (1) as 
“pleasant” and at the other as “very stressful.” In- 
structions were then given that some of the slides 
previously seen were to be presented on a screen 
and each one was to be rated immediately after it 
was shown using one of the 7 categories. The 
subject was explicitly permitted to employ a cross- 
section of the categories during the rating task. After 
the experimenter ascertained that the subject under- 
stood the use of the rating scale, the assigned tape 
was again presented to the subject as a review of 
what was previously heard. Following the tape, the 
subject was requested to “simply remain seated for 
5 minutes during calibration of the polygraph.” 

When approximately 5 minutes had elapsed, the 
set of 5 slides were each presented 16 times for a 
total of 80 exposures (cf, Chapman & Feather, 1971; 
Clark, 1969; Clark & Mehl, 1971). Order of the 
80 presentations was randomized independently for 
each subject. Duration of the slides was 10 seconds 
with the intertrial interval randomly varied between 
17 and 23 seconds with a mean of 20 seconds, Skin 
conductance was monitored throughout the presenta- 
tion employing a Gross Model 7 polygraph with 
a 7P1 preamplifier and Beckman electrodes covered 
with a K-Y sterile lubricant, as mentioned above, 
conductance being taken from the palmar and dorsal 
surfaces of the nonpreferred hand. Each session 
lasted approximately 1 hour. 


Analytical Procedures 


The rationale underlying the present analyses de- 
serves additional comment. The initial ranking pro- 
cedure was analogous to, but obviously not identical 
to, the designation of “signal” versus “no-signal” 
stimuli or, equivalently, “noise-plus-signal” versus 
“noise only” stimuli in conventional signal detection 
experiments. In research such as that on auditory 
mechanisms, determining whether a given stimulus 
carries the defined signal or a certain magnitude of 
signal, is usually straightforward and can be physi- 
cally identified. However, the degree of stressor 
properties of a complex visual stimulus for a given 
subject based solely on some measurable physical 
characteristic is not readily forthcoming. Rather, 
estimating the magnitude of stressor properties asso- 
ciated with the stimulus continues to require the 
participation of the subject (cf. Neufeld & Davidson, 
1974). In this case, the initial ranking task repre- 
sented the subject’s disposition to the stimuli with 
respect to properties of stress before the appraisal- 
modifying or control-passage manipulations were 
presented. 
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TABLE 1 


Proportions OF RESPONSES IN EAcH RATING 
CATEGORY FOR THE FouR 
TREATMENT GROUPS 
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Stimu- Rating category 
Group lus = 
rank 
1 2 3 4 5 6 7 

HYV-I-D 9 33 
7 222 
5 38 
3 08 
1 «55 | .27 | 

HV-SH 9 AS 
7 -09 
5 -09 
3 .07 | .13 
1 «17 | 10 

SD-I-D 9 24 
7 02 
5 -03 
3 lo 05 
1 54] .29 

SD-SH 9 40 
7 15 
5 06 12 
3 05 04. 
1 79) .11 


The postmanipulation ratings provided estimates 
of the subsequent disposition toward the same 
stimuli. Analysis of these ratings within a signal- 
detection framework provided information about 
this latter disposition with respect to: (a) subjective 
sensitivity to variation in these stimuli (d’) as well 
as (b) the criterion (Zz) for allocating a given 
magnitude (rating category) of stress to the 
“subjective effect” of a given stimulus presentation. 

The effects of exposure to the variously ranked 
stimuli were assumed to be normally distributed and 
overlapping. The value of d’ for a pair of stimuli 
was considered to be the distance between the mid- 
points of their corresponding distributions. The value 
of Ls for a given pair of stimuli was considered as 
the ratio of the likelihood that a given subjective 
event had emanated from a stressor of higher rank 
to the likelihood that the subjective occurrence had 
emanated from a lower ranking stressor. The value 
of d’ was calculated in unit normal deviate values 
using the proportion of lower ranked stimulus 
presentations allocated to a given rating category 
(“false-alarm rate”) in conjunction with the propor- 
tion of higher ranked stimulus presentations allocated 
to that rating category (“hit rate”). The ZL. value 
was calculated as the ratio consisting of the ordinate 
of the normal curve corresponding to the hit rate 
proportion divided by the normal-curve ordinate 
corresponding to the false-alarm rate proportion. 


RESULTS 


In order to assess the effects of the experi- 
mental manipulations group-wise d’ and L, 


- erties for the rank 5 stimuli (as Suggested by 


values (cf. Chapman & Feather, 1971; Clark. 
1969) were computed by calculating the phy i 
portion of responses in each of the rating. 
categories. For each stimulus, there was a 
total of 240 responses per group—16 for each 
of the 15 subjects. These proportions are 
presented in Table 1. Four values of d’ were 
computed for each group—one d’ for each 
pair of adjacent stimuli. The scale was di- 
chotomized at Category 5 for stimulus pairs 
9-7, 7-5, and 5-3. Thus the cumulative pro- 
portion of ratings of 5 or above for the higher 
ranked stimulus was analogous to the group’s 
hit rate and the cumulative proportion al- 
lotted 5 or above for the lower ranked stimu- 
lus was considered the group’s false-alarn 
rate. Category 3 replaced Category 5 as thi 
point of dichotomy for stimulus pair 1-3, 
Pair-wise comparisons? of corresponding d 
values were carried out between the following 
groups: HV-I-D and HV-SH; SD-I-D and 
SR-SH; HV-I-D and SD-I-D; and HV-SH and 
SD-SH. Statistical significance of the differ- 
ences in the d’ values was assessed using 
G tests (Gourevitch & Galanter, 1967). Type 
I error rate was set at .05 per family of the 
four comparisons between a pair of groups 
employing Dunn’s procedure (cf. Kirk, 1968). 
The d’ values and G values are presented in 
Table 2. 

For groups HV-I-D versus HV-SH, the 
hypothesis of no differences in any of the four 
pairs of d’ values was rejected at the .05 level 
of significance. The peice i dn 


passage was evidently effective in reducing 
d’ for stimulus pairs 3-5 and 7-9. The signifi- 
cantly higher d’ for pair 5-7 relative to that 
of the group presented with the passage on 
study habits could have been attributable to 
an elevation in the “stressor properties” of the 
rank 7 stimuli or a decrease in stressor prop- 


the HV-ID group’s lowered pair 3-5 d’ value). 
To assess the former possibility, the rating 
scale was dichotomized at Category 5 and the 
computed d’ values for pair 3-7 were com- 
pared. The difference was found to be non- 
significant (G = .06, p > .10) indicating that 


2 Although the experimental design suggested fac- 
torial analyses of variance, this method would have, 
meant computing d' and Ls for each subject employ-" y 
ing 16 responses per stimulus and sometimes imguffi- 
cient overlap between stimuli for reliable esti 
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Phe significant increase in d’ of pair 5-7 was 
attributable to lowered stressor properties of 
the rank 5 stimuli for group HV-I-D. Group 
SD-I-D was compared to group SD-SH in 
order to assess generalization of the effects of 
the intellectualization-denial passage (as com- 
pared to the neutral control passage) to 
aversive stimuli not included in the passage 
content. Again, the hypothesis of no differ- 
ences in the four pairs of d’ values was re- 
jected at the .05 level of significance. How- 
ever, the affected stimulus rank pair was 1-3 
with no significant differences for any of the 
other stimulus pairs. 

Comparisons between groups HV-SH and 

* SD-SH indicated that d’ varied for all stimu- 
lus rank pairs other than 7-9. Thus, the 
“initial” d’ values, according to the responses 
of these control groups, differed between the 
two sets of stimuli. Despite a significantly 
larger initial d’ for the HV control group than 
for the SD control group for stimulus rank 
pair 3-5, the I-D passage significantly re- 
duced d’ relative to the corresponding d’ of 
the SD-I-D group. With no initial significant 
difference in d’ for rank pair 7-9, the denial 
passage again significantly reduced d’ for 
group HV-I-D relative to that of group 
SD-I-D. While the I-D passage significantly 
reduced the d’ of rank pair 1-3 for the SD 
group relative to its own SH control, it failed 
to do so relative to d’ of the HV-I-D group. 

` However, the “initial” d’ value—as reflected 
in the comparison between the two SH 
groups—was significantly larger for the SD 
group. With respect to the significantly higher 
d’ of rank pair 5-7 for group HV-I-D as com- 
pared to group SD-I-D, it appeared that re- 


TABLE 2 


VALUES or d’ AND G Tests ror GROUP DIFFERENCES 
IN d’ FOR RESPECTIVE STIMULUS RANK PAIRS 


Stimulus-rank pairs 
Group pairs as 
1-3 3-5 


5-7 7-9 

A 2.29 38 1.19 —.08 
vst 1.97 1.68 —,.14 -63 
G (HV-I-D)-(HV-SH)| 1.43 | —7.37* 7.029* | —3.25* 
-T- 1.98 1S 66 66 
Spsit 2,68 AT 50 1.00 
G (SD-I-D)-(SD-SH) =3.17* 947 70 —1.06 
G (HV-I-D)-(SD-I-D) 1.53 | —2.28* 2.75* | —3.09* 
—3.25* 3.94* | —3.69* 1.85 


G (HV-SH)-(SD-SH) 


Note. Alpha per family of four comparisons between each 


group pair = .05. 5 
*p < .0125, two-tailed. 
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duced stressor properties of stimulus 5 of 
group HV-I-D were again responsible rather 
than increased stressor properties of stimu- 
lus 7. The d’ values of rank pair 3-7 were com- 
puted dichotomizing the rating scale at Cate- 
gory 5 and found to be nonsignificant between 
groups HV-I-D and SD-I-D (p > .10). The 
hypothetical probability distributions reflect- 
ing variation in stressor properties of the 
stimuli are presented in Figure 1. 

The effects of the experimental manipula- 
tions on the response criterion, Zz, adopted by 
each group were assessed by again cumulating 
the proportions in Table 1. Values of Ly were 
computed at rating Categories 5, 6, and 7 for 
stimulus rank pair 9-7; Categories 5 and 6 
for rank pair 7-5; Categories 3, 4, and 5 for. ` 
pair 5-3; and Category 3 for pair 1-3.. Thus, 
there were 9 Ly values per group. A 9 X 4 
Friedman two-way analysis of variance by 
ranks was performed on these values and 
indicated that the groups were not identical 
in the criteria adopted, x*(3) = 10.46, p< 
.02. The same groups were contrasted with 
respect to Lẹ as compared with respect to d’ 
using Wilcoxon matched-pairs signed-ranks 
tests. Each between-group contrast was tested 
at the same significance level as the overall 
level employed for d’ (alpha = .05). Two sig- 
nificant comparisons were obtained—that be- 
tween group HV-SH versus SD-SH (Z = 2.55, 
p< .01) in the direction of a stricter cri- 
terion for group HV-SH (X = 1.56 vs. 1.15) 
and that between groups HV-I-D versus 
HV-SH (Z= 2.19, p < .028) again in the 
direction of a stricter criterion for group 
HV-SH (£ = .95 vs. 1.55). 

The effects of repression—sensitization 
(R-S) as assessed by the R-S scale on d’ and” 
L, were evaluated in each of the four treat- 
ment groups. A Stimulus Content x Tape 
Content analysis of variance on subjects’ R-S 
scores revealed no significant main effect or 
interaction (all ps > .10) suggesting that these 
scores were more or less evenly distributed 
among the treatment groups. In each group, 

1 subject was randomly excluded and the re- 
maining 14 subjects were divided into two 
groups—the 7 highest (sensitizers) and 7 
lowest (repressors) R-S scorers. There were 
112° responses to each stimulus in each of 
these groups. The d’ values for each adjacent 
stimulus pair were contrasted yielding four 
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comparisons between repressors and sensitiz- 
ers (as classified above) in each treatment 
group with the Type I error rate (.05) dis- 
tributed across the four comparisons. Again, 
the rating scale was dichotomized—at Cate- 
gory 6 for the top 3 rank pairs and at Cate- 
gory 3 for rank pair 1-3. These d’ values are 
presented in Table 3. In both groups pre- 
sented with the I-D tape, d’ for repressors 
was significantly lower than that for sensi- 
tizers for stimulus rank pairs 1-3 and 7-9 (all 
ps < .012). For the groups given the SH tape, 
d' of repressors exceeded that of sensitizers 
for stimulus rank pair 9-7 ( < .012) of the 
HV stimuli while d’ of sensitizers exceeded 
that of repressors for rank pair 5-7 of the SD 
stimuli. To assess whether the R-S differences 
in the groups exposed to the I-D tape were 
attributable to an increase in d’ on the part 


of the sensitizers or a decrease in d’ on the 
part of the repressors, additional comparisons 
were made. As opposed to their counterparts 
presented with the S-H tape, repressors in the 
HV-I-D group had a lower d’ for rank pair 
7-9 (G=3.04, p< .012). The d’ value of 
repressors in the SD-I-D group was signifi- 
cantly lower than that of repressors in the 
SD-SH group for rank pair 1-3 as well 
(G= 5.09, p<.012). For the sensitizers, 
subjects exposed to the I-D tape had a higher 
d' for rank pair 9-7 than those given the S-H 
tape for both the HV stimuli (G = 3.86, 
p< .012) and the SD stimuli, (G = 2.67, 
p< .012). 

Likelihood ratios were calculated and ana- 
lyzed in the same manner employed to test 
experimental treatment effects on L,. None 
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of the four repressor-sensitizer comparisons 
reached the .05 level of significance. 

Assessment of whether effects on d’ and L, 
were paralleled by variation in skin conduct- 
ance (SC) required the scoring of reactivity 
to each stimulus. Six of the 16 exposures were 
evaluated. Three were randomly selected from 
the first 8 exposures and 3 from the last 8. 
Reactivity was scored as the difference be- 
tween conductance (micromhos X .10) just 
prior to stimulus onset and maximum con- 
ductance during stimulus exposure. Two scores 
per stimulus were thus obtained—the mean of 
the 3 exposures from the first 8 and the mean 
of the 3 exposures from the last 8. Estimation 
of reactivity during the course of the ratings 
was obtained by performing an analysis of 
variance on these SC reactivity scores em- 
ploying the following design: 2 (levels of the 
Taped Passage) X 2 (levels of Stimulus Con- 
tent) X 2 (levels of Repression—Sensitization, 
as Classified above, “nested within tape type 
and stimulus type)? X 5 (Stimulus Ranks) x 
2 (first versus second halves of the Exposures 
by subjects nested within tape type, stimulus 
type, and R-S classification). 

Significant effects were obtained for the 
first versus second halves of the exposures, 
X= .154 and .085, respectively, (1,52) 
= 6.47, p < .05, and for the Stimulus Rank 
X Stimulus Content X Tape Content second- 
order interaction, F (4,208) = 2.42, p < .05. 
Analysis of simple first-order interactions 
indicated that the “source” of the second- 
order interaction was a Stimulus Content X 
"Tape Content effect specific to stimulus 
rank 3, F(1, 130) = 4.32, p < .05. Further 
analysis of “simple simple main effects” (cf. 
Kirk, 1968) revealed a significant effect of 
stimulus content—in the direction of greater 
reactivity to the HV stimuli (X = .2366 vs. 


8The R-S classification was considered “nested 
within treatment groups (tape type and stimulus 
type)” as the classification was based on the median 
of each group of randomly assigned subjects. Pre- 
liminary tests on R-S nested within treatment 
groups; R-SX Order, nested within treatment 
groups; R4% Stimulus Rank, nested within treat- 
ment grou „fand R-S X Order X Rank, nested 
within treatment groups invariably exceeded values 
of .10. These terms were pooled with the appropriate 
error terms (Kirk, 1968) involving subjects nested 
within tape type, stimulus type, and RS (nested 
within treatment groups) to test the remaining 
effects, 
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TABLE 3 
REPRESSORS’ AND SENSITIZERS’ d’ VALUES IN 
EACH TREATMENT GROUP 


Stimulus rank pairs 
Group 

1-3 3-5 5-7 7-9 

HV-I-D 

Repressors 1.59 —.02 AS 0 

Sensitizers 3.67 4i -92 1.27 

HV-SH 
Repressors 2.19 1.24 AL 76 
Sensitizers 2.66 1.19 -64 05 

SD-I-D 
Repressors 1.46 OL 3.77 45 
Sensitizers 3.05 73 3.17 1.28 

SD-SH 
Repressors 3.29 39 39 28 
Sensitizers 2.69 =S 1.44 62 


.0839)—specific to the SH tape and the 
stimulus rank 3, F(1,130) = 3.93, p < .05. 
Thus, there appeared to be generally greater 
electrodermal reactivity during the first half 
of the exposures. The other findings did not 
indicate any apparent concordance between 
SC and d’ or L,. Rather, the only significant 
group difference was in the opposite direction 
of the corresponding d’ difference—d’ of group 
HV-SH being significantly lower than that of 
group SD-SH for rank pair 1-3. 


Discussion 

Results from the present study suggested 
that an experimentally programmed orienta- 
tion, aimed at modifying the cognitive ap- 
praisal of stressor stimuli in the direction of 
reduced threat, lowered the subjective sensi- 
tivity to several of these stimuli. However, 
the criterion for reporting stress was not 
raised. Rather, the elevated criterion associ- 
ated with exposure to some of the stimuli 
(slides of the homicide victims) under the 
neutral passage disappeared with exposure to 
the threat-reducing passage. 

The generalization of the effects of the 
threat-reducing passage, to aversive stimuli 
not directly implicated by the passage con- 
tent, was limited. Both fewer and different 
stimuli were affected. Moreover, the group 
given the threat-reducing passage, “irrelevant” 
to those stimuli subsequently encountered, 
differed from the group given the stimuli ad- 
dressed by the passage in a similar manner 
as did the neutral-passage control group 
(HV-SH). 
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Previous research (Averill et al., 1972) has 
indicated that generalization of habituation to 
complex emotional stimuli (segments of a 
stressor movie) not included in prior habitu- 
ation trials was limited. Similarly, the present 
results suggested limits in generalization of 
the effects of modified cognitive appraisal on 
the “sensory component of subjective stress” 
to stimuli not included in the programmed 
modification. 

Significant repressor-sensitizer differences 
were found in each group with respect to d’ 
but not L,. Five significant comparisons were 
in the direction of greater sensitivity on the 
part of those classified as sensitizers and one 
significant difference was in the opposite di- 
rection. These differences in d’ values differed 
from those of Van Egeren (1968) who did 
not find significant repressor-sensitizer dif- 
ferences. However, the present stimuli may 
have been “stronger” than Van Egeren’s anx- 
iety stimuli (words) psychometrically classi- 
fied for each subject. In addition, Van 
Egeren’s task involved a tachistoscopic identi- 
fication of the preclassified words as opposed 
to the rating of the subjective stress of aver- 
sive visual scenes. Finally, Van Egeren calcu- 
lated individual d’ values on the basis of a 
large number of trials per subject and as- 
sessed the statistical significance of variation 
among group means of the d’ values. The 
present study computed d’ values for each 
group combining responses within and be- 
tween subjects and assessed group differences 
through planned pair-wise comparisons of 
these group-wise values. Any one or several of 
these variations in experimental procedure, or 
the different scale (Ullmann, 1962) employed 
by Van Egeren to assess repression-sensitiza- 
tion, may have been at the root of this 
divergence in results. 

The attempt, in this study, to estimate 
agreement on the effects of the experimental 
treatments between autonomic reactivity and 
d! and L, must be considered exploratory. No 
apparent correspondence of the electrodermal 
reaction to the two components of subjective 
stress was observed. However, responses ob- 
tained within and across subjects were em- 
ployed to compute d’ and Ly values leading to 
a different statistical analysis of variation in 
these measures as compared to the analysis of 
skin conductance. In addition, detection of 
possible agreement between skin conductance 
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and ZL, may have been somewhat obscured 
due to the collapsing across stimulus pairs in 
carrying out comparisons on L,. Moreover, 
the sensitivity of skin conductance to varia- 
tion in stress has been seen to be somewhat 
elusive depending, in part, on the nature of 
the stressful situation (cf. Craig & Wood, 
1971; Davidson & Neufeld, 1974; Geer & 
Klein, 1969; Hare, Wood, Britain, & Frazelle, 
1971; Neufeld & Davidson, 1974). A future 
study permitting more conclusive results 
might assess the relationship of several 
physiological measures to d’ and Ls employing 
correlational procedures. 

Inasmuch as the value of d’ in this study 
can be construed as a criterion-free index of 
subjective stress, the effect of the programmed 
alteration of appraisal seemed to be in the 
direction of “prevention of disturbance” to at 
least some of the stimuli relative to other 
stimuli. This interpretation of the effect 
agrees with recent formulation by Lazarus 
and Averill (1972) suggesting that cognitive 
appraisal altered prior to stressor presentation 
can “short circuit” threat so that less or no 
disturbance is actually aroused. 

The effect of repression—se 
“dispositional” variable probably reflecting 
“cognitive style’ in Lazarus and Averill 
(1972, p. 243) framework, was on the d’ com- 
ponent of certain stimuli as well. Thus, the 
tendency on the part of repressors to report 
less stress under certain conditions (cf. 
Lazarus & Alfert, 1964; Pagano, 1973) would 
seem to be related to the amount of stress 
“experienced” rather than distortion of the 
actual amount of stress in the verbal report. 

The propensity to report stress (L+) was 
not reduced by the intellectualization—denial 
passage or the classification as a repressor. To 
the extent that ZL, reflected what might re- 
semble an intrapsychic process of “psychic 
avoidance” (cf. Lazarus & Averill, 1972), the 
only effect of the appraisal-modifying passage 
was in the direction of an increase in the 
disposition to acknowledge stress for the 
homicide victim stimuli. 

The present attempt at a signal detection 
approach to stress reactions involved the re- 
peated presentation of “static” stimuli. A 
drawback of signal detection analyses in 
stress research is the need for a large number 
of responses under relatively constant condi- 
tions. Unfortunately, signal detection proce- 
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dures may not be feasible in laboratory ana- 
logues such as viewing threatening films, 
simulation of directly stressful situations, as 
well as in field studies. However, where 
applicable, these procedures may be of future 
value in assessing variation in subjective 
stress and the tendency to acknowledge or 
deny stress. 


REFERENCE NOTE 


1, Neufeld, R. W. J., & Davidson, P. O. Multi- 
dimensional subjective responses to frequently 
used laboratory stressers. Paper presented at the 
meeting of the Canadian Psychological Associa- 
tion, Montreal, June 1972. 
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A Test of the Content-Consequence Hypothesis _ 
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This article contains two conceptual replications of an experiment designed 
to test the hypothesis that when monetary payments are inherent to the 
task’s content their presence (vs. absence) increases intrinsic motivation, 
whereas when they constitute the task’s exogenous consequence their presence 
(vs. absence) decreases intrinsic motivation toward the task. The data of both 
experiments strongly supported the research hypothesis. The findings were 
interpreted as consistent with the assumption that intrinsic motivation ensues 
whenever the actor causally attributes his performance of the task to the 
task’s content and inconsistent with the proposal that intrinsic motivation be 
identified with internal (or self-) attributions and extrinsic motivation with 


attributions to the external environment. 


Attributional analysis has been recently 
brought to bear on the distinction between 
intrinsic and extrinsic motivation (e.g., Deci, 
1971, 1972a, 1972b; Kruglanski, Alon, & 
Lewis, 1972; Kruglanski, Friedman, & Zeevi, 
1971; Kruglanski, Riter, Arazi, Agassi, 
Monteqio, Peri, & Peretz, in press; Lepper, 
Greene, & Nisbett, 1973). According to Deci 
(1971, 1972a, 1972b), intrinsic motivation 
obtains whenever the actor locates the cau- 
sality for his activity within himself and ex- 
trinsic motivation whenever he locates it in 
the external environment. For instance, mone- 
tary rewards may constitute a highly plausi- 
ble external cause for an activity, hence the 
actor would discount his personal motivation 
to the task as a possible cause. By now there 
exist experimental data to wit that the intro- 
duction of external rewards (money, prizes, 
experimental credits, etc.) leads to a decline 
in intrinsic motivation measured, for example, 
by reported interest (Weick, 1964), persist- 
ence at or resumption of an activity (Deci, 
1971, 1972a, 1972b; Lepper et aly ad Moe 
Greene & Lepper, Note 1), or the quality of 
task performance (Kruglanski et al., 1971; 
Weick, 1964). 
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the first author. 

Requests for reprints should be sent to Arie W. 
Kruglanski, Department of Psychology, Tel-Aviv 
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But Deci’s distinction between internal 
versus external causes leads to conceptual 
difficulties with the criteria for internality or 
externality. Thus, (a) any activity seems to 
have an internal cause, the actor’s motive for 
engaging in it; and (b) many activities seem 
to have an external cause as well as an inter- 
nal one, namely, the anticipated change in the 
environment (reception of the reward, solu- 
tion of the problem) satisfying the actor’s 
motive for engaging in the activity. For in- 
stance, money constitutes an external object, 
however, the desire for it may be acutely 
internal. Similarly, interest in a task may 
be internal, yet the task eliciting such inter- 
est is certainly external to the person. All of 
which suggests that the classification of ac- 
tivities into those with internal as opposed 
to external causes may be problematic on 
logical grounds. 

In addition, the exclusive identification of 
intrinsic motivation with internal causes and 
extrinsic motivation with external causes con- 
fronts a difficulty when viewed against the 
finding (e.g., Deci, 1971) that positive verbal 
feedback about performance delivered by an 
external agent consistently leads to an in- 
crease in intrinsic motivation. Possibly this 
datum has led Deci to modify somewhat his 
theoretical focus by proposing that intrinsic 
motivation be identified with the need of self- 
determination (or competence) presumably 
satisfiable by a positive feedback about per- 
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~ formance. The latter formulation has several 


implications: (a) its identification of intrinsic 
motivation with a specific need (reminiscent 
of the posttilation of intrinsic and extrinsic 
needs by some industrial psychologists)* re- 


: moves the analysis from the original attribu- 


tional level impartial to the type of motive; 
(b) it seems to misidentify internal attribu- 
tions of causality with the need of making 
internal attributions (or the need of. self. 
determination); (c) it seems to suggest that 
motives outside that of self-determination 
could not motivate activities in an intrinsic 
way. . 


Content—Consequence Distinction 


An attributional conception of intrinsic 
motivation which stays clear of the problems 
just outlined rests on a distinction between 
the content of an activity and its incidental 
consequence (Kruglanski, Note 2; akin to 
the distinction between stimulus and circum- 
stante made by Nisbett & Valins, 1971; and 


_ Valins & Nisbett, 1971). On this view, in- 


trinsic motivation is defined by the case in 
which the person’s self-attributed cause for 
an activity inheres in its content. By contrast, 
extrinsic motivation is defined by the case in 
which the person’s self-attributed cause for an 
activity lies in the consequences thereof, 
arbitrarily contingent on its performance. The 
foregoing conceptualization of intrinsic and 
extrinsic motivation does not share the clas- 
sification problem of the internal-external 
distinction: it should be relatively easy to 
discern whether a given cause inheres in the 
task’s content or constitutes its exogenous 
consequence. Furthermore, the present defini- 
tions are stated on a strictly attributional 
level, that is, they are impartial to the spe- 
cific motives satisfiable within the. content 
or the consequence of an activity. 

This report contains two conceptual repli- 
cations designed to pit against one another 
the two attributional analyses of task 
motivation discussed earlier. Specifically, a 
situation is created in which monetary rewards 
are in the first case intrinsic to the task’s 
content and in the second case extrinsic (in 
the sense of being arbitrarily attached) to it. 
According to the internal-external distinction 
above, monetary rewards constitute a salient 
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situational feature external to the person, so 
their presence (vs. absence) should suppress 
intrinsic motivation in both conditions. By 
contrast, the content-consequence distinction 
suggests that whenever money is inherent to 
the task its presence should enhance intrinsic 
motivation as it provides a salient intrinsic 
cause for task performance, whereas whenever 
it is extrinsic to the task its presence should 
lower intrinsic motivation. The research de- 
signed to test the foregoing hypothesis is 
described below. 


EXPERIMENT 1 


Subjects and Procedure 


Subjects in this experiment were 48 boys, 14 to 
15 years old, and from a low socioeconomic back- 
ground. They had volunteered for research about 
boys’ games conducted by investigators from the 
Tel-Aviv University. The experiment was conducted 
at several municipal youth centers 2 during the time 
of regular activity at these clubs. Each subject 
played a game under conditions determined by a 
2X 2 crossing of the task variable (money intrinsic 
versus extrinsic) and payment variable (payment 
present vs. absent). 

Manipulating the task variable. One half of the 
subjects, randomly assigned to the money intrinsic 
condition, played with the experimenter (Asher 
Amitai) a coin-toss guessing game (heads or tails). 
As this game is always played for actual money, 
it was assumed that (by past association) money 
would be apprehended as a game-intrinsic feature. 

The remaining half of the subjects, within the 
money extrinsic condition, used wooden blocks for 
the construction of models according to pictures. In 
the subjects’ experience, games of this sort are hardly 
ever associated with payment, so money should be 
perceived as a game extrinsic feature. 

Manipulating the payment variable. Half of the 
Subjects, in the payment present condition, were 
given 10 Agoroth (.10 Israeli IL)? for each correct 
guess (in the money intrinsic case) or construction 
(in the money extrinsic one) until each subject had 
accumulated 20 10 Ag. coins. Subjects in the pay- 
ment absent condition received 1 point for each 
Correct guess or construction until each subject had 
gathered 20 points, 


1 For instance, Herzberg (1966) identified intrinsic 
motivation with the need of psychological growth 
and extrinsic motivation with the need of pain- 
avoidance; identified intrinsic motivation with the 
need of self-enhancement, ete. 

2 Devoted to the provision of cultural and educa- 
tional activities for youths of lower socioeconomic 
background. 

842 IL=$1 U.S. 
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At that time the subject was referred to an 
adjoining room and responded to a questionnaire 
administered by a second experimenter (Leorah 
Shabtai), blind as to the subject’s experimental con- 
dition. This questionnaire included checks on the 
task manipulation and items about the subject’s 
interest in the game. The subject was asked not to 
discuss the study with his friends and to rejoin the 
ongoing club activity. At the end, all subjects were 
interviewed, and given a full explanation regarding 
the purpose of the research, Subjects in the Payment 
absent condition received the two IL rewards as 
had their payment Present counterparts, 


RESULTS 
Checks on the Task Manipulation 


Subjects answered the question: “In your 
opinion, to what extent do monetary rewards 
belong to the (coin-toss; model-construction) 
game, that is, constitute an inseparable part 
of this game?” As all the subjects were 
familiar with both games, they responded to 
the item above separately in reference to each 
game (rather than just to the specific game 
they played). The answers were recorded on 
7-point bipolar scales marked at the ends 
“very much belongs” and “does not belong 
at all.” As expected, the subjects perceived 
the monetary payment as belonging more 
(thus more intrinsic) to the coin-toss versus 
the model-construction game, F(1,46) = 
211.66, p < .001 for main effect of the game 
rated.* 

A second check on the task manipulation 
was phrased: “Tf, and when you will play the 
(coin-toss; model-construction) game, to 
what extent are you likely to play it for 
money?” Again, each subject responded to 
this item with Tespect to both games. The 
answers were recorded on 7-point bipolar 
scales marked at the ends “very likely” and 
“very unlikely,” Consistent with expectation, 
the subjects judged themselves more likely 
to play for money the coin-toss versus the 
model-construction game, F(1,46) = 11.52, 
Ż < .01 for main effect of the game rated. 


Intrinsic Motivation 


The subject’s degree of intrinsic motivation 
toward the game he had played was inferred 
from three separate questions: “To what 
extent did you find the game you have just 


+All p values reported are two-tailed. 


played interesting?”; “To what extent do you 
feel that, outside of an experimental situa- 
tion, boys your age might find the game inter- 
esting?”; “To what extent do you feel that 
in the future you may want to play the game 
with friends during leisure times?” The first 
and second items above inquire into per- 
ceived interest and the third item into per- 
ceived preference for the activity over pos- 
sible alternatives, hence they constitute mea- 
sures of intrinsic motivation comparable to 
those employed by other investigators in this 
area (e.g., interest and preference were mea- 
sured by Deci, 1971: preference by Lepper 
et al., 1973; interest by Kruglanski et al., 
1972, and by Kruglanski et al., 1971). The 
answers to these questions were recorded on 
7-point bipolar scales anchored appropriately 
at the ends, for example, with “very interest- 
ing” and “not at all interesting” in the case 
of the first item above, and so on. The pat- 
terns of mean differences in the responses to 
all three questions just described turned out 
to be highly similar, and the within-cell corre- 
lations between them significantly positive 
(the 7s ranging from .37 to .84), so they were 
combined algebraically into a single index of 
motivation. The means for this measure are 
shown in Table 1. As hypothesized, the inter- 
action effect of the task and the payment 
variables was highly significant, F(1, 44) = 
43.32, p < .001. Specifically, in the money 
intrinsic condition the subjects manifested a 
higher degree of intrinsic motivation when 
the payment was present versus absent, ¢(22) 
= 7.84, p< .001, whereas in the money 
extrinsic condition the subjects manifested a 
lower degree of intrinsic motivation with than 
without payment, ¢(22) = 221) p< 05, 


a 


EXPERIMENT 2 
Method 


Subjects and Procedure 


Subjects in this experiment were 80 boys, 15 and 
16 years old and students at two separate high 
schools. They had volunteered for a research on 
gifted children conducted at the Tel-Aviv Univer- 
sity. Twenty subjects were allocated on a random 
basis to each of the four cells in the familiar 2 X 2 
design, with the task variable (money intrinsic vs. 
extrinsic) and the payment variable (payment 
Present vs. payment absent) crossed orthogonally. 

Upon arrival at the laboratory, each subject was 
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seated beside a desk opposite an experimenter 
(Daliah Zaksh or Bath-Shevah Margolin). The 
second experimenter was situated in an adjoining 
cubicle with a communication slot. The first experi- 
menter informed the subject that he was about to 
play a novel arithmetic game for gifted children, in 
which guessing was required. 

Manipulating the task variable. Subjects in the 
money intrinsic condition engaged in a “Stock 
Market” game whose essence revolved about mone- 
tary profits. The subject received a chart whose rows 
contained the names of six familiar companies oper- 
ating in Israel, The three columns in the chart 
contained (a) the value (in Israeli IL) of each 
company’s stock prior to a “market transaction,” 
(b) the value following this transaction, and (c) the 
gain or loss in the value of each stock (determined 
by subtraction of the value in column 2 from that 
in column 1). The subject had 200 Agoroth (or 
2.00 IL) at his disposal and could use this capital 
in whole or in part to invest in the stocks of the 
various firms with the purpose of maximizing his 
profits. Following each set of investments the chart 
was transmitted to the second experimenter who per- 
formed the “market transaction” and returned to 
the subject the next chart in which column 1 now 
contained the old stock values (i.e., those in col- 
umn 2 of the previous chart) and column 2 the new 
values, The subject calculated his net gains (there 
were no net losses) and these were handed him 
in cash by the first experimenter. Subsequently, the 
next transaction was performed and so on for the 
total of five transactions. The experimenter who 
performed the “transactions” made the appropriate 
calculations so that each subject ended up with 
2.00 IL in profit.® 

Subjects in the money extrinsic condition engaged 
+ in activities similar to those above, now presented as 

the “Athletics Game” (i.e, one whose essence was 

unrelated to money). The subject acted as a mana- 
ger of an athletics team with 200 points at his dis- 
posal and his objective was to maximize the num- 
ber of points in five subsequent competitions (cor- 
responding to the five market transactions above). 

In parallel to the six companies of the “Stock 

Market” game there were six athletic events which 
. current “values” constituted the average numbers 

of points gained by the team’s athletes in these 

events on the last competition. The subject invested 
points by allocating different numbers of athletes to 
the events listed such that each assignment of an 

athlete to an event constituted the investment of a 

single “value” of that event. Based on the pro- 

cedure employed with the “Stock Market” game, 
each subject earned 200 additional points in the end. 


5 This was accomplished by raising the stock 
values of three companies (selected randomly for 
each market run) by 30%, lowering the stocks of 
the remaining companies by 10%, and providing the 
necessary balance (to the desired 2 IL profit) by 
the manipulation of stock values on the last 
“market run.” 
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TABLE 1 


AVERAGE DEGREES OF INTRINSIC MOTIVATION 
(Experiment 1) 


mt Money Money 

Condition intrinsic | extrinsic 
Payment present 5.44 4.66 
Payment absent 2.25 5.60 


Note. Higher figures denote a greater degree of intrinsic 
motivation on a 7-point scale. 

Manipulating the payment variable, Subjects in 
the money intrinsic-payment present condition were 
advised at the outset that they could keep to them- 
selves any profits beyond the initial 2.00 IL. Within 
the money extrinsic condition the payment present 
subjects were advised at the outset that the research 
management decided to remunerate the participants, 
by paying them 1 Ag. for each point earned beyond 
the initial 200, As a consequence, each subject in the 
payment present conditions ended up with a 2.00 IL 
remuneration. 

Subjects in the money intrinsic-payment absent 
condition were advised that in the end they would 
be requested to return their cash profits to the 
experimenter “for use with subsequent subjects.” 
Within the money extrinsic-payment absent condi- 
tion there was no mention of a remuneration 
throughout the experiment. 

Dependent measures. Upon completion of the 
game, the subject was met by a third experimenter, 
unaware of the subject’s task condition and of the 
experimental hypothesis but cognizant of the sub- 
ject’s payment condition. The subject was presented 
with the choice between (a) continuing with the 
same game, (b) playing another one (“from the 
same general content area”) whose exact nature was 
purposely left unspecified. Within the payment pres- 
ent condition it was especially emphasized that the 
new game offered the same chances of earning 
money as did the old one. Consistent with previous 
operational definitions of intrinsic motivation (e.g., 
in Deci’s works), the subject’s preference for the 
same versus another game was interpreted as an 
index of intrinsic motivation with respect to the 
former, 

In addition, the subject responded to a question- 
naire which included a check on the task manipu- 
lation and items about his degree of interest in and 
enjoyment of the game he had played. The subject 
was then interviewed and given a full explanation of 
the experiment’s purpose. Subjects in the payment 
absent conditions received the 2.00 IL rewards as 
had their payment present counterparts and all sub- 
jects were enjoined to secrecy. 


RESULTS 
Check on Task Manipulation 
Subjects answered the question: 


In the planning of the various games we have been 
deliberating whether they should be played for mere 


748 


TABLE 2 


GAME PREFERENCES AND RATED INTEREST 
(Experiment 2) 


Money Money 
Measure intrinsic | extrinsic 
Proportions of subjects 
preferring the same 
(vs. another) game £ 
Payment present 65 35 
Payment absent 25 -10 
Average rated interest 
Payment present 5.15 4.43 
Payment absent 4.31 5.30 


Note. Higher figures denote a higher rated interest on a 
7-point scale, 


points versus real money. We are aware that it is 
not usual for children’s games to be played for 
money. Still we are asking your honest opinion, 
unaffected by convention, as to the degree to which 
monetary rewards seem to belong with the game 
you just played. 


The answers were registered on a 7-point bi- 
polar scale anchored at the ends with “does 
not belong at all” and “belongs very much.” 
As expected, subjects in the money intrinsic 
conditions rated monetary payments as be- 
longing more with their game than did sub- 
jects in the money extrinsic conditions, F (1, 
76) = 179.92, p < .001 for main effect of the 
game played. 


Intrinsic Motivation 


Game preferences. The proportions of sub- 
jects who preferred to continue with the same 
game versus trying another one are presented 
in Table 2. Consistent with the experimental 
hypothesis, in the money intrinsic condition 
a higher proportion of subjects preferred to 
continue with the same game in the payment 
present versus absent case (z for difference 
between proportions = 2.66, p < .01), where- 
as in the money extrinsic condition a higher 
proportion of subjects preferred to continue 
with the same game in the payment absent 
versus present case (z = 2.33, p < .02). 

Interest ratings. Subjects answered four 
questions inquiring into the perceived attrac- 
tiveness and interest of the game they had 
played: “To what extent did you find the 
game as such (i.e. apart from the experi- 
mental situation) interesting?”; “To what 
extent would you advise to modify the game 
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in some way?”; “To what extent did you 
enjoy the guessing aspect of the gamer”; 
and “To what extent would you recommend 
to include the game in its present form within 
a game manual for gifted children?” In all 
cases the answers were registered on 7-point 
bipolar scales marked appropriately at the 
ends, for example, with “very interesting” 
and “not at all interesting” for the first item 
above, and so on. The patterns of mean dif- 
ferences in the responses to all four items 
were highly similar and the within-cell cor- 
relations between them positive (the rs rang- 
ing from .18 to .62), so they were combined 
algebraically into a single index of intrinsic 
motivation. The means for this measure are 
presented in Table 2. 

An analysis of variance performed on these 
data yielded, as expected, a significant inter- 
action between the task and the payment 
variables, F (1, 76) = 25.18, p < .001. With- 
in the money intrinsic condition the payment 
present subjects exhibited a higher degree of 
intrinsic motivation than did their payment 
absent counterparts, £ (38) = 4.82, p< 01, 
while in the money extrinsic condition the 
payment absent (vs. payment present) sub- 
jects exhibited the higher degree of intrinsic 
motivation, £ (38) = 2.37, p < .05. 


GENERAL Discussion 


The two studies contained in this article 
support the hypothesis that when money is 
intrinsic to a task its presence (vs, absence) 
enhances intrinsic motivation, whereas when 
it is extrinsic to the task its presence lowers 
intrinsic motivation. These results are con- 
sistent with the earlier analysis whereby in- 
trinsic motivation toward a task ensues when- 
ever the task performance may be causally 
attributed to the task’s content and extrinsic 
motivation whenever performance appears to 
have been caused by the task’s exogenous 
consequence. By contrast, the findings of this 
research do not easily jibe with the internal- 
external distinction (between intrinsic and 
extrinsic motivations) proposed by Deci 
(1971, 1972a, 1972b). Specifically, salient 
external rewards (monetary payments) in one 
case (money intrinsic) heightened intrinsic 
motivation toward an activity and lowered it 
in another case (money extrinsic), so they 
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cannot be viewed as a sufficient condition of 
extrinsic motivation. Furthermore, in the two 
task conditions (in both studies of this arti- 
cle) the monetary payments were contingent 
upon the degree of competence displayed by 
the subject with respect to the task. There- 
fore, the finding that in the one case they 
lowered intrinsic motivation, while enhancing 
it in another case argues against the identifi- 
cation of intrinsic motivation with the salient 
satisfaction (through monetary feedback) of 
the subjects’ competence (or self-determina- 
tion) need. 

But besides an attributional rendition of 
the present results they are also open to an 
alternative interpretation in terms of an ex- 
pectancy hypothesis. Specifically, the money 
intrinsic tasks may be associated with the 
expectancy of monetary“ payment while the 
money extrinsic tasks with that of no pay- 
ment. A disconfirmation of expectancies would 
result in an aversive state (cf. Aronson & 
Carlsmith, 1962) generalizable to low degree 
of task enjoyment, hence one would expect a 
pattern of findings exactly as obtained: lower 
intrinsic motivation in the money intrinsic— 
payment absent (vs. present) and the money 
extrinsic-payment present (vs. absent) condi- 
tions, that is, where expectancies were dis- 
confirmed. In the money extrinsic-payment 
absent case, the disconfirmed expectancy 
could further lead to a frustration due to non- 
reward and thus exacerbate the aversiveness 
experienced. 

A funnel-type postexperimental interview 
yielded no signs of discontent or frustration 
by any of the subjects; however, neither ex- 
pectancies nor attributions were measured 
directly in this research, so the expectancy 
hypothesis remains a viable alternative to an 
attributional account of the present results. 

Note that the expectancy interpretation 
does not vitiate the basic finding of this re- 
search that the reception of payments within 
the money intrinsic game enhances intrinsic 
motivation, while in the money extrinsic game 
it suppresses intrinsic motivation, Also, the 
attributional explanation of these results is 
consistent with additional data from the do- 
main of intrinsic motivation not readily 
understood in terms of the expectancy hy- 
pothesis. The present reference is to a recent 
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experimental series by Kruglanski et al. (in 
press) where subjects received monetary 
payments for the performance of dull versus 
interesting tasks not plausibly differentiable 
in terms of prior payment expectancy. As 
expected, the degree of extrinsic motivation 
(e.g., measured by the subjects’ degree of re- 
luctance to donate their pay to charity) was 
reliably lower for the interesting versus the 
dull tasks. In terms of the present analysis, 
an interesting task enhances content attribu- 
tions; therefore extrinsic consequences are 
discounted as possible causes. By contrast, a 
dull task precludes content attributions, 
therefore consequence attributions should fol- 
low more easily. 

Finally, the equation of intrinsic motiva- 
tion with content attributions, consistent with 
the present results, offers a broader concep- 
tion of this motivation than inherent in its 
identification with specific needs (e.g., of 
self-determination, self-enhancement, psycho- 
logical growth, etc.). Specifically, it implies 
that any activity, not just one associated with 
the satisfaction of a given need, may be in- 
trinsically motivated (with the consequences 
of being more enjoyed, persisted at, etc.). 
According to this view, it should be possible, 
for example, to instill intrinsic motivation 
with respect to activities as diverse as eating, 
resting, as well as performance of achieve- 
ment tasks by creating a situation wherein 
the person’s participation in the activity is 
causally attributed by him to the activity’s 
content. 
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Conjugate Lateral Eye 


Stanford 


only one direction, 
indicate that the 
are also preferentially activated within 


A body of neuroanatomical and behavioral 
evidence indicates that the two cerebral hem- 
ispheres of the human brain serve different 
functions. The left hemisphere, in right- 
handed subjects, seems to serve language 
functions, particularly those that require 
logical or analytical thought processes (Bogen, 
1969; Gazzaniga, 1970). The right hemi- 
sphere, in right-handed subjects, seems to 
serve visual-spatial and temporal functions 
(Benton, 1972; Milner, 1968). Left-handers 
seem to be less well lateralized for hemi- 
spheric functioning than right-handers (Hé- 
caen & deAjuriaguerra, 1964). The various 
means of identifying the functions associated 
with each hemisphere include the sodium 
amytal technique (Wada & Rasmussen, 1960), 
studies of brain-injured patients (€.., Ben- 
ton, 1962), and tachistoscopic and dichotic 
listening experiments (e.g., Kimura, 1967). 

Recently Kinsbourne (1972) has described 
still another and seemingly even simpler tech- 
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Movements as an Index 


of Hemispheric Activation 


Raquel E. Gur 


University 


Right handers moved their eyes leftward when solving spatial problems and 
rightward for verbal problems when the questioner sat behind them. When 
facing the questioner, the same subjects moved their eyes predominantly in 
either right or left, 
cerebral hemispheres, 
the same individuals. 


regardless of problem type. The results 
though specialized for problem type, 


nique for studying cerebral lateralization. The 
technique presupposes that the direction of 
orientation of the eyes and head indicates 
innervation or activation of the cerebral hemi- 
sphere contralateral to the direction of move- 
ment. The reasoning is as follows: The frontal 
areas of the brain control the turning of the 
head and eyes. When both centers are equally 
activated, attention is directed straight ahead. 
Because the cerebrum is a highly linked sys- 
tem, an activation of the cognitive function- 
ing of one hemisphere will “overflow” to the 
orientation center of that hemisphere and 
cause eye movement contralateral to that 
hemisphere, or, as Kinsbourne (1972) ex- 
pressed it, it is as though “driving attentional 
balance off center . - 3 (p. 539). 

In support of this formulation, Kinsbourne 
(1972) recorded eye and head movements of 
undergraduate subjects as they responded to 
questions asked by an experimenter seated 
behind them. The recordings were made by 
a hidden videotape camera. Tt was found that 
direction of movement was related to problem 
type: Subjects showed a significant tendency 
to move their eyes to the right for verbal 
problems and either upward or to the left for 
spatial problems. The differences in laterali- 
zation appeared strongest between verbal and 
spatial problems but negligible for numerical 
problems. These differences were found in 
right-handed subjects. In Jeft-handers, eye 
movement lateralization was fairly evenly 
distributed regardless of problem type- 

These findings, however, seem inconsistent 
with results obtained from an independent line 
of research in which the experimenter faces 
the subject during testing for eye direction- 
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ality. In this situation, when the subject 
reflects upon a question asked by the experi- 
menter, the subject usually breaks eye con- 
tact and moves his eyes either to the right or 
to the left. This observation was first re- 
ported by Teitelbaum (1954) and later was 
investigated more thoroughly by Day (1964), 
who found that the direction in which the 
eyes move is fairly consistent for a given indi- 
vidual. Duke (1968) offered further experi- 
mental support for these observations and 
suggested a new typology, “left movers” and 
“right movers.” 

Other empirical studies (¢.g., Bakan, 1969, 
1971; Bakan & Shotland, 1969; Bakan & 
Svorad, 1969; Day, 1967a, 1967b, 1968; Gur 
& Reyher, 1973) suggest that left movers 
are more spontaneous, emotional, and local- 
ize anxiety and conflict internally. Right 
movers, on the other hand, seem to be more 
verbal, intellectual, and see the source of 
their anxiety and conflicts as external to them- 
selves. 

Bakan (1971) tried to explain the differ- 
ences between left movers and right movers 
by viewing eye directionality as an indication 
of the relative dominance of one cerebral 
hemisphere over the other. Thus lateral eye 
movements are symptomatic of easier trigger- 
ing of activity in the hemisphere contralateral 
to the direction of eye movement regardless 
of the content of the problem put to the sub- 
ject. 

Differences in procedure may account for 
the apparent inconsistency between these 
results and those obtained by Kinsbourne 
(1972). The most obvious difference between 
the procedures is that in Kinsbourne’s study 
the experimenter was seated behind the sub- 
ject, and the eye movements were recorded by 
a hidden videotap recorder camera. Kins- 
bourne (1973) suggested that the face-to- 
face findings may be attributable to errors 
caused by the experimenter’s own subcon- 
scious lateral gaze behavior, as well as from 
lack of control of the appearance of the back- 
ground, However, in at least some of the 
experimenter-facing-subject studies (Bakan & 
Shotland, 1969; Gur & Reyher, 1973), both 
such factors have been rigorously controlled. 
Furthermore, it is hard to reconcile the notion 
that these results are artifacts with the vari- 


Raguet E. Gur 


ous personality and behavioral differences 
found to be associated with each movement 
type. It appears that an experimenter’s pres- 
ence before the subject affects the lateraliza- 
tion of underlying cerebral activities in law- 
ful and meaningful ways and that some 
factor in the relationship between the experi- 
menter and the subject may be responsible. 
Thus the difference between the two lines of 
results may indeed be tied to aspects of the 
experimental situation, but contrary to the 
implication in Kinsbourne’s criticism, in a 
lawful and theoretically significant way. Per- 
haps the experimenter-facing-subject situa- 
tion, being obviously interpersonal, is more 
threatening and anxiety provoking, so that 
an anxious subject then falls back on char- 
acteristic modes of response. Thus, when 
questioned, the subject tends to rely on that 
hemisphere which is the more characteristi- 
cally used, even though it might be the 
“wrong” hemisphere for a particular kind of 
problem. When the testing situation is “im- 
personal” (experimenter behind subject), this 
anxiety would tend to be elicited in a smaller 
degree, permitting a hemispheric differentia- 
tion of response related to problem type. 

This hypothesis was tested by directly 
comparing the two procedures and determin- 
ing whether, within individual subjects, di- 
rectionality of eye movements is influenced 
by experimenter location, Specifically, two 
hypotheses were tested in this phase: (a) 
Hypothesis 1; The direction of an individual's 
conjugate lateral eye movements is a func- 
tion of the type of question asked of him 
_when the questioner is behind him (i.e., left 
to spatial, right to verbal). (b) Hypothesis 
2; The direction of an individual’s conjugate 
lateral eye movements is consistently in one 
direction, irrespective of type of question, 
when the questioner is facing him. 


METHOD 
Subjects 
Thirty-two right-handed male undergraduates from 


an introductory psychology class served as subjects 
in this experiment. Their ages ranged from 17 to 25 


(=19.3). 
Materials 


Following Kinsbourne 
questions were used: 


(1972), 
“verbal” 


three types of 


(explanation of 
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proverbs), “numerical” (solution of arithmetical 
problems), and “spatial” (visualization and identifi- 
cation of spatial relationships of familiar places and 
visual arrangements). There were 20 pairs of parallel 
questions for each type, constituting a pool of 120 
questions. From this pool of questions four test 
forms of 30 questions each were constructed. Each 


test form contained 10 verbal, 10 numerical, and 10, 


spatial questions. To assure a randomized, counter- 
balanced order of presentation, each pair of equiva- 
lent questions, within a given type, was randomly 
assigned to one of two subsets. This produced six 
subsets: two verbal, two numerical, and two spatial, 
each containing 20 questions. Each subset was then 
divided randomly into two sections of 10 questions 
each, These sections were combined into the four 
test forms in a random order with the restriction 
that each form contained one verbal, one numerical, 
and one spatial section. Finally, the three types of 
questions within each form were arranged in triads 
—verbal, numerical, and spatial—and the order of 
presentation within each triad was randomized. Each 
subject received 60 questions in each session (two 
different forms). The forms were administered in a 
counterbalanced design. 

A videotape recorder was hidden behind a dark 
cloth which had a small hole in it to enable record- 
ing of the subject’s eye movements. 

Humphrey’s (1951) 20-item questionnaire, which 
assesses hand usage for a variety of tasks, was used 
to determine handedness. (The questionnaire was 
slightly modified from the version used in England 
in order to adapt it to the American culture.) 


Procedure 


Each subject first answered the Humphrey (1951) 
handedness questionnaire, and only subjects who 
reported using their right hand for at least 15 of 
20 tasks were retained. Four of 36 subjects initially 
* tested did not meet this criterion, leaving a final n 
of 32. 

The subjects were tested individually in a 2.5 X 2 
m portable experimental room placed inside the 
laboratory. The walls of the room were 2 m high 
and were completely covered with brown cloth to 
provide a perfectly homogeneous surrounding. 

Two experimental sessions, 1 week apart, were pre- 
sented in a counterbalanced order, as follows. 


1. Experimenter-behind-subject (Kinsbourne, 1972) ; 
The subject was seated at a small table 1.2 m 
from the wall, and the experimenter sat approxi- 
mately .8 m behind him. The subject was told to 
concentrate on answering a scries of questions within 
30 sec (following Kinsbourne, 1972), to remain 
- seated, and to not look behind him. Two test forms 
were read to him by the experimenter, who also 
recorded his answers. Each trial was concluded either 
when the subject answered or after 30 sec. The 
experimenter made sure that the subject was looking 
straight ahead at the start of each trial. Eye direc- 
tionality was recorded by the videotape recorder 
hidden behind the wall and scored from a monitor 
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by two independent judges, seated in a separate 
room. The judges were instructed and trained to 
score the first lateral eye movement following the 
presentation of each question. 

2. Experimenter facing subject (Bakan, 1971) ; The 
subject sat in the same position as in the experi- 
menter-behind-subject situation, but now the ex- 
perimenter sat directly across from him at a dis- 
tance of .8 m. The instructions were the same as in 
the experimenter-behind-subject situation. The ex- 
perimenter presented the two alternate test forms 
and recorded both the answers and eye direction- 
ality. Special care was taken in training the experi- 
menters to be efficient and accurate in recording the 
eye movements. 


The experimenters in both testing situations were 
20-year-old women. Testing time in each session was 
approximately 45 minutes. 


RESULTS 


The two independent judges who scored 
the eye directionality in the experimenter- 
behind-subject situation agreed in 89% of 
the movements. In 8% of the cases, one judge 
saw a movement when the other did not. In 
the remaining 3%, the judges opposed each 
other in scoring the direction of movements. 
A third judge decided in both cases of dis- 
agreement. 

Since each subject went through both situ- 
ations with the order of the situation counter- 
balanced across subjects, the data were first 
examined for a possible order effect on eye 
directionality. No such effect was found: 
54.8% of the movements in the first session 
were to the left compared with 52.1% move- 
ments to the left in the second session (¢ < 
1). 

In order to test the experimental hypothe- 
ses, arc sine transformations were applied to 
individual proportions of eye movements to 
the left for each subject in each experimental 
condition on the verbal, spatial, and numeri- 
cal problems. The actual proportions and the 
results of the analysis of variance performed 
on the are sine transformations are presented 
in Table 1. 

As seen in Table 1, verbal, spatial, and 
numerical problems produced significantly 
different proportions of eye movements to 
the left in the experimenter-behind-subject 
situation, with verbal problems producing 
more eye movements to the right, spatial 
problems producing more eye movements to 
the left, and numerical problems producing 
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TABLE 1 


PERCENTAGE OF EYE MOVEMENTS TO THE LEFT 
FOR THE THREE TYPES OF QUESTIONS UNDER 
THE Two EXPERIMENTAL CONDITIONS 


Type of question 


Experimental 
pane ie 
Verbal | Spatial | Nts) 
Experimenter f 
behind subject | 34.84 | 67.81 54.25 | 29.48* 


Experimenter a 
facing subject | 55.72 | 58.91 | 53.53 |<1 


adf = 2, 62. 
*p <.01, All differences among means are significant at 


the .01 level. 


about equal proportions of movements in each 
direction. These findings are consistent with 
those of Kinsbourne (1972). In the experi- 
menter-facing-subject condition, by contrast, 
directionality of eye movements did not de- 
pend on problem type. This finding is con- 
sistent with Bakan’s (1971) line of results. 
Both experimental hypotheses were thus sup- 
ported. 

The results are more clearly illustrated 
in the analysis of the performance of individ- 
ual subjects in the verbal and spatial prob- 
lems. The numerical problems were not in- 
cluded in this analysis, since there were no 
bases, from Kinsbourne’s study (1972), for 
predicting whether the solution of numerical 
problems would be subserved more by one 
hemisphere than by another. In other words, 
given a subject who consistently moved his 
eyes in one direction for each of the 20 nu- 
merical questions, there are no theoretical 
grounds for deciding whether the subject’s 
movements reflected characteristic usage of 
one hemisphere, in Bakan’s sense, or were 
problem based, in Kinsbourne’s sense. The 
subjects were thus classified into two cate- 
gories in the following way. 


1. Unidirectional subjects: A significant 
majority (70%) of a subject’s eye move- 
ments were to one direction. The 70% level, 
also used by Bakan (e.g., 1969, 1971), is 
high enough to be expected by chances at only 
the .01 level, where # equals 40 (20 verbal 
questions and 20 spatial questions). The uni- 
directional subjects were subclassified into 
(a) left movers, when the majority of eye 
movements were to the left, and (b) right 


movers, when the majorit, 
movements were to the right 

2. Bidirectional subjects: | 
of the eye movements were to irec 
The bidirectional subjects were fu: er i 
fied in the following manner: Foc the 20 
verbal questions and the 20 spatial q) ‘ions, 
if the larger proportion of movements ne 
direction in the verbal was the smaller | 
portion of movements to that direction in i 
spatial, the difference between the proportion 
of movements to the right and to the left in 
the verbal set was added to that difference in 
the spatial set. If the sum was at least 70, th 
subject was judged to be a “discriminator’ 
(with a confidence level of p< .01), Sub- 
jects whose summed differences in propor- 
tions were less than 70 were called “nondis- 
criminators.” The bidirectional subjects were 
thus subclassified into two categories. (a) 
Discriminators were subjects who moved their 
eye differentially according to the problem 
type, that is, either left to spatial and right 
to verbal or the reverse (against the theo- 
retically expected direction), right to spatial 
and left to verbal problems. (b) Nondiscrimi- 
nators were subjects whose eye movements 
were inconsistently related to problem type 
and appeared haphazard. 

The results of these classifications are sum- 
marized in Table 2. The difference between 
the two experimental conditions was signifi- 
cant by a chi-square test (McNemar, 1969) 
comparing changes in expected versus no 
change or changes in expected direction, ver- 
sus no change (or changes) in an opposite 
direction, x? (1) = 17.19, p < .001. Note that 
with the experimenter-behind-subject pro- 
cedure, in line with Kinsbourne’s (1972) find- 
ings, the great majority of subjects (88%) 
were bidirectional, most of them discrimi- 
nators. With the experimenter-facing-subject 
procedure, in contrast, the majority of sub- 
jects (72%) were now classified as unidirec- 
tional, either left movers or right movers. 
These results are consistent with other find- 
ings using this procedure (Bakan, 1969, 
1971; Bakan & Shotland, 1969; Bakan & 
Svorad, 1969; Gur & Reyher, 1973). 

Table 2 also shows that a small number of 
subjects remained either unidirectional or bi- 
directional in both conditions. The consist- 
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TABLE 2 


NUMBERS AND PERCENTAGES OF UNiDIRECTIONALS (Lert Movers AND Ricut Movers) 
AND BipIRECTIONALS (DISCRIMINATORS AND NONDISCRIMINATORS) 
IN THE Two EXPERIMENTAL CONDITIONS 


Experimenter behind 


Experimenter facing subject 


subject ks % = 
Left movers Right movers Discriminators | Nondiscriminators 
Left movers 2 6.3 2 
Right movers 2 6.3 1 1 
Discriminators 18» 56.3 8 6 2 2 
Nondiscriminators 10 31.3 4 2 4 
% 100 43.8 28.1 6.3 21.9 


® One reversed discriminator, 


ently unidirectional subjects did not change 
direction (i.e., left movers remained left 
movers; right movers remained right movers). 
They therefore appear to have shown an ex- 
tremely strong preference for use of one hemi- 
sphere, whether the left or the right, across 
experimental conditions. The consistently bi- 
directional subjects, most of whom were non- 
discriminators, appear to show little evidence 
of lateralization. Both groups of subjects rep- 
resent potentially interesting populations for 
further study. 

When these same analyses were repeated 
with the numerical questions included, the 
same distributions of unidirectional and bi- 
directional subjects emerged. In other words, 
in both experimental situations subjects who 
were either left or right movers on the verbal 
and spatial questions showed the same pref- 
erence, and in approximately the same pro- 
portion, in the numerical questions. Of the 
subjects classified as bidirectionals, whether 
discriminators or nondiscriminators, eye move- 
ment direction in the numerical questions was 
haphazard. These results, like those of Kins- 
bourne (1972), therefore suggest that neither 
cerebral hemisphere predominantly subserves 
the solution of numerical problems. 

In light of these findings and data pertain- 
ing to the functional asymmetry of the brain, 
does direction of eye movements in response 
to a certain question type affect the correct- 
ness or quality of response? Since a move- 
ment of the eyes to the left presumably 
indicates activation of the right cerebral hemi- 
sphere, does a movement to the left in re- 
sponse to a spatial question result in a more 
accurate answer than a movement to the 


right for that question? Conversely, in view 
of left hemisphere specialization for verbal 
material, does a movement to the right, in 
response to a verbal question, result in a bet- 
ter answer than a movement to the left? 

A comparison was made, within individual 
subjects, between accuracy of response for 
those verbal and spatial questions to which 
the direction of eye movements was appro- 
priate and accuracy of response to those ques- 
tions to which the direction was inappropri- 
ate. Only those subjects were included who 
made at least two movements in each direc- 
tion within a set of 20 questions. The results 
indicate a tendency for movements to the 
right to be associated with better verbal re- 
sponses and for movements to the left to be 
associated with better spatial responses. This 
tendency, however, was significant only for 
the left movements in response to spatial 
questions in the experimenter-facing-subject 
situation, £ (18) = 2.67, p < .01, one-tailed, 
and marginally significant for movements to 
the right in the experimenter-behind-subject 
situation, £ (22) = 1.68, .05 < p < .06, one- 
tailed. 


Discussion 


The present investigation corroborated, us- 
ing a within-subjects design, both Kins- 
bourne’s (1972) and Bakan’s (1969, 1971) 
findings. Thus, the difference in procedure 
seems to account for the discrepancy in the 
results obtained by these two lines of research. 
The direction of eye movements in response 
to questions appears, therefore, to be deter- 
mined by at least two factors: (a) problem 
type and (b) characteristic use of a certain 
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hemisphere by a given individual. The influ- 
ence of the first factor seems to be maximized 
when the experimenter is seated behind the 
subject, whereas the influence of the second 
factor is maximized when the experimenter 
faces the subject. 

At present one can only speculate as to the 
mechanisms involved in producing the rather 
drastic change, from one situation to another, 
in the influence of each of these factors on 
the subject’s response. Perhaps in the experi- 
menter-behind-subject situation the subject is 
faced only with the problems presented to 
him. The task of answering the problems is 
therefore more salient than in the experi- 
menter-facing-subject situation, and the dif- 
ferent problem types may have a stronger 
“pull” on the subject’s response. The experi- 
menter-facing-subject situation confronts the 
subject with a new set of stimuli to respond 
to and thereby decreases the saliency of the 
problem type. By being more interpersonal, 
it also may make the subject more anxious in 
terms of physiological level of arousal. Anx- 
iety, according to Sullivan (1953), is closely 
tied-up with interpersonal relationships and 
tends to stand in the way of an appropri- 
ate problem-solving or need-related behavior 
- (Sullivan, 1953, p. 44). 

This anxiety, along with the decrease in 
problem-type saliency, may cause the subject 
to use his preferred hemisphere, either left or 
right, possibly depending on personality varia- 
bles, even when this is not the most “efficient” 
hemisphere for dealing with a particular prob- 
lem. 

The present results also have implications 
for the study of gaze aversions as a form of 
nonverbal communication (e.g., Efran, 1968; 
Exline, Gray, & Schvette, 1965; Libby, 1970). 
Recently, Libby and Yaklevich (1973) used 
an experimenter-facing-subject procedure and 
found that most of their subjects looked to the 
left and that this tendency correlated with 
high “need abasement.” They explained this 
finding by the presence of the door on the 
subjects’ left and concluded that the direction 
of gaze aversion reflected the nonverbal com- 
munication of the subjects’ wish to leave the 
room. They further concluded that their re- 
sults “obviously favor a situationally deter- 
mined interpretation of the data over the 


hypothesis that looking direction is deter- 
mined by cerebral dominance.” (p. 203) This 
conclusion is unwarranted in light of the 
present investigation, since a high consistency 
in eye directionality was obtained even 
though the experimental room was completely 
symmetrical. What might be suggested, in- 
stead, is that situational variables interact 
with variables related to cerebral activity in 
producing gaze aversions as well as in deter- 
mining their direction, The relative impor- 
tance of each of these factors can only be 
established by future research that should 
extend sampling to include females, left hand- 
ers, and subjects from a larger variety of age 
and socioeconomic backgrounds. 
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Subjects read stories containing pairwise sentiment relationships among four 
people. The stories were either complete (all six relationships stated) or incom- 
plete (only four relationships stated). When complete they were either bal- 
anced or imbalanced; when incomplete they were either balanceable or non- 
balanceable. After reading two stories, subjects attempted to recall whether 
each relationship had been presented, and if so which sentiment was involved. 
Generally, subjects recalled balanced stories better. However, subjects some- 
times recalled pairwise sentiments not stated in the (incomplete) stories. For 
balanceable stories those misrecalls balanced triads containing them; for non- 
balanceable stories they imbalanced triads containing them. The data supported 
an encoding model for the storage of social information rather than a model 


positing a generalized drift toward balance. 


This article intends to investigate the man- 
ner in which social information is stored in 
and retrieved from memory. In particular, we 
examine memory for social information de- 
fined by the “balance” principle. Constructive 
encoding models are compared with models 
positing a general drift (in memory) toward 
balance. 

Heider (1946, 1958) formulated the prin- 
ciple of structural balance. According to bal- 
ance theory, a particular system of relations 
may be in one of two states. A balanced state 
is one in which the relations “fit together 
harmoniously; there is no stress toward 
change” (Heider, 1958, p. 201). An imbal- 
anced state, on the other hand, is assumed 
to be disharmonious and unstable. For exam- 
ple, the sentiments among three people (a 
triad) may be defined by three interpersonal 
relations. Triads containing three positive 
relations, or one positive and two negative 
relations, are defined as balanced, Triads con- 
taining one negative and two positive rela- 
tions, or three negative relations, are defined 
as imbalanced (although the last case is 
considered to be somewhat ambiguous). For 
groups larger than three, a totally balanced 
structure results when all the triads are bal- 


anced. 
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Heider posited a “force” or tendency to 
change imbalanced structures into balanced 
ones. Therefore, people should tend to mis- 
recall certain relations so as to convert a pre- 
sented imbalanced structure into a recalled 
balanced structure. Consequently, balanced 
situations should be more accurately recalled 
than imbalanced situations. Note that the 
prediction of a generalized change in the di- 
rection of balance does not specify where in 
the total memory process the alterations might 
be occurring. 

Recently, there has been a reorientation of 
views about balance theory (cf. Bear, Note 1; 
Cottrell, Note 2; Crockett, Note 3). Rather 
than a force leading to change, it has been 
depicted as a principle which deals with the 
cognitive organization of social structures. In 
this context, the balance principle is viewed 
as a rule used in processing, storing, retriev- 
ing, and decision making about social infor- 
mation. Several models of this type may be 
specified. For example, a constructive encod- 
ing notion would propose that individuals 
build up a cognitive structure while storing 
the information. Social relationships would be 
incorporated into the existing structure as 
they were encountered. Other views might 
hold that better recall of balanced structures 
could be due to a change in the memory trace 
between the storage and retrieval of the in- 
formation, or to recall errors selectively in- 
troduced during the course of retrieval. There 
might also be biases introduced after re- 
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trieval and during decision making. The pres- 
ent study was designed to discover some basic 
facts about recall of social structures and to 
take some initial steps in distinguishing among 
these various possibilities. 

A second proposition by Heider is most 
important for the present research. It asserts 
the principle of “induction of relations and 
equifinality,” which states that given an in- 
complete structure (i.e. one in which only 
some of the possible relations exist), there is 
a tendency to induce new relations so as to 
produce balance. The present study tests this 
proposition directly. 

The three methods most commonly used to 
examine balance theory are the following: 
pleasantness-tension, prediction, and paired- 
associate learning. In the first procedure, 
subjects are presented balanced and imbal- 
anced structures and are asked to rate each 
for its pleasantness or unpleasantness (e.g., 
Jordan, 1953; Morrissette, 1958; Price, Har- 
burg, & Newcomb, 1966). This method is 
derived from Heider’s implication that im- 
balanced states produce tension and conse- 
quently should be rated less pleasant than 
balanced structures. While the results gen- 
erally support these predictions, discrepancies 
have been observed. Jordan (1966) has noted 
that the pleasantness-tension method may be 
irrelevant for studying balance theory. He 
argued that balance theory should be con- 
cerned with what is structurally required and 
appropriate rather than with what is posi- 
tively valent and pleasant. Crockett (Note 3) 
has made a similar point by indicating that 
even though a balanced state is assumed to 
be one in which the relations fit together har- 
moniously, “harmoniousness” does not neces- 
sarily translate into “pleasantness.” 

In the prediction method, subjects are pre- 
sented with incomplete structures and they 
are required to predict the unspecified rela- 
tion (eg. De Soto & Kuethe, 1959; Mor- 
rissette, 1958). According to Heider’s principle 
of induction of relations and equifinality, sub- 
jects should tend to predict relations so as to 
complete the structure in a balanced way. 
This prediction is generally supported by the 
data. However, these two methods are obvi- 
ously inadequate for studying the storage and 
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structure of social information since no coding 
or memory is required. 

The third method stems from Heider’s 
(1958) statement that balanced situations 
should be better remembered. Most studies 
(e.g., Rubin & Zajonc, 1969; Zajonc & Burn- 
stein, 1965a, 1965b; Zajonc & Sherman, 
1967) have used the anticipation method of 
the paired-associates learning procedure. 
Structures are broken down into pairs of 
people (the stimuli) and the relationships 
between them (the responses). Usually this 
method has not supported the predictions of 
balance theory. In any event, the method 
suffers from several shortcomings. First, sub- 
jects usually do not receive all the pairs from 
each structure separately and thus may not 
be able to organize the relationships into their 
discrete structures (cf. Press, Crockett, & 
Rosenkrantz, 1969; Crockett, Note 3). Sec- 
ond, subjects in studies using the paired- 
associate procedure are frequently asked to 
learn a great deal of information and may be 
under extreme cognitive overload. In such a 
case, principles such as balance may not be 
used to structure the relationships. 

Other procedures have also been used to 
investigate balance. Cottrell (Note 2), for 
example, studied balance as a conceptual unit 
by employing a “reception paradigm.” On 
each learning trial a total of 16 different so- 
cial structures were individually presented. 
Subjects attempted to classify each structure 
into one of two available categories according 
to a rule known to the experimenter but not 
told to subjects. After each classification re- 
sponse, the subject was given immediate feed- 
back and the trials were continued until the 
subject reached a criterion of consecutive 
correct responding. Cottrell tested several 
different classification rules, one of which 
was the balance rule. He hypothesized that 
since, according to Heider, people frequently 
classify social situations according to the 
balance principle, it is a conceptual rule that 
should be highly available for use. In gen- 
eral, the results indicated that rules involving 
a smaller number of relevant dimensions 
were learned faster. Further, when compared 
to other rules involving the same number of 
dimensions the balance rule was found easier 
to use. This latter relationship seems to be 
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reversed for nonsocial stimuli. Shepard, Hoy- 
land, and Jenkins (1961), for example, found 
that a three-dimensional rule corresponding 
to the balance type was more difficult to use 
when the stimuli were nonsocial than were 
other rules involving the same number of 
relevant dimensions, 

The memory approach used in the present 
study was based on other research which has 
provided valuable insights into schemas in 
memory. Bransford and Franks (1971) pre- 
sented sets of sentences to subjects with each 
set expressing some single complex idea. A 
memory procedure was used in which test 
sentences were presented and subjects were to 
indicate whether each sentence was identical 
to one of those given during acquisition. 
Bransford and Franks found that as long as 
the test sentence presented for recognition did 
not contradict the semantic relations of the 
original sentences, subjects said the test sen- 
tence was identical to one of those previously 
presented. They interpreted these results to 
indicate that subjects tend to integrate re- 
lated semantic components and store the 
resulting complex idea (although they do not 
provide an adequate description of exactly 
what it is that is stored). This evidence along 
with the findings of Bransford, Barclay, and 
Franks (1972) suggested that subjects impose 
schemas representing some “cognitive knowl- 
edge of the world” upon the input and then 
store the result. 

In an attempt to specify more adequately 
what is stored in this type of situation, Potts 
(1972) investigated the cognitive organization 
of logical relationships by employing relations 
such as “older than” and “better liked than.” 
Subjects were given information sufficient to 
imply a linear ordering of the form A > B > 
C > D. Such an ordering can be broken down 
into six pairs of terms, three of which are 
necessary to establish the ordering (the “ad- 
jacent” pairs: A > B, B > C, C > D) while 
the other three are redundant in that they 
can be deduced from the adjacent pairs. Sub- 
jects were given either the three adjacent 
pairs or all six pairs and then given a recog- 
nition test for all six pairs. Potts found that 
the relations were learned by arranging the 
terms serially and then storing this arrange- 
ment, regardless of whether only the three 


J. Picex, S. SHERMAN, AND R. SHIFFRIN 


adjacent pairs, or all six pairs, had been pre- 
sented. For example, subjects best remem- 
bered A > D, even when A > D had not been 
presented, but had to be inferred from the 
pairs A > B, B >C, and C > D. In other 
words, subjects seemed to build up in mem- 
ory a conceptual structure independent of the 
manner in which information was presented. 
This conclusion is consistent with those of 
Bransford and Franks (1971) and Bransford 
et al. (1972). Potts has thus shown that logi- 
cal linear orderings can serve as an organiz- 
ing principle behind schema in human mem- 
ory. In this article we attempt to see whether 
the balance principle can act similarly as a 
fundamental principle underlying the storage 
of social structures in memory. 

The present study used a memory pro- 
cedure similar to the one used by Potts. Sub- 
jects were presented with two four-person 
social structures made up of sentiment rela- 
tions. Each structure was presented in story 
context. A complete structure consisted of the 
presentation of all six possible sentiment rela- 
tions among the four people. The complete 
structure was either balanced or imbalanced. 
Incomplete structures were created by elimi- 
nating two of the relationships from one of the 
corresponding complete structures and were 
either potentially balanceable or nonbalance- 
able. Following the study periods, subjects 
were given a recall task. 

This procedure allows certain predictions 
to be made. First, if balanced situations are 
easier to recall, as Heider suggested, subjects 
should make fewer errors during the recall of 
those relationships than ones from the imbal- 
anced situation. On the other hand, Heider’s 
view seems to suggest that every relationship 
in an imbalanced structure should be affected 
and therefore remembered more poorly. A 
constructive encoding model could predict 
that recall of the first few relationships pre- 
sented (i.e., before any imbalancing informa- 
tion is received) is at a different level than 
the relationships presented after the imbalanc- 
ing information. 

Second, consider the cases when subjects 
mistakenly complete (in recall) the incom- 
plete structures. If we assume Heider’s princi- 
ple of induction of relations and equifinality 
and his statement that all systems tend 
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toward balance no matter what their starting 
state, then subjects should tend to complete 
both the balanceable and the nonbalanceable 
structures in a balanced way. For the non- 
balanceable structure this should take the 
form of balancing the incomplete triads when 
possible. A different prediction arises from 
memory models for balance. Cottrell (Note 
2) has shown that individuals can classify 
social situations as being balanced or imbal- 
anced. Since the only complete triad in the 
nonbalanceable structure is already imbal- 
anced, subjects may encode this structure as 
being a type of “imbalanced” situation or one 
which is not socially logical. At the time of 
retrieval they may recall this fact and use it 
to bias their responses (i.e., they may try to 
generate triads which are imbalanced). Thus, 
in opposition to the notion of a generalized 
tendency toward balance in storage and re- 
trieval which predicts subjects will always 
tend to balance incomplete triads, this view 
of balance theory suggests that subjects tend 
to imbalance the incomplete triads from non- 
balanceable structures. 


METHOD 
Subjects 


A total of 224 introductory psychology students 
participated in the experiment to satisfy one of the 
3 hours of suggested experimental participation. 
Eight subjects were eliminated from the data analy- 
sis for failure to follow instructions correctly, leav- 
ing 216 subjects. One hundred forty-four subjects 
took part in Experiment 1 and 72 subjects in Ex- 
periment 2 about a month later. 


Materials 


Four different four-person social structures were 
used. Their general form appears in Figure 1. Each 
is based on reciprocated sentiment relations (i.e., 
“like each other” or “dislike each other”). Two of 
the structures are complete with all six of the rela- 
tionships present. One of these structures is balanced 
and the other imbalanced. Both contain three like 
and three dislike relationships. The remaining two 
structures are incomplete. These were created by 
eliminating one like and one dislike relationship 
from each complete structure, with the same two 
relationships eliminated in both cases. The balance- 
able incomplete structure can therefore be completed 
in a balanced way by supplying the appropriate 
missing relationships. The nonbalanceable structure, 
on the other hand, cannot be balanced by completion 
because the B-C-D triad is already imbalanced (see 


Figure 1). 
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Ficure 1. Four person social structures used in 
the recognition memory task. (Solid lines indicate 
like relations; dashed lines indicate dislike relations.) 


Two interesting and somewhat humorous stories 
of approximately 350 words in length were con- 
structed so as to provide a context in which to 
present the structures. Each story described a situ- 
ation which required some type of interaction 
among the four people. One of the stories involved 
male astronauts and the other female jurors. Short, 
distinctive descriptions of each person were pre- 
sented within the story setting. These descriptions 
were arranged in the following order of presentation: 
D, C, B, A (see Figure 1). In order to make the 
characters distinguishable, the name of each person 
reflected his or her description in some way (e.g., 
“Mrs. Tart” was a baker), Each person’s relation- 
ship to the other people previously described in the 
story was presented following that person’s descrip- 
tion (except in the case of incomplete structures 
where two of the six relationships were left un- 
stated). The relationships were presented so as to 
alternate between like and dislike relations for all 
structures. Alternation of sign (to guard against 
biases such as positivity) necessitated slightly differ- 
ent orders of presentation for the relationships from 
different structures. For the balanced and balance- 
able structures the order of presentation was the 
following: C-D, B-D, B-C, (A-B), (A-C), and A-D. 
For the imbalanced and nonbalanceable structures 
the order was the following: C-D, B-C, B-D, (A-C), 
(A-B), A-D. 


Procedure 


Subjects participated in the experiment in six 
large groups of approximately 40 each. Booklets 
were constructed and distributed to subjects. Each 
booklet included both types of story formats. 


1The stories can be obtained from the authors 
upon request. 
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TABLE 1 


PROPORTION OF CORRECT RESPONSES FOR TYPE 
or CoMPLETE STRUCTURE, ORDER OF 
PRESENTATION, AND THE 
TYPE OF RELATION 
(Experiment 1) 


Balanced 
Order of presentation 
Relation 
First Second M 
(n = 36) n = 36) 2 
Like -750 -843 -196 
Dislike -898 -852 875 
M 824 847 -836 
Imbalanced 
Like -796 -676 -736 
Dislike «7187 -685 -736 
M -792 681 -736 
Combined 
Like 773 759 -766 
Dislike 843 -769 -806 
M -808 -764 -786 


Experiment 1. One story contained a complete 
structure, while the other story contained an incom- 
plete structure. One-half of the subjects received one 
balanced and one nonbalanceable structure, while the 
remaining one-half received one imbalanced and one 
balanceable structure (n = 72 in each). The order of 
presentation of the structures and the story format in 
which each appeared were both counterbalanced. 

The initial instructions to subjects stated that they 
would be given 8 minutes per story in order to read 
and try to remember as much as possible about 
them. They were not told to focus especially upon 
the social information. After the presentation of each 
story, subjects solved arithmetic problems for 2 min- 
utes to eliminate the possibility of retrieval from 
short-term memory. After the second set of problems, 
the final instructions were presented and a set of test 
questions was given. Pairs of names were presented, 
and each subject was to indicate either that the 
story stated the two people liked each other, disliked 
cach other, or that nothing was stated about their 
relationship, Relationships from the first of the two 
stories that each subject read were always presented 
first in the recognition task. All the pairs were printed 
on one page, and the subject could answer in any 
order he saw fit.? 

Experiment 2. The procedure for the second ex- 
periment was identical to Experiment 1 in all re- 
spects except that each booklet contained two incom- 


2 The actual order of listing of the test pairs was: 
C-D, B-D, B-C, A-D, A-C, A-B (see Figure 1). 
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plete stories, one balanceable, and one nonbalance- 
able. Presentation order, story format, and other de- 
tails were counterbalanced as in Experiment 1. 


RESULTS 
Proportion Correct 


Experiment 1. Analyses of variance showed 
story format to have no main effects and no 
interactions; hence all subsequent data and 
analyses are lumped over this factor. Table 1 
gives the probability of correct recall for the 
complete structures. A correct response was 
defined as responding like for a like relation or 
dislike for a dislike relation. Table 2 gives 
similar data for the incomplete structures. In 
this case, a correct response was defined as 
responding like for a like relation, dislike for 
a dislike relation, or nothing stated for a rela- 
tion not presented. 

Since all subjects received one type of com- 
plete and one type of incomplete structure, 
interactions which involved the type of struc- 
ture would be difficult to interpret. Conse- 
quently, the data from the complete and in- 
complete structures were analyzed separately, 


TABLE 2 


Proportion OF CORRECT RESPONSES FOR TYPE 
OF ĪNCOMPLETE STRUCTURE, ORDER OF 
PRESENTATOIN, AND THE 
TYPE or RELATION 
(Experiment 1) 


Balanceable 


Order of presentation 


Relation 
First Second M 
(n = 36) (n = 36) 
Like 694 -806 «750 
Dislike 778 847 +813 
Nothing stated ALT 833 625 
M -630 -828 -129 
Nonbalanceable 
Like -136 625 681 
Dislike -136 -806 771 
Nothing stated 639 819 729 
M -104 -750 227 
Combined 
Like 715 715 715 
Dislike 757 826 792 
Nothing stated 528 826 677 
M -667 «789 -128 
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However, a ¢ test was performed to determine, 
in general, whether complete or incomplete 
structures were remembered better. This test 
indicated that subjects were more accurate in 
recalling the complete than the incomplete 
structures—proportions correct were .786 and 
.728, respectively; £ (143) = 2.61, p < .05.° 

Table 1 shows that relationships from the 
balanced structure were recalled better than 
those from the imbalanced structure, This dif- 
ference was significant as revealed by a 2 X 2 
Xx 2 analysis of variance—Type of Structure 
X Order of Structure Presentation X Type of 
Sentiment Relation; F(1, 140) = 6.75, p < 
.05. None of the other main effects nor any 
of the interactions were significant. 

A 2 X 2 X 3 analysis of variance (Type of 
Structure X Order of Structure Presentation 
Type of Relation) was performed on the 
data from incomplete structures (Table 2). 
Two of the main effects were significant. First, 
the type of relationship to be remembered 
affected recall accuracy. Dislike relationships 
were most accurately remembered, while un- 
stated relationships were least accurately re- 
called, F (2, 280) = 6.54, p < .01. Second, re- 
lationships which were presented in the second 
story were remembered better than those pre- 
sented in the first story, F(1, 140) = 10.41, 
p < .01. There was no overall effect of struc- 
ture type (F <1). 

In Table 2, the main effects were largely 
due to two factors: (a) poorer performance 
on relationships not presented for the balance- 
able than for the nonbalanceable structure 
when presented first in order; and (b) poorer 
performance on the like relation for the non- 
balanceable than for the balanceable structure 
when presented second in order. These factors 
also contribute to the interactions. There was 
a significant Structure X Relation interaction, 
F(2, 280) = 4.17, p < .05. When the struc- 
ture was balanceable, the like and dislike re- 
lations were remembered correctly more often 
than when the structure was nonbalanceable. 


3Given that a subject correctly recalled a struc- 
ture as either being complete or incomplete, the 
probability of guessing the correct relationship is 4 
for the complete structure and } for the incomplete 
structure. Therefore, better recall of the complete 
structures could reflect better memory for the rela- 
tionships or simply a higher guessing probability. 
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TABLE 3 


PROPORTION OF CORRECT RESPONSES FOR TYPE 
OF STRUCTURE AND TYPE OF RELATION 
(Experiment 2) 


Structure 
Relation ~ 
Balanceable | Nonbalanceable M 
Like -806 «143 174 
Dislike 819 -646 «733 
Nothing stated -729 688 -708 
M 785 692 «138 


On the other hand, more errors. were made 
for the balanceable structure on the relation- 
ships which were not presented. However, the 
effect was fairly small: Three planned com- 
parisons showed that the differences between 
the balanceable and nonbalanceable structures 
for the like, F(1, 142) = 1.74, dislike (F < 
1), or nothing stated, F(1, 142) = 2.72, p< 
.11, relations failed to reach significance when 
considered separately. 

Two other interactions, both involving or- 
der, were also significant. First, the significant 
Order X Relation interaction, F(2, 280) = 
11.72, p < .01, reflects the fact that the like 
relation was remembered equally well in the 
first and second structure, the dislike relation 
was remembered somewhat better when it ap- 
peared in the second structure; while the re- 
lationships not presented were remembered 
much better when they appeard in the second 
structure presented. Second, the Order x 
Structure interaction was significant, F(1, 
140) = 4.03, p < .05. Subjects made many 
more errors in recalling the balanceable struc- 
ture when it was presented first rather than 
second (due largely to the nothing stated 
category). On the other hand, there was only 
a slight decrement for the nonbalanceable 
structure when it appeared first in order. The 
three-way interaction was not significant, F (2, 
280) = 2.11, p > .10. 

Experiment 2. Story format and order had 
little effect (£ < 1) and these factors were 
not considered further in this analysis. Table 3 
gives the probability of correct recall for the 
two incomplete structures and the three types 
of relations. A 2 X 3 analysis of variance 
(Type of Structure X Type of Relation) was 
performed on the data presented in Table 3, 
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Ficure 2. Accuracy of recall as a function of rela- 
tionship presentation order. (Arrows indicate the 
point at which imbalancing information was first 
presented, for the imbalanced and nonbalanceable 
structures.) 


Subjects, in general, made fewer errors on the 
balanceable structure, F(1, 71) = 8.58, p< 
.01. In addition, there was a significant Struc- 
ture X Relation interaction, F(2, 142) = 
3.40, p < .05. A ż test indicated that actually 
presented relationships were remembered 
more accurately when the structure was bal- 
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anceable, ¢ (71) = 3.14, p < .01, while the 
difference for the nonpresented relationships 
was not significant, ¢ (71) = 1.14, p > .10. 
The advantage was very large for the dislike 
relationship (.18) and smaller for the like 
relationship (.06). 


Relationship Presentation Order within Story 


Experiment 1. For the complete structures, 
the mean proportions correct for structure 
type and order of relationships presentation 
are displayed in the upper panel of Figure 2, 
A 2X26 analysis of variance (type of 
Structure X Order of Structure Presentation 
X Order of Relationship Presentation) on the 
complete structures indicated that subjects, in 
general, made fewer errors on the first two 
relationships presented than on the remaining 
four relationships, F(5, 700) = 5.07, p < .01. 
In fact, the effect was solely due to the im- 
balanced structure, reflected in the significant 
Structure Type X Relationship Presentation 
Order interaction, F(5, 700) = 3.23, p < .01. 

For the incomplete structures, the mean 
proportions correct for structure type and 
order of relationship presentation are dis- 
played in the middle panel of Figure 2, A 2 x 
2 X 4 analysis of variance (Type of Struc- 
ture X Order of Structure Presentation X 
Order of Relationship Presentation) was per- 
formed on the data from the incomplete struc- 
tures. (Relationships not presented were, of 
course, not included in this analysis but will 
be dealt with below.) This analysis indicated 
a significant main effect for relationship pre- 
sentation order, F(3, 420) = 10.19, p < .01. 
This was due to subjects making excessive 
errors in the third presentation position for 
the nonbalanceable story. This led to a sig- 
nificant interaction between structure type 
and order of relationship presentation, F'(3, 
420) = 5.95, p< .01. 

Experiment 2. The mean proportion correct 
for structure type and order of relationship 
presentation are displayed in the lower panel 
of Figure 2, A 2 X 4 analysis of variance 
(Type of Structure X Order of Relationship 
Presentation) was performed. There was a 
significant main effect for order of relation- 
ship presentation, F(3, 213) = 5.08, p < .01. 
This result, however, must be interpreted in 
terms of the significant Structure Type X Re- 
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lationship interaction, F(3, 213) = 2.70, p < 
.05. The lower panel of Figure 2 shows that 
there was little difference between balanceable 
and nonbalanceable structures on the first two 
relationships presented, while the nonbalance- 
able structure was much inferior for the third 
and fourth relationships presented. 


Responses Made to Relationships Not 
Presented (Incomplete Structures) 


Experiments 1 and 2. For relationships 
which were left unstated, subjects could mis- 
takenly think they were actually presented; 
furthermore, these errors could be of two 
types: liking or disliking. The first analysis 
considers the structures as presented, For the 
balanceable structure, the number of times 
these errors balanced the structure were com- 
pared with the number of times these errors 
led to an imbalanced structure. 

Referring to the balanceable structure in 
Figure 1, an error in the direction of balance 
was said to occur in the following cases (re- 
member A-C and A-B were not actually pre- 
sented): 


1. A-B misrecalled as like, A-C recalled as 
not having been presented. 

2. A-B recalled as not having been pre- 
sented, A-C misrecalled as dislike. 

3. A-B misrecalled as like, A-C misrecalled 
as dislike. 


In a similar fashion, an error in the direction 
of imbalance was said to occur in the follow- 
ing cases: 


1. A-B misrecalled as dislike, A-C recalled 
as not having been presented. 

2. A-B recalled as not having been pre- 
sented, A-C misrecalled as like. 

3. A-B misrecalled as dislike, A-C misre- 
called as like. 


There were 36 instances of these two types of 
errors for the balanceable incomplete struc- 
ture in Experiment 1. In 29 instances subjects 
misremembered the relationship (s) in the di- 
rection of balance and in 7 instances in the 
direction of imbalance, x? (1) = 13.44, p < 
.01. As a control for response bias, the errors 
in the nonbalanceable structure for the non- 
presented relations were analyzed in exactly 
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the same way, although of course, none of 
the responses could actually balance the struc- 
ture. Of the 27 such instances, 14 were of 
the first type, while 13 were of the second 
type, x?(1) < 1. 

The results of Experiment 2 were analyzed 
in exactly the same way. Of the 25 cases of 
errors on the relationships not presented, 17 
were in the direction of balance and 8 were in 
the direction of imbalance for the balanceable 
structure, x? (1) = 3.24, p < .10. Again as a 
check for response bias, errors for the non- 
balanceable structure were analyzed in the 
same way. Of the 28 such instances, 13 were 
of the first type and 15 were of the second 
type, x? (1) < 14 

Given that a bias toward the responses 
which balance the balanceable stories does not 
appear for the nonbalanceable stories, we next 
attempted to see whether a bias toward im- 
balance exists, In similar fashion to that used 
for the balanceable stories, we defined re- 
sponses for the nonbalanceable stories which 
balanced or imbalanced triads containing an 
unstated link, The whole story of course 
could not be balanced, so balanced and im- 
balanced responses were defined solely in 
terms of the triads involving the links not 
stated. The results for Experiment 1 showed 
14 errors in the direction of imbalance and 7 
in the direction of balance. The results for 
Experiment 2 showed 15 errors in the direc- 
tion of imbalance and 14 in the direction of 
balance. These results for the nonbalanceable 
stories appear to be, if anything, opposite to 
those for the balanceable stories. For balance- 
able stories, subjects made responses to non- 
stated links in the direction of balance; for 
nonbalanceable stories they made responses to 
nonstated links in the direction of imbalance. 

To verify further these findings, the next 
analysis considered triads which the subject 
actually produced (without regard to the links 


‘Instances in the data where the nonpresented 
pairs were both remembered as like, or both as 
dislike (i.e., one balanced and one imbalanced), were 
excluded from the analysis. For Experiment 1, there 
were three such instances for the balanceable struc- 
ture and two instances for the nonbalanceable struc- 
ture. For Experiment 2, there were three instances 
for the balanceable and six instances for the non- 
balanceable structure. 
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TABLE 4 


FREQUENCY or TYPES or RECALLED TRIADS FOR 
BALANCEABLE AND NONBALANCEABLE 
STRUCTURES 


Experiment 1 


Recalled triads 
Structure presented — m 
Balanced Imbalanced 
Balanceable 26 6 
Nonbalanceable 9 17 
Experiment 2 
Balanceable 15 8 
Nonbalanceable 9 21 


originally presented). Analysis is restricted to 
triads containing just one nonpresented rela- 
tionship. Each incomplete structure contained 
two such triads: the A-B-D triad and the 
A-C-D triad (see Figure 1). Since both of 
these triads are incomplete, they are neither 
balanced nor imbalanced. In this analysis, 
however, balance was defined by the actual 
responses subjects gave during retrieval, re- 
gardless of what had been presented origin- 
ally. We examined instances in which subjects 
recalled all three relationships making up one 
of these triads without regard to the correct- 
ness of these responses. The results from Ex- 
periment 1 are shown in Table 4. There was 
a significant tendency to generate balanced 
triads when the originally presented structure 
had been balanceable, and imbalanced triads 
when the original structure had been non- 
balanceable, x? (1) = 13.04, p < .01. 

The results from Experiment 2 are also 
shown in Table 4. However, since each subject 
received both types of incomplete structures, 
these data were analyzed differently than in 
Experiment 1, An index was computed for 
each subject in the following way: The num- 
ber of balanced triads generated when the 
structure had been balanceable and the num- 
ber of imbalanced triads generated when the 
structure had been nonbalanceable was first 
summed, from this was subtracted the number 
of imbalanced triads generated for the bal- 
anceable structure and the number of bal- 
anced triads generated for the nonbalanceable 
structure. This index was then tested against 
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zero, t (26) = 2.30, p < .05. The results rep- 
licate those from Experiment 1: Subjects - 
tended to generate triads which were balanced 
when the original structure was balanceable 
and imbalanced when the original structure 
was nonbalanceable, 


GENERAL DISCUSSION 


All models predict a general advantage for 
balanced or balanceable structures, if we con- 
sider only the relations actually presented. 
The data from both studies support this pre- 
diction, although the magnitude of the effect 
is not extremely large. 

The data on relationship presentation order 
should be more enlightening with regard to 
possible models of balance. According to 
“tension reducing” or similar models of the 
balance effect, the advantage for actually seen 
relations should be the same for all relations 
from the balance(d) (able) structures, Some 
coding models, on the other hand, suppose 
that subjects attempt to construct a social 
code from the relationships as they occur 
serially in the story. Tt is not until the third 
relationship that the balance(d) (able) stories 
differ from the nonbalance(d) (able) stories. 
Thus we might expect that the nonbal- 
anced(d) (able) stories would show a marked 
inferiority from the third relationship on, but 
at most a small inferiority for the first two 
relationships presented. 

The data from Figure 2 bear out this pre- 
diction. The difference between balance(d)- 
(able) and nonbalance(d) (able) structures is 
negligible until presentation position three 
(the first relationship indicating imbalance), 
at which time performance drops for the non- 
balance(d) (able) structures. This finding is 
explicable in terms of a serial encoding model, 
but seems difficult to understand by more 
traditional treatments of balance. 

Perhaps the most crucial data for distin- 
guishing among possible models for the recall 
of social structures arise from subjects’ er- 
rors when they recall a relationship (as “like” 
or “dislike”) which was not actually pre- 
sented. Let us call these errors “intrusions.” 
Do such intrusions indicate a bias toward 
either balance or imbalance? Any model pre- 
dicting a general trend toward balance would 
predict all intrusions to appear in the direc- 
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tion of balance, whether the original structure 
was balanceable or nonbalanceable. The data 
clearly indicate that if the story was originally 
balanceable, intrusions were biased toward 
balance, If the story was originally nonbal- 
anceable, however, the intrusions showed just 
as strong a tendency toward imbalance. 

All theories proposing generalized drifts to- 
ward balance for any incomplete structure 
appear to be in conflict with these intrusion 
data. We propose instead an encoding model. 
During presentation, the stories are encoded 
in a way that indicates them to be either 
“socially balanced and logical” or “socially 
imbalanced and illogical.” At testing, this code 
is recalled. Then in instances where a subject 
is uncertain of the direction of a relationship, 
he makes a guess consistent with his recalled 
code. Thus, intrusions tend toward balance for 
balanceable structures and imbalance for non- 
balanceable structures. These arguments are 
supported by the studies by Cottrell (Note 2) 
and Cottrell, Ingraham, and Monfort (1971). 

There are undoubtedly many models cap- 
able of explicating the present data. Even 
among the coding models, a number of plau- 
sible models can be constructed. Awaiting 
further data, we lean toward a coding model 
making the following assumptions: 


1. The concept of social balance and im- 
balance is real and significant to subject. He 
has verbal labels or images to discriminate in- 
stances of balance from those of imbalance. 

2. The codes for balanced situations are 
easier, shorter, and more available than those 
for imbalanced situations. 

3. Subject constructs a social code serially, 
in the order of presentation of the relation- 
ships in a story; he attempts to add new so- 
cial information to the structure that has been 
constructed to that point. 

4, Subject finds it easier to retrieve from 
memory social structures encoded in shorter 
or simpler fashion. 

5. When uncertain of the sentiment for a 
particular relation during recall, subject will 
bias his response, or guess, in accord with any 
simple, general description of the structure as 
a whole that can be recalled. 


i model therefore suggests that subjects 


a mo 
£ some type of general code to distinguish 
(3 
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balanced (d) (able) from imbalance(d) (able) 
structures as a group, in addition to the as- 
sumption that specific balanced codes are sim- 
pler and more available. This model has some 
parallels with that of Potts (1972), who pre- 
sented incomplete linear orderings. Subjects 
in his experiments tended to order items 
linearly, filling in the missing relationships 
and storing the resulting complete structure. 
In his studies, subjects had a logical rule by 
which to form the structure; in our studies, 
we suggest subjects used social rules. 

In conclusion, our view of balance theory 
as a cognitive organizing principle and our 
theory of serial encoding offer an alternative 
to the traditional models of balance. However, 
our results do provide strong support for the 
reality of balance as a cognitive principle. 
Thus we have been able, on the one hand, to 
support the notions of balance as a cognitive 
structure and as a simplifying code, and on 
the other hand, to reject the hypothesis that 
there exists a “mindless drift” toward balance. 
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In an experiment to study the effects of judges’ attitudes, a superimposed classi- 
fication, and connotations of the response language on judgments of attitude 
statements, 108 boys and 108 girls, aged from 12 to 13, rated the favorability 
of 10 statements toward adult authority along eight scales. Anti subjects gave 
more polarized ratings (narrower unit) along four scales where the anti term 
was evaluatively positive and the pro term negative than on four scales where 
the pro term was the more positive; this difference was reversed for pro 
subjects. Also, on four (“marked”) scales, the anti term was seen as more 
masculine in connotation than the pro term, whereas on four (“unmarked”) 
scales the terms carried no connotation of masculinity or femininity. When 
subjects were told that the 5 anti statements had been made by boys and the 
5 pro statements by girls, subjects showed an accentuation of the judged 
differences between the two groups of statements on both marked and un- 
marked scales compared with subjects who were given no such information 
about the source of the statements. A similar accentuation effect was shown 
on unmarked scales only by subjects who were told that the pro statements 
had been made by boys and the anti statements by girls. It is concluded that 
the effect of a peripheral attribute on judgment depends on its congruity with 


connotations of the response language. 


As Hovland and Sherif (1952) were the 
first to demonstrate, a person who is asked to 
judge a series of attitude statements in terms 
of their favorability or unfavorability toward 
a given issue (as in the construction of atti- 
tude scales according to Thurstone’s method 
of equal-appearing intervals) will typically be 
influenced by his own opinion on the issue. 
Many aspects of this relationship still remain 
problematic, in particular, the finding that a 
judge’s attitude can influence the extent to 
which he will polarize his ratings toward the 
two extremes of the rating scale, in Upshaw’s 
(1965, 1969) terms, the extent to which he 
will use a narrow or wide reference scale unit. 
This should be distinguished from the original 
emphasis of the Hovland and Sherif study 
which stressed the tendency for certain judges 
to concentrate their ratings at one extreme of 
the scale, “Polarization,” as we shall use the 
term, refers to the bilateral spread of a per- 
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son’s judgments, that is, the extent to which 
he accentuates the differences between the op- 
posite halves of the item series, 

The polarization or accentuation of a per- 
son’s judgments has often been taken as an 
index of his confidence (Cantril, 1946) or the 
salience to him of the judgments he is making 
(Bonarius, 1965; Eiser, 1971a; Tajfel & 
Wilkes, 1964). In addition, Eiser and Stroebe 
(1972) have attempted to relate polarization 
of attitudinal judgment to the finding in psy- 
chophysical judgment that subjects’ ratings 
can be influenced by attributes of the stimuli 
other than those they are required to judge. 
Specifically, when a series of stimuli varies 
simultaneously in terms of a “focal” attribute 
which is to be judged and an unjudged 
“peripheral” attribute, in such a way that dif- 
ferences along the focal dimension correlate 
with differences along the peripheral dimen- 
sion, then subjects tend to give more polarized 
ratings than when such peripheral cues are 
absent. For example, when Tajfel and Wilkes 
(1963) superimposed a classification in terms 
of two alphabetic labels on a series of visually 
presented lines, so that the shorter lines fell 
into one class and the longer lines into the 
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other, subjects’ estimates of the lengths of the 
lines showed an “accentuation of interclass 
differences” as compared with estimates of the 
lines without any superimposed cues. Com- 
parable effects have been observed by Eiser 
(1971b) when the superimposed cues were the 
sources to which the statements were attrib- 
uted, Applying this principle to the effects of 
judges’ attitudes, Eiser and Stroebe argued 
that a person’s own levels of agreement and 
disagreement with the statements presented 
may function as a peripheral dimension super- 
imposed on the focal dimension of favorability 
toward the issue. One might therefore expect 
judges with more extreme attitudes (for whom 
these dimensions would be more likely to co- 
vary) to accentuate the differences between 
the favorable and unfavorable items, and 
hence give more polarized ratings than neu- 
tral judges. 

The relevant data suggest, however, that 
such accentuation effects may occur only for 
judges whose own positions lie at one extreme 
of the attitude continuum. On the issue of 
attitudes toward Negroes, pro-Negro judges 
give more polarized ratings than neutral 
judges, but neutral judges still give more 
polarized ratings than anti-Negro judges 
(Selltiz, Edrich, & Cook, 1965; Upshaw, 
1965). It may be possible to account for this 
asymmetry, however, by attending to semantic 
features of the response scales employed. Since 
the terms used to define the extremes of a 
response scale might differ from each other not 
only denotatively but also in evaluative con- 
notation, Eiser and Mower White (1974) 
adopted a procedure analogous to that sug- 
gested by Peabody (1967, 1970) for uncon- 
founding the effects of evaluative and descrip- 
tive similarity in trait inferences, For an issue 
“X,” one presents subjects with some scales 
where the term denoting the “pro-X” extreme 
is evaluatively positive and the “anti-X” term 
is evaluatively negative and other scales where 
the pro-X term is negative and the anti-X 
term positive. Eiser and Mower White found 
that the former type of scales produced a di- 
rect relationship between the favorability of 
judges’ attitudes and the polarization of their 
ratings, whereas the latter type produced an 
inverse relationship. In other words, judges 
gave more polarized ratings when their “own 
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end” of the scale was positively labeled. This 
might account for the asymmetry in studies 
on attitudes toward Negroes if one can as- 
sume that terms like pro-Negro carried more 
positive connotations than terms like anti- 
Negro for most of the subjects involved. The 
implication, then, is that more extreme judges 
will not tend to accentuate the differences be- 
tween favorable and unfavorable items if their 
own positions lie toward the negatively labeled 
extreme of the scale in question, that is, when 
their own evaluations of the items are incon- 
gruent with the evaluations implied by the 
judgment scale labels. 

We intend to show that this is not an iso- 
lated effect, restricted to the case of evalu- 
ative consistency between the response lan- 
guage and the judge’s own evaluations of the 
items, but implies a more general principle, 
that polarization of judgment depends both on 
the correlation between the focal and periph- 
eral continua and on the degree of congruity 
between the peripheral cues and the response 
scale labels. With the possible exception of 
work on the semantics of high-low continua 
(e.g., Harvey, 1973), psychophysical research 
provides little direct evidence on this point, 
since in most cases the peripheral cues em- 
ployed have been quite arbitrary. Since in 
these cases, the question of congruity between 
the response language and the peripheral cues 
does not arise, the direction of the correlation 
between the peripheral and focal continua 
typically has no effect on the accentuation of 
judged differences along the focal continuum 
(e.g., Tajfel & Wilkes, 1963). On the other 
hand, in a situation where this question of 
congruity does arise, the direction of the cor- 
relation may be very important. It was our 
prediction that in such a situation, a correla- 
tion between the focal and peripheral attri- 
butes would lead to increased accentuation of 
differences if the peripheral attribute were 
congruent with the response scale but not if 
it were incongruent. 

This study also permits a replication of the 
major aspects of the Eiser and Mower White 
(1974) study, with a revised set of scales and 
a younger age group as subjects but with the 
same series of statements concerning chil- 
dren’s attitudes toward adult authority. Spe- 
cifically, we were concerned with replicating 
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(a) the finding that judges give more polar- 
ized judgments (as reflected by a greater 
mean difference between their ratings of the 
pro and anti halves of the series) on scales 
where their own positions lie toward the more 
positively labeled extreme and (b) the effect 
of the value connotations of the scale labels 
on reference-scale origin, whereby judges’ 
overall mean ratings of the item series tend 
to be closer to the positive extreme of each 
scale, A possible interpretation of this effect 
is discussed by Eiser and Stroebe (1972, p. 
163f.). 


METHOD 


A total of 252 subjects aged 12-13 years were 
administered a three-part questionnaire at school dur- 
ing regular classroom periods. Of these, 27 were dis- 
carded for giving incomplete responses, and a further 
9 were randomly excluded to yield equal groups. 
This left 216 subjects (108 boys and 108 girls) for 
use in the analysis. 

Part 1 of the questionnaire contained the 10 state- 
ments supposedly made by young people when asked 
about adult authority over children, listed in Eiser 
and Stroebe (1972, pp. 159-160). Of these, 5 repre- 
sent “pro-authority” and 5 “anti-authority” posi- 
tions, Subjects indicated their agreement with each 
statement in terms of a 100-mm linear scale from 0 
(agree very much) to 100 (disagree very much). A 
measure of each subject’s attitude toward authority 
was derived by summing his scores for the 5 anti 
items and subtracting his scores for the 5 pro items; 
the more positive the resulting difference score, the 
more pro was the subject’s attitude. 

In Part 2, subjects rated the two concepts “The 
kind of person I like least” and “The kind of person 
I like most” in terms of eight 100-mm linear scales. 
In Part 3, subjects were reshown the items used in 
Part 1 and had to rate each of them along the same 
eight scales used in Part 2. Controls for possible 
effects of item order and scale order were incorpor- 
ated. The eight scales were labeled as follows: (a) 
bold-timid, (b) adventurous-unadventurous, (c) dis- 
obedient-obedient, (d) rude-polite, (e) with-it-old- 
fashioned, (f) creative-uncreative, (g) unhelpful- 
helpful, and (h) uncooperative-cooperative. 


Selection of Scales 


These scales were selected from an initial sample 
of 60 pairs of bipolar adjectives shown to two pilot 
groups of 25 subjects of the same age and from the 
same school as those used in the main study. To find 
scales which would discriminate reasonably between 
pro and anti positions on the issue of adult authority, 
the first group rated the pro and anti item groups, 
both presented en bloc, in terms of the 60 pairs of 
adjectives in a two-category, forced-choiced format 
(e.g., bold or timid). The second group responded in 
terms of the same 60 adjective pairs to the questions 
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“Are boys most likely to be... ?” “Are girls most 
likely to be... ?” and “Is it better to be [e.g.] bold 
or timid?” Controls for possible order effects were 
incorporated in both pilot studies, 

The final eight scales were chosen according to the 
following criteria. All of the scales discriminated rea- 
sonably between the pro and anti item groups and 
were therefore assumed to be appropriate to denote 
the continuum of favorability toward adult authority. 
In each case, the left-hand term (scored as 0) repre- 
sented the anti extreme and the right-hand term 
(scored as 100) the pro extreme. Scales a to d, 
referred to as Marked scales, were chosen, since 
one of the terms in each pair was seen as more 
applicable to boys, and the other term to girls. 
Specifically, boys were seen as more likely to be bold, 
adventurous, disobedient, and rude by 22, 24, 16, 
and 13 subjects, respectively, out of 25; whereas 
girls were seen as more likely to be timid, unadven- 
turous, polite, and obedient by 21, 15, 20, and 17 
subjects, respectively. In each case, the anti term 
was seen as more applicable to boys than the pro 
term. (There were no scales in the initial sample 
where the pro term was seen as more applicable to 
boys and the anti term to girls). Scales e to h, re- 
ferred to as unmarked scales, used terms which were 
not seen as differentially applicable to boys or girls, 
On scales a, b, e, and f, referred to as A scales, 
the anti term was seen as the more evaluatively 
positive: Specifically, 20, 25, 24, and 25 subjects felt 
it was better to be bold, adventurous, with-it, and 
creative, respectively. Scales c, d, g, and h, referred 
to as P scales, were ones where the pro term was 
the more positive: 25, 24, 25, and 25 subjects felt 
it was better to be obedient, polite, helpful, and co- 
operative, respectively. The eight scales thus sub- 
divided into four pairs, constituting a 2 X 2 factorial, 
the variables being marking (marked vs. unmarked) 
and value (A vs. P). 


Classification Conditions 


There were three different versions of Part 3, each 
presented to 36 male and 36 female subjects. In the 
control condition, subjects were simply told to read 
each statement and imagine “What the person who 
made it was like” and then rate this imaginary person 
on the eight scales. The remaining 144 subjects re- 
ceived the following instructions after completing 
Part 2, 


On the next pages you will read the same state- 
ments as were in Part 1, and below each statement 
the name of the boy or girl who made it. This 
time we want you to read the statement and 
imagine what the boy or girl who made it was 
like. Then describe this boy or girl using the same 
eight pairs of words as you used in the last part. 


Each item was then presented in inverted commas 
with one of five boys’ or five’ girls’ first names 
printed underneath. For half of these subjects, (di- 
rect condition), the five boys’ names were attached 
to the five anti items, and the five girls’ names to the 
five pro items. For the remaining subjects (reverse 
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condition), the five boys’ names were attached to the 
five pro items and the five girls’ names to the five 
anti items, Thus in the direct and reverse conditions, 
subjects rated the statements in the presence of a 
superimposed classification, namely, the sex of the 
children who had supposedly made them, which was 
systematically related to their favorability. Since the 
terms used to label the unmarked scales carried no 
reliable connotations of masculinity or femininity, the 
question of the congruity between these scales and 
the superimposed classification does not arise; on 
these scales, therefore, both the direct and reverse 
subjects should give more polarized ratings than the 
control subjects. This is comparable to the situation 
in the Tajfel and Wilkes (1963) study, where ac- 
centuation of interclass differences occurred, irre- 
spective of whether the short lines were labeled A 
and the long lines B or vice versa. The terms used 
to label the marked scales, on the other hand, would 
be congruent with the superimposed classification in 
the direct condition but incongruent for the reverse 
subjects; on these scales, therefore, only the direct 
subjects should show greater polarization than the 
control subjects. 


Judges’ Attitudes 


To check that subjects’ attitudes were randomly 
distributed over the six sex-by-classification groups, 
subjects’ attitude scores derived from their ratings 
of agreement with the statements were submitted to 
a 2X3 (Sex X Classification) analysis of variance. 
This yielded no Fs greater than 1. The mean atti- 
tude score for the total sample was 33.4. The 36 
subjects in each of the six groups considered sepa- 
rately were then split into three equal groups of 12 
subjects designated pro, neutral, and anti. This meant 
that the border lines between different attitude 
groups varied slightly between conditions but yielded 
a 2X3X3 factorial design with 12 subjects per 
group. 


RESULTS 


Each subject’s ratings of the items in Part 
3 were averaged over the five anti and five 
pro items and over each of the four pairs of 
scales. The design then permitted a six-way 
analysis of variance (Sex X Classification x 
Attitude X Marking X Value x Item Group), 
but for simplicity of exposition we shall report 
instead the results of three separate two-way 
analyses of variance which in fact contain 
most of the effects of any magnitude and 
interest revealed by the more complex analy- 
sis. First, to examine the effects of the classi- 
fification conditions on polarization, mean 
item group difference scores were calculated 
by subtracting the scores from the anti items 
from those for the pro items. Since more pro 
ratings were associated with higher numerical 
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scores, one would expect these difference 
scores to be positive. A 3 X 2 (Classification 
X Marking) analysis of variance was then 
conducted on these scores, collapsing over 
Sex, attitude, and value. As can be seen from 
Table 1, the marked scales showed more 
polarization (larger item group differences) 
than the unmarked scales across all three 
conditions, marking main effect, F (1, 213) = 
109.74, p < .001. The differences between the 
three conditions, disregarding the type of 
scale, are not reliable, classification main ef- 
fect F = 1.42, ns. However, our prediction 
that the reverse group should be closer to the 
control group on marked scales, but closer to 
the direct group on unmarked scales, is sup- 
ported by a significant Classification X Mark- 
ing interaction, F (2,213) = 4.10, p< 025, 
and more specifically by the contrast corre- 
sponding to the (Reverse versus Direct plus 
Control) X Marking interaction, F = (1, 213) 
= 5.32, p<.025. The effectiveness of a 
superimposed classification in producing an 
accentuation of judged differences thus seems 
to depend, as predicted, upon its congruity 
with the response language, even when the 
kind of congruity in question is nonevaluative, 
On the other hand, when the question of con- 
gruity does not arise, as on the unmarked 
scales, the direction of the correlation be- 
tween the superimposed classification and the 
focal attribute is immaterial. It should be 
noted that this was so in spite of the fact that 
subjects may well have had prior expectations 
as to the sex of the persons making the state- 
ments, which would have been confirmed in 
the direct but not in the reverse condition. 
When asked to guess the sex of the person by 
whom each statement had been made, sub- 
jects in the first of our pilot groups indicated 
that they thought the pro items had been 


TABLE 1 


MEAN Item Group DIFFERENCES As A FUNCTION 
OF CLASSIFICATION AND MARKING 


Classification 


Scale 
Direct Reyerse Control g 
4 
Marked 29.6 24.1 23.8 ¢ 
Unmarked 19.9 19.2 16.9 N 
M 24.7 21.6 20.3 
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made by girls (average agreement over the 
five items was 82.4%) and that the anti items 
had been made by boys (average agreement 
was 76.8%). 

Second, a 3 X 2 (Attitude X Value) analy- 
sis of variance was conducted on the mean 
item group differences, collapsing over sex, 
classification, and marking, to examine the 
relationship between judges’ attitudes and 
polarization as a function of the value conno- 
tations of the response language. The major 
finding of the Eiser and Mower White (1974) 
study, that judges will give more polarized 
ratings on scales where their own positions 
lie toward the more positive extreme, is sup- 
ported by a significant Attitude X Value in- 
teraction, F (2,213) = 21.62, p < .001, and 
more specifically by the contrast correspond- 
ing to the (Pro versus Anti) X Value interac- 
tion, F (1,213) = 43.24, p < .001. There is a 
significant attitude main effect, F (2,213) = 
4.14, p < .025, mostly due to the neutral 
versus pro plus anti contrast, F (1, 213) = 
7.86, p < .01, indicating that over all scales 
combined, the most polarized ratings were 
given by subjects at both extremes of the at- 
titude continuum. In addition, an unexpected 
value main effect, F (1,213) = 15.06, p< 
.001, indicates that ratings were more polar- 
ized on the P than A scales (see Table 2). 

„Another 3 X 2 (Attitude X Value) analy- 
sis was then conducted, this time on the means 
for both item groups combined, to determine 
the effects of attitude and value connotations 
on overall mean judgment (reference scale 
origin). As shown in Table 3, ratings tended 
to be far higher, that is, more pro, on the P 
than the A scales, value main effect, F (1, 
213) = 635.44, p<.001, confirming the 
Eiser and Mower White (1974) finding that 
mean judgments tend to be closer to the posi- 


TABLE 2 


MEAN ITEM GROUP DIFFERENCES AS A FUNCTION 
OF ATTITUDE AND VALUE 


Attitude 
Scale 
Pro Neutral Anti 
F 36.0 21.5 19.8 
A 14.5 14.4 27.3 
M 25.2 17.9 23.5 
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TABLE 3 
OVERALL MEAN RATINGS AS A FUNCTION 
OF ÅTTITUDE AND VALUE 
Attitude 
Scale — 
Pro Neutral Anti 
P. 61.6 65.0 63.8 
A 37.0 36.5 40.6 
M 49.3 50.8 52.2 


tive extreme of each scale. The Attitude x 
Value interaction, F = 2.44, ns; (Neutral ver- 
sus Pro plus Anti) X Value contrast, F (1, 
213) = 4.55, p <.05, indicates that this “‘pos- 
itivity” effect is strongest for the neutral sub- 
jects, while the attitude main effect, F (2, 
213) = 4.32, p < .05; pro versus anti con- 
trast, F (1, 213) = 8.33, p < .01, reflects an 
overall contrast effect, that is, an inverse rela- 
tionship between attitude and mean judg- 
ment. Neither of these latter two effects were 
present in the Eiser and Mower White (1974) 
results. In addition, subjects’ ratings of the 
concepts “The kind of person I like least” and 
“The kind of person I like most” were analyzed 
to determine how subjects viewed the value 
connotations of the scale labels. This strongly 
validated the initial distinction between the 
A and P scales and confirmed the Eiser and 
Mower White (1974) finding that judges 
evaluate labels denoting positions they accept 
more positively than those denoting positions 
they reject. Finally, the mean ratings given 
to each of the 10 items by all 216 subjects on 
each scale were calculated, and the sets of 
mean ratings for the different scales were 
intercorrelated. Of the 28 comparisons, all 
yielded values of r over .65, and 15 were over 
.90. The major differences between the scales, 
as applied to the specific items used in this 
study, would thus seem to be connotative 
rather than denotative. 


DISCUSSION 


This study has attempted to synthesize two 
relatively distinct notions concerning the rela- 
tionship between judges’ attitudes and the 
polarization of their ratings. On the one hand, 
the principle of categorization views polariza- 
tion as the outcome of judges accentuating 


774 J. Ricuarp EISER AND CAMILLA J. Mower Wuite 


the differences between stimuli when an un- 
judged peripheral attribute is correlated with 
the focal attribute, and stresses the links be- 
tween social and psychophysical judgment (cf. 
Eiser, 1971b; Eiser & Stroebe, 1972). On the 
other hand, stressing the links between atti- 
tudinal judgment and attitude organization, is 
the principle of congruity, which implies that 
judges prefer to describe statements they ac- 
cept in positive terms and statements they 
reject in negative terms; that is, they seek to 
“balance” their evaluations and their ratings 
(cf. Eiser, 1971a; Eiser & Mower White, 
1974). 

Our reasoning has been guided by the con- 
viction that these two principles are in fact 
alternative ways of conceptualizing what may 
be fundamentally the same process, namely, a 
tendency for individuals to structure their cog- 
nitions in terms of simpler, rather than more 
complex, frames of reference. The central no- 
tion of cognitive consistency theories is essen- 
tially that people seek to organize their evalu- 
ations into the simplest available structures 
(cf. Eiser & Stroebe, 1972, pp. 182-184), and 
one way in which such simplification may be 
achieved is for them to see as most “salient” 
those aspects of an issue in terms of which 
they can maintain a positive self-evaluation 
(Eiser, 1971a). Similarly, the concept of cate- 
gorization presupposes that individuals will 
exploit the redundancy inherent in most stim- 
ulus situations so as to simplify the informa- 
tion with which they have to deal, 

By attending to a peripheral attribute which 
is correlated with the focal attribute of a 
stimulus series, the judge can normally pro- 
vide himself with more information on the 
basis of which to make appropriate discrimi- 
nations. But if this peripheral attribute car- 
ries connotations or associations which are 
incongruent with the language in which he 
must express his judgments, in Campbell’s 
(1956) terms, his ratings will be the product 
of competing “response tendencies.” Within 
the same experiment, we have demonstrated 
both that (a) the value connotations of the 
response scale labels mediate the relation- 
ship between judges’ attitudes and the polari- 
zation of their ratings and (b) other conno- 
tations of the same labels mediate the effects 
of an experimentally superimposed classifica- 


tion. It is our contention that both of these 
results reflect a common principle, that cate- 
gorization processes depend on the congruity 
between stimulus attributes and the language 
in which judgments are expressed. 

In addition, when the data for different 
types of scales are all combined, we found 
overall effects of judges’ attitudes on both the 
origin and unit of their reference scales. The 
inverse relationship between attitude and 
overall mean judgment is an example of the 
“contrast effect” found in a number of previ- 
ous studies. This is compatible with the gen- 
eral notion that the judge will use his own 
attitude as an “anchor” or reference point to 
which his ratings of the statements will be 
relative. The overall tendency for judges with 
more extreme attitudes to give more polarized 
ratings when all scales are averaged together 
is precisely the effect which would be pre- 
dicted for any scale taken singly by the “as- 
similation—contrast” model of Sherif and 
Hovland (1961), and by our own categoriza- 
tion principle for any scale which was free o 
evaluative bias. Although most previous re- 
search has failed to demonstrate a straight- 
forward relationship between extremity o 
attitude and polarization of judgment, it has 
also failed to control directly for any such 
evaluative bias in the response language em- 
ployed. For reasons such as these, we fee 
that a consideration of semantic factors must 
be an integral part of any adequate theory 
of social judgment. 
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Strategy Choices in Three Variants of a Threat-Vulnerable Game 


Angelo Santi and Roger Wells 
Carleton University 
Ottawa, Canada 


One hundred eighty male subjects were involved in one of three variants of 
a threat-vulnerable game for 125 trials under one of three communication 
conditions: forced, optional, or no communication, The results indicated that 
as row’s threat to column increases, row’s propensity to select his nondominant 
strategy increases. Total payoff to the row player was inversely related to 
d the amount of threat contained in the payoff matrix. Also, the average total 
payoff to row player was greater when communicative opportunities were 
available than when they were not. The stochastic measures revealed that as 
row’s threat to column increased, row’s propensities of revolt and persistence 


and column’s propensity of appeasement increased. This finding suggested that 
column players may be reinforcing what they consider to be desirable be- 
havior by row player. The failure of communication to significantly affect 
game behavior was interpreted as a schedule effect and supported by a review 


of the relevant gaming literature. 


Guyer and Rapoport (1972) have suggested 
that the orientation of experimental social 
psychologists toward two-person two-strategy 
games could be dichotomized in terms of 
whether the game itself is viewed as the inde- 
pendent variable of major interest or as an 
assessment device to generate dependent 
variables as a function of other situational 
or subject variables. The former orientation 
is characterized by a concern for the social 
behavior which occurs as a function of the 
reward—punishment contingencies defined by 
the payoff matrix, while the latter orientation 
is characterized by an emphasis on the social 
behavior which occurs as a function of the 
beliefs, values, attitudes, personality traits, 
and expectancies which an individual brings 
to the situation. Adopting the former orien- 
tation, Guyer and Rapoport examined the 
single-play strategy choices of subjects play- 
ing a variety of nonzero sum games in order 
to determine their responses to the strategic 
problems posed by the various games in the 
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absence of any interaction effects, learning 
effects, or sequential payoff patterning. One of 
the games selected for investigation from the 
taxonomy provided by Rapoport and Guyer 
(1966) was Game 21. This game has been 
termed the “Threat Game” since it contains 
a single “threat-vulnerable” equilibrium, 
Game 21 as defined by the ordinal ranking of 
the payoffs is displayed in Figure 1. It can be 
observed that response 1 is a dominant strat- 
egy for both the row and column player. If 
both row and column select their dominant 
strategy the payoffs in quadrant 1 are ob- 
tained. This outcome is referred to as an 
equilibrium outcome since neither player is 
motivated to depart from it. Note however, 
that while column player obtains his most 
preferred outcome row player obtains his 
next-to-least preferred outcome. In order to 
obtain his most preferred outcome row must 
depend on column player choosing response 
2 (i.e, his nondominant strategy). Column 
player may be induced to shift to his non- 
dominant strategy if he perceives or can be 
persuaded that it is to his advantage to shift 
rather than suffer the consequences of the 
tow player shifting. As Guyer and Rapoport 
(1970) point out, in iterated plays row’s 
unilateral departure from the equilibrium out- 
come reduces both his own and column’s pay- 
off and thus, can be regarded as the use of 
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threat against column. An operationalization 
of the magnitude of threat, in terms of col- 
umn’s payoff reduction, and the cost of threat 
usage, in terms of row’s payoff reduction, is 
readily available for experimental investiga- 
tion with this type of game. 

Guyer and Rapoport (1972) reported that 
the single-play strategy choices of individuals 
playing the “Threat Game” varied as a func- 
tion of both threat magnitude and threat cost. 
As row’s cost for unilaterally departing from 
the equilibrium outcome increased, the per- 
centage of row players carrying out the threat 
decreased. It was also found that as row’s 
threat to column increased, the percentage of 
column players selecting the nondominating 
strategy choice increased. The behavior of 
subjects playing iterated trials of the “Threat 
Game” has also been investigated. Guyer 
and Rapoport (1970) found that row’s use 
of threat was positively related to the 
gaining of concessions from column. Rapoport, 
Guyer, and Gordon (1971) manipulated pay- 
off parameters within subjects but failed to 
observe systematic differences in performance 
as a function of these payoff parameters. This 
contagion effect has also been observed with 
the Prisoner’s Dilemma Game (Rapoport & 
Chammah, 1965). It appears that manipu- 
lating payoff parameters between subjects is 
the best strategy for assessing their effects on 
responding. 

The present study was undertaken to 
determine the effects of increasing row’s 
threat to column on strategy choices in a 
threat-vulnerable game with communication 
options available. The effects which com- 
munication has on game behavior appears to 
depend on a number of variables. It depends 
on the communication frequency (Voissem 
& Sistrunk, 1971), the motivational orienta- 
tion of the subjects (Deutsch, 1958, 1960), 
the structure of the game (Swingle & Santi, 
1972), the message content (Cheney, 
Harford, & Solomon, 1972; Loomis, 1959; 
Swensson, 1967; Wyer & Polen, 1971), com- 
munication credibility (Gahagan & Tedeschi, 
1968), the communication medium (Wich- 
man, 1970; Flint, Harris, & Rector, Note 2), 
the timing of communication (Krauss & 
Deutsch, 1966; Marwell, Schmitt, & Sho- 
tola, 1971), the available threat potential 
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Ficure 1. The threat game as defined by the ordinal 
ranking of payoffs. 


(Deutsch & Krauss, 1962; Anderson & 
Smith, Note 1), and the relationship between 
the protagonists (Wallace & Rothaus, 1969). 

Swingle and Santi (1972) studied the game 
behavior of females playing a Prisoner’s 
Dilemma, a chicken game, or a power game 
with either forced, optional, or no opportunity 
to communicate. It was found that optional 
opportunity to communicate had the greatest 
enhancing effect on cooperative responding 
both prior to the first communication and 
over the entire 100-trial game. There was 
also an indication that structural variables 
played an important role in determining 
the effectiveness of communication. In order 
to determine the reliability and generality 
of these earlier findings, the present study 
included opportunities for either forced, 
optional, or no communication in an asym- 


metrical mixed-motive game where the 
structural dimension of threat could be 
manipulated. 

METHOD 
Subjects 


One hundred eighty male students enrolled in 
summer courses at Carleton University served as 
subjects. All subjects had volunteered their participa- 
tion by signing “volunteer sheets” during registration 
or during solicitation in classrooms. Subjects were 
accepted from a variety of academic backgrounds 
and levels, 
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Ficure 2. Three variants of Game 21 varying row’s threat to column. 


Experimental Situation 


Subjects were seated in Separate cubicles which 
visually isolated them from each other while 
allowing the experimenter to view both subjects 
from the control room through a one-way mirror. 
Acoustic isolation was achieved by requiring the 
subjects to wear a set of headphones throughout the 
experimental session. Each cubicle contained a game 
console on which were placed two response buttons, 
a red “go” light, a display window, and a counter 
which automatically recorded the subjects’ cumu- 
lative scores (i.e., both own and other). Variant 1, 
2, or 3 of Game 21 was affixed to the display 
window (see Figure 2). The variants manipulated 
row’s threat to column, which was defined by the 
extent to which a unilateral departure from the 
natural outcome by the row player reduces column’s 
payoff in such a way that Variant 1 represents the 
greatest threat and Variant 3 the least threat. The 
assignment of payoff values to variants of the ordi- 
nally defined game follows closely those of Guyer 
and Rapoport (1970, 1972), Communication was 
introduced as a second independent variable by 
providing standardized messages. Groups of 20 sub- 
jects were combined in a 3 X 3 design. Each group 
was characterized by a particular variant of Game 21 
and by a particular form of communication which 
included forced, optional, or no communication. 


Procedure 


Once both subjects were seated, the experimenter 
drew the subjects’ attention to a typed set of 
instructions and informed them that they could read 
along as the instructions were also presented over 
the headphones. Subjects were told that the experi- 
menter was concerned with “how people make 
simple decisions in a two-person situation which has 
certain payoffs.” They were also told that “the 
payoff depends both on what you do and what the 
other person does.” After a detailed description of 
the game, the subject’s attention was drawn to five 
stacks of IBM cards on each of which was typed 
a different message. The messages were as follows: 


Row Player 
1. I will play response 1. 
2. I will play response 2. 


3. I will play response 1, if you play response 2 
on every second trial. 

4. If you continue to play response 1, I will play 
response 2. 

5. I do not wish to disclose my intentions at this 
time. 

Column Player 

1. I will play response 1. 

2. I will play response 2. 

3. I will play response 2 on every second trial, if 
you play response 1. 

4. Even if you play response 2, I will continue to 
play response 1. 

5. I do not wish to disclose my intentions at this 
time. 


As can be observed, messages 1 and 2 are simple 
statements of intent. Statement 3 can be regarded 
as a request for cooperative alternation for 
either player. Row’s Message 4 is a statement of 
“contingent” threat to the column player, while 
column’s message 4 is a statement of resistance to 
that threat. Message 5 is a refusal to state one’s 
intent. The subjects in the forced-communication 
condition were told the following: 


At the end of every 15 trials of the game, the 
experimenter will enter the room and you will 
give him one of the messages, to be delivered to 
the other player. Both players will thus be re- 
quired to communicate with each other at that 
time. Otherwise no communication is allowed. 


In optional communication, the subjects were told 
the following: 


At the end of every 15 trials of the game, the 
experimenter will enter the room and if you wish 
you may give him one of the messages, to be 
delivered to the other player, Both players will 
thus have the Opportunity to communicate with 
each other at that time, if they wish to do so, 
Otherwise no communication is allowed. 


The subjects in this condition could give the experi- 
menter a blank card to be delivered to the other 
player if they did not wish to transmit any of the 
five standardized messages, but they had to send 
something, Thus the essential difference between the 
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forced and optional conditions is that in the former 
the subjects had to transmit one of the five stan- 
dardized messages, while in the latter the subjects 
could choose not to send any of the five standardized 
messages. In the no-option conditions, subjects were 
told the following: 


At the end of every 15 trials of the game, the 
experimenter will enter the room and you will 
give him one of the messages. The other player 
will also be required to do this at the same time. 
Choose the message which you would send to the 
other player if you had a chance to do so. Note, 
however, that your message will not be delivered 
to the other player, nor will you receive one from 
him. Only the experimenter will know which 
message you have selected. 


Thus the no-option condition differed from the 
forced and optional conditions only in that it did 
not allow actual message exchange. Consequently, 
information effects are being held constant. 

Following Guyer and Rapoport (1970), all sub- 
jects were given an individual orientation. They 
were told: “Your goal in the game is to make as 
many points as possible for yourself without regard 
to the number of points earned by the other.” All 
subjects were questioned to ensure that they under- 
stood the instructions. Specifically, the experimenter 
made sure the subjects knew how payoffs were 
determined on the basis of various response combi- 
nations. That is, the experimenter made sure they 
“understood” the payoff matrix. The experimenter 
also made certain that subjects knew what would 
happen at the end of every 15 trials of the game 
(iey message selection). Finally, the experimenter 
reminded the subjects to make a response every 
time the red light went on and to refrain from 
laughing, talking, sighing, or in any way attempting 
to communicate with the other player during the 
experimental sessions. 

When the probe was completed, the game began. 
The red “go” light went on, indicating the start 
of a trial to the subject. The subject made his 
response by pushing either Response 1 or Re- 
sponse 2. Subjects were given feedback on each 
trial by means of a light which lit up behind the 
appropriate quadrant of the matrix affixed to the 
display window. During this s-second visual display, 
the subjects’ points were automatically accumulated 
on a counter. There was a 5-second delay between 
the termination of feedback and the onset of the 
red “go” light. Thus, the interresponse interval was 
approximately 11 seconds, There were 125 trials of 
the game with five opportunities for communication. 
Trials 75-125 were played without communication 
in order to determine the stability of any “coopera- 
tive settlements” which might have developed as a 
result of communication. 

At the end of the 125-trial game, the subjects 
were encouraged to speak out concerning any aspects 
of the experimental procedure which puzzled or 
disturbed them. The experimenter answered any 
questions the subjects posed and asked them not to 
reveal the details of their participation to others. 


779 


RESULTS 
Probability of Nondominant Responding 


The probability of nondominant strategy 
choices (i.e., Response 2) during each block 
of 25 trials, is shown in Figure 3. In Variants 
1 and 2 the overall differences in row and 
column in probability of nondominant re- 
sponding are striking. Tn Variant 3, for which 
row’s threat to column is small, the difference 
is much less pronounced. Overall, the prob- 
ability of nondominant responding for the 
row player is directly proportional to the 
amount of threat to the column player which 
is contained in the payoff matrix. While the 
communication options do not appear to 
produce any stable overall differences in the 
probability of nondominant responding, there 
are some interesting localized effects to be 
noted. In Variant 1, and to a lesser extent 
in Variant 2, row’s frequency of nondomi- 
nant responding is slightly higher when no 
communication is allowed than when com- 
munication options are available. Also, there 
is an indication that the reverse may be the 
case for the column player, especially in 
Variants 1 and 3. 

The data shown in Figure 3, were arcsin 
transformed and submitted to analysis of 
variance appropriate for two between-subject 
variables (variant and communication) and 
two correlated measures (game role and 
trials) (see Table 1). The arcsin transforma- 
tion is used to obtain normality of within-cell 
distribution by correcting for skewness in 
binomially distributed data, as recommended 
by Winer (1962). The analysis indicated a 
significant main effect for game role with a 
higher average probability of nondominant 
responding for row player (p < .001), a sig- 
nificant main effect for variant condition 
(p < 001), and a significant Game Role 
X Variant interaction (p < .01), which is 
displayed in Figure 4. It can be seen that a 
test of simple main effects indicated a signifi- 
cant difference in the probability of non- 
dominant responding between row and col- 
umn in Variants 1 and 2 (p < .001) and only 
a marginal difference between them in Vari- 
ant 3 (p= .09). Thus, the probability of 
nondominant responding was greater for row 
than for column in Variants 1 and 2 of Game 
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TABLE 1 


ANALYSIS OF VARIANCE OF THE PROBABILITY OF A NonpoMINANT RESPONSE AS A FUNCTION OF 
Game ROLE COMMUNICATION, VARIANT, AND TRIAL BLOCKS 


Source SS df MS F 
Between subjects 
Communication (A) 33 2 16 14 
Variant (B) 32.53 2 16.26 14.34%" 
AXB 5.06 4 1.26 1.12 
Error (between) 91,88 81 1.13 
Within subjects 
Trials (C) 2.01 4 50 3.49* 
AXC 53 8 06 47 
BXC 67 8 -08 58 
AXBXC 2.31 16 14 1.00 
Error (C X Within) 46.66 324 14 
Game Role (D) 50.52 1 50.52 51.43** 
AXD 4.26 2 2.13 2.17 
BXD 13.60 2 6.80 6,92** 
AXBXD 1,05 4 26 27 
Error (D X Within) 79.57 81 98 
CXD NJ 4 -03 pail 
AXCXD 1.36 8 17 89 
BXCxD 1.89 8 23 1.23 
AXBXCXD 1.75 16 11 57 
Error (C X D X Within) 62.18 324 
*p <01 
** b < 001 
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GAME ROLE X VARIANT INTERACTION 
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Ficure 4. The Game Role X Variant interaction for probability of 
nondominant responding. 


21. Also, while the analysis of simple main 
effects did not indicate a significant difference 
in the probability of nondominant responding 
among the Variants of Game 21 for the 
column player (p > .10), a significant differ- 
ence was found for the row player (p < .001). 
Application of the Newman-Keuls test indi- 
cated that each variant differed from the 
other (p < .05). Thus, for the row player, 
the probability of occurrence of the nondomi- 
nant strategy was highest in Variant 1, 


TABLE 2 


MEANS AND STANDARD DEVIATIONS or TOTAL PAYOFF 
DATA FOR ROW AND COLUMN 


Variant 
Condition m 
1 2 3 
Row player 
Communication (forced 
and optional) 
846.05 994.75 | 1,101.85 
SD 435,28 256,36 345.83 
o communication 
N Mo 598.00 887.70 981,50 
SD 288.08 299.25 225.98 
Column player 
Camii nication N orced 
tional 
Mo a 4,030.70 | 1,582.85 | 2,087.40 
SD 572.54 | "387.59 | 309.35 
nicati 
Ne renee es 747.00 | 1,411.50 | 2,216.60 
SD 467.76 | 293.76 | ‘285.74 


Note. ns = 60 for communication groups; ns = 30 for no- 
communication groups. 


somewhat lower in Variant 2, and lowest in 
Variant 3. 

The analysis of variance also revealed a 
main effect for trials which was significant by 
the conventional test (p < .01) and margin- 
ally significant (p = .06) after application of 
the Greenhouse-Geisser correction (Kirk, 
1968). Trend analysis revealed a Gontfeant 
ine ies G < .01), with the overall prob- 
ability of nondominan i i 
ae NE EE t responding decreasing 


Total Payoff Data 


While communication did not appear to 
affect individual responding in a statistically 
significant manner, it seemed interesting to 
determine whether communication increased 
row’s total payoff or column’s total payoff. 
The data on total payoff are presented in 
Table 2. Analysis of variance of row’s total 
payoff indicated a significant communica- 
tion main effect, F(2,84) = 4.34, p< .05. 
Thus, when communication options are avail- 
able, row players obtain a higher total pay- 
off than when communication options are not 
available. The analysis also revealed a sig- 
nificant variant main effect, F(2, 84) = 5.95 
p< 005. Application of the Newman-Keuls 
test indicated a significant difference between 
Variant 1 and Variants 2 and 3 (p< .05). 
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TABLE 3 


MEANS AND STANDARD DEVIATIONS OF FIRST-ORDER 
STOCHASTIC MEASURES ror Row: 
REVOLT AND PERSISTENCE 


Communication 


Condition Forced Optional | No option 


M | SD M | SD | M | SD 


Revolt (R2/R1C1) 


Variant 1 -525 | .290 | .487 | 2.70 | .587 | .211 
Variant 2 «396 | .213 | .354 | .174 | .424 | (272 
Variant 3 -261 | .190 | .223 | .170 | .224 | 1143 
Persistence 
(R2/R2C1) 
Variant 1 -722 | 177 | .676 | .181 | .763 | .160 
Variant 2 -488 | .186 | 533 | .218 | .752 | .107 
Variant 3 -445 | .256 | 364 | .192 | .347 | 1160 


Note. 3.33% of persistence data is estimated. 


Thus, row’s total payoff was lower when his 
threat to column was high than when his 
threat to column was moderate or low. Analy- 
sis of variance performed on column’s total 
payoff data failed to reveal any significant 
effects of communication. However, there was 
an obvious variant main effect, F(2,84) 
= 58.58, p< .001. Since differences in col- 
umn’s quadrant 3 payoff contributes greatly 
to this effect, it is not considered further. 


Stochastic Measures 


The stochastic measures are conditional 
probabilities that a subject will choose his 
nondominant strategy on a trial, given a par- 
ticular dyadic outcome on the preceding trial.+ 
For row player, reliable estimates of the 
propensities of revolt and persistence were 
obtained. Revolt [p(R2/R1C1)] refers to 
role player’s propensity to select his nondomi- 
nant response on trial n given that both row 
player and column player had selected their 
dominant strategy on trial n — 1. Persistence 
[p(R2/R2C1)] refers to row player’s pro- 


1 Because some dyads did not have a sufficient 
number (3 or more) of certain dyadic outcomes, it 
was necessary to estimate some conditonal proba- 
bilities. This was done by taking the average level 
of a conditional probability of those dyads with 3 
or more examples of a particular state and assigning 
that value to dyads with an insufficient number. 
This procedure can result in a reduction of within- 
cell variance and possibly positive bias in an analysis 
of variance if a large number of estimates are re- 
quired. However, it is not felt that such a bias 
was operative in the analysis reported here due to 
the fact that only 3 observations out of 90 were esti- 
mated overall. 


pensity to select his nondominant strategy 
on trial n given that row player had selected 
his nondominant strategy and column player 
had selected his dominant strategy on trial 
n— 1. These propensities are presented in 
Table 3. Analysis of variance performed on 
tow’s propensity to revolt (arcsin trans- 
formed) revealed a significant variant main 
effect (p < .01). Application of the Newman- 
Keuls test indicated that the propensity for 
revolt decreased significantly from Variant 1 
to 2 and 2 to 3 (p < .05). Thus a positive 
monotonic relationship was obtained between 
row’s propensity to revolt and row’s threat to 
column. Analysis of variance on the persist- 
ence data (arcsin transformed) also revealed 
a significant variant main effect (p< .01). 
The Newman-Keuls test indicated that Vari- 
ants 1 and 2 differed from Variant 3 but not 
from each other (p < .05), A marginal com- 
munication variant interaction was also found 
(p< .10). This marginal interaction indi- 
cated that while persistence monotonically 
decreased in communication conditions as 
row’s threat to column decreased, when no 
communication was allowed, the decrease in 
persistence between Variant 1 and 2 was 
minimal. 

For column player, the appeasement and 
capitulation propensities are presented in 
Table 4. Appeasement [p(C2/R1C1] refers 
to column player’s propensity to select his 
nondominant response on trial n given that 
both row and column player had selected 
their dominant strategy on trial n—1, 


TABLE 4 


MEANS AND STANDARD DEVIATIONS or F. TRST-ORDER 
STOCHASTIC MEASURES ror CoLuMN: 
APPEASEMENT AND CAPITUALTION 


Communication 
Condition Forced Optional | No option 
M | SD| M |SD|M™M | SD 
Appeasement 
(C2/R1C1) 
Variant 1 -408 | .256 | .297 | .209 | .338 | .252 
Variant 2 -409 | .276 | 298 | .227 | 412 | 1223 
Variant 3 -167 | .152 | .239 | ,202 | .195 | :289 
Capitulation 
(C2/R2C1) 
Variant 1 +194 | .120 | .228 | .158 | .126 | .053 
Variant 2 -328 | .201 | .203 | .159 | .230 | 1146 
Variant 3 +157 | .179 | .328 | .288 | .167 | .220 


Note. 3.33% of capitulation data is estimated. 
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TABLE 5 


Proportion or Row AND COLUMN PLAYERS SENDING ONE OR MORE OF THE 
VARIOUS STANDARDIZED MESSAGES 


Message numbers 
Condition 1 2 3 4 5 6 
Row Col. Row Col. Row Col Row Col. Row Col, Row Col. 
Variant? (n = 20) 
1 30 55 A0 nae -65 ah A5 AS -60 -40 AS 50, 
2 20D: 65 50 45 .55 45 35 45 WEI .60 50 A5 
3 50 205) 230! .20 -70 -30 FEB) AO -60 65 ‘35 35 
Communication 
(n = 30) 
Forced 50 -60 .40 27 -63 53 43 aS) 83 70, =>, eee 
Optical 40 57 43 33 -63 AT 33 33 AT AO .87 87 
No option -60 -63 57 13 a7 40 50 -60 -60 -60 — — 


a Transcript of message content can be found in the Method section, Message 6 is a blank card, which could only be used in the 


optional communication condition. 


b The data summarized for the variants are based on the combination of the forced and optional conditions only. The data for 
the no-option condition are included in the lower half of the table. 


Capitulation [p(C2/R2C1)] refers to column 
player’s propensity to select his nondominant 
strategy given that column player had se- 
lected his dominant strategy and row player 
had selected his nondominant strategy on trial 
n — 1. Analysis of variance on the appease- 
ment data (arcsin transformed) indicated a 
significant variant main effect (p < .05). 
Application of the Newman-Keuls test indi- 
cated that Variant 1 and Variant 2 differed 
from Variant 3 but not from each other 
(p < 05). Thus column’s propensity to ap- 
pease row was greater in variants of Game 21 
with high or moderate threat, than in a vari- 
ant with low threat. Analysis of variance on 
the capitulation data (arcsin transformed) 
did not reveal any statistically significant 
differences. 

There are four other stochastic measures 
conditionalized on quadrant 2 and quadrant 4 
outcomes. However, these outcomes were so 
infrequent among dyads in Variant 3 that 
reliable estimates could not be obtained. For 
example, in Variant 3 with no communication 
only 3/10 and 5/10 dyads had the criterion 
number of quadrant 2 and quadrant 4 out- 
comes, respectively. This results from the 
low probability of nondominant response in 
Variant 3 by the column player, as already 
noted. 


Communication Data 


The proportion of row and column players 
who send the various standardized messages 
is presented in Table 5. The proportions for 
the 3 variants are based on a maximum of 20, 
since these data were obtained by combining 
the data in the forced and optional communi- 
cation conditions. For row players, the only 
difference of note among the variants, is the 
one for row’s unconditional statement of co- 
operation (message 1). The proportion of 
row players who send the unconditional state- 
ment of cooperation differs among the vari- 
ants, however, this difference was not statis- 
tically significant, x?(2) = 2.83, p < .20. It 
is interesting to note that overall only 60% 
(approximately) of the row players sent a pro- 
posal of cooperative alternation (message 3). 
A considerable smaller proportion, 45% or 
less, of the row players sent messages of con- 
tingent or noncontingent threat (messages 4 
and 2, respectively). 

For the column players, the most obvious 
trend is that as row’s threat to column in- 
creases the proportion of column players who 
send a request for cooperative alternation 
(message 3) increases. There were 30% of 
the column players who sent such a request 
in Variant 3, whereas 75% of the column 
players did so in Variant 1. The proportion 
of column players who sent a request for co- 


784 


operative alternation differed among the vari- 
ants, x°(2) = 8.40, p < .05. Comparing these 
proportions with those for row players, it 
appears that while more row players than 
column players sent a request for cooperative 
alternation in Variant 3, there are only small 
differences in the proportions for Variants 
1 and 2. Note also that in the optional com- 
munication condition where the blank card 
(message 6) is available, more row and 
column players chose it than chose message 5. 
This finding is consistent for both players in 
each of the variants when optional communi- 
cation is available, 

Study of the frequency of message usage 
(not shown in Table 5) suggests that for 
both row and column players a request for 
cooperative alternation occurs less frequently 
in optional communication than in forced 
communication. This difference cannot be 
attributed totally to differences in proportions 
of players sending this message and it does 
represent a small but interesting trend. Also, 
considering these data in conjunction with the 
proportion data presented in Table 5, it ap- 
pears, on average, that message 4 is used 
by row and column only once, while mes- 
sage 3 is used more than once. This supports 
the conjecture that promises of cooperative 
alternation are more preferred than threats 
or warnings as a means of exerting influence 
on the other player, 


Discusston 


The results of the present study indicate 
that in a “threat-vulnerable” game, as row’s 
threat to column increases, row’s propensity 
to select his nondominant strategy increases. 
This represents a strong indirect effect of 
payoff manipulation, in that altering the pay- 
offs to the column player produces a sig- 
nificant change in the choice behavior of the 
row player but no Statistically significant 
change in the behavior of the column player. 
This finding differs from that of Guyer and 
Rapoport (1972) who found a direct effect 
for row’s increasing threat to column when 
using a “within-subject methodology” by 
examining strategy choices with no feedback. 
That is, as row’s threat to column increased, 
column’s choice of his dominant strategy 
decreased. 
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Guyer and Rapoport (1970) indicated 
that row’s departures from the natural out- 
come (quadrant 1) enhanced row’s payoff 
relative to the maximum payoff dictated by 
the normative solution to the game. However, 
the present study indicated that payoff to the 
row player is inversely related to the amount 
of threat contained in the payoff matrix. A 
comparison of the data from the Guyer and 
Rapoport study and the “replication” condi- 
tion of the present study (i.e., Variant 2 with 
no communication) suggests that the behavior 
of the row player is the source of the differ- 
ences. In both studies the behavior of the 
column player is quite similar, in that column 
chooses his dominant strategy on 70% of the 
trials. Also, column capitulates to row’s 
threat with a probability of .23 in both 
studies. However, in the present study row 
player selects his dominant strategy on only 
47% of the trials, while in the Guyer and 
Rapoport study row does so on 86% of the 
trials. Also, whereas the propensities for re- 
volt and appeasement were .10 and .12, re- 
spectively, in the Guyer and Rapoport 
(1970) study, in the present study these 
propensities were .42 and .41, 

These data suggest that the column players, 
when confronted with a row player who 
selects the threat option frequently, maintain 
the same level of capitulation, but increase 
the level of appeasement. While a direct test 
of this notion would require the reponses of 
the row player to be experimentally manipu- 
lated, the present study presents some indi- 
rect support for this notion, in that the pro- 
pensities of revolt and appeasement both 
increase from Variant 1 to 3. Thus column 
players may be reinforcing what they con- 
sider to be desirable behavior by row player 
and not reinforcing behavior which is un- 
desirable. An important difference between 
the Guyer and Rapoport (1970) and the 
present study is that in our no-communication 
conditions subjects were required to select a 
Message even though it was not communi- 
cated to the other player. Thus, information 
about possible strategies, especially coopera- 
tive alternation, was present in this study but 
absent in Guyer and Rapoport. This might 
have contributed to the differences found 
between the two studies, 


j 
{ 
f 


The high propensity of revolt obtained in 
the present study might also be partially 
accounted for by the consideration of the 
motive to maximize relative gain. There are 
two reasons for suggesting that this motive 
may be operating. First, the subjects were 
playing for points rather than money. 
While matrix studies investigating the vari- 
able of reward size have produced inconclu- 
sive and sometimes equivocal results (Knox & 
Douglas, 1971; Shaw, 1972), it is plausible 
to suggest that the value of points is prob- 
ably tied to the social comparison process of 
the experimental situation itself. Second, sub- 
jects were given continuous rather than inter- 
mittent feedback on total points accumulated. 
Previous research has demonstrated that both 
of these are important variables (McClintock, 
1972; McClintock & McNeil, 1966; Messick 
& Thorngate, 1967). If the variants of 
Game 21 are transformed into zero sum 
games by determining the algebraic differ- 
ences of the payoffs (Rapoport, 1970), then 
it can be shown that for all three variants 
saddle points exist. In Variants 1 and 2 the 
saddle point is located in quadrant 3, whereas 
in Variant 3 the saddle point is located in 
quadrant 1, Thus, while for column re- 
sponse 1 is the maximin strategy in all 
variants, for row player response 1 is the 
maximin in Variant 3, while response 2 is the 
maximin in Variant 1 and 2. The data sup- 
port the notion that relative gain was an 
important consideration. In Variants 1 and 2 
row players selected their maximin strategy 
58.2% and 46.1% of the time, whereas this 
drops to 26.4% in Variant 3. 

While communicative opportunities did re- 
sult in significantly higher total payoffs for 
the row player, one would have to conclude 
that overall the communication variable, as 
operationalized in this study, does not signifi- 
cantly affect behavior in a threat-vulnerable 
game. This finding corroborates the results 
of several studies which found communication 
to be ineffective in increasing cooperative be- 
havior in situations where threat capability 
existed and was used (Cheney et al., 1972; 
Deutsch & Krauss, 1962; Nardin, 1968; 
Anderson & Smith, Note 1). Interestingly, 
the actual use of the threat message was quite 
low in this study. Overall, it was found that 
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promises of cooperative alternation were pre- 
ferred over threats or warnings as means of 
exerting influence on the other player. This 
is also consistent with the findings of Cheney 
et al. (1972). 

The finding that communicative opportu- 
nities did not significantly affect game be- 
havior, despite the use of messages of co- 
operative alternation, was somewhat puzzling. 
However, a review of the gaming literature 
suggests that very frequent communication 
opportunities are necessary with standardized 
modes. Numerous studies which have allowed 
free oral or written communciation (regard- 
less of the communication schedule) have 
found that communication increased coopera- 
tive behavior in a variety of bargaining 
situations (Bixenstine, Levitt, & Wilson, 
1966; Deutsch & Krauss, 1962; Krauss & 
Deutsch, 1966; Marwell et al, 1971; 
Minas, Ratoosh, & Lipetz, 1959; Swingle & 
Santi, 1972; Terhune, 1968; Wichman, 
1970). Voissem and Sistrunk (1971) using 
standardized messages found that cooperative 
behavior increased only for those who could 
communicate before every trial in a Prisoner’s 
Dilemma game. They found no significant 
increase for dyads who communicated every 
10 trials and subsequently suggested that free 
oral communication could be more effective 
than standardized message exchange in en- 
hancing cooperative interaction. Two other 
studies (Cheney et al., 1972; Loomis, 1959) 
using standardized messages found enhance- 
ment of cooperation, but in both studies 
communication could occur on every trial. 
Thus while free oral or written communica- 
tion has generally increased cooperative be- 
havior, standardized message exchange has 
been found to increase cooperation only in 
situations where opportunities for exchange 
are frequent. 
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Seeking and Avoiding Belief-Discrepant Information 


as a Function of Its 


Perceived Refutability 


Randall R. Kleinhesselink and Richard E. Edwards 
Washington State University 


Rate of momentary button-pressing to eliminate static which partially masked 


tape-recorded messages was employed 


as a dependent variable. After obtaining 


button-pressing rates on a neutral baseline message, four groups of subjects 


were exposed to either a supportive or 


a nonsupportive message on the legaliza- 


tion of marijuana. For half the subjects, the message was easy to refute, and 
for half it was difficult to refute. Results indicated that subjects were be- 
haviorally receptive to either the supportive, difficult-to-refute message or the 
nonsupportive, easy-to-refute message, but were significantly less receptive to 
either the supportive, easy-to-refute message or the nonsupportive, difficult-to- 


refute message. This was interpreted 


as supporting Lowin’s approach-avoid- 


ance model of selective exposure to information. 


While a number of investigators support 
the general hypothesis that individuals have 
a motivated tendency to seek out supportive 
information and/or to avoid nonsupportive 
information (e.g., Brock & Balloun, 1967; 
Mills, 1968), several different kinds of al- 
ternative explanations of selectivity in ex- 
posure to information have been proposed 
(e.g., Sears, 1968; Stotland & Canon, 1972), 
Other investigators fail to find any selective 
avoidance or even to find that individuals 
seek out nonsupportive information rather 
than avoid nonsupportive information (e.g., 
Lowin, 1969; McGuire, 1968). McGuire 
(1968), for example, in summing up the 
literature on the selective exposure hypothe- 
sis, considers the possibility that most studies 
which support the selective exposure hypothe- 
sis are either spurious or represent support 
for a sociological rather than a psychological 
process of selectivity. Thus, the structure of 
the social system may promote biased selec- 
tivity through community consensus or com- 
municator selectivity in presenting only one 
side of an issue. This skepticism about psy- 
chologically motivated selectivity is also 
shared by Freedman and Sears (1965) and 
Sears (1968). 

Some recent research, however, does pro- 
vide support for a kind of psychologically 
motivated behavioral selectivity. Brock and 


Requests for reprints should be sent to Randall 
R. Kleinhesselink, Department of Psychology, Wash- 
ington State University, Pullman, Washington 99163. 


Balloun (1967), in reviewing the methodol- 
ogy for the measurement of dependent vari- 
ables in selective exposure research, argue 
that many of the measures employed in 
previous research may have been “insensitive 
to the actual tuning behavior of the recipient” 
(p. 413). They point out that although in 
most previous studies (e.g., Brock, 1965; 
Freedman, 1965; Jecker, 1964; Mills, Aron- 
son, & Robinson, 1959; Sears & Freedman, 
1965) the experimenters measured either the 
subject’s preferences among messages or the 
amount of time subjects spent reading various 
messages, there was no theoretical necessity 
to employ these particular dependent vari- 
ables. By employing a static-eliminating 
button-pressing response measure, Brock and 
Balloun (1967) found that subjects attended 
significantly more to supportive messages 
than to nonsupportive messages. Smokers 
pressed a button significantly more frequently 
than nonsmokers to remove static from a 
“smoking does not cause cancer” message, 
while nonsmokers pressed a button more fre- 
quently than smokers to clarify a “smoking 
causes cancer” message. The results were con- 
sistent with their hypothesis that persons will 
press a button more frequently to eliminate 
partially masking static from belief-consistent 
rather than from belief-discrepant messages. 
This result was confirmed in four separate ex- 
periments employing different attitude issues. 

The Brock and Balloun (1967) study pro- 
vides support for a selective attention to in- 
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formation hypothesis. This conclusion is 
based upon their analysis of real-life situa- 
tions in which there is little opportunity to 
evaluate prospective information. Individuals 
confronted by ongoing belief-discrepant in- 
formation may tune out this information by 
inattention or by not eliminating other inputs 
which would mask or conceal the message. 
Conversely, individuals confronted by ongoing 
belief-consistent information may tune in this 
information by heightened attention or by 
eliminating other inputs which mask or con- 
ceal the message. Thus, Brock and Balloun 
(1967) conclude that in ongoing message 
situations, the process of selective attention 
(exposure to information with attention only 
to parts) should be distinguished from the 
process of exposure (choosing to totally ap- 
proach or avoid future information). 
Other contributors to research in the area 
of selective exposure have explicitly ques- 
tioned the propositions which predict that 
individuals will seek out all supportive in- 
formation and avoid all nonsupportive in- 
formation. For example, Canon (1964) pre- 
dicted and obtained a significant interaction 
between selectivity and the degree of con- 
fidence individuals have in their initial opin- 
ion. Canon found that highly confident sub- 
jects approach both belief-consistent and be- 
lief-discrepant information, but that subjects 
low in confidence approached only belief- 
consistent information and avoided belief- 
discrepant information. However, other re- 
searchers (Freedman, 1965; Lowin, 1967) 
have found that manipulations of confidence 
did not interact with subsequent selectivity. 
Recently, Lowin (1967, 1969) has predicted 
that the ease or difficulty with which a par- 
ticular message can be refuted will interact 
with selectivity. In his model of selective 
exposure, the conditions which may be re- 
sponsible for an individual’s avoidance of 
supportive information in one situation and 
approach to nonsupportive information in 
other situations are specified. Lowin suggests 
that a belief system may be maintained by 
avoiding nonsupportive information or by 
approaching and refuting the nonsupportive 
information. Whether a person will approach 
or avoid depends upon the individual’s sub- 
jective estimate of the vulnerability of the 
nonsupportive information to his own refuta- 
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tional efforts. Lowin also extends this analysis 
to supportive information and predicts a 
strong but reverse preference. Lowin (1969) 
proposes that “if a person draws support for 
his own position from the similar beliefs 
others, then weak consonant messages wo 
be distasteful, for they imply flimsy so 
support” (p. 265). However, difficult-to-re- 
fute, supportive information would imply 
strong social support for an individual’s belief 
system and would be highly desirable. The 
model predicts a robust interaction, Indi- 
viduals will be more strongly attracted by a 
supportive message when information is diffi- 
cult to refute and by a nonsupportive message 
when information is easy to refute. The pres- 
ent experiment was designed to test this in- 
teraction prediction employing the Brock and 
Balloun (1967) procedure. 


of 


METHOD 
Design 
The design was a 2X 2 factorial (15 subjects/ 


cell) with two levels of subjects’ attitude toward a ‘ 


given topic (pro or con) and two types of message 
(easy or difficult to refute). Thus, half of the sub- 
jects listened to a message which was consistent with 
their prior beliefs, and the other half listened to a 
message which was opposed to their prior beliefs, 
Further, half of the subjects in each of these two 
groups heard a message which was easy to refute, 
and the other half heard a message which was diffi. 
cult to refute. 


Subjects 


A 16-item attitude survey (containing 16 different 
attitude issues) was administered to a large section 
of introductory psychology students (450) at the 
beginning of the semester. This survey was one of 
four separate unrelated questionnaires administered 
during the group session. On the basis of responses 
to the item in the survey on the legalization of 
marijuana, 30 subjects were randomly selected fron 
the 87 respondents who strongly disagreed with the 
legalization of marijuana sale and use. Another 30 
subjects were randomly selected from the 94 re- 
spondents who strongly agreed with the legalization 
of marijuana sale and use, The selected subjects were 
invited to participate several weeks later in an ex- 
periment on “judgment of sincerity” for course credit. 
The experimenter had not participated in the earlier 
group session, and no mention was made at any time 
of the relationship between the earlier attitude survey 
and subjects’ participation in the experiment. 


Materials 


Two 73-min speeches for the legalization of mari- 
juana were prepared, one using the seven arguments 
which were judged (by students comparable to the 


ve 


y 
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subjects) most difficult to refute among a set of 40 
arguments, and the other using the seven arguments 
judged most easy to refute. The seven easy-to- 
refute arguments for the legalization of marijuana 
sale and use were that it would (a) substitute for 
cigarette smoking, (b) help bring the American 
family together, (c) make rock and roll musicians 
feel less paranoid, (d) create more pleasant odors 
than cigarette smoke, (e) make it more of an “in 
thing” to do, (f) vindicate George Washington’s 
cultivation of it, and (g) heighten art appreciation 
by society. The seven difficult-to-refute arguments 
for the legalization of marijuana sale and use were 
that (a) it is an effective tension reducer, (b) it 
has no damaging genetic effects, (c) it has no harm- 
ful physiological effects, (d) present marijuana laws 
socialize criminality, (e) it is better for one than 
alcohol, (f) it has the lowest user/abuser ratio of 
all popular drugs, and (g) prohibitions against popu- 
lar drugs never work. 

A short 4-min neutral speech on imprinting in 
domestic chickens was also prepared. This was de- 
signed to lend credibility to the cover story about 
making judgments of sincerity from a speaker's 
voice qualities, and also to provide a baseline level 
of button-pressing to eliminate static. The neutral 
speech was tape recorded by a female, and both 
. marijuana speeches were recorded by the same male. 

A noise generator was used to introduce a continuous 
white noise into both channels of the tape. The noise 
could be temporarily eliminated from the tape by 
pressing a small, hand-operated button, Each press 
of the button eliminated the noise for 5 sec and also 
produced a mark on one channel of an event re- 
corder. The average sound pressure level was ap- 
proximately 75 db, which was sufficient to obscure 
or mask many of the words in the 4-min and the 
74-min speeches, 


Procedure 


All subjects were tested individually during the 
latter part of the semester. The order in which sub- 
jects appeared for testing was randomized. Each 
subject was seated in front of a tape recorder and 
told to listen to two different speeches—one about 
imprinting and the other about marijuana—and to 
rate the sincerity of each speaker after hearing the 
speech. After adjusting the earphones, each subject 
was told that the speeches were recorded under less 
than ideal conditions, so that “there is quite a bit 
of static on the tape”; the subject was then shown 
a button that could be pressed to eliminate the 
static, The neutral speech on imprinting was played 
first, and the subject was given the sincerity rating 
form. Then the subject listened to the marijuana 
speech and rated the sincerity of the speaker. After 
subjects rated both speeches, the original 16-item 
attitude survey was readministered, and each student 
was given a final debriefing concerning the nature 
and purpose of the experiment. 


RESULTS AND DISCUSSION 


The analyses of primary interest are those 
concerned with the button-pressing data. In 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR THE BASELINE 
AND EXPERIMENTAL BUTTON-PRESSING RATES 


Differ- 
Condition ence 
score 
Pro 
Difficult to refute | 11.41 | 1,30 | 11.93 | 1.8: 152 
Easy to refute 0.18 | 3.17 24 81 | —1.94 
Con 
Difficult to refute | 10.15 | 3.91 53 | 5.46 | —2.62 
Easy to refute 10.56 | 2.90 | 10.00 | 3.42 | —.56 


order to make a direct comparison of per- 
formance between baseline and experimental 
conditions, the frequency of button-pressing 
in each condition was converted to a rate 
measure (number of responses/minute). 
Table 1 shows the means and standard de- 
viations for the rate of button-pressing during 
the baseline and experimental messages in 
each experimental condition. 

An analysis of variance on the rate of 
button-pressing during the baseline message 
failed to produce any significant interaction 
or main effects. A similar analysis on the ex- 
perimental messages revealed a significant 
Attitude X Type of Message interaction, 
F(1, 56) = 8.11, p < .01. The main effects 
of attitude and type of message were not 
statistically reliable. An analysis of, variance 
on the baseline~experimental difference scores 
revealed similar results, Again, a significant 
Attitude X Type of Message interaction was 
obtained, F(1, 56) = 8.03, p < .01, while the 
main effects of attitude and type of message 
were not statistically reliable. 

Additional analyses indicated that there 
were no significant effects of the baseline and 
experimental messages on the sincerity rat- 
ings. An analysis of the pre- and postexperi- 
mental change in attitude toward the legaliza- 
tion of marijuana sale and use indicated that 
there was a significant main effect, F(1, 56)= 
43.70, p < .001, for the consistency of the 
message with the subject’s initial attitude 
position. Subjects in the pro conditions hear- 
ing a pro-marijuana message maintained their 
extreme pro position, while subjects in the 
con conditions hearing a pro-marijuana mes- 
sage shifted their attitude toward the pro 
direction. Although the size of this shift was 
not anticipated, it probably indicates only 
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that “ceiling” effects prevented subjects with 
initially extreme favorable attitudes toward 
the legalization or marijuana from shifting 
further in the pro direction, However, sub- 
jects in the con conditions were not similarly 
constrained from shifting toward the pro 
direction. 

In conclusion, by employing a different 
attitude issue, the legalization of marijuana, 
the results from the difficult-to-refute argu- 
ment conditions represent a successful repli- 
cation of Brock and Balloun’s (1967) experi- 
mental finding. The data of more primary 
interest in this experiment are those which 
have implications for Lowin’s (1967, 1969) 
approach-avoidance model of selective ex- 
posure to information. The present data offer 
strong support for Lowin’s (1967, 1969) 
prediction that the ease or difficulty with 
which a particular message can be refuted 
will interact with selectivity in exposure to 
information. A significant crossover-type in- 
teraction was obtained in this experiment. 
Subjects attended more to a difficult-to-refute, 
supportive message and to an easy-to-refute, 
nonsupportive message than to either a diffi- 
cult-to-refute, nonsupportive message or an 
easy-to-refute, supportive message. Consistent 
with this analysis, it may be necessary to 
know first whether a message presented to an 
individual is supportive or nonsupportive of 
that individual’s beliefs, and second whether 
that message is judged to be easy or difficult 
to refute, before it is possible to predict an 
individual’s approach or avoidance tendencies 
toward the message. It should be emphasized 
that the generalizability of the results of this 
experiment are confined to those situations in 
which behavioral approach and avoidance 
tendencies are assessed concurrently with the 
presentation of the message. In ongoing mes- 
sage situations, behavioral selectivity toward 
information would seem to be primarily medi- 
ated by selective attention, that is, exposure 
to all presented information with heightened 
or lowered attention to selected parts. 
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Alternate Models of Family Influence on Student Political Ideology 


Robert E. Kraut 
Cornell University 


Steven H. Lewis 
Yale University 


Research on the role of the family in political socialization has often assumed 
that intergenerational continuity and conflict are opposing models of political 
development, In a longitudinal design, cross-lagged correlational analysis and 
path analysis were used to examine the causal relations among parental politi- 
cal ideology, parent-student interpersonal relations, and student political 
ideology. The results showed both parental ideology and family relations to 
be important determinants of a student’s ideology. Leftist parental ideology 
and high family conflict each led to leftist student ideology, at least as the 
family variables were reported by the students. The finding that nonpolitical 
interpersonal relations were translated into political ideologies may be under- 
stood from psychoanalytic, social learning, or historical perspectives. 


Much of the literature on political sociali- 
zation derives from either a primarily psycho- 
logical or a primarily political approach. The 
psychological approach sees an individual’s 
political beliefs as intimately bound with his 
personality, his psychological needs and de- 
velopment, and his interpersonal relationships. 
These issues are central to most aspects of 
social life, including politics. Researchers and 
commentators with this orientation stress non- 
political personality and interpersonal, espe- 
cially family, relations in explaining the 
acquisition of individual political ideology 
(Adorno, Frenkel-Brunswik, Levinson, & 
Sanford, 1950; Lane, 1959; Smith, Bruner, 
& White, 1956). Compared with the psycho- 
logical approach, a more political one views 
political life as relatively distinct from per- 
sonal affairs and attempts to explain an indi- 
vidual’s political ideology and behavior in 
terms of his social characteristics, class and 
reference group interests, and the specifically 
political values and attitudes which he has 
learned (Campbell, Converse, Miller, & 
Stokes, 1960; Hess & Torney, 1967; Hyman, 


1959). 


This research was supported by student research 
funds from the Department of Psychology, Yale 
University, and National Science Foundation Grant 
IG-73-12 to the University of Pennsylvania. The 
authors thank Kenneth Keniston and Robert Abel- 
son who commented wisely on earlier versions of 
Eoo for reprints should be sent to Robert E. 
Kraut, Department of Psychology, Uris Hall, Cor- 
nell University, Ithaca, New York 14850. 


791 


These approaches are represented by two 
contrasting hypotheses in the recent literature 
on the student movement. What has been 
called the generational conflict hypothesis 
represents the psychological approach (Bet- 
tleheim, 1969; Feuer, 1969, 1972; Ruben- 
stein & Levitt, Note 2). While the details 
may vary, these researchers agree that leftist 
student political ideology is an extension of 
nonpolitical relationships developed between 
parent and child. Drawing heavily on psycho- 
analytic theory and an analogy between the 
family system and the political system, indi- 
vidual radical protest is ascribed to unre- 
solved or poorly resolved Oedipal conflicts 
compounded by permissive child-rearing prac- 
tices. The leftist student translates rebellion 
against parental authority into political re- 
bellion. With few exceptions, a generational 
conflict hypothesis has not been used to ex- 
plain rightist student political ideology (but 
see Adorno et al., 1950; Schiff, 1964). What 
has been called a generational continuity hy- 
pothesis represents a political learning expla- 
nation of the role of socialization in leftist 
student protest. In this view the family di- 
rectly transmits explicitly political values, 
attitudes, and information to the child. Stu- 
dents of all political beliefs share values and 
political stances with their parents. Radical 
students and their parents are more politically 
liberal than the general population; rather 
than rebelling against their parents, radical 
students are identifying with them and, given 
recent political history, responding appropri- 
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ately to a socialization which predisposed 
them toward both a leftist ideology and 
political activism. 

The published literature on the student 
movement presents mixed evidence on the 
conflict-continuity question. The early studies 
consistently supported the continuity hypothe- 
sis (see Flacks, 1967; Keniston, 1967 for 
reviews), but in recent studies the data were 
ambiguous. For example, Dunlap (1970) 
reported significant correlations between 
family conflict and leftist ideology. On the 
other hand, Gamson, Goodman, and Gurin 
(Note 1) and Cowdry, Keniston, and Cabin 
(1970) found that conflict did not relate to 
political ideology but did relate to political 
inaction. Flacks (1967) and Thomas (1971) 
found no association between family conflict 
and either political attitudes or activism. 
Finally, Block (1972), Braungart (1971), 
and Lewis and Kraut (1972) found that 
family conflict and political continuity were 
both related to leftist ideology. 

Part of this inconsistency is due to meth- 
odological and conceptual shortcomings (see 
Block, Haan, & Smith, 1970; Cowdry et al., 
1970; Lewis & Kraut, 1972; Lipset, 1968; 
Silvern & Nakamura, 1971), But even in 
those studies which have attended to these 
issues, the contradictions may be more ap- 
parent than real. For example, results sup- 
porting the continuity or the conflict hypothe- 
ses were often based on different types of 
data. Political measures such as parental 
party identification and student political 
ideology generally show  intergenerational 
agreement, while more personal domains tend 
to show an association between conflict and 
leftist ideology. Continuity and conflict are 
not necessarily incompatible models of po- 
litical socialization, although few studies in 
this area have entertained that hypothesis 
and few have used longitudinal data to study 
the causal influence of either continuity or 
conflict. 

This article presents a detailed examina- 
tion of the relationship of student political 
orientations to identification with and con- 
flict and disagreement with parents. Using 
panel data, cross-lagged correlations, and 
path analysis, it focuses on the causal influ- 
ence of conflict and continuity on student 
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political ideology. The specific findings of 
this study are discussed in the context of 
literature on political socialization concerned 
with the development of political orientations. 


METHOD 
Sample 


This research is a continuation of a longitudinal 
survey focusing on the development of political 
orientations during college. An initial random sam- 
ple of 400 freshmen from an incoming class of 1,250 
at Yale College was mailed questionnaires in the 
early fall of 1969. Fifty-five percent (N = 216) re- 
sponded, but due to the small numbers of female 
and nonwhite returns, only the data from white 
males were analyzed. These students were mailed 
second and third questionnaires in the spring of 
their freshman year (1970), before and after the 
National Black Panther Rally/Kent State Weekend; 
and a fourth in the spring of 1972, their junior year. 
The number of white male respondents for the four 
waves, respectively, was 164, 133, 80, and 108. Data 
presented here are from the first, second, and fourth 
waves. 


Ideology Scale 


Along with other items, each questionnaire con- 
tained a scale of political ideology on a left-right 
dimension and measures of students’ perception of 
parents’ political ideologies. The Student Ideology 
scale (Lewis & Kraut, 1972, describes this scale and 
other aspects of the methodology in more detail), 
asked for self-ratings of the extent of agreement 
with general ideological stances and specific political 
proposals. The mean test-retest reliability of the 
scale, given four times over a two-and-one-half 
year period, is high (mean r=.76), as is its corre- 
lation with students’ self-labeling of political views 
(r= .70). Students described their parents’ political 
ideology by giving mothers’ and fathers’ party 
identification on the first wave and by comparing 
parents’ ideologies to that of an “average American” 
on the second and fourth waves. These six items 
were combined to form a Parental Ideology scale 
by transforming to equivalent response scales and 
averaging. 


Conflict Scales 


The three questionnaires contained a variety of 
questions assessing family conflict in three areas: 
students’ conflict with parents over personal issues, 
students’ disagreement with general and specific 
parental political positions, and parents’ emotional 
distance from each other. Scale construction was 
based on this a priori tripartition supported by a 
subsequent factor analysis.t To limit potential selec- 


1The factor analysis showed that students’ po- 
litical conflict with their parents was distinct from 
their personal conflict. However, items measuring 
parent-student personal conflict and parent-parent 
personal conflict loaded highly on the same factor. 


MobEts or FAMILY INFLUENCE ON STUDENT POLITICAL IDEOLOGY 


tion biases, every item that was initially included 
on a questionnaire to measure conflict and disagree- 
ment was also included on one of the three conflict 
scales, even though this lowered scale reliabilities. 
Typical items on the Personal Conflict Scale, as- 
sessing a student’s affective relations with his par- 
ents, asked about disagreements with parents, con- 
flict and anger in the father-son relationship, the 
influence of the father on the student's personal 
development, and the student’s disagreement with 
his parents over issues such as curfews, schoolwork, 
chores, future plans, vacations, employment, and 
choice of friends and clothing. Typical items on the 
Political Conflict scale asked about a student’s gen- 
eral political disagreement with his parents, poten- 
tial disagreement over specific political issues such 
as civil rights, violent political disruptions and civil 
disobedience, and the need for social change; the 
extent of his father’s influence on his political be- 
liefs; and the similarity of his political opinions to 
those of his parents. The items on the Intra-Parental 
Conflict scale, assessing the emotional climate in the 
family as a whole and the relationship between the 
two parents, asked about conflict between parents, 
their sharing of interests, the amount of time they 
spent together, and the warmth expressed openly in 
the family. Some items in each conflict scale asked 
students to remember past conflict, while others 
asked about contemporary relations. Not every 
item was repeated on every questionnaire. These 
and similar items were made into three scales by 
first weighting each item to transform it to a com- 
mon response scale and then taking the mean of 
the items across the three questionnaires. 

All of these scales are, of course, indirect assess- 
ments based on student report, but the literature on 
offspring perception of parental political party and 
ideology suggests substantial accuracy (Hyman, 
1959; Sears, 1969), and Block (1972) has recently 
collected evidence which supports the general agree- 
ment of parent-student perceptions of child-rearing 
practices and family climate, 

All questionnaires also contained items relating to 
parental and student activism and involvement in 
political matters including, for students, global self- 
reports, activity participation checklists, and reports 
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of organizational membership. While the results 
indicated similarity between parent and student 
political activity and involvement (see Lewis & 
Kraut, 1972), this dimension showed little relation- 
ship to any of the conflict scales. 


ANALYSIS AND DISCUSSION 


While the content of the scales is straight- 
forward and the analysis focuses on the inter- 
relationships of only a few variables, the 
examination of differing causal explanations 
makes the statistical analysis complex. For 
both parental political ideology and family 
conflict, the correlational results are presented 
first and then examined for causal direction 
using cross-lagged correlational techniques 
(Pelz & Andrews, 1964; Rozelle & Campbell, 
1969). Next, the possibility is examined that 
the relationship between family conflict and 
student ideology is spurious, because of an 
autocorrelation or the simultaneous relation- 
ship of conflict and ideology to third varia- 
bles, such as social desirability or a more gen- 
eral autonomous personality (Tomkins, 
1963). Finally, a path analysis (Heise, 1969) 
is used to summarize the contribution of both 
continuity and conflict to student ideology. 

Table 1 shows the correlations between 
the Student Ideology scale and the Parental 
Ideology, Personal Conflict, Political Conflict, 
and Intra-Parental Conflict scales. For these 
correlations, all ñve scales included the rele- 
vant items from all questionnaires and are 
thus summary measures of ideology and con- 
flict, collapsed over the waves of the longi- 
tudinal design. The results indicate that a 
student’s ideology was related significantly to 
both his estimate of his parents’ political 


TABLE 1 


CORRELATIONS AMONG IDEOLOGY, VALUES, AND CONFLICT 


75 hs ae 

1. Student Ideology Scale" 84 

2. Parents’ Ideology Scale* .39* .87 

3. Personal Conflict Scale 39° —.12 89 

4. Political Conflict Scale” 24" —.29* .63* 93 

5. Intra-parental Conflict Scale? 25* —.08 7 41" 87 

6. Autonomy-Empathy Values® .28* 19 24" 12 09 71 
Note. N = 163 except for correlations involving the Intra-parental Conflict scale where N = 147, Diagonal E are: split- 


half reliability estimates, 
a High Oen indicate leftist ideology. 
b High scores indicate greater conflict, j 
o High scores indicate autonomous and empathic values. 


*p <01. 
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ideology and the amount of conflict he per- 
ceived in his family. 


Parent and Student Ideology: 
Causal Analysis 


Student ideology was similar to perceived 
parental ideology, with leftists generally hav- 
ing liberal parents and rightists conservative 
ones. This finding is consistent with many 
studies of political socialization. The prior 
literature suggests that the association is due 
to both a modeling-transmission effect and 
a “distortion toward similarity” effect in 
which offspring may overestimate the extent 
of agreement. Undoubtedly, both processes 
occur, with the distortion effect elevating 
somewhat the generally positive correlations 
found even when parental ideology is based 
on self-reports (Dawson & Prewitt, 1969; 
Sears, 1969). Cross-lagged correlational analy- 
sis was used to examine the relative strengths 
of these two processes. The general logic of 
this analysis is that for any pair of variables, 
a and b, which are measured on the same 
sample at two times, 1 and 2, a pair of corre- 
lations with rar» > Tbiag is consistent with the 
hypothesis that a caused b. On the second and 
fourth questionnaires students compared their 
mother’s and father’s politics to that of an 
average American and completed the Student 
Ideology scale. 

For a student’s perception of his mother’s 
ideology, but not his father’s, the data sup- 
port a modeling-transmission hypothesis over 
a distortion one. The correlation of a stu- 
dent’s perception of his mother’s ideology in 
his freshman year with his own ideology in 
his junior year (r = .40) is significantly larger 
than the correlation of his freshman year 
ideology with his junior year perception of 
his mother’s ideology, r= .17, £ = 1.74, p< 
.05, one-tailed. For the student’s perception 
of his father’s ideology, there were no reli- 
able differences in these cross-lagged correla- 
tions (r= .29 and .33, respectively), con- 
sistent with the null hypothesis that neither 
process was more powerful during this 2- 
year period.” 

Family Conflict and Student Ideology: 
Alternative Explanations 


A student’s political ideology was related 
to perceived family conflict, as well as to his 
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parents’ political ideology. The more disagree- 
ment, conflict, and lack of cohesion reported, 
the more leftist the ideology. As shown in 
Table 1, this was the case for each of the con- 
flict scales; it was also true for auxiliary 
measures not included in the scales. For ex- 
ample, when respondents compared their own 
and their parents’ ideologies to that of an 
average American, larger absolute differences 
between student’s and parent’s ideologies were 
associated with leftist Student Ideology scale 
scores (mean r = .29, p < .01). Furthermore, 
all three family conflict dimensions are highly 
intercorrelated (mean y= .52), suggesting 
that conflict, when it occurs, is more general 
than the separate scales imply and is not 
merely political or limited to parent-student 
relations. 

Autocorrelation. One might doubt the 
causal influence of family conflict on student 
ideology if the only evidence for it were the 
political disagreements leftist students had 
with their parents. As recent commentators 
have noted, the current student generation is 
moving to the left of their parents, perhaps 
because of historical and social-structural 
changes (Starr, 1974). In the present 
study, when students compared their own 
political beliefs to those of their parents, they 


2The data are also consistent with the hypothe- 
sis that as a student becomes more rightist, he sees 
his mother as more to the left, in a contrast effect 
(Rozelle & Campbell, 1969). However, as this is 
not consistent with earlier research and discussion, 
it can be dismissed as implausible on a priori 
grounds (see Sears, 1969). 

The slightly higher correlations for mother-stu- 
dent than for father-student ideology, as well as the 
significant cross-lagged analysis only for the former 
pair, may seem somewhat puzzling, especially as all 
these students are males. One of the many ways in 
which psychoanalytic concepts have influenced 
thinking in political socialization is in the assump- 
tion of the dominance of the father-son relationship 
in the development of political ideology. Thus, for 
example, Lane states, “around the world, the story 
is much the same: father is the source of political 
orientation, especially for sons . . .” (1968, p. 481). 
But those studies which have empirically assessed 
the degree of concordance between each of the 
parents and offspring have generally indicated a 
slight but consistent tendency in the opposite direc- 
tion, with mothers rather than fathers exerting 
greater influence (Jennings & Langton, 1969; Lang- 
ton, 1969; Maccoby, Mathews, & Morton, 1954; 
Nogee & Levin, 1958). Our results are also con- 
sistent with this finding. 
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saw themselves as more leftist or liberal than 
their parents (p < .001). Within this genera- 
tional trend, leftist students, as measured by 
the Student Ideology scale, placed themselves 
further to the left of their parents than did 
rightist students (r = .24, p < .01). However, 
because data of this sort may partly reflect 
an autocorrelation, they provide only limited 
support for the generational conflict hypothe- 
sis: To some extent a student’s ideology is a 
component of his political conflict with his 
parents. 

But the association of family conflict and 
student political ideology did not merely re- 
flect political differences between parents and 
students. The associations occurred on items 
probing conflict and disagreement over spe- 
cific personal issues and on more behavioral 
measures. Leftists reported more disagreement 
with their parents over nonpolitical issues 
including curfews; future plans; vacations; 
use of the family car; choice of friends and 
clothing; and amount of time spent on school- 
work, chores, and part-time work, All 12 cor- 
relations between reported conflict on this 
type of item and the Student Ideology scale 
showed greater conflict associated with leftist 
ideology (p < .001 by the sign test), and 6 
of these were individually statistically signifi- 
cant. On more behavioral measures, leftists 
reported receiving fewer letters from their 
parents (r= —.25, p<.01) and writing 
fewer to them (r = —.29, p < .01), although 
there were no differences in contact by tele- 
phone. 

Social desirability, Nor does the link be- 
tween family conflict and student ideology 
appear to be an artifact of differential self 
presentation including the underreporting of 
socially undesirable characteristics (Adorno et 
al., 1950; Donovan & Shaevitz, 1973; Haan, 
Smith, & Block, 1968). In the present data, 
admitting conflict with parents is socially 
‘undesirable; the correlation of the Personal 
Conflict scale and the Crowne-Marlowe so- 
cial desirability scale (Crowne & Marlowe, 
1964) is —.20, p < .05. However, the partial 
correlation between Student Ideology and 
Personal Conflict, holding social desirability 
constant, was not substantially different from 
the value in Table 1 (r = .37). 

Tomkins’s left-right personality dimension. 
Tomkins’s discussion (1963) of leftist and 
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rightist personalities suggests a theoretically 
interesting third variable which might simul- 
taneously relate to both student political 
ideology and family conflict, yielding a non- 
causal covariation between them. Tomkins 
argues that all strongly held but unproven 
belief systems are ideological in nature; po- 
litical beliefs are only one ideological belief 
system. Within and across diverse content 
areas, the ideology an individual holds is in 
part a function of a set of other beliefs, val- 
ues, and feelings with which the ideology 
resonates, Leftist and rightist ideologies differ 
in many ways, but in general leftist ideolo- 
gies are individual centered, while rightist 
ideologies are norm centered. Both a student’s 
leftist political ideology and his conflict with 
his parents may derive from a more general 
leftist personality with its emphasis on free- 
dom, autonomy, independence, empathy, and 
creativity and its rejection of constraint, con- 
formity, and authority. A leftist political 
ideology stresses independence and rejects 
rigid authority and the status quo; family 
conflict often concerns issues of personal inde- 
pendence and a rejection of parental author- 
ity and many of the constraints of normal 
socialization, 

The literature on leftist students provides 
evidence that the political ideology is sup- 
ported by values similar to Tomkins’s two 
dimensions of the “ideology of individualism” 
—independence/autonomy and empathy/emo- 
tional sensitivity (p. 102). Repeatedly, re- 
searchers describe political leftists as inde- 
pendent, individualistic, different, rejecting 
traditional religion, rebellious, empathic, 
humanitarian, altruistic, and. idealistic (see 
Keniston, 1967, 1968 for discussions and 
1973 for an exhaustive bibliography). In our 
data, the correlations between Student Ideol- 
ogy and unidimensional self-description ad- 
jective scales measuring individualism (mean 
r=.22, p<.01) and empathy (mean r= 
.23, p < .01) were both significant. Also, on 
an open-ended question leftists were most 
likely to mention themselves as the major 
source of their political, social, and ethical be- 
liefs (r = .20, p < .05) and least likely to 
mention their parents (r = —.17, p < .07). 
Tomkins’s leftist values also related to family 
conflict; the individualism scale correlated 
(r = .25, p < .01) with the Personal Conflict 


796 


scale, although the empathy scale did not 
(=a). 

While leftists’ concern with autonomy may 
have led them to resist or reject parental in- 
fluence, this does not seem to be the case with 
influence from peers. Student Ideology was 
essentially unrelated to reported influence of 
friends measured on items worded parallel to 
parental items, reporting friends’ influence on 
personal development or political beliefs; nor 
was Student Ideology related to listing friends 
as a source of values on the open-ended ques- 
tion. 

Further, as noted above, our data indicated 
that leftist ideology was associated with more 
family conflict in general, parent-student 
conflict being only one part of this. While 
not inconsistent with the autonomy hypothe- 
sis, these results make it less plausible. We 
cannot hypothesize that leftists were merely 
more autonomous, but must, at the very least, 
claim that their autonomy caused conflict 
between parents and within the rest of the 
family or that a certain level of conflict was 
an antecedent of an autonomous personality. 

In summary then, the correlational data 
left the role of autonomy-empathy values in 
causing both leftist ideology and family con- 
flict ambiguous. A statistical test of the hy- 
pothesis is presented below in the discussion 
of the path analysis. 


Family Conflict and Student Ideology: 
Causal Analysis 


For the moment, let us assume that the 
association between family conflict and po- 
litical ideology implies a direct causal link. 
The causal direction has not been established. 
Conflict may cause leftist ideology, as sug- 
gested by the generational conflict hypothe- 
sis, or leftist ideology may influence the ac- 
tual or perceived nonpolitical conflict in the 
family, as implied by Dunlap’s (1970) analy- 
sis. The use of longitudinal data allows a 
choice between these two interpretations. Of 
the questions assessing family conflict, 23 
were repeated on at least two questionnaires. 
From these, it was possible to compute 31 
pairs of cross-lagged correlations of family 
conflict at one time with the Student Ideology 
scale at another. In 24 (79%) of these com- 
parisons, the correlation of conflict at a prior 
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time with ideology at a later time was larger 
than the correlation of ideology at a prior 
time with conflict at a later time (p < .001 
by the sign test). Predictions from a student’s 
perception of family conflict to his political 
ideology are better than from the reverse. 
These results are consistent with the hypothe- 
sis that family conflict leads to a leftist ide- 
ology and were found for all three conflict 
scales. 

While these effects are consistent, they are 
not powerful: In only 3 of the 31 pairs of con- 
flict-ideology cross-lagged correlations were 
the differences individually significant. The 
mean correlation of prior conflict to later 
ideology is .24, while the mean correlation of 
prior ideology to later conflict is .16, again a 
reliable difference (¢ = 3.18, p < .01), but 
not a large one. During the college years, con- 
tact and thus mutual influence between par- 
ent and student are probably diminishing. 
Many of the relationships among conflict with 
parents, parental ideology, and student ideol- 
ogy are undoubtedly the residue of earlier 
influence. 


Summary Path Analysis 


To this point, we have focused on three 
possible determinants of a student’s political 
ideology. Separately, his parents’ ideology and 
his conflict with his parents may both con- 
tribute to his ideology, and, in addition, his 
valuing autonomy and empathy may precede 
both his ideology and his conflict with par- 
ents. Given these possibilities, it is fruitful to 
use a multivariate analysis to summarize the 
argument, to disentangle and compare the 
importance of the several related, but concep- 
tually independent variables, and to test a 
theoretical model. The method chosen, path 
analysis, allows examination of the asym- 
metric and cumulative relations among varia- 
bles once the causal order among them has 
been established independently, either empiri- 
cally or theoretically (Duncan, 1966, Heise, 
1969; Land, 1969). 

The assumptions for the path analysis are 
(a) parental ideology and family conflict were 
causes of student ideology, (b) parental ide- 
ology was a cause of family conflict, and (c) 
Tomkins’s leftist personality was a cause of 
both family conflict and student ideology. The 
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previous cross-lagged correlational analyses 
supported the assumption that parental ideol- 
ogy and family conflict lead to student ideol- 
ogy. The second assumption is held more 
tentatively. Prior research has shown that 
ideology forms and stabilizes during late ado- 

“lescence and early adulthood (Lambert, 
1972). Since this generally precedes the birth 
and rearing of children, it is likely that if 
parental ideology and parent-child conflict are 
causally related, parental ideology leads to 
conflict. 

Finally, we have argued and presented evi- 
dence that an autonomous and empathetic 
personality-value system may be a cause of 
both family conflict and student political 
ideology, making the relationship between 
them at least partially spurious.* 

In the path analysis that follows, the Per- 
sonal Conflict scale was used to measure fam- 
ily conflict, since it was the conflict dimension 
most relevant to the generational conflict 
hypothesis and, at the same time, least con- 
ceptually confounded with a student’s politi- 
cal ideology. Tomkins’s values of autonomy 
and empathy were entered by combining the 
independent-different and the loving-human- 
istic adjective scales averaged over the three 
times they were given. The results given 
below were not substantially different when 
the autonomy and empathy dimensions were 
entered into the path model as distinct varia- 
bles. 

Figure 1 shows the path model and the 
standardized path coefficients for the four 
variables. It shows that reported parental po- 
litical ideology (8 = .42) and parent-student 
personal conflict (8 = .42) were equally im- 
portant determinants of a student’s political 
ideology. Altogether, the path analysis model 
explains 35% of the variance in the Student 
Ideology scale, and these two variables alone 
account for 34% of the variance. 

While the zero-order correlation shows that 
the autonomy-empathy variable was related 


3 Alternate analyses show the second and third 
assumptions are not crucial. tting the Tomkins’s 
leftist ideology from the analy and varying the 
causal order of parental ideology and family con- 
flict does not change the major conclusion: Parental 
ideology and family conflict are equally important, 
independent causes of student ideology- 
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Ficure 1. Path model for family variables, stu- 
dent values, and student political ideology. (See 
Table 1 for zero-order correlations. N equals 163 for 
all relationships. Numbers indicated are the path 
coefficients, standardized beta weights, for the 
causal relations specified by the arrows. Only those 
significant at the .05 level or better are shown,) 


to student ideology, the path model indicates 
that it had no direct causal impact. The path 
coefficient (8 = .10) was smaller than its 
standard error and not significant. Thus, the 
simultaneous relationship of autonomy to both 
parental conflict and student ideology could 
not explain their relationship to each other. 
The major effect of the autonomy-empathy 
variable on student ideology was indirect, 
contributing to conflict with parents, which in 
turn influenced ideology. 


CONCLUSIONS 


For this group of students at least, the 
family had a dual influence on student po- 
litical ideology. Both parental ideology and 
the level of interpersonal conflict and dis- 
agreement within the family contributed sig- 
nificantly to student ideology. While taken 
together they accounted for only 34% of the 
variance in student ideology, the aim of the 
analysis was to compare the two, not to 
explain as much of the variance in student 
ideology as possible; indeed, this is a large 
amount considering that only three “predic- 
tor” variables were entered into the model. 
These findings contradict the theme implicit 
in much of the recent literature on student 
activism that generational conflict and con- 
tinuity are two opposing and mutually ex- 
clusive models for explaining the origins of 
leftists students’ ideology. Instead, they are 
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consistent with Hess and Torney’s (1967) 
formulation which included both identifica- 
tion and the transfer of learned interpersonal 
relationships, as well as the accumulation of 
political information and cognitive develop- 
ment, as complementary components of po- 
litical socialization. 

As noted frequently in both the political 
socialization and student activism literatures, 
to some extent parents serve as models for 
their children’s political knowledge, attitudes, 
and beliefs. It is irrelevant for our purposes 
whether the beliefs are directly learned or the 
similarity occurs because of needs for cogni- 
tive consistency (Heider, 1958). The simi- 
larity of student ideology to perceived pa- 
rental ideology partly reflects the influence 
of what can broadly be called direct trans- 
mission or modeling. 

There is currently some discussion in the 
political socialization literature over the size 
of the modeling effect (see Jennings & Niemi, 
1968; Sears, 1969). For example, Sears, 
among others, has argued that researchers 
often overestimate modeling effects by ignor- 
ing the social and demographic characteristics 
shared by parent and offspring which are pre- 
dictive of ideology. In addition, Sears has 
proposed that modeling is strongest for salient 
and limited choices such as political party or 
candidate preference and is minimal with 
more abstract or less clearly manifest politi- 
cal opinions. Our data show moderate influ- 
ence of parental modeling, at least for intel- 
ligent, middle-class male college students, 
even when the student opinion variable is an 
abstract political ideology. 

While the modeling effects were expected on 
the basis of both the political socialization 
and student activism literature, the findings 
on family conflict were not. Indeed, most 
research and discussion within the field of 
political socialization has suggested that re- 
bellion against parents, when it occurs, is 
simply not translated into political concerns 
or political rebellion (Jennings & Niemi, 
1968; Lane, 1959). It is against this back- 
ground that the finding that nonpolitical con- 
flict within the family influences ideology is 
surprising, and more so because the influence 
of nonpolitical conflict is about equal to that 
of direct political transmission. 


How are nonpolitical relationships, personal 
conflict, and disagreement translated into a 
political ideology? There are at least three 
ways: The first is that the student, to the 
extent that he rejects his parents, also rejects 
the values they hold, including political ones. 
In some sense, his parents become negative 
role models. However, the data do not sup- 
port this view. Parents of leftist students are 
themselves more liberal than the general 
population; if leftist students were expressing 
rebellion by rejecting their parents’ political 
ideology, they should become politically more 
conservative as often as more liberal. But, as 
noted in earlier research and the present data, 
in rebelling against their liberal parents, 
leftists move further left: Conservative stu- 
dents with liberal parents are found infre- 
quently (Keniston, 1973; Sears, 1969). 

A second way in which personal relations 
can shape political ideology corresponds to 
what Hess and Torney called the interper- 
sonal transfer model, An individual patterns 
his style of relating to political authority in 
part on a basic orientation toward authority 
established within the family. According to 
a Freudian version of this model (Feuer, 
1969), a person relates to powerful authori- 
ties as substitutes for his father, and if he 
has not adequately resolved his rebellion 
against his father, he will reenact these con- 
flicts with other authority figures, including 
governmental authority. A social learning 
version of the model is similar. Since the 
family and the political system are analogous 
social institutions, both oriented to power, 
allocation of resources, and social welfare, 
much of the child’s experiences with these, 
and hence his strategies for dealing with 
them, will have been developed in the context 
of the family. If the student develops a con- 
flictful and antagonistic relationship with au- 
thority in the family, he may transfer this to 
a political realm. 

A third, and more speculative, explanation 
begins with the particular features of the his- 
torical period in which the relationships be- 
tween family process and the development of 
political orientations have been observed. The 
translation of personal rebellion into political 
disaffection may be likely only when politics 
are salient within the family or general soci- 
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ety, especially in times of social unrest. A 
student’s conflict with and detachment from 
his family may lead him to be influenced more 
by other aspects of his social environment, 
including a peer group more liberal than the 
general population (Middletown & Putney, 
1963; Sears, 1969), a liberalizing college edu- 
cation, and a specific set of political issues— 
the draft, the war in Vietnam, drug laws— 
conducive to youthful disaffection. Periods of 
political conflict may increase the likelihood 
of generational political change, with the di- 
rection modified not only by the social en- 
vironment but also by historical forces. In 
any case, this study and several other recent 
ones which found associations between leftist 
ideology and family conflict contrast sharply 
with older studies that concluded that per- 
sonal rebellion is not translated into political 
dissent. The discrepancy suggests that the 
content and form of political socialization 
may vary in different historical periods. 
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The Development of Values: 
A Perceptual Interpretation 


John Paul McKinney 
Michigan State University 


A perceptual model is proposed as a theory of the development of values. 
Just as a “body schema” develops as the result of self-produced movement, 
values were assumed to develop more strongly in individuals whose locus of 
control was internal. Subjects responded to a locus of control measure and 
rated six attitude concepts on the semantic differential. The attitude concepts 
(three pairs) had been taken from an earlier factor analysis where they were 
the highest loading positive and negative items on three value factors. The 
distance between a subject’s positive evaluation of the positive attitude and 
his negative evaluation of the negative attitude was taken as a measure of 
value clarity, which was found to be significantly higher for internally than 
externally controlled subjects. In a second study, separate sets of semantic 
differential scales were chosen to represent the attitudes’ afferent- and efferent- 
habit strengths which contributed differentially to value clarity and value 
relevance scores, both of which were found to be significantly higher for 


internal subjects. 


The concept of value can be used to ac- 
count for object-consistent behavior in the 
way Campbell (1963) uses the term template 
to identify*the “memories” or instinctive 
guides that make possible an object-consistent 
response.” (p. 143) In the present study, a 
value was conceptualized as a schematic rep- 
resentation of earlier behavior which serves 
as a guide in the execution of new behavior. 
The analogy here is to those central nervous 
system representations of active motor behav- 
ior which constitute the body schema. It has 
been demonstrated that a vital component of 
the development of the schema is the active, 
that is, self-produced, movement of the body, 
or body part, in question and the associated 
reafference, or sensory feedback (e.g, Held, 
1961). 

Translating this theory of visual perception 
into the social psychology of values, it be- 
comes apparent that for a subject to develop 
a value (schema), he must freely choose 
(self-produced movement) and perceive the 
stimulus changes contingent on his own be- 
havior (reafference). If there is an analogy 
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to the self-produced quality of an individual's 
movement, it would be the generalized ex- 
pectancy that one is responsible for his own 
behavior, that is, internal locus of control. 
The present study was designed tò test this 
hypothesis that values would be more strongly 
developed in subjects who had an internal 
locus of control, that is, perceived reinforce- 
ment as a function of their own behavior, 
than in subjects with external control, 

One study which appears to lend support to 
such a schematic interpretation of values is 
the research of Caro (1966) on the values 
which middle- and working-class youngsters 
attach to various occupations, In testing the 
notion of a lower-class “value stretch” pro- 
posed by Rodman (1963), Caro found that 
working-class youngsters did, indeed, “stretch” 
their values in perceiving less prestigious oc- 
cupations as more desirable than did middle- 
class youngsters. However, Caro found that 
the working-class subjects also rated the 
more prestigious occupations significantly 
lower in value than did their middle-class 
counterparts. These last data, however, do not 
appear to have been predicted from Rodman’s 
notion of a lower-class value stretch. Caro 
interprets these findings in terms of cognitive 
dissonance. 

An alternative interpretation might focus 
on another sort of value stretch, namely, the 
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distance between the high ratings given the 
prestigious occupations and the low ratings 
given the less prestigious occupations. This 
distance will be called “value clarity” so as 
not to confuse it with Rodman’s (1963) very 
different use of the term value stretch. By the 
term value clarity is simply meant the dis- 
tance between one’s positive attitude toward 
a desired event and his negative attitude 
toward an undesirable one. It may be that 
middle-class youngsters have a greater free- 
dom of occupational choice, analogous to 
self-produced movement, and thus have built 
up value schemata of greater clarity than the 
lower-class subjects. This squares nicely with 
the data which show that locus of control is 
more external in lower-class than in middle- 
class subjects (Lefcourt, 1966; Rotter, 1966). 

Another study which might be interpreted 
as lending support to a schematic view of 
values was done by Thompson and Gardner 
(1969). These authors compared delinquent 
and nondelinquent boys and girls on a mea- 
sure of “telenomic trends,” that is, the values 
which their subjects perceived as character- 
istic of a happy-successful or an unhappy- 
unsuccessful person. In 10 of the 13 value 
categories tested, the nondelinquent subjects 
gave more extreme ratings, that is, gave rat- 
ings which were significantly higher for the 
positive values and significantly lower for 
the negative values than did the delinquents. 
In other words, the nondelinquents showed a 
greater value clarity between their positive 
ratings for the happy-successful values and 
their negative ratings for the unhappy-un- 
successful values. 

Just as with the lower-class subjects in the 
Caro (1966) study, it would be plausible to 
argue that the delinquents, who had a nar- 
rower range of value ratings, have a less 
clear perception of the dominant values, since 
their internal control is undoubtedly lower. 
The data from studies on locus of control 
(Lefcourt, 1966; Rotter, 1966) support this 
in indicating a more external locus for de- 
linquent and criminal subjects than for non- 
delinquents. 


Stupy 1 


This study Was an attempt to test directly 
the hypothesis that subjects with an internal 
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locus of control have a greater value clarity 
(i.e., between positive and negative attitudes) 
than externally controlled subjects. Such a 
hypothesis, if confirmed, would support an 
interpretation of values as schemata which 
develop as a function of internal control, just 
as the development of a body schema depends 
on self-produced movement. 


Method 
Subjects 


The subjects were 105 undergraduate students in 
child psychology courses at Michigan State Univer- 
sity. 


Selection of Values 


A variety of distinctions has been made between 
the concepts of “value” and “attitude.” The term 
value here refers to a more global construct which 
is defined by the correlations among a set of atti- 
tudes, Thus, the value of equality might be defined 
by one’s attitude toward minorities, racial groups, 
toward the poor, toward achieving school integration, 
etc., to the extent that these are intercorrelated. 
Thus, a factor analysis of ratings for a list of atti- 
tudes would yield a set of value factors. 

The value-defining attitudes used in the present 
study were taken from such a factor analysis (Mc- 
Kinney, 1973). In an earlier study, a group of 744 
college students had rated 95 behavioral attitude 
items for social desirability on a 5-point scale. The 
attitude items had been generated by another group 
of subjects in response to the sentence stems, “I am 
proud of myself when, . . .” and “I would be 
ashamed of myself if. . . .” The universe of re- 
sponses which 67 subjects gave to these two stems 
comprised the list of 95 items then given to the 
larger group of subjects for social desirability rat- 
ings. A factor analysis of these ratings yielded eight 
factors, three of which were used in the present 
study. Two items were chosen to represent each 
factor, the item with the highest positive loading 
and the item with the highest negative loading. 
Thus, six attitude concepts were used which to- 
gether defined the three values. These were as fol- 
lows: “making a good grade” and “flunking a test,” 
defining the value of academic achievement; “help- 
ing someone out” and “doing something deceitful,” 
defining the value of interpersonal morality; and 
“accomplishing something” and “letting myself 
down,” defining the value of general competence. 


Semantic Differential 


Subjects were asked to rate each of these six 
attitudes on nine semantic differential scales. Three 
of the semantic differential scales were picked be- 
cause Osgood, Suci, and Tannenbaum (1957) had 
found them to load highly on an evaluative factor. 
These were the following: good-bad, false-true, 
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and disreputable-reputable. The other six scales 
were taken from the activity and potency factors 
and were simply included as distractors and not 
scored. 


Locus of Control 


After completing the semantic differential, sub- 
jects were given the Social Reaction Inventory 
(Rotter, 1966) as a measure of internal versus ex- 
ternal locus of control. Those subjects with the 
highest, that is, external scores (n= 22) were then 
compared to those with the lowest, that is, internal 
scores (n= 27) on value clarity, that is, difference 
between their evaluative scores on the polar atti- 
tudes defining each of the three values (see Table 
1), The uneven » reflects the fact that while ap- 
proximately the highest and lowest 27% of the 
total group were used for maximum discrimination 
(Kelley, 1939), they were selected so there was no 
overlap in scores with the remaining subjects. 


Results 


The value clarity data are presented in 
Table 1. As predicted, internal subjects rated 
the positive attitudes higher and the negative 
attitudes lower than did the externally con- 
trolled subjects. Value clarity scores were 
computed for each subject as the difference 
between the ratings for positive and negative 
attitudes defining each value. Across all three 
values these value clarity scores were signifi- 
cantly higher for internal subjects than for 
external subjects, F (1,47) = 9.14, p < .01. 


Stupy 2 


While Study 1 dealt with value clarity, one 
could also consider another dimension of val- 
ues, namely their relevance, Fishbein and 
Ajzen (1972) observed that in some recent 
studies the inclusion of relevance scores had 
only served to reduce the predictive power of 
attitude measures. However, it may be that 
relevance is a separate dimension or aspect 
of a dimension which, in its own right, has 
predictive power. In the context of the per- 
ceptual theory outlined above, “relevance” 
would appear to be a component of value 
which has more to do with the efferent as- 
pects of choice making than with the afferent 
aspects. 

The distinction here is simply that terms 
like good-bad and reputable-disreputable 
describe an event from the point of view of 
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TABLE 1 


88 ON THREE EVALUATIVE 
Pairs OF ATTITUDE 
EPTS AND MEAN DIFFERENCE 
Scores (VALUE CLARITY) 

FOR Eacu Pair 


Internals» Externals> 
Value a ien Sa ee ~ 
X s xX st 
Academic achievement 
Making a good grade 5.03 2.95 
Flunking a test = 3.48 3.22 
Difference 8.51 | 24.87 | 6.22 | 23.99 
Interpersonal morality 
Helping someone out 6.85 5.68 
Doing something deceitful | —6.78 —4,95 
Difference 13.63 | 23.78 | 10.63 | 18,15 
General competence 
Accomplishing something | 6.62 4.50 
Letting If down =5.37 —3.18 
Difference 11.99 | 20.38 7.68 | 18.61 
sn=27, 
bn = 22, 


the event, itself, while relevant-irrelevant 
describes the event in terms of some evalu- 
ator. To say that an object is good tells one 
about the object, while to say that it is rele- 
vant to X tells one about X’s orientation 
toward the object. This distinction is similar 
to that made by Staats (1968) between the 
reinforcing function and the discriminative 
function of attitude stimuli. 

Such a distinction is also similar to the 
refinement that Doob (1947) draws in defin- 
ing attitudes. In a classic article on the be- 
havior of attitudes, he refers to their afferent- 
and efferent-habit strengths. By the former 
he means “the strength of the bond between 
the stimulus pattern and the response which 
is here defined as attitude.” (p. 141) About 
efferent-habit strength he says, “Like all im- 
plicit responses, attitudes can be said to have 
stimulus-value, i.e., they arouse other re- 
sponses, . . . Here again reference must be 
made to the habit strength of the attitude, 
only this time it is a bond between attitude 
as a stimulus and a response pattern. . . .” 
(p. 142) 

Two difficulties arise in comparing Doob’s 
position with the distinction made earlier 
concerning values. First of all, Doob is talk- 
ing about attitudes, not values; second, it 
has been observed (Campbell, 1963) that 
Doob does not provide operations on the 
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basis of which one aspect of attitudes can be 
distinguished behaviorally from the other. In 
effect, he sees both as developing with experi- 
ence, and whenever he mentions an increase 
in one, he also deals with the corresponding 
increase in the other. Campbell suggests that 
by using a single term, one could be more 
parsimonious and still lose nothing by way 
of additional meaning. 

The first of these concerns, that is, that 
Doob’s position deals with attitudes, not val- 
ues, is easy to answer. As in Study 1, a value 
is here considered a more global, general term 
referring to a broader class of behaviors or 
objects than an attitude. The position here is 
that value is a personal-social factor defined 
by a set of attitudes which load on that fac- 
tor. For simplicity’s sake values were defined 
in terms of the most relevant positive attitude 
and most relevant negative attitude, that is, 
those attitudes which load highest positively 
and negatively on the factor of interest. 

The first study was designed to examine 
value clarity, as the difference between the 
afferent-habit strength of the two attitudes. 
It should also be possible to examine value 
relevance, that is, the sum of the efferent- 
habit strengths of the two value-defining at- 
titudes. 

The afferent-habit strengths were either 
positive (for the positively defining attitude) 
or negative (for the negatively defining atti- 
tude). Their range, that is, the difference 
between them, was used to define the strength 
of the value as perceived as a standard, that 
is, as external. The efferent-habit strengths, 
on the other hand, do not refer to direction 
in value space (good—bad) but rather to the 
strength of the evaluator’s involvement. Thus, 
the range of such a habit strength would be 
zero or above. 

This mathematical difference between the 
two sorts of habit strength, coupled with the 
notion of both positive and negative attitudes 
defining a value, may provide an operational 
way of showing a distinction between affer- 
ent- and efferent-habit strengths, despite the 
fact that they increase simultaneously, as 
indeed they must as two components in a 
servomechanism. 

Study 2 was an attempt to demonstrate 
the differential behavior of the two attitude 
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habit strengths and, at the same time, to 
replicate verification for the “schema” theory 
of values. 

In other words, the present study used a 
similar technique but examined not only the 
clarity of values but also their relevance. 
Value clarity was defined, as in the earlier 
study, as the difference between the positive 
and negative afferent-habit strengths of the 
deñning attitudes. Value relevance was defined 
as the sum of the efferent-habit strengths. 
Internal subjects were predicted to have val- 
ues of greater clarity and relevance than 
external subjects. 


Method 
Subjects 


The subjects were 128 undergraduate students in 
child psychology courses at Michigan State Univer- 
sity. After discarding the data from partially com- 
pleted forms and from those subjects who were 
over 25 years old, there were 106 remaining subjects. 


Instruments 


Semantic differential. The instrument used in- 
cluded fourteen 7-point scales. Six scales (three 
each) came from Osgood et al.’s (1957) activity and 
potency factors. These were not scored and were 
only included as distractors. The eight scales of 
interest were scales which Osgood had found to 
load on the evaluative factor. These were: false- 
true, good-bad, foolish—wise, disreputable-reputable, 
important — unimportant, intentional — unintentional, 
meaningless-meaningful, and accidental-controlled. 
The first four of these were found in Osgood et al.’s 
study to load only on the evaluative factor and 
could thus be considered pure evaluative items. The 
latter four (with the exception of intentional—unin- 
tentional, which was considered a synonym for ac- 
cidental-controlled, and was therefore included for 
balance, i.e., to have four items of each type) were 
found to load also on the factor of activity. In 
this dual allegiance to the two factors of evaluation 
and activity, these four items distinguish themselves 
as referring to the subjective locus of value, while 
the former scales refer simply to the objective 
behavior. 

The first four items were chosen to represent 
Doob’s (1947) notion of afferent-habit strength, 
namely, “the bond between the stimulus pattern and 
the response which is here defined as attitude.” 
(p. 141) In other words, they refer specifically to 
the quality of the incoming information about the 
concept under consideration. 

The latter four scales were chosen to represent 
Doob’s (1947) efferent-habit strength. That is, they 
refer to “the bond between attitude as a stimulus 
and a response pattern (implicit or overt). . . .” 
(p. 142) 


co 
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The concepts to be scored were the same as in 
Study 1. Each of the six concepts was presented on 
a separate sheet, followed by the 14 scales. The six 
concepts were presented in a fixed random order, as 
were the 14 scales. In addition, on half of the 
scales the polarity was reversed. Again, as in Study 
1, the subjects were then given Rotter’s (1966) 
Social Reaction Inventory as a measure of locus of 
control. 

Afferent-habit strength scores were obtained by 
adding the subject’s ratings for the appropriate four 
scales for each of the six attitudes separately. 
Efferent-habit strength scores were obtained simi- 
larly, using the appropriate four scales. 

Those subjects with the highest locus of control 
scores, that is, most external (n=26) and those 
with the lowest scores, that is, most internal (n= 
30) were compared on their value clarity and value 
relevance scores. Again, the uneven results from 
the fact that the extreme groups (27%) were se- 
lected, so there was no overlap in scores with the 
remaining subjects. 


Results 


The afferent data for both polar attitudes 
of each value are given in Table 2. As pre- 
dicted, internal subjects rated the positive 
attitudes more positively and the negative 
attitudes more negatively than did the ex- 
ternal subjects. Across all three values this 
difference was highly significant, F (1,54) = 
7.24, p < .01. 

The efferent data (see Table 3) were also 
significant. That is, when the intensity of the 
ratings for the two polar attitudes were added 
to obtain value relevance scores, the internal 
subjects had significantly higher scores across 
the three values, F (1,54) = 5.69, p < .025. 

Finally, a factor analysis was done on the 
data for the eight evaluative scales summed 
across the six attitude concepts for all 105 
subjects. A principal components analysis with 
varimax rotation was carried out to check 
the assumption that four of the scales defined 
an efferent component of value, while the 
other four defined the afferent component. 
The two factors emerged as predicted and 
accounted for 39% of the total variance. 

As predicted, subjects with internal locus 
of control scored higher on both value clarity 
and value relevance than externally controlled 
subjects. Since the semantic differential uses 
a Likert-type rating scale, one might question 
whether these results merely reflect a greater 
extreme response set among internal sub- 
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TABLE 2 


Mean TOTAL Scores on Four EVALUATIVE 
SCALES FoR THREE Pairs or ATTITUDE 
Concepts AND MEAN DIFFERENCE 
Scores (VALUE CLARITY) 
FOR EACH Parr 


Internals" Externals! 
Value AS — Eia: 
£ xX s 
Academic achievement 
Making a good grade 5.10 2.15 
Flunking a test —=4.67 =4,19 
Difference 9,77 | 37.43 6.34 | 34.16 
Interpersonal morality 
Helping someone out 8.93 8.12 
Doing something deceitful | —7.77 —5,96 
Difference 16.70 | 27.46 | 14,08 | 40.63 
General sat espe 
Accomplishing something | 8,93 742 
Letting myself down =—4,30 — 3.08 
Difference 13.23 | 28.74 | 10.50 | 34.98 
on = 30, 
bn = 26, 


jects for some unexplained reason. This was 
not the case, however. That possibility was 
checked by computing the mean frequency of 
extreme ratings (1 or 7) across all 14 scales 
for all six concepts for external subjects (X 
= 24.7) and internal subjects (X = 23.3). 
There was no significant difference in these 
means, 


GENERAL Discussion 


One way of interpreting these results is as 
support for a “template” theory of values. 


TABLE 3 


Mean Toran Scores on Four EVALUATI 
Scares FOR THREE PAIRS or ATTITUDE 
Concepts AND MEAN SUM SCORES 
(VALUE RELEVANCE) 


Value ie] e 
TARRE JE 
Academic achievement ine) [r= 
Making a good grade 6,90 17 
Flunking a test 1.17 1.42 
Sum 8.07 | 43.58 | 5.19 | 49.36 
Interpersonal morality 
Helping someone out 9.13 9,00 
Doing something deceitful | 6.83 4.62 
Sum 5.96 | 29,82 | 13.62 | 29.26 
General competence 
Accomplishing something 9,77 8.54 
Letting myself down 3.20 2.08 
Sum 12.97 | 21.07 | 10.62 
*n = 30. 
dnu = 26, 
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The specific model which generated the hy- 
potheses was von Holst’s (1954) efference 
copy theory, which interprets perceptual units 
as correlator-storage mechanisms which de- 
velop as a result of self-produced movement 
and reafferent feedback. The analogy for 
these conditions in the social sphere seemed 
to be met by internally controlled subjects 
who perceive reinforcement as contingent on 
their own behavior. In short, these results 
suggest, at least for the values tested, that 
internal subjects have developed stronger val- 
ues, both in terms of clarity and relevance, 
than have external subjects. While admittedly 
assessed only with a pencil-and-paper tech- 
nique, the particular values used were selected 
because they were found (McKinney, 1973) 
to be most relevant for the population from 
which the current subjects were drawn. 

There is also support in these data for the 
independence of the two sorts of attitude 
habit strength described by Doob (1947). 
The afferent-habit strengths of two bipolar 
attitudes were used to measure value clarity. 
It was assumed that value clarity referred to 
the bidirectionality of a value, and thus the 
difference between the afferent-habit strengths 
of the two attitude poles was employed as a 
measure. The efferent-habit strengths, on the 
other hand, were summed to obtain a measure 
of value relevance. That the two measures 
both differentiated between internal and ex- 
ternal subjects supports the independence of 
the two sorts of attitude habit strength. It 
also supports the position taken here of a 
distinction between values as perceived, that 
is, clarity, and values as expressed, that is, 
relevance. 

The independence of the two attitude di- 
mensions is further suggested by the results 
of the factor analysis of the eight scales used 
to measure the afferent- and efferent-habit 
strength of the six attitude concepts. The 
results of the factor analysis revealed, as 
expected, two main factors, one dealing with 
value as a property of the evaluated behav- 
ior (good-bad, reputable-disreputable, etc.) 
and the other dealing with value as a property 
of the active relationship between the object 
and the evaluator. 

The fact that the value scores (clarity and 
relevance) were both higher for internal sub- 
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jects supports the perceptual model of values. 
However, the different manner in which these 
two dimensions were computed supports the 
discrimination between their components. 
When items defining afferent-habit strength 
were used to rate the attitude concepts, the 
difference between the ratings of the two bi- 
polar concepts defined value clarity. When 
items defining efferent-habit strength were 
used, the sum of the ratings of the two bi- 
polar attitude concepts defined value rele- 
vance. Clarity appears to be a perceptual di- 
mension, and relevance, a motor-affective di- 
mension of values. Any prediction from values 
to behavior should probably take both into 
account, just as response certainty has been 
found to be an important moderator variable 
in predicting from attitude scales (Sample, 
Note 1). In other words, “value” for an indi- 
vidual is neither entirely objective nor en- 
tirely subjective but, like perception, lies on 
the interface between external reality and 
internal commitment. 


REFERENCE NOTE 


1. Sample, J. Response certainty: An indicator of 
the cognitive and motivational basis of an attitude 
expression. Paper presented at the meeting of the 
Midwestern Psychological Association, Cleveland, 
May 1972. 
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This study set out to investigate whether the risky shift found in group con- 
sensus decisions as compared to group members’ personal decisions extends to 
situations that combine risk and aggression, In a between-subjects design, sub- 
jects were exposed, either alone or as members of unisex triads, to an experi- 
mental task in which the risk entailed severe noxious consequences. The find- 
ings show that for both sexes, groups take significantly greater risks than 
individuals even when risk taking is mediated by aggressive means. The results 
are seen and discussed as limiting the explanatory scope of the risk-as-a-value 
hypothesis and as more consistent with the diffusion of responsibility 


hypothesis. 


In recent years there has been a growing 
interest in the observation that groups tend 
to be more risky than individuals (for re- 
views, see Clark, 1971; Dion, Baron, & 
Miller, 1970; Pruitt, 1971a, 1971b; Vinokur, 
1971). The purpose of the present study was 
to determine whether this observation would 
hold also in situations where risky decisions 
are associated with infliction of pain on a 
second party who has no voice in reaching 
these decisions. 

Except for a few investigations (e.g., Bem, 
Wallach, & Kogan, 1965; Malamuth & 
Feshbach, 1972; Pruitt & Teger, 1969; 
Wallach, Kogan, & Bem, 1964), studies of 
the risky shift have used hypothetical situa- 
tions to obtain subjects’ positions on matters 
of risk. The most popular instrument has 
been the Choice Dilemmas Questionnaire 
developed by Kogan and Wallach (1964). 
These hypothetical problems call for role 
playing rather than providing the experience 
of actually being in a risky situation and 
concretely experiencing the outcome of risky 
decisions. Freedman (1969) has cogently 
argued against the practice of role playing 
as a substitute for real experimentation, his 
principal reason being that people are very 
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poor predictors of their own actual behavior. 
The present study, in addition to studying 
risky behavior in a relatively realistic context, 
allows the examination of differential predic- 
tions derived from two hypotheses that have 
been used to explain the risky-shift phenome- 
non: diffusion of responsibility and risk as 
a value. 


The Diffusion of Responsibility Hypothes 


The diffusion of responsibility hypoth« 
has been modified several times because o! 
data disconfirming earlier formulations. The 
first modification of the original diffusion of 
responsibility hypothesis was suggested by 
Wallach and Kogan (1965), after obtaining 
the risky shift under a group discussion 
without consensus condition. This finding led 
Wallach and Kogan to formulate the “affec- 
tive bonds” hypothesis. It states that “the 
affective bonds formed in discussion . . . 
may enable the individual to feel less than 
proportionally to blame when he entertains 
the possible failure of a risky decision” 
(p. 18). The affective bonds emerge not so 
much because of the content of the discus- 
sion but due to the very social nature of 
the situation. 

The affective bonds hypothesis has been 
investigated in a number of studies (Dion, 
Miller, and Magnan, 1971; Kogan & Wallach, 
1967a; Myers, Murdoch, & Smith, 1970; 
Pruitt & Teger, 1969; Wallach, Kogan, & 


Burt, 1967; Clark & Willems, Note 1), with 
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most of the evidence failing to support this 
interpretation of the risky shift. One impor- 
tant reservation to this conclusion is that 
the risky shift found under the “discussion 
without consensùs” condition does not neces- 
sarily lead to the rejection of the original 
diffusion of responsibility hypothesis. It might 
still be argued that the group setting elicits 
people’s readiness to behave impulsively, ir- 
responsibly, or irrationally through altering 
self-blame attribution, although some other 
mediating mechanisms such as increased ex- 
citement or reduction of fear may also be 
operative, 


"Risk as a Value Hypothesis 


The risk as a value hypothesis proposed by 
Brown (1965) maintains that the group dis- 
cussion leads the participants to change their 
decisions made individually in the direction 
of the .relevant social-cultural value. This 

, change is mediated by the “self-image 
maintenance” mechanism. The argumént is 
that the person tends to consider his own 

- decisions to be more consistent with widely 
held values ‘than the decisions of similar 
others, Upon the discovery that some others’ 
decisions. are more consistent with these 
values than his.own, the person tries to main- 
tain his self-image by changing his decision 
towards the extremes. The risk as a value 
hypothesis can explain not only “risky shifts” 

` but also “conservative shifts.” 

The latter hypothesis is more consistently 
supported by the data than the diffusion of 
responsibility hypothesis (see Levinger & 
Schneider, 1969; Madaras & Bem, 1968; 
Stoner, 1968; Wallach & Mabli, 1970; 
Wallach & Wing, 1968; Willems, 1969; 
Clark & Willems, Note 1; and others) and, 
consequently, there is a tendency among 
students of the risk shift to abandon the 
diffusion of responsibility hypothesis. It 
may be argued, however, that the diffusion 
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of responsibility hypothesis has not been 
given a fair test. It seems improbable that 
feelings of guilt and responsibility for fail- 
ure are aroused by the use of the choice- 
dilemmas task, which has been used in 
most of the ©udies aimed at refuting the 
diffusion of responsibility hypothesis. When 
the responsibility was experimentally assigned 
to groups or individuals and subjects were 
led to anticipate some real consequences 
following group discussion, the results have 
tended to support the original diffusion of 
responsibility hypothesis (see Bem et al., 
1965; Wallach et al., 1964), although there 
are exceptions here as well (Pruitt & Teger, 
1969). 

The concept of diffusion of responsibility 
was originally invoked to account for certain 
group aggressive behaviors before being ap- 
plied to the risk-taking realm (see Kogan & 
Wallach, 1967b). Both agegressive- and risk- 
taking behaviors may entail negative conse- 
quences and may represent irresponsibility 
and loss of self-control on the part of the 
actor. The question remains to determir 
whether the risky shift occurs where aggres- 
sive consequences are an integral component 
of the risk. This might happen when people 
face circumstances which involve a choice 
between a safe alternative and a risky one 
which is also associated with damaging or 
hurting a second party who has no vote in the 
decision making. As a concrete example, we 
can think of a group of army commanders 
who are engaged in a war of attrition against 
the enemy. By the employment of shelling 
only, they can maintain a very low death toll 
among their soldiers, but they cannot win the 
war. If they decide to mount a major attack, 
they will undoubtedly lose many of their men 
and, in doing so, may either win the war or 
go down to a total defeat. 

Diagramatically, this situation may be 
described as shown in the following chart: 


Risk-taking decision 


+ positive outcome (victory) 
— “victimization” (high death toll) 


— negative outcome (defeat) 
— “victimization” (high death toll) 


Conservative decision 


— Loss of positive outcome (no victory) 
+ “no victimization” (very low death toll) 
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In the present study the subject is faced 
with a modest analogue of this double ap- 
proach-avoidance conflict. On the one hand, 
taking the risky alternative which may yield 
a desirable outcome (monetary reward) and 
which entails the infliction of pain may also 
lead to a loss of the promised outcome. On 
the other hand, choosing the conservative 
alternative does not require the negatively 
valued behavior of pain infliction but mini- 
mizes the desirable outcome. 

According to the diffusion of responsibility 
hypothesis, under these circumstances, the 
interaction experienced by the group mem- 
bers should facilitate both their aggressive 
and risky behaviors. As mentioned before, 
both behaviors entail possible negative conse- 
quences (suffering of the “victim” and loss 
of money) and thus raise a question of re- 
sponsibility for the dual failure. If the respon- 
sibility for both types of failure can be shared 
with the group, their joint effect should yield 
a very strong risky shift. 

Let us now examine how the “risk as a 
value” hypothesis predicts the results for 
the situation, combining aggression and risk 
taking. We can reasonably expand the risk 
as a value hypothesis to a broader social 
value hypothesis which would imply that the 
group will induce its members to be more 
consistent with any given social value under 
consideration. To be risky, according to the 
social value hypothesis, is a positive social 
value, while to inflict pain on others is nega- 
tively valued by society. Thus under a condi- 
tion in which aggression and risk taking are 
combined, these two tendencies operate in 
opposing directions. While a social value hy- 
pothesis could explain any outcome on a post- 
factum basis by assumptions regarding the 
relative strengths of the two values, the ex- 
perimental design will permit more specific 
analyses of decisions which bear upon this 
hypothesis. 

A second interest of the study is in sex 
differences. Wallach, Kogan, and Bem (1962) 
and Kogan and Carlson (1969), among 
others, have found that “the females as well 
as males show the same change toward 
greater risk-taking as a result of the group in- 
teraction” (1962, p. 86). Males have, in most 
studies, been found to be more physically ag- 


Y. Ymon, Y. JAFFE, AND S. FESHBACH 


gressive than females (Feshbach, 1970). This 
finding applies to situations employing an 
aggression-machine apparatus comparable to 
the one used in the present study (Buss, 
1963). Since pain infliction is a strong nega- 
tive value for females, evidence of a risky 
shift effect for females would have special 
relevance for the social value hypothesis. It 
should also be mentioned that the design used 
in the present study differs from previous 
studies of the risky shift which have used 
the same subject both as an individual and 
as a group member and avoids some of the 
methodological problems entailed in a within- 
subjects as compared to between-subjects 
design. 


METHOD 
Subjects 


Subjects were 91 male and 80 female undergradu- 
ate students at the University of California, Los 
Angeles, who volunteered to take part in this ex- 
periment to fulfill a requirement of the introductory 
psychology course they were taking. Twenty-five 
males and 20 females were randomly assigned to an 
individual condition, and 22 male triads and 20 
female triads grouped at random participated in a 
group condition. Under both conditions, males were 
joined by a male confederate and instructed by a 
male experimenter; females were joined by a female 
confederate and instructed by a female experimenter. 


Apparatus and Paradigm 


The Buss Aggression Machine and the “learning 
experiment” paradigm associated with it (Buss, 1961) 
constituted the core of the experimental procedure. 
The machine has two components, a teacher’s panel 
and a learner’s panel located in adjacent rooms. On 
each trial, the teacher presents to the learner a 
certain sequence of lights (light switches and light 
bulbs numbered 1, 2, 3, and 4 are available for this 
purpose). The learner can respond by either pressing 
a light switch marked A or a light switch marked B. 
The concept he had to “learn” in the present experi- 
ment was the association between Response A and 
light sequences that include Light Number 1; 
sequences without this light required Response B. 
Subjects were instructed to go through 20 trials 
unless the learner performed to a criterion of 4 
consecutive correct trials. Confederates’ responses 
were programmed to be incorrect on Trials Number 
1, 2, 5, 7, 8, 11, 14, and 18, that is, to show progress 
but not to reach the criterion within the 20 trials 
of the experiment. Correct responses were signaled 
to the learner by means of a light switch located on 
the teacher’s panel. For incorrect responses, the 
teacher could use either a flashing light signal or 
punishment by means of electric shock. By pressing 
any 1 of the 10 shock buttons marked 1-10 and 
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arranged in a line on his panel, the teacher could 
ostensibly administer to the learner (through a pair 
of electrodes fastened to the learner's fingers) vari- 
ous shock intensities, from a very mild shock at 
Level 1 through a painful shock at Level 5, to a very 
painful shock at Level 10. In actuality, the confed- 
erate learner took the electrodes off his fingers once 
he got out of the teacher's sight. A concealed device 
on his panel allowed him to record the shock level 
administered by the teacher, 


Safe and Risky Alternatives 


Real-life situations often present two main choices ; 
one guarantees a safe outcome, whereas the other, 
the risky choice, offers a much more desirable out- 
come but with much less assurance for its attain- 
ment. Ostensibly, as explained to the subjects by 
means of the Table of Contingencies (see Table 1), 
the safe course of action in this experiment was to 
use only the flashing light to signal incorrect re- 
sponses. Subjects were assured that on the basis of 
Previous experiments, it was certain that through 
this means the learner would be able to at least 
articulate the concept at the end of the experiment 
(i.e. after the 20th trial). Yet, since the full objec- 
tive was to teach the learner to criterion as fast as 
possible within the 20 trials, the teachers were told 
not to expect any monetary reward if this goal was 
not accomplished. 

The risky course of action was to use electric 
shock as punishment for incorrect responses. Subjects 
were instructed that, in general, the use of electric 
shock, particularly the higher shock levels, was found 
to be more effective in promoting fast learning than 
the use of a light signal. In order to make the risk 


“features of this course of action more salient, the 


experimenter emphasized that this use of shocks 
depends on the individual learner and that with 
some learners the use of strong, painful shocks may 
turn out to have disruptive effects and yield worse 
results than when a light signal or very mild shocks 
are used. To further characterize the teaching task 
as one that could involve considerable risk, subjects 
were promised that they would share monetary 
rewards in direct proportion to the speed of learning 
to criterion. The different payoffs and the proba- 
bilities associated with their attainment were sum- 
marized in a table (see Table 1) which was presented 
to the subjects, 

Following the rationale advanced by Bem et al. 
(1965), the entries in the Table of Contingencies 


` were calculated to yield a constant expected payoff 


value (i.e., magnitude of payoff probability of at- 
tainment) for all levels of risk taking. In other 
words, in the present experiment, the higher the 
shock levels chosen by subjects, the more they are 
inclined toward risky aggression per se. 


Procedure 


Group condition. For each experimental Session, 
four subjects, one of whom was the confederate, 
signed up for a “teaching-learning” experiment. 


TABLE 1 
TABLE OF CONTINGENCIES 


Probability | Monetary 


Variable used (in %) payoff 
Shock 
Teaching to criterion 
within the: 
First 10 trials 10 32 (16)" 
First 15 trials 40 8 (4) 
By the 20th trial 80 4 (2) 


Light signal 
Teaching only to 
articulate the concept | 100 0 (0) 


^ Sums in parentheses refer to payoffs (in dollars) offered 
in the individual condition. Since the monetary rewards are 
shared equally with the learner, the amount of money each 
subject can earn under group or individual conditions is the 
same, 


Upon arrival at the lab, they were met by the 
experimenter who told them that the purpose of the, 
experiment was to examine two main teaching meth- 
ods, one of which uses electric shock as punishment. 
A rigged lottery procedure assigned the confederate 
the role of learner, whereas the other three subjects 
were to serve as the group of teachers. The learner 
was then handed a sheet titled “Instructions for 
learner” and was asked to read it carefully in an- 
other room while the experimenter explained the task 
to the teachers. After the learner left the room, the 
experimenter showed the subjects to their seats 
around the teacher’s panel of the apparatus, handed 
each of them a sheet titled “Schedule of light 
Sequences and teacher's record sheet,” and went on 
to explain and demonstrate the teaching procedure 
as described above. 

It was repeatedly emphasized to the subjects that 
they should try to do their best in teaching the 
learner to reach the criterion of four consecutive 
correct trials as fast as possible and that monetary 
rewards, which would be shared equally with the 
learner, would be given accordingly, Using the Table 
of Contingencies (see Table 1; this table was left 
with the subjects throughout the experiment), the 
experimenter thoroughly explained the safe and risk- 
involving teaching methods available. Two Steps pre- 
ceded each administtation of shock: First, each sub- 
ject was to write down privately his personal 
decision; then the three of them were to exchange 
these decisions and discuss the issue until they 
reached a unanimous decision (i.e, a consensus), 
which was then to be administered to the learner 
each time by a different subject, Since eight incor- 
rect responses were programmed, the group had eight 
occasions on which to make a decision, However, 
if the first group discussion (ie, the discussion 
following the first incorrect response) ended with a 
decision to use the safe alternative (i.e, the flashing 
light method), they were to use it throughout the 
experiment; no combination of the two methods 
was allowed. 

Just before asking the learner to come in, the 
experimenter asked each subject to sample Shock 
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Levels 1, 3, and 5 “in order to give you a concrete 
idea of the different shock levels and their gradua- 
tion.” Actually, the main purpose of this procedure 
was to reinforce the credibility of the cover story. 
After the learner was called in, the experimenter 
asked him if he had any questions or objections to 
the experiment. He voiced no objections and was 
escorted by the teachers to the adjacent room with 
the learner’s panel in it, whereas the experimenter 
left the room. The teachers demonstrated the panel 
to the learner, put the electrodes on his fingers, and 
after closing the door behind them, returned to their 
seats in front of the teaching panel to start the 
experiment. At the end of the 20th trial they called 
the experimenter in, and together they went into the 
learner's room and asked him to try to verbalize the 
concept he was supposed to learn. The confederate, 
of course, spelled it out correctly. Each subject was 
then thoroughly interviewed to find out about any 
suspicions he might have had regarding the experi- 
ment. The experimenter then debriefed the subjects 
and thanked them for their cooperation. 

Individual condition. Only two subjects, one of 
whom was the confederate, appeared for each ex- 
perimental session. The procedure was identical to 
that used with groups, except for the absence of 
group discussion whenever a decision had to be 
made about feedback for incorrect responses. After 
each incorrect response, the subject was to record 
his decision and then proceed to administer it to 
the learner, 

Suspicion of and failure to follow properly the 
experimental procedure resulted in elimination of 
three males and one female from the individual 
condition and two male triads and two female triads 
from the group condition. 


RESULTS 


As was indicated earlier, subjects could 
avoid risk taking altogether by using the 
flashing light signal instead of the electric 
shock. Two male and five female subjects 
under the individual condition and three fe- 
male groups chose the safe alternative. No 
male group avoided the risk completely. 
Although there were no significant differences 
between experimental conditions in the fre- 
quency with which the safe alternative was 
chosen, it was felt that inclusion of subjects 
who chose the latter might appear to bias the 
results in the direction of less risk in the indi- 
vidual condition inasmuch as the mean shock 
level for these subjects over trials would have 
to be zero. (Actually, inclusion or exclusion 
of the subjects who chose the flashing light 
signal in the data analysis did not materially 
affect the experimental outcome, the sig- 
nificance of experimental differences for the 
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whole group and for males being somewhat 
enhanced, and for the females, only slightly 
reduced.) Consequently, only one set of 
analyses, based on those subjects who took 
at least a minimal level of risk, will be 
presented. 

Mean shock level. Table 2 summarizes the 
results for males and females separately over 
the eight decision trials. A two-way analysis 
of variance showed that there were two highly 
significant main effects. The mean of group 
consensus decisions was higher than the 
mean of individual decisions for both sexes, 
F(1,65) = 20.25, p < .001. The mean of 
decisions for the male subjects was higher 
than the mean of decisions for the female 
subjects, no matter whether the decision was 
made individually or group induced, F (1, 65) 
= 34.58, p < .001. No interaction between 
the experimental condition and sex was found, 
F(1,65) < 1. 

A similar analysis of variance was per- 
formed separately on each of the eight deci- 
sion trials. In all cases the above two main 
effects were highly significant with no inter- 
action effects. It is noteworthy that for males, 
a significant difference was obtained on the 
very first trial (group M = 3.82; individual 
M = 2.75, p < .01), before any group dis- 
cussion took place. The results thus support 
the notion that groups are more risky than 
individuals, even when aggression is the 
medium of risk taking. 


Dynamics of Group Decisions 


The data concerning the individual recom- 
mendations (made under the group condition) 
made it possible to classify the group deci- 
sions according to how they were reached. 
This analysis is important because it permits 
one to determine whether the group exchange 
of information and discussion led subjects to 
be influenced by the most risky or the most 
cautious group member according to the value 
made salient in the situation, as the social 
value hypothesis assumes. 

In Table 3 the distribution of 280 group 
decisions made by male and female subjects, 
according to the different rules by which they 
have been determined, can be seen (8 Deci- 
sions X 35 Groups). A chi-square analysis 
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pe TABLE 2 ations that involve victimization of a person 

ri c, *~* Mean SHOCK ADMINISTERED over THE Excur Decision who has no part in the decision-making 

Taiats Ee aA AND Grove process. In light of the tendency in the litera- 

ture (see Pruitt, 1971a, 1971b) to reject the 

Condition Males Females diffusion of responsibility hypothesis, it is 

: Indivi pertinent that the results of the present study 
$ ndividual “ S 5 ; 

S Mean shock 3.95 2,38 are more consistent with this hypothesis than 

ai 20 14 with several of the value theories. 

Men shock 5.20 3.58 The experimental task used in the study 

# n 20 15 entails real outcomes for both the “victim” 

or Tore = crstommne g #24 the subjects. Thus, the usual criticism 

ee RG eet Eie indigiu chiens pe pointing out the improbable application of the 

tion, refers to number of groups in that condition, diffusion of responsibility concept to studies 


using the hypothetical choice-dilemmas task 

showed that there was no significant differ- does not hold. The basic tenet of the diffusion 

ence between the distributions of group deci- of responsibility hypothesis (i.e., implication 

sions for males and females, x*(5) = 10.665, of others leads to perception of shared re- 

p > .05; therefore the data for both sexes sponsibility which, in turn, brings about a 

could be combined. shift toward risk) is consistent with the ex- 

It was found that on 35% of these deci- perimental findings. At the same time, it must 

sions the group members agreed at the outset be cautioned that the data do not provide 

» even before their discussion. The majority direct evidence that the perception of shared 

~ rule was used in 39% of these cases. In 11% or diffused responsibility is the critical mecha- 

of the cases, the groups decided to take the nism mediating the greater risk-taking be- 

average of the individuals’ suggestions, In havior by the group. Thus, the data can also 

15% of the cases, one of the group members be explained by Rettig’s (1966) suggestion 

carried the other two members along with his that the essential mediating factor may be 
opinion. In 9% of the cases, he was the most fear reduction through affiliative behavior, 

conservative one, and in 6% of the cases, he On the other hand, in order to interpret the 

was the most risky one. It is, of course, only data according to risk as a value theory, it 

Vas in these latter 15% of the decisions that the must be assumed that risk taking associated 

Ne group did not act on the basis of some kind with aggression is, in our culture, a widely 

of a compromise, held positive value and that the average indi- 

vidual considers himself to be at least as will- 

ing to engage in risky aggression as his peers. 

The results of the present study provide Itis difficult to accept this assumption, espe- 

evidence that in a between-subjects design, cially in the case of females for whom pain 

groups of both sexes take higher risks than infliction has a clear negative social value. 

individuals. This conclusion is limited to situ- Apart from the dubiousness of the assumption 


Discussion 


TABLE 3 


7 k DISTRIBUTION OF GROUP DECISIONS ACCORDING TO THE RULE USED TO Reacu THEM 
FOR MALES AND FEMALES 


Majority rule Surrender to: 
Sex Agreement ————| Average rule — < Total 
Risky Conservative Risky Conservative 
Male 47 39 25 18 11 20 160 
Female 51 32 13 13 6 5 120 
‘Total 98 71 38 31 17 25 280 
% for total 35 25 14 11 6 9 100 
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that risk has a greater social value than avoid- 
ance of pain infliction, the finding of a risky 
shift effect on the very first decision, before 
any discussion or knowledge of the decision 
of others in the group was possible, casts 
considerable doubt on the social value hy- 
pothesis. Furthermore, the data collected on 
the dynamics of the group decisions show that 
out of 280 decisions, there were 182 cases in 
which the group members did not agree be- 
fore starting the discussion. Out of these 182 
decisions, the groups acted on the basis of 
some kind of a compromise (majority rule, 
average) in 77% of the cases. Only in a few 
cases the mechanisms of “self-image mainte- 
nance” or “release” could operate, and even 
then their influence worked toward more 
caution as well as toward more risk (see 
Table 3). Thus, the social comparison and 
release theories (which are both value 
theories) would have difficulty in accounting 
for these data. 

The assumption that “groups shift in a 
direction toward which most members are 
already attracted as individuals” (Pruitt, 
1971b, p. 498), is shared not only by all 
value theories but also by the diffusion of re- 
sponsibility hypothesis. The difference between 
the theories lies in pointing at the specific 
mechanism which causes the shift. According 
to value theories, it is the social comparison 
or the example of the high-risk-taking mem- 
ber of the group, and according to the dif- 
fusion of responsibility hypothesis, it might 
be the anonymity or lack of affective bonds 
(Dion et al., 1971) existing among group 
members which facilitates the shift toward 
risk. While the results of the present study 
are consistent with the diffusion of responsi- 
bility hypothesis, further research with more 
experimental realism than achieved by using 
the hypothetical choice dilemmas is needed to 
clarify the group processes leading to the dif- 
ference in risk-taking behavior between indi- 
viduals and groups. Also needed is further 
analysis of and experimental support for the 
observation that a group setting tends to 
override restraints against aggressive actions, 
particularly in view of the many group situa- 
tions, ranging from a street gang to commit- 
tees on national policy, in which decisions 
entailing aggressive alternatives are made. 
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REFERENCE NOTE 


1. Clark, R. D., Ill, & Willems, E. P. The risky- 
shift phenomenon: The diffusion-of-responsibility 
hypothesis or the risk-as-value hypothesis. Paper 
presented at the meeting of the Southwestern 
Psychological Association, St. Louis, April 1970. 
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Effect of Importance of Freedom and Attraction to Group 
Members on Influence Produced by Group Pressure 


Jack W. Brehm and Millard Mann 
Duke University 


An analysis of group influence processes indicates that the functional relation- 
ship between attraction to a group and influence exerted by the group de- 
pends on the importance to the individual member of any freedom that is 
threatened by group pressure to conform. This proposition was tested by an 
experiment in which the effect of group conformity pressure on public and 
private opinions was measured under two levels of attraction to the group 
crosscut by two levels of importance of freedom. The results were as pre- 
dicted. It was found that when the importance of freedom was relatively low, 
the greater was the attraction to other group members, the greater was the 
amount of both public and private influence. But when the importance of 
freedom was relatively high, the greater was the attraction to other group 
members, the less was the amount of both public and private influence. The 
condition of high importance of freedom with high attraction to the group 
produced a boomerang effect on private opinions and a trend toward a boom- 
erang on public opinions. A sleeper effect was found when the group pressure 
to conform was removed; subjects in the condition of high importance of 
freedom and high attraction to the group changed positively on a measure of 
private opinion. It was concluded that the evidence supported a reactance 
theory analysis of group influence processes. Limiting conditions and other 


qualifications were discussed. 


Early work on group influence processes 
(Back, 1951; Berkowitz, 1957; Festinger, 
1953; Schachter, Ellertson, McBride, & 
Gregory, 1951) held that when a member is 
in disagreement with the rest of his group, 
his tendency to come into agreement increases 
as his attraction to the group increases. 
Somewhat later work (French, Morrison, & 
Levinger, 1960; Raven & French, 1958a, 
1958b; Zipf, 1960) pointed out a variety of 
factors that may increase a group member's 
resistance to change. And, as Cartwright and 
Zander (1968) noted in their review of work 
on group influence processes, it is generally 
agreed that resistance to change decreases as 
attraction to the group increases. Thus, an 
increase in attraction both increases pressure 
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of a conference on Processes of Minority Influence 
organized by the Social Science Research Council 
and funded by the National Science Foundation. The 
research was supported by NSF Grant GS 3205 to 
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W. Brehm, Department of Psychology, Duke Uni- 
versity, Durham, North Carolina 27706. 


to change and decreases resistance against 
doing so. However, an analysis of group influ- 
ence processes in terms of reactance theory 
(Brehm, 1966) leads to a quite different con- 
clusion. The purpose of the present article is 
to show how reactance can reverse the effect 
of group attraction on the tendency to agree 
with the group. 

Briefly, reactance theory assumes that indi- 
viduals believe themselves to have behavioral 
freedoms, such as being able to hold a par- 
ticular position on an opinion issue. It- is 
hypothesized, then, that when a person feels 
a freedom has been threatened or eliminated, 
reactance is aroused. Reactance is defined as 
a motivational state that is directed toward 
the restoration of whatever freedom has been 
threatened or eliminated. One way to restore 
a threatened freedom is to exercise it, that is, 
to engage in the behavior in question. For ex- 
ample, if a person thought he was free not 
to engage in a particular behavior X, and 
were then subjected to pressure to do X, he 
could restore his freedom by doing not-X. 
Furthermore, the more important to him is 
the freedom not to do X, the greater is the 
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magnitude of reactance when pressured to do 
it, and the greater is his consequent tendency 
to do not-X. 

As previous theory and research indicate, 
the amount of pressure to come into agree- 
ment with a group increases as the member’s 
-attraction to the group increases. Thus, for a 

group member who disagrees with the group, 
increases in attraction to the group increase 
the perceived threat to his freedom to decide 
for himself on the issue. Where the impor- 
tance of this freedom is relatively low, little 
reactance is aroused and the individual’s 
tendency to come into agreement is primarily 
a function of the pressure to agree. The 
greater is his attraction to the group, the 
greater is the pressure to agree and the more 
he tends to change toward the group’s posi- 
tion. But when the importance of the threat- 
ened freedom is relatively great, the magni- 
tude of reactance is relatively large, and the 
individual’s tendency to come into agreement 
is determined by reactance as well as by the 
pressure to agree. With high importance of 
freedom, increases in attraction to the group 
produce more pressure to come into agree- 
ment but also produce large increases in 
reactance, Large amounts of reactance tend to 
cancel out any tendency to come into agree- 
ment with the group. If the magnitude of 
“reactance is greater than the pressure to 
agree, the individual changes away from the 
i group’s position. In general, then, increases in 
bine to the group produce increases in 


{change toward the group position only to the 
, . (extent that the freedom threatened by the 


g group pressure is low in importance, The more 

` important is the freedom, the less does in- 

= » Crease in attraction produce increases in 

change toward agreement. If a freedom is 

sufficiently important, increases in attraction 

result in decreases in change toward the 

<, group, or result in increases in boomerang 
attitude change. 

In summary, two hypotheses can be stated. 

» If a member finds himself in disagreement 

with the rest of his group, and the group 

>$ explicitly pressures him to move toward agree- 

ment, (a) when the pressure threatens a 

freedom of relatively low importance, the 

i a 8teater his attraction to other group mem- 

bers, the greater is his change toward the 
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group’s position, and (b) as the importance of 
the threatened freedom increases, the positive 
relationship between attraction and change 
tends to reverse. The first hypothesis asserts 
that the long-accepted relationship between 
attraction and change holds only when the 
freedom threatened by the pressure to change 
is relatively low in importance. The second 
hypothesis asserts that there is an interaction 
between importance of freedom and attraction 
to the group; when the importance of freedom 
is relatively high, increased attraction results 
in little or no increase in positive change and 
can result in increased negative change. 

An interesting question may be raised 
about the nature of any reactance-produced 
resistance or boomerang effect. It is possible 
that any resistance or boomerang effect would 
be confined to the individual's public posi- 
tion, and that his private position would at 
the same time change toward the group posi- 
tion. That is, the attempt to restore freedom 
might be purely public rather than both pub- 
lic and private, since it is only the public 
behavior that is subject to surveillance by the 
group. To obtain evidence on this point, both 
public and private measures of opinion must 
be obtained. 

A second point of interest concerns how the 
individual responds once explicit pressure to 
change has been removed, Theoretically, re- 
moval of the explicit pressure reduces reac- 
tance and leaves intact the motivation to 
change toward agreement with the group that 
derives from disagreement and attraction to 
the other group members, Thus, a third hy- 
pothesis is: (c) Even when the freedom to 
make one’s own judgment is important, once 
the explicit pressure to change has been 
removed, the greater the attraction to other 
members, the greater is the change toward 
their position (a sleeper effect). 


METHOD 


In order to test the hypotheses, it was first neces- 
sary to give each subject the impression that he 
was a member of a group, that he was unanimously 
opposed by other members of the group on an 
issue of importance to the group, and that he was 
under explicit pressure by the other members of the 
group to change his position in favor of the major- 
ity. Within this general context, it was necessary to 
create two levels of the subject’s attraction to other 
group members and two levels of the importance of 
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the freedom to hold one’s own position on the issue, 
Finally, it was necessary to measure both public and 
private opinion change due to the group’s pressure 
to change and to measure any further private opin- 
ion change after the group pressure to change had 
been removed. These needs were met by adapting 
methods used by Burdick (1955) and McBride 
(1954). 

Procedure 

Male undergraduates from elementary psychology 
courses at Duke University volunteered for an 
experiment entitled, “Group Decision Making in 
Ecology.” While these students are required to 
participate in a certain amount of research, they 
may select freely among a variety of research activi- 
ties. Subjects were scheduled in groups of five or six, 
and when they arrived they were seated around a 
hexagonal grouping of trapezoidal tables.t On each 
table was a pad of note paper, a pencil, and a card 
with an identifying letter lying face down. After 
being seated, subjects were told that they were going 
to make an evaluation from reports on the Alaska 
oil pipeline—from environmentalists, oil interests, 
and the Defense Department—and decide what im- 
pact the environmentalists’ paper would have in the 
final recommendations made by a group of Rand 
Corporation scientists to the Administration and 
Congress. 

Before proceeding to the business of the group, 
each subject was asked to identify himself to the 
rest of the group and to tell a little about his 
interests, hobbies, etc. This was done in an effort to 
have subjects feel they were members of a group 
and in order to serve as the basis for the manipu- 
lation of attraction to the group. After each sub- 
ject had introduced himself, all subjects were asked 
to fill out a questionnaire that required rating each 
other group member on a scale ranging from —5 
(dislike extremely) to 5 (like extremely). These 
questionnaires were then collected. 

To provide a rationale in which importance of 
freedom could be manipulated and explicit pressure 
to change might be brought to bear by group mem- 
bers, subjects were told that a foundation was 
subsidizing this research. The experimenter explained 
that the foundation funds were to be used to ap- 
proximate a real-life situation as closely as possible. 
It had therefore been decided that each group would 
receive a $15 participation fee that they could divide 
among themselves as they saw fit. In addition, an 
individual prize was to be awarded to the subject 
whose judgment came closest to the correct posi- 
tion, that is, the position picked by the group of 
scientists. 

Manipulation of importance of freedom. A booklet 
of reports on the issue was then handed to. each 
subject. The covering page gave additional instruc- 
tions and included the manipulation of the im- 
portance of freedom. Some subjects learned that the 

1A confederate was used to bring two groups up 
to the size of five. 
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prize for individual accuracy of judgment was $2 
(low importance), while others learned that it was 
$10 (high importance). A 

When the reports had been read, the experimente: 
explained that it was necessary to keep a record of 
the discussion and to ensure against all nonverbal 
communications. Shields were then erected so that 
subjects could not see each other, and the experi- 
menter explained the system of balloting, writing 
notes to discuss the issue, and reballoting. The 
subjects were instructed to turn over their cards in 
order to learn their identifying letters, which would 
be used to show the distribution of opinion within 
the group and with which they would address and 
sign their discussion notes. All subjects were actually 
given the letter, B. 

Manipulation of attraction. At this point the ex- 
perimenter noted that other groups had expressed 
interest in learning the outcome of the attraction 
questionnaire, and so, he explained, he had prepared 
a summary sheet to show each person how he had 
been rated by others in the group. Low attraction 
was created by informing subjects that their ratings 
on the 11-point scale were —2, —1, —1, 0, and 0; 
high attraction was created by informing subjects 
that their ratings were 3, 3, 2, 2, and 1 (one score 
would be omitted in groups consisting of five per- 
sons). 

Balloting and discussion. Following the attraction\’ 
manipulation, the first ballot was passed out, and») 
subjects were asked to indicate their position and 
sign their letter so that the results for the wholé 
group could be recorded. The ballot consisted of the 
question, “How much weight do you think the 
environmentalists will have in the final decision?” 
followed by a 30-point response scale labeled “very 
little” at one end and “determining factor” at the 
other. The ballots were collected, ostensibly tallied, 
and a form presumably showing the resulting grou 
distribution was then given to each subject. Ea 
form was actually designed to give the subject the 
impression that he was opposed by all other mem-, 
bers of the group. The other letters, Aj C, D, E, 
and possibly F, were all shown at the opposing side 
of the scale within a range of 7 points and with an 
average distance from the subject of about 15 points. 

When the ballot results had been distributed subs 
jects were told they could begin their note writing. 
Within a few minutes the experimenter began, — 
circling the table to pick up completed notes and 
to pass out notes. All notes distributed were pre- 
written by the experimenter. Each subject first 
received a note from C that asked his reason for his 
choice of position. He then received a note from A 
that stated he (B) did not seem to agree with the 
group and stated an argument for the group’s side 
of the issue. The third and fourth notes were 
designed to make explicit the group pressure to 
agree. The third note, signed by D, said, “Why 
don’t you change your mind? If you do, T'il vote 
$6 of the $15 group fee for you.” The fourth note. 
signed by E, said, “I’ve been in contact with D andl 
agree with his offer. Why don’t you change?” After 
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delivery of the fourth note, the experimenter an- 
` nounced that the discussion was over. The whole 
discussion period lasted about 15 minutes. 

Public and private measures, A second ballot was 
then given to each subject, and they were reminded 
+, to sign their letters to their ballots so that the results 
could be shown to the group. Upon completion and 
/ collection of this public measure, the experimenter 
passed out a two-page questionnaire and asked sub- 
jects, as a favor, to fill it out in order to aid another 
experimenter in the department who was doing 
research on the same general topic. The experimenter 
-explained that the questionnaire was of no interest 
to him, that they should put neither their names 
nor their letters on it, and that the questionnaires 
would simply be collected and turned over to this 
other researcher for his own use. One item on this 
questionnaire was identical to the group opinion 
ballot, and it constituted the measure of private 
opinion on the issue. The second page of the ques- 
tionnaire contained questions designed to measure 
the success of the experimental manipulations. 

Removal of group pressure and final ballot. When 
the questionnaires had been collected, the experi- 
menter announced that it was time for the group to 
vote on the distribution of the $15 group partici- 
pation fee, and he distributed a form for that 
purpose. This form asked the subject to write in 
the amount of money he thought should be given 
to each person, including himself, identified by 
letter. Subjects were told not to put any identifica- 
Apion on this sheet. These sheets were then collected. 
Finally, the experimenter asked for another ballot 
to be filled in, stressing that it was this vote that 
would count for the individual prize. Subjects were 
told to put their full names on the sheet so that 
they could be contacted if they won. This, of course, 
was the measure of opinion after the group pressure 
had been removed. The ballots were collected, the 
shields removed, and a careful debriefing session 
followed, 


Summary of Design 


. Two levels of importance of freedom were crosscut 
by two levels of attraction to other group members. 
. Fifty-two subjects were run in groups of 5 or 6, 
and they were randomly assigned to the four experi- 
mental conditions within these groups with the 
restriction that condition numbers be equal. The 
initial measure of opinion -was taken after the in- 
duction of importance and attraction but prior to 
| the: introduction of group pressure. After introduc- 
tion of the explicit group pressure to change, there 
was a public and then a private measure of opinion. 
After the group pressure had been removed, a final 

s private measure of opinion was taken, 


RESULTS 


Thorough pretesting was done to ensure 
hat the rationale was believable, the issue 
was of concern to the subjects, and the 


“amounts of money were credible. The post- 
= 


= 


en 


BY GROUP PRESSURE 819 
experimental interviewing revealed very little 
suspicion on the part of subjects, and no 
subject data were deleted from the analysis 
of results. Although subjects were run in 
groups, and there was a small amount of 
group interaction at the beginning of each 
experimental session, subjects were assumed 
to be independent of each other for the sta- 
tistical analysis of results. This assumption 
of independence seems justified because sub- 
jects were isolated from each other prior to 
the introduction of any influence processes 
and because subjects within groups were ran- 
domly assigned to the four experimental con- 
ditions. ; 


Success of Manipulations 


The second page of the anonymous ques- 
tionnaire contained questions designed to 
measure the success of the experimental ma- 
nipulations. Two questions were relevant to 
the manipulation of attraction to the group, 
and two questions were relevant to the ma- 
nipulation of importance of freedom, 

To measure attraction, one question asked, 
“How much did the group like you?” Re- 
sponses were made on an 11-point scale: run- 
ning from —5 (labeled Dislike Extremely) 
to 5 (Like Extremely). The mean perceived 
liking by others in the low-attraction condi- 
tion (—1.04) was clearly less than the mean 
perceived liking by others in the high at- 
traction condition (1.31). An analysis of 
variance revealed a strong main effect be- 
tween high- and low-attraction conditions, 
F (1,48) = 22.37, p< .001, and no other 
effects. 

The second question, which used an iden- 
tical response scale, asked, “In general, how 
much did you like the other members of the 
group?” The mean response was —1.19 in 
low-attraction groups and 1.38 in high-at- 
traction groups. An analysis of variance of 
these data yielded a strong main effect for 
the attraction manipulation, F (1,48) = 
23.48, p < .001, and no other significant ef- 
fects. Thus, the evidence from these two 
questions indicates that the attraction manip- 
ulation was reasonably successful. 

The first question designed to measure im- 
portance of freedom was “How important was 
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it to take your own stand on the issue?” 
Responses were made on an 11-point scale 
that ran from 0 (labeled No Importance) to 
10 (Very Important). While the means were 
4.46 in the low-importance and 5.20 in the 
high-importance condition, neither this dif- 
ference nor any other effect was found to be 
significant by an analysis of variance. It may 
well be that subjects interpreted “take your 
own stand” to mean a public defense of their 
position. This interpretation is essentially 
irrelevant to the importance of freedom di- 
mension, and its use by subjects would ac- 
count for the lack of effect obtained on this 
question. 

The second question to measure importance 
asked, “How concerned were you that the 
group make an accurate estimate?” Responses 
were made on an 11-point scale that ran 
from 0 (Didn’t care) to 10 (Very concerned). 
It was expected that concern for accuracy of 
the group estimate would vary inversely with 
the importance to the individual of the free- 
dom to make his own judgment. In accord- 
ance with that expectation, these means were 
6.34 in the low-importance and 4.38 in the 
high-importance condition. An analysis of 
variance of these responses revealed a weak 
though reliable main effect for importance, 
F (1,48) = 6.79, p< .05. Taken together, 
the responses to the two questions concerning 
the importance of freedom indicate that the 
manipulation of importance may have been 
weak. However, it may also be that these 
questions were not good measures, for, as 


TABLE 1 


Means or [INITIAL Position AND OF PUBLIC AND 
PRIVATE CHANGES UNDER PRESSURE TO CHANGE, 
AND OF PRIVATE CHANGE AFTER PRESSURE 
1s REMOVED 


Under pressure 


iti to changed Pressure 
ii Initial 
iti removed 
Condition positiona =a =| SCARE 
| Public | Private 
7.15 | 3.70 2.39 1.31 
ion] 8.00 8.62 6.38 3.77 
ce 
Low attraction 7,62 5.61 3.69 1.15 
High attraction 8.00 —2.31 —2.85 4.92 
MSe 15.93 31.45 24.10 50.24 


ion was tabulated as a deviation from the 
center of the scale, Possible scores ranged from 1 to 15. 

b A positive value indicates change toward the group position, 
All measures were on a 30-point scale. 
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can be seen below, the major results indicate 
that the manipulation of importance of free- 
dom was effective. 

A filler question on the questionnaire asked 
subjects, “How useful did you find the dis- 
cussion?” While it was not anticipated that 
this question would show a difference, an 
analysis of variance yielded a main effect for 
the attraction variable, F (1,48) = 7.29, p 
< .01. Subjects in low-attraction conditions 
thought the discussion less useful (3.34) than 
did subjects in high-attraction conditions 
(5.12). It seems plausible that subjects in 
the low-attraction conditions may have 
judged the discussion to be relatively useless 
because they were relatively uninterested in 
what the group was doing. 


Influence Processes 


The subjects were about equally divided 
initially regarding whether they thought the 
environmentalists would have much or little 
influence. Because we were primarily inter- 
ested in the amount of change in opinion 
shown in each condition and because all sub- 
jects were led to believe that they were on 
the other side of the scale from the rest of 
the group, initial opinions were computed as 
deviations from the center of the scale. As 
can be seen in the first column of Table 1, 
these means were moderate and similar among 
the four experimental conditions. 

Public and private responses to group 
pressure, During the discussion, each subject 
received a note ostensibly from each of four 
other group members. The first note asked the 
subject why he held his position, the second 
argued for the group position, and the third 
and fourth notes promised to give the sub- 
ject a larger than normal share of the group 
participation fee if he would only agree with 
the rest of the group. At the end of the dis- 
cussion, a ballot was taken that ostensibly 


would be made public so that everyone in the 5 


group could see how each other person, identi- 
fied by letter, stood. 
It was expected that, where the importance 


of freedom was low, the greater was the at- 


traction to the group, the greater would be 
the tendency of the individual to comply 
publicly. But where the importance of the 
freedom was high, group pressure to comply 


AŻ 


į 
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_ should produce reactance and a consequent 
reduction in the positive relationship between 
attraction and compliance. As can be seen in 
the-second column of Table 1, the changes in 
") public position conform quite well to this 
) analysis. Where the importance of freedom 
was low, the high-attraction condition pro- 
duced greater compliance (8.62) than did the 
low (3.70), F (1,48) =5.01, p<.05. In 
contrast, where the importance of freedom 
gs high, the high-attraction condition pro- 
j duced less compliance (—2.31) than did the 
Slow (5.61), F (1,48) = 12.97, p < .001. The 
Í interaction of attraction and importance of 
freedom was highly significant, F (1,48) = 
17,05, p < .001. 
The measure of private change was taken 
} immediately after that of public, and it was 
taken in order to see whether the reactance 
effects due to the group pressure were limited 
to public responses. As can be seen in the 
third column of Table 1, the means for pri- 
vate change are quite similar to those for 
public. Again, the interaction between attrac- 
tion and importance was highly significant, 
F (1,48) = 14.98, p < 001. There were no 
reliable differences between the public and 
f private measures. 
4 The negative change obtained in the condi- 
tion of high importance of freedom and high 
attraction to the group was submitted to 
further analysis to see if it was reliable. On 
the public measure, 8 out of 10 subjects who 
showed any change at all showed a boomerang 
effect (p= .11), and the F value (2.20) fell 
short of significance. On the private measure, 
however, 10 out of 11 subjects who showed 
any change at all showed a boomerang ($ = 
.02), and the F value (4.37) was also signifi- 
-cant (p < .05). Thus, there was a reliable 
boomerang effect on private change, and a 
trend toward a boomerang on public change. 
Acceptance of the inducement. When sub- 
jects were given a chance to distribute the 
group participation fee as they saw fit, they 
could indicate their overt acceptance of the 
inducement for conformity by giving them- 
selves a disproportionately large amount. It 
was found that six of the subjects in each of 
the low-importance conditions accepted the 
inducement, eight accepted it in the condition 
of high importance and low attraction, and 


By GROUP PRESSURE 821 
only one accepted it in the condition of high 
importance and high attraction, These data 
are entirely consistent with the opinion change 
data. 

Removal of group pressure. Once the group 
members had filled out the forms to distribute 
the group participation fee, the explicit pres- 
sure to conform to the group was removed. 
At this time a private measure of opinion was 
taken that would ostensibly be used to deter- 
mine if there were a winner of the individual 
accuracy prize. This measure allows an as- 
sessment of what position was thought to be 
correct in the absence of both public conform- 
ity and reactance pressures. If private opinion 
change in the absence of explicit pressure is a 
direct function of attraction to the group, 
then the high-attraction condition should 
show more positive change than the low even 
where the importance of freedom is great. The 
mean changes from initial opinions to the last 
measure are shown in the last column of 
Table 1. While an analysis of variance of 
these data shows that the main effect for 
attraction fails to reach significance, F (1, 
48) = 2.51, p <.20, it is notable that the 
highest mean (4.92) is found in the condi- 
tion of high attraction and high importance 
of freedom. In other words, those subjects 
who showed a boomerang tendency to the 
group pressure to conform actually showed 
somewhat more positive change than did 
subjects in other conditions once the pressure 
to conform had been removed. As can be seen 
in Table 2, the differences between private 
measures while the group pressure was applied 
and after it was removed are quite dramatic. 
All conditions showed a slight decrement in 
acceptance of the group position except for 
those in the condition of high attraction and 
high importance of freedom, who showed a 
highly reliable, F (1,48) = 22.54, p < .001, 
positive change. This effect strongly supports 
the contention that the boomerang tendency 
produced by the joint effects of group pres- 
sure, attraction to the group, and importance 
of freedom is due to reactance. 


Discussion 


The present experiment has shown that 
when a member finds himself in disagreement 
with the rest of his group, and he is offered 
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TABLE 2 


MEAN CHANGES IN PRIVATE OPINION DUE TO REMOVAL 
OF GROUP PRESSURE 


Attraction to group 
Importance | 
of freedom 


High 


Low 
High 


Note. MSc = 34.82. 


special rewards by the group for compliance, 
the degree of his public compliance and pri- 
vate acceptance is determined by a joint func- 
tion of his attraction to the group and how 
important is his freedom to hold his own posi- 
tion on the issue. Where the importance of 
freedom is relatively small, both public and 
private compliance are a direct function of 
attraction to the group. However, as the im- 
portance of freedom increases, the magnitude 
of the direct function between attraction and 
compliance tends to decrease. Where the im- 
portance of freedom is relatively great, both 
public compliance and private acceptance de- 
crease as attraction to the group increases. 

This experiment also demonstrated that 
the negative relationship between attraction 
and compliance that is obtained when the 
importance of freedom is relatively great can 
be greatly reduced by removal of the ex- 
plicit pressure to comply. Though statistically 
nonsignificant, the evidence suggests that, 
once the explicit group pressure has been 
removed, there remains a direct relationship 
between attraction and private acceptance. 
The lack of a stronger relationship between 
attraction and positive influence may be due 
to the fact that pressure to agree with other 
members was not totally removed. After the 
allocation of rewards, members presumably 
still felt pressure to come into agreement 
with others to whom they were attracted, and 
this pressure too should arouse some reac- 
tance. Theoretically, there will almost always 
be some reactance when there is pressure to 
change, and it is not surprising that many 
studies have reported marginal positive influ- 
ence effects (e.g., Back, 1951; Berkowitz, 
1957). 

The results are clearly consistent with the 
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analysis of group influence processes in terms 
of reactance theory that was presented in the * 
introduction of this article. Nevertheless, ty 
points should be noted about the particular 
conditions under which the present test was 
carried out. First, the manipulation of attrac- 
tion to the other members of the group was 
carried out by leading subjects to believe they 
themselves were liked or disliked by the other 
members. While this operation apparently 
was successful in creating differing degrees of 
attraction to the group, it doubtless had other 
effects as well. For example, those in the 
high-attraction condition probably felt more 
accepted by other members than did those in 
the low-attraction condition. It is therefore 
possible that the boomerang tendency ob- 
served in the condition of high attraction 
with high importance of freedom depends, at 
least to some extent, on the feeling of being 
accepted. Just as Hollander (1958) has postu- 
lated that “idiosyncrasy credits” may allow 
a leader to deviate from group norms, so in 
the present case may the feeling of accepta- 
bility allow subjects to show opinion change 
away from the group. Further research is 
needed to determine whether the acceptability 
factor is important for the pattern of results 
obtained. 

Second, the group pressure to change was 
made explicit to subjects by the offer of a 
reward for compliance. Theoretically, the 
arousal of reactance does not require that 
either rewards for compliance or threats for 
noncompliance be involved in order for group 
pressure to serve as a threat to freedom. The 
offer of rewards was used in the present ex- 
periment in order to dramatize the group 
pressure as a threat to freedom. Since it was 
used, however, further research is needed to 
show whether such explicit group pressures 
are necessary for the effects obtained. 

Social power analysis. French and Raven 
(1959) have stipulated conditions under which 
boomerang attitude change should obtain, and 
it may be useful to contrast their analysis 
with the present experiment. They hold that 
negative influence may occur when coercive 
power (use of threats), illegitimate power, or 
negative referent power is used to obtain 
change. Each of these uses of power intro- 
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duces a negative valence that tends to be- 
come associated with the behavior that is the 
target of inducement. 

It is apparent that neither coercive power 
nor negative reference power was involved in 
producing the boomerang tendency found 
here. The offer of an extralarge share of the 
group participation fee would constitute re- 
ward power rather than coercive power. Simi- 
larly, the condition of high attraction to the 
group constitutes high positive reference 
power rather than negative reference power. 

The question of the legitimacy of the in- 
fluence attempt is less easily determined on a 
priori grounds. While the subjects were told 
at the beginning of the session that they 
would be able to divide the group participa- 
tion fee among themselves in any way that 
they saw fit, it is possible that subjects would 
not normally have thought of making an 
uneven distribution, and they may have 
thought that the offer to give them more was 
an unfair way to put pressure on them. How- 
ever, the results of this pressure are difficult 
to understand as a product of illegitimate 
power. First of all, the effects differed as a 
function of whether freedom was important, 
an effect not clearly predicted in terms of 
the legitimacy concept. Second, once the group 
pressure had been removed, private attitudes 
of subjects who had shown a boomerang 
tendency became relatively positive. If the 
boomerang effect had been due to negative 
valence associated with an illegitimate influ- 
ence attempt, it is not apparent that the nega- 
tive valence would have disappeared after the 
illegitimate distribution of rewards had pre- 
sumably been carried out. Thus, an explana- 
tion in terms of illegitimacy of the influence 
attempt does not seem compelling. 


Implications 
The reactance theory analysis presented 


. here can be extended to other forms of pres- 


sure toward uniformity in groups and to other 
dimensions of social power. It may be ex- 
pected, for example, that other determinants 
of cohesiveness aside from attraction would 
also have the capability to increase the threat 
to freedoin that results from group influence 


attempts. 


However, there remains a question about 
when group influence attempts arouse reac- 
tance. Grabitz-Gniech (1971) interpreted her 
experiment to mean that when a group decides 
upon an issue of relevance to the group, 
members would give up their freedom to 
differ. It is apparent that her generalization 
does not hold for the present experiment, and 
it remains unclear what conditions determine 
whether freedom is given up or reactance is 
aroused. 

A sleeper effect. Those subjects for whom 
both importance of freedom and attraction 
to the group were great revealed not only a 
boomerang to group pressure to conform, but 
also positive change once the group pressure 
had been removed. This kind of sleeper effect 
should be latent in all influence attempts that 
arouse reactance. If an influence attempt has 
aroused reactance, and because of that has 
been ineffective, positive influence can still be 
brought about by removing the pressure to 
change. For example, a therapist who has had 
little or no success in getting a patient to 
adopt new styles of behavior may find that 
those styles are adopted after the therapeutic 
relationship is terminated. Or to take another 
example, a group of demonstrators may find 
that the attitudes of people they are trying 
to influence will change favorably only after 
their demonstration has ceased. And finally, 
the forced busing of children to schools has 


obviously threatened or eliminated the free” 


dom of parents to send their children to 
neighborhood schools. While the resistance of 
parents to forced busing has been apparent, 
if the forced busing has positive effects on 
the attitudes of the parents, these effects 
may remain hidden until the “force” involved 
in busing is removed. In summary, the para- 
dox is that influence attempts tend to fail to 
the extent that they arouse reactance, and to 
that same extent, they may succeed as soon 
as they stop. 
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The unexpected nature of the findings of 
this experiment and those of a similar, follow- 
up investigation (Baron & Lawton, 1972) 
suggest that the relationship between ambient 
temperature and aggressive behavior may be 
far more complex than has generally been 
assumed. Thus, the need for a theoretical 
framework within which the effects of this 
and other environmental factors (e.g., crowd- 
ing, noise, etc.) may be interpreted is clearly 
indicated. One possible basis for such a frame- 
work is provided by the social learning theory 
of aggression proposed in recent years by 
Bandura (1973), 

According to this theory, exposure to aver- 
sive treatment of almost any type serves to 
induce heightened emotional arousal among 
the individuals involved. Such heightened 
arousal may then, in turn, enhance the fre- 
quency or strength of later aggressive behav- 
ior under conditions where these persons are 
“prone to behave aggressively” (i.e., where 
aggression represents a strong or dominant 
behavior in their response hierarchies; see 
Bandura, 1973, p. 56). In view of the fact 
that adult human subjects report finding 
high ambient temperatures to be extremely 
unpleasant and aversive (see, eg., Baron, 
1972a; Griffitt, 1970), it would be expected, 
on the basis of Bandura’s theory, that such 
environmental conditions will tend to facili- 
tate subsequent aggression when this type of 
behavior is dominant in subjects’ response 
hierarchies but will fail to produce such ef- 
fects or possibly even tend to inhibit subse- 
quent aggression under conditions where some 
other, incompatible form of activity is domi- 
nant instead. 

The present experiment was designed to 
investigate the validity of these suggestions. 
Tn order to accomplish this task, two factors 
which would be expected to exert a powerful 
influence upon the strength or dominance of 
subjects’ aggressive tendencies, (a) the degree 
of prior provocation subjects experienced and 
(b) exposure to the actions of a highly ag- 
gressive model, were varied systematically 
along with ambient temperature. It was as- 
sumed, on the basis of previous research, that 
both would influence the strength or domi- 
nance of aggressive behavior, so that subjects’ 
tendencies to engage in such actions would be 
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weakest under conditions of low provocation 
and lack of exposure to an aggressive model 
but strongest under conditions of high provo- 
cation (e.g., Baron, 1971; Geen, 1968) and 
prior exposure to the actions of such a model 
(e.g., Baron, 1972b; Turner & Berkowitz, 
1972). In accordance with these suggestions 
and the theory proposed by Bandura (1973), 
it was predicted that high ambient tempera- 
tures would be most effective in facilitating 
later aggression when subjects had both wit- 
nessed the actions of a highly aggressive 
model and been exposed to strong provoca- 
tion from the victim, but least effective in 
this regard when they had neither witnessed 
the actions of an aggressive model nor been 
exposed to such instigation. 


METHOD 
Subjects 


Sixty-four undergraduate males enrolled in sections 
of Elementary Psychology at Purdue University 
participated in the experiment. Subjects took part in 
the study in order to satisfy part of a course re- 
quirement. 


Design 


A 2X2 2 factorial design based upon two levels 
of ambient temperature (cool, hot), two levels of 
prior anger arousal (nonangry, angry), and two 
levels of exposure to the behavior of an aggressive 
model (no model, model) was employed. Subjects 
were randomly assigned to each of the cells of this 
design as they appeared for their experimental 
appointments. 


Apparatus 


The apparatus consisted of a modified Buss “ag- $ 


gression machine” (Buss, 1961) and a Lafayette stop 
clock (Model 20225A). The aggression machine was 
identical to apparatus employed in a number of 
previous investigations (e.g., Baron, 1971, 1972a, 
1972b) and contained 10 push-button switches which 
could be employed by subjects to deliver electric 
shocks of varying intensity to another individual. 
The stop clock was employed to measure the dura- 
tion of shocks administered by subjects to this per- 
son. 


Procedure 


The experiment consisted of three distinct parts 
conducted in the following order: (a) a time-filler 
phase employed to ensure that subjects spent suffi- 
cient time in the study for any influence of ambient 
temperature upon their behavior to become ap- 
parent; (b) an anger arousal phase during which 
their level of anger toward the person who would 
later serve as the victim of their attacks was varied; 
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and (c) an aggression phase, during which subjects 
were provided with an opportunity to aggress against 
the victim. 

Time-filler phase. When subjects arrived for the 
experiment, they found two male confederates al- 
ready present in the waiting room. Shortly there- 
after, the experimenter entered and conducted all 
three individuals to the first experimental room, 
where she explained that the major purpose of the 
study was that of investigating the effects of two 
environmental factors, temperature and humidity, 
upon performance and physiological reactions.1 She 
then went on to indicate that the first phase of the 
experiment would be concerned with the influence 
of these two factors upon the performance of a 
complex task involving both motor and perceptual 
factors. This task consisted of the assembly of a 
jigsaw puzzle, and all three individuals were then 
conducted to separate rooms where they worked on 
the puzzles in isolation for a period of 10 minutes. 
At the end of the allotted time, the experimenter 
returned, pretended to record carefully each indi- 
vidual’s performance, and collected all materials. 
This marked the conclusion of the initial time- 
filler phase. In order to eliminate the possibility that 
some subjects would experience frustration during 
this portion of the study as a result of difficulties in 
performing the task, the puzzle employed was se- 
lected through pretesting to ensure that it was one 
which could be completed by virtually all individuals 
within the time period allowed. Observations of 
subjects’ actual performance during the study sug- 
gested that, as anticipated, no individuals experienced 
difficulty in successfully performing this task. 

Anger arousal phase. Following the completion of 
the above procedures, the experimenter conducted 
all three individuals back into the first experimental 
room and explained that the second part of the 
study would concern the effects of temperature and 
humidity upon the performance of a task involving 
complex social judgments. The subject and the con- 
federate who would later serve as the victim were 
then chosen to participate in this task, while the 
second confederate (who would later serve as the 
model) was designated a “control subject” and 
asked to “sit out” this portion of the study. The two 
participants were then moved to separate cubicles 
where they were first asked to write brief sketches 


1No attempt was made to conceal the experi- 
menter’s interest in the impact of ambient tempera- 
ture because (a) the equipment employed to vary 
this factor (and to hold relative humidity constant) 
could not safely or readily be disguised and (b) 
pilot data suggested that all subjects exposed to high 
temperatures would guess that this was one of the 
factors under investigation, regardless of any at- 
tempts by the investigators to disguise this fact. As 
a result, it was deemed more effective to openly 
admit concern with ambient temperature but then to 
divert subjects’ attention away from the major 
purposes of the investigation by means of appropri- 
ate cover stories. 
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or descriptions of their own personalities. These 
sketches were then exchanged, and on the basis of 
this information, both individuals were asked to 
rate their partner on a series of traits (e.g., intelli- 
gence, maturity, likableness, etc.). Finally, these 
ratings, too, were exchanged, so that both were able 
to learn what kind of impression they had made on 
their partner. In reality, both the personality sketch 
and ratings subjects received from their partner 
were provided by the experimenter and varied in a 
systematic manner so as to influence their level of 
anger toward the future victim. Thus, in the non- 
angry condition, the personality sketch supposedly 
written by the confederate suggested that he was a 
pleasant, modest, and friendly individual. Moreover, 
the ratings he assigned to the subject were quite 
favorable and flattering (e.g., he described the sub- 
ject as being highly intelligent, likable, sincere, and 
mature). In the angry condition, by contrast, the 
self-description ostensibly prepared by the confed- 
erate indicated that he was a nasty, conceited, and 
hostile person. Further, his ratings of the subject 
were highly unfavorable and quite derogatory. These 
procedures were adapted from those employed in a 
number of previous investigations (e.g., Baron, 
1972a; Doob & Wood, 1972) in which subjects have 
been angered by insulting or derogatory comments 
from others. In addition, they were found, in a 
preliminary pilot study, to be highly successful in 
inducing contrasting levels of anger among subjects. 

Aggression phase and exposure to the model, After 
completing the above procedures, the experimenter 
conducted all three individuals to an additional ex- 
perimental room (they had performed the second 
phase of the study individually in separate cubicles), 
where she explained that the third part would be 
concerned with the effects of temperature and 
humidity upon physiological reactions to electric 
shock. She then stated that in order to investigate 
this topic, one of the three individuals would serve 
as a responder and receive a series of electric shocks 
of varying intensity from the remaining individuals, 
who would act as stimulators. The responder’s 
physiological reactions to the shocks he received 
would then be monitored continuously in order to 
determine the precise impact of these stimuli upon 
his responses. These procedures were employed 
instead of the usual “teacher—learner” paradigm 
because recent evidence (Baron & Eggleston, 1972) 
suggests that they provide a measure of aggression 
less contaminated by various altruistic motives 
(eg. a desire to help the “learner” master the 
experimental materials) than the more commonly 
used technique. 

The experimenter then asked the confederate who 
had served as the subjects’ partner during the anger 
arousal phase if he would now agree to play the 
role of responder. He consented to this request and 
was then conducted to a third room where the 
experimenter presented instructions for his task in 
the study and attached shock and physiological 
recording electrodes to his wrists and arms. The 
physiological recording electrodes were connected to 
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an impressive-looking polygraph (Lafayette Model 
76014 E) which was located in full view of the 
subjects and ostentatiously switched on at this time. 
In reality, however, no records of the responder’s 
physiological processes were actually made during 
the study, and this apparatus was employed solely 
to lend credibility to the experimenter’s statement 
that this part of the experiment was concerned pri- 
marily with the effects of temperature and humidity 
on physiological reactions to electric shock. 

After completing these activities, the experimenter 
returned to the room where the subject and the 
second confederate were waiting and provided de- 
tailed instructions for their role as stimulators. These 
instructions indicated that each time a red signal 
light on the apparatus was illuminated, the stimu- 
lator whose turn it was should select and depress 
1 of the 10 buttons on the apparatus in order to 
deliver electric shock to the responder. The experi- 
menter further explained that this light would be 
illuminated according to a fixed random schedule 
specifically designed to prevent the responder from 
“getting ready” for each shock, and so influencing 
his physiological reactions to them. She then added 
that because (a) any shocks they employed would 
be perfectly adequate from the point of view of 
fulfilling the major objectives of the study and (b) 
a very large number of individuals would eventually 
participate in the experiment, subjects should feel 
completely free to choose any shock buttons they 
wished and to depress these buttons for as long a 
period as they desired each time the red shock signal 
was illuminated.* It was further explained that the 
higher the number of the buttons chosen, the 
stronger the shocks to the responder would be, and 
that the longer any button was depressed, the longer 
the shock to this individual would last. In order to 
demonstrate the magnitude of the shocks which 
could be delivered to the responder, the experimenter 
then administered “samples” from Buttons 4 and 5 
to each of the stimulators. These shocks were pro- 
duced by an inductorium located inside the appa- 
ratus and lasted for .5 sec. The shock delivered by 
Button 5 was noticeably stronger than that produced 
by Button 4, but both were judged by. subjects to 
be moderately noxious and unpleasant. 

At this point in the procedures, the experimenter 
designated either the subject or the confederate to 
serve as the first stimulator. In the no-model condi- 
tion, the subject was chosen to play this role. Thus, 
in this group, he aggressed against the victim in 
the absence of prior exposure to the model’s actions. 
In the model condition, however, the confederate 
was chosen to serve as the first stimulator. He then 
acted in a highly aggressive manner toward the 
responder, pushing Buttons 8, 9, or 10 for 2.0-3.0 sec 
on all occasions when the red shock signal was 
illuminated. As a result, subjects in this group were 
exposed to the behavior of an aggressive model 
before obtaining an opportunity to attack the victim 
themselves. The red shock signal was flashed a total 
of 20 times during both the model’s and the sub- 
ject’s turns, thus providing each individual with 20 
opportunities to shock the victim. 


Postexperimental questionnaire. Following the last 
shock trial, subjects completed a brief questionnaire 
on which they rated (on 7-point scales) the tem- 
perature in the experimental rooms (very cool- 
very warm), their feelings of comfort while par- 
ticipating in the study (unpleasant-pleasant), their 
degree of anger toward the victim (not angry at 
all-very angry), and their degree of liking for this 
person (dislike very much-like very much). 

Ambient temperature. The manipulation of am- 
bient temperature was quite simple and straight- 
forward. In the cool condition, all phases of the 
experiment were conducted under comfortably cool 
temperatures, while in the hot condition, all phases 
took place under uncomfortably high temperatures. 
These variations in ambient temperature were ob- 
tained by means of air conditioners and electric 
heaters, employed in appropriate combination. 
Temperature readings were taken both before and 
after each experimental session in both experimental 
rooms (i.e., the rooms occupied by the subject). In 
the cool condition, the average before and after 
readings were 74.0°F and 73.8°F, respectively, in 
the first room, and 73.5°F and 73.2°F in the second. 
In the hot condition the corresponding readings were 
92.4°F and 92.9°F in the first room and 94.5°F and 
95.5°F in the second. Subjectively, the experimental 
rooms seemed to be pleasantly cool and comfortable 
in the cool condition but uncomfortably hot and 
stuffy in the hot condition.® 


RESULTS 


Preliminary analyses revealed that the two 
basic measures of aggression obtained in the 
present study, shock intensity and shock dura- 
tion, yielded identical patterns of results. 
Thus, as has been the case in a number of 
previous investigations (e.g., Hartmann, 
1969; Parke, Ewall, & Slaby, 1972), these 
measures were combined into a single index 
of aggression through multiplication (i.e., 
Shock Intensity X Duration). In order to 
eliminate heterogeneity of variance, the scores 


2 These instructions were included in an attempt 
to reduce any possible influence of demand char- 
acteristics on subjects’ behavior. More specifically, 
they were designed to lessen any feelings on the 
part of subjects that they were “expected” or 
“required” to deliver any particular pattern of 
shocks to the confederate. 

SIn an attempt to hold relative humidity con- 
stant as temperature was varied, two humidifier- 
vaporizers were also employed. In general, this 
attempt to maintain uniform conditions was suc- 
cessful, although humidities tended to be slightly 
higher in the cool (M=49.6%) than in the hot 
(M = 43.5%) condition. It appears that much more 
elaborate equipment and facilities than were avail- 
able for the present experiment would be required 
to achieve even more precise control over this factor. 


* square 


so obtained were then subjected to a simple 
root transformation (X= Vx+ 
_Vx +1). Figure 1 presents the mean level 
of aggression (as assessed by this measure) 
directed against the confederate by subjects 
in each of the eight experimental groups. In- 
spection of this figure suggests that the 
strength of subjects’ attacks against this per- 
son was influenced both by their level of prior 
anger arousal and exposure to the actions of 
the model. More specifically, subjects in the 
angry condition directed higher levels of ag- 
gression against the confederate than those in 
the nonangry group, and those in the model 
condition directed stronger attacks against 
this person than those in the no-model group. 
In addition, and of somewhat greater interest 
within the context of the present research, 
ambient temperature and level of prior anger 
arousal appeared to interact in a totally un- 
expected manner in influencing the strength of 
subjects’ attacks against the confederate. 
More specifically, high ambient temperatures 
served to facilitate later aggression by indi- 
viduals in the nonangry condition but actually 
appeared to inhibit such behavior by subjects 
in the angry group. Further, with respect to 
this latter finding, there was also some indi- 
cation that high temperatures were more 
effective in reducing subsequent aggression 
by subjects in the angry—no model group than 
by those in the angry-model condition. 

An analysis of variance performed upon the 
data shown in Figure 1 yielded significant 
main effects for anger arousal, F (1,56) = 
45.48, p < .001, and exposure to the model, 
F (1,56) = 7.45, p < 01. Thus, as suggested 
by Figure 1, aggression was facilitated by 
přior provocation and exposure to the behav- 
ior of a highly aggressive model. In addition, 
the interaction between Ambient Temperature 
and anger arousal also attained significance, 
F (1,56) = 9.94, p< .005. Simple effects 
analyses undertaken to examine this interac- 
tion more closely revealed that, as suggested 
by Figure 1, it arose from the fact that ex- 
posure to high ambient temperatures signifi- 
cantly facilitated aggression by subjects in 
the nonangry group ($ < .005) but actually 
inhibited such behavior on the part of indi- 
viduals in the angry condition (p < .05). Ap- 
parently, then, the influence of such unpleas- 
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Ficure 1. Mean level of aggression (transformed 
Shock Intensity X Duration) delivered to the victim 
by subjects in each of eight experimental groups. 


ant environmental conditions upon subse- 
quent aggression was strongly affected by the 
degree of provocation previously experienced 
by subjects. 

Despite the suggestion in Figure 1 that 
high ambient temperatures were more effec- 
tive in inhibiting aggression by subjects in 
the angry-no-model group than by those in 
the angry-model condition, the triple inter- 
action between the anger, model, and tem- 
perature factors failed to attain conventional 
levels of statistical significance, F (1,56) = 
2.93, p = .09. Thus, this apparent pattern of 
findings should be viewed as only suggestive 
in nature and interpreted with extreme cau- 
tion. 


Postexperimental Questionnaire 


The first three items on the postexperi- 
mental questionnaire were designed to assess 
the effectiveness of the temperature manipula- 
tion. More specifically, these items required 
subjects to rate, on 7-point scales, the tem- 
perature in the experimental rooms (very 
cool-very warm), the pleasantness of cendi- 
tions existing in these rooms (unpleasant— 
pleasant), and subjects’ personal feelings of 
comfort while participating in the study (un- 
comfortable-comfortable), Separate analyses 
of variance performed on the data for each of 
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these items indicated that, as expected, sub- 
jects in the hot condition rated the experi- 
mental rooms as significantly warmer, F (1, 
56) = 173.80, p < .001, and less pleasant, 
F (1,56) = 62.59, p< .001, and reported 
feeling significantly less comfortable, F (1, 
56) = 18.82, p < .001, than those in the cool 
condition. Apparently, then, the experimental 
variations in ambient temperature produced 
the anticipated effects upon subjects. 

Two additional items on the questionnaire 
were employed to assess the effectiveness of 
the attempted manipulation of anger arousal. 
The first required subjects to rate their de- 
gree of anger toward the victim, while the 
second required that they rate their degree of 
liking for this person. Separate analyses of 
variance performed on the data for each ques- 
tion revealed that, as expected, subjects in 
the angry group reported significantly greater 
anger toward, F (1,56) = 45.85, p < .001, 
and dislike for, F (1,56) = 77.91, p < .001, 
the victim than subjects in the nonangry con- 
dition. Thus, there was some evidence that 
the attempted manipulation of anger arousal 
had indeed been successful. 


DISCUSSION 


It was expected, both on the basis of Ban- 
dura’s (1973) social learning theory of ag- 
gression and informal observation of events 
occurring under naturalistic conditions, that 
high ambient temperatures would be more 
effective in facilitating subsequent aggression 
by angry than nonangry subjects. However, 
contrary to these predictions, uncomfortably 
hot environmental conditions were found to 
facilitate subsequent aggression by nonangry 
individuals while actually inhibiting such be- 
havior by subjects who had previously been 
provoked. In view of the unexpected nature 
of these results, a replication of the present 
investigation (V = 36) was then performed, 
with omission of the model variable repre- 
senting the only change in procedure. The 
results of this follow-up study were highly 
similar to those reported above. That is, once 
again, high ambient temperatures facilitated 
aggression by nonangry subjects but inhibited 
such behavior by angry individuals.* Given 
the apparent reliability of this surprising pat- 


tern of findings, it seems important to con- 
sider possible explanations for its occurrence. 
Fortunately, one such explanation is sug- 
gested by certain findings of both investiga- 
tions. 

First, it was found in both the present and 
follow-up studies that subjects in the hot- 
angry group reported a higher level of dis- 
comfort than those in any other condition (p 
< .05). Further, in the follow-up investiga- 
tion, this group reported greater feelings of 
irritation and annoyance than any other (p 
< .05). Apparently, then, the combination of 
strong insult from the confederate and stifling 
environmental conditions induced extremely 
high levels of negative affect among subjects. 

Second, there is evidence that subjects in 
the hot-angry group found these conditions 
so unpleasant that they lost interest in every- 
thing but obtaining release from the study as 
quickly as possible. This is suggested both 
by comments made by these individuals dur- 
ing the postexperimental interviews which 
followed each session (e.g., “The only thing I 
thought about was getting the hell out of 
here”) and the finding that in the follow-up 
investigation, angry subjects exposed to un- 
comfortably warm environmental conditions 
reported that they were significantly more 
eager for the study to end than nonangry 
individuals exposed to similar conditions (p 
< .05). 

Combining these diverse findings, it seems 
reasonable to suggest that both in the initial 
and follow-up studies, subjects in the hot- 
angry group found the experimental situa- 
tion so unpleasant and aversive that escape 
or minimization of present discomfort became 
the dominant tendency in their behavior hier- 
archies. As a result, they may have reduced 
the duration of the shocks they employed and 
also lowered the intensity of these attacks in 
order to avoid any delays which might result 
from their use of strong shocks (e.g., protests 
from the victim, possible censure by the ex- 
perimenter). In contrast, subjects in the hot- 
nhonangry group appeared to find the experi- 


4Shortly before this manuscript was submitted 
for review, a second replication (N = 64) was com- 
pleted. The results of this investigation confirmed 
those of the present study once again. 


. to obtain release from the study. Thus, the 
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‘mental situation somewhat less aversive and 
so experienced (and reported) weaker desires 


‘feelings of irritation or discomfort they suf- 
feted as a function of exposure to the hot 
environmental conditions may have actually 
served to enhance their aggression against 
the. confederate (cf. Griffitt, 1970; Griffitt & 
Veitch, 1971). 

` Of course, despite the fact that such an 
interpretation is supported by and based 


- upon evidence from both the initial and fol- 


low-up studies, it must be verified in further 
„research. For example, evidence regarding the 
validity of these proposals may be obtained 
by the inclusion of a warm condition in which 
subjects are exposed to temperatures in the 
80’s (eg., 82°F-84°F). If the suggestions 
offered above are indeed accurate, it would 


_ be expected that such conditions will facilitate 


aggression by angry as well as nonangry indi- 
viduals, since the level of discomfort induced 
among subjects will not be sufficiently intense 


~to make escape rather than aggression the 
- dominant response in their behavior hier- 


archies. Further experiments along these lines 
are currently planned. 

Before concluding, the finding that high 
ambient temperatures facilitated aggression 
by nonangry subjects in the present study 
but actually inhibited such behavior by non- 
angry individuals in a previous investigation 
(Baron, 1972a) should be considered. Unfor- 
tunately, due to continued attempts at refine- 
ment in experimental technique (see Baron, 
1973, 1974; Baron & Eggleston, 1972) the 
present and previous investigations differed in 
a number of procedural details. Thus, the 
task of explaining the contrasting results ob- 
tained with nonangry subjects in both experi- 
ments is truly complex. However, one par- 
ticularly suggestive explanation is provided 
by the fact that the Baron (1972a) study 
was conducted in South Carolina during July, 
when typical daytime temperatures were in 
the mid to upper 90’s, while the present 
experiments were carried out in Indiana dur- 
ing months when considerably cooler tempera- 
tures prevailed. These marked differences in 
prevailing climatic conditions, coupled with 
evidence that the arousing or irritating influ- 
ence of high ambient temperatures increases 
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over time (Macpherson, 1956), suggest that 
subjects in the earlier study may have en- 
tered the laboratory in a considerably more 
negative affective state than those in the 
present investigations. As a result, even those 
in the nonangry group may have found the 
hot condition so unpleasant that they concen- 
trated on minimizing their discomfort and so 
evidenced lowered aggression against the con- 
federate. The validity of this suggestion may 
be examined in future research by (a) obtain- 
ing a measure of subjects’ affective state im- 
mediately prior to their participation in the 
experiment and (b) systematically replicating 
the present investigation during different sea- 
sons of the year. 

Regardless of the outcome of such experi- 
ments, however, the findings of the present 
research are clear in suggesting that the rela- 
tionship between ambient temperature and the 
occurrence of overt aggression is quite com- 
plex. More specifically, the influence of this 
environmental factor on the frequency or 
strength of aggression seems to depend, to an 
important degree, upon the level of provoca- 
tion previously endured by potential aggres- 
sors. Further, there is some indication that 
the impact of high ambient temperatures 
upon aggression is also mediated by a number 
of additional variables, including the dura- 
tion of subjects’ exposure to such conditions, 
and the presence of other sources of irrita- 
tion or negative affect (see Griffitt, 1970; 
Griffitt & Veitch, 1971). Clearly, then, 
further research designed to examine the po- 
tential influence of such factors is necessary 
before any definitive conclusions regarding the 
existence or precise form of the frequently 
cited “long, hot summer” effect can be at- 
tained. 


REFERENCE NOTES 


1. Goranson, R. F., & King, D. Rioting and daily 
temperature: Analysis of U.S. riots in 1967. Un- 
published manuscript, York University, Toronto, 
Canada. 


REFERENCES 


Bandura, A. Aggression: A social learning analysis. 
Englewood Cliffs, New Jersey: Prentice-Hall, 1973. 

Baron, R. A. Magnitude of victim’s pain cues and 
level of prior anger arousal as determinants of 
adult aggressive behavior. Journal of Personality 
and Social Psychology, 1971, 17, 236-243, 


832 ROBERT A. BARON AND PAUL A. BELL 


Baron, R, A. Aggression as a function of ambient 
temperature and prior anger arousal. Journal of 
Personality and Social Psychology, 1972, 21, 183- 
189. (a) 

Baron, R. A. Reducing the influence of an aggres- 
sive model: The restraining effects of peer censure. 
Journal of Experimental Social Psychology, 1972, 
8, 266-275. (b) 

Baron, R. A. Aggression as a function of victim’s 
pain cues, level of prior anger arousal, and ex- 
posure to an aggressive model. Journal of Person- 
ality and Social Psychology, 1974, 29, 117-124. 

Baron, R. A. Threatened retaliation from the victim 
as an inhibitor of physical aggression. Journal of 
Research in Personality, 1973, 7, 103-115. 

Baron, R. A., & Eggleston, R. J. Performance on 
the “aggression machine”: Motivation to help or 
harm? Psychonomic Science, 1972, 26, 321-322, 

Baron, R. A., & Lawton, S. F. Environmental in- 
fluences on aggression: The facilitation of model- 
ing effects by high ambient temperatures. Psycho- 
nomic Science, 1972, 26, 80-83. 

Buss, A. H. The psychology of aggression. New 
York: Wiley, 1961. 

Doob, A. N., & Wood, L. Catharsis and aggression: 
The effects of annoyance and retaliation on ag- 
gressive behavior. Journal of Personality and 
Social Psychology, 1972, 22, 156-162. 

Geen, R. G. Effects of frustration, attack, and prior 
training in aggressiveness upon aggressive behavior, 


Journal of Personality and Social Psychology, 
1968, 9, 316-321. 

Griffitt, W. Environmental effects on interpersonal 
affective behavior: Ambient effective temperature 
and attraction, Journal of Personality and Social 
Psychology, 1970, 15, 240-244. 

Griffitt, W., & Veitch, R. Hot and crowded: Influ- 
ence of population density and temperature on 
interpersonal affective behavior. Journal of Per- 
sonality and Social Psychology, 1971, 17, 92-98. 

Hartmann, D. P. Influence of symbolically modeled 
instrumental aggression and pain cues on aggres- 
sive behavior, Journal of Personality and Social 
Psychology, 1969, 11, 280-288. 

Macpherson, R. K, Environmental problems in 
tropical Australia, Canberra, Australia: Common- 
wealth Printer, 1956, 

Parke, R. D., Ewall, W., & Slaby, R. G. Hostile and 
helpful verbalizations as regulators of nonverbal 
aggression. Journal of Personality and Social Psy- 
chology, 1972, 23, 243-248. 

Turner, C. W., & Berkowitz, L. Identification with 
film aggressor (covert role taking) and reactions 
to film violence, Journal of Personality and So- 
cial Psychology, 1972, 21, 256-264. 

United States Riot Commission. Report of the Na- 
tional Advisory Commission on civil disorders. 
New York: Bantam Books, 1968. 


(Received October 26, 1973) 


i 


Journal de Personality and Social Psychology 
1975, Vol. 31, No. 5, 833-839 


Increasing Adherence to a Stressful Decision via a Motivational 
Balance-Sheet Procedure: A Field Experiment 


Michael F. Hoyt and Irving L. Janis 
Yale University 


During a short telephone interview, women who were enrolled in a gymnasium 
exercise class completed motivational balance sheets that induced careful con- 
sideration of the consequences of a personal decision. In a 2 X 2 factorial de- 
sign, subjects filled out either (a) relevant (i.e, concerning attending exercise 
class) or (b) irrelevant (i.e., concerning cutting down or abstaining from 
smoking cigarettes) balance sheets and were engaged in either (a) high or (b) 
low self-disclosure by having them read their balance-sheet entries aloud or 
silently, respectively. All subjects then heard a short tape recording strongly 
recommending that they regularly and frequently attend exercise class. As 
predicted, the subsequent attendance of relevant balance-sheet subjects (as- 
sessed for a 7-week period) was much greater than that of irrelevant balance- 
sheet subjects, whose rate of attendance did not differ from that of a no- 
treatment (baseline) control group. Degree of self-disclosure did not affect 
attendance, most probably because the brief self-disclosure treatment was not 


strong enough to engender a meaningful sense of self-disclosure. 


Stemming from Lewin’s (e.g., 1951) topo- 
logical analysis of conflict resolution, various 
gain—loss models of the decision-making pro- 
cess have been proposed (e.g., Blau, 1964; 
Homans, 1961; Janis, 1959; Thibaut & 
Kelley, 1959). As Janis (1959; Janis & 
Mann, 1968) has suggested, the anticipated 
gains (or benefits) and the anticipated losses 
(or costs) can be exhaustively categorized 
into four major types of consequences: (a) 
utilitarian gains or losses to self; (b) utili- 
tarian gains or losses for significant others; 
(c) approval or disapproval from significant 
others; (d) self-approval or disapproval. 

Briefly, the first category refers to the ex- 
pected instrumental effects of the decision 
with respect to the satisfaction of personal 
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objectives, while the second category pertains 
to the goals of important individuals and 
groups with whom the decision maker is 
associated or identified. The anticipated social 
approval or disapproval of reference groups 
and reference persons constitutes the third 
category of consequence. The final category 
pertains to self-appraisals based on inter= ~~ 
nalized standards and ego ideals. 

The decision-making process optimally in- 
volves consideration of all relevant informa- 
tion. Attention to possible negative outcomes, 
which oftentimes are overlooked, is especially 
important because it minimizes postdecisional 
regret when unanticipated consequences ma- 
terialize (Janis, 1968a). Several studies point 
to the value of emotional inoculation (e.g., 
Egbert, Battit, Welch, & Bartlett, 1964; 
Janis, 1958, 1971; Janis & Leventhal, 1965; 
Vernon & Bigelow, 1974) and informational 
preparation (McGuire, 1964) for equipping 
persons to deal with stress. Janis (1972) has 
recently documented the deleterious effects 
that incomplete consideration of possible con- 
sequences has had in the realm of American 
foreign policy decision making. 

A tool that is useful in helping persons to 
consider more thoroughly the consequences of 
an impending decision is the “decisional bal- 
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ance-sheet procedure” developed in a series 
of pilot studies by Janis (Note 1). Yale 
College seniors were asked to describe all the 
alternatives they were considering for the 
subsequent year and to specify all the pros 
and cons for each alternative. They were then 
presented with a balance-sheet grid with the 
headings we have listed above to describe the 
four major types of decisional consequences 
(“utilitarian gains or losses to self,” etc.). 
The interviewer explained each of the four 
categories and then helped the subject fill in 
the pro and con entries for his two most 
preferred alternatives. 

Randomly assigned to conditions, 18 Yale 
seniors underwent this procedure, while 18 
others participated in an equivalent control 
interview, but without a balance-sheet pro- 
cedure. The results were encouraging: All the 
balance-sheet subjects added considerations 
not mentioned in the standard part of the 
interview. More than half the balance-sheet 
subjects reported changing their evaluations 
of the alternative they had most preferred, 
whereas only one control interview subject 
did so. This observation suggests that aside 
from whatever long-term effect the balance- 
sheet procedure might have in preparing the 
person for coping with subsequent negative 
feedback, it can have an immediate self- 
persuasion effect similar to that which may 
occur in certain types of role playing when a 
person writes an essay in favor of a counter- 
attitudinal position and becomes more aware 
of the supporting arguments (see Collins & 
Hoyt, 1972; Janis, 1968b). 

This pilot work was followed up by Mann 
(1972) in a similar study using high school 
seniors who were trying to decide which col- 
lege to attend. The balance-sheet procedure 
again proved effective: On posttest measures 
taken approximately 6 weeks after the stu- 
dents had notified the colleges of their deci- 
sion, balance-sheet subjects showed somewhat 
less postdecisional regret and were able to 
be more objective about the consequences of 
their decision than were control interview 
subjects. 

A recent study by Colten and Janis (in 
press) employed a 2 X 2 factorial design, 
wherein women who had come to a diet clinic 
underwent (a) either a high or low self-dis- 


closure interview (with the interviewer pro- 
viding constantly positive feedback) and then 
(b) either a balance-sheet procedure (per- 
taining to the pros and cons of going on the 
recommended diet versus not doing so) or a 
control interview that gave the clients essen- 
tially the same information but without sug- 
gesting that they consider the pros or cons of 
alternative courses of action. Those subjects 
who underwent both the high self-disclosure 
interview and completed decisional balance 
sheets showed significantly more behavioral 
conformity than the others: They sent in 
more weekly reports concerning their dieting 
and, most importantly, were significantly 
more successful in losing weight. 

This evidence for the influence of self-dis- 
closure was predicted from Janis’s (1975) 
model for increasing the effectiveness of con- 
tact with supportive persons on adherence to 
stressful decisions. He posits that the first 
critical phase in successful dyadic helping 
relationships involves the helper acquiring 
motivating power. Drawing upon his own ob- 
servations as well as those of several other 
psychologists (e.g., Bennis, Schein, Berlew, 
& Steele, 1968; Rogers, 1961), Janis suggests 
that a counselor’s effectiveness is greatly en- 
hanced if the client’s initial self-disclosures 
are responded to in an explicitly accepting 
manner. This type of positive feedback serves 
to increase the perceived attractiveness and 
trustworthiness of the counselor and fosters 
an attitude of high reliance upon the helper 
that increases his or her effectiveness in the 
role of a norm-sending communicator. 

In addition to the Colten and Janis (in 
press) study, three other recent field experi- 
ments also show greater behavioral compliance 
to the interviewer’s recommendations follow- 
ing a self-disclosure interview when subjects 
were given positive rather than neutral feed- 
back. Two of the experiments dealt with 
women in a weight-reduction clinic (Conolley 
& Janis, in press; Dowds & Janis, in press) 
and a third used potential blood donors as 
subjects (Mulligan, in press). 

The present study used a 2 X 2 design to 
obtain additional information regarding the 
effects of (a) the decisional balance-sheet pro- 
cedure versus a control procedure and (b) 
high versus low self-disclosure interviews. 


— 
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INCREASING ADHERENCE TO A STRESSFUL DECISION 


METHOD 
Design Overview 


The setting for our research was a women’s early- 
morning exercise class conducted at the Yale gym- 
nasium and we used records of class attendance to 
obtain an unobtrusive behavioral measure as the 
dependent variable. Attendance at such classes is 
strictly yoluntary. As the initial excitement of the 
class wanes and other morning activities beckon, a 
decisional conflict arises for many participants re- 
garding whether to continue attending the class on 
a regular basis. 

Early in the semester, during the third week, 40 
members of the exercise class were interviewed via 
telephone. During the telephone conversation, we 
presented two different conditions that corresponded 
to our two independent variables. After receiving an 
introductory explanation and a request to have a 
pencil and paper at hand, each subject was first 
assigned on a random basis to a relevant or an 
irrelevant balance-sheet procedure. Those in the 
relevant balance-sheet condition were asked to con- 
sider carefully and to write out all the advantages 
_ and disadvantages that would result, for themselves 
and significant others, from their regular participation 
in the exercise class. Subjects randomly assigned to 
the irrelevant balance-sheet condition, on the other 
hand, were asked to consider carefully and to write 
out all the pros and cons involved in another health- 
oriented decision, reducing or abstaining from ciga- 
rette smoking. (Nonsmokers completed balance sheets 
concerning their ongoing decision not to smoke.) 
Thus, in contrast to the balance-sheet studies cited 
above, wherein comparisons were made between 
balance-sheet experimental and no-balance-sheet con- 
trol groups, the present study provided for com- 
parison of two types of balance-sheet treatments. 
This may be a somewhat superior procedure in that 
there is a greater equivalence across treatment con- 
ditions with respect to holding constant the nature 
of the subject-interviewer interaction and the amount 
of cognitive work required of the subject. 

Immediately after completing the relevant or ir- 
relevant balance-sheet procedure, each experimental 
subject was randomly assigned to either the high or 
low self-disclosure treatment. Subjects in the high 
self-disclosure condition were asked to read their 
balance-sheet entries aloud to the telephone inter- 
viewer (with the interviewer responding in a posi- 
tive, accepting manner). Those in the low self-dis- 
closure condition were instructed to scan their entries 
silently. While our telephone intervention was brief 
and superficial in contrast to the more intimate, gen- 
erally face-to-face interactions used in other self- 
disclosure studies, it seemed worthwhile to add this 
minimal self-disclosure treatment in order to test 
the limits of self-disclosure effects. 

All interviews were conducted in the same way, 
including the same standard introduction and the 
same concluding remarks. At the end of every inter- 
view, a brief tape-recorded message in the inter- 
viewer’s own voice was played recommending regular 
and frequent attendance at the exercise class. 
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Behavioral records were kept of every subject’s 
attendance for a 7-week period following the tele- 
phone interview. The exercise class met three times 
weekly and class participants were asked to sign an 
attendance sheet each time they came. Similar sheets 
were posted outside most of the gymnasium facili- 
ties, Care was taken to avoid any perceptible con- 
nection between the attendance sheets and the tele- 
phone intervention; subjects gave no indication to 
the class instructors of having surmised any such 
connection. 

It was hypothesized that subjects in the relevant 
balance-sheet condition, having given careful atten- 
tion to the advantages to be gained and the disad- 
vantages to be overcome, would attend the exercise 
class more than would irrelevant balance-sheet sub- 
jects who had focused their attention on the pros 
and cons involved in another decision, We were also 
interested in determining whether the minor form of 
self-disclosure used in the telephone interview was 
sufficient to have a discernible effect. 

Ten additional members of the same exercise class 
were randomly assigned to a no-treatment control 
group. The purpose of collecting attendance data for 
this supplementary group was to obtain a baseline 
rate of attendance for comparison with the experi- 
mental treatment groups in the 2 X 2 after-only ex- 
perimental design. 


Subjects 


Fifty women, including a few graduate students 
and wives of graduate students, with the majority 
being wives of Yale faculty members, served as sub- 
jects. Each subject had signed a class enrollment 
sheet during the second week of class, on which she 
had indicated her phone number, Using this list, 
10 subjects were randomly assigned to each of the 
four experimental treatment conditions (Relevant vs. 
Irrelevant Balance Sheet X High vs. Low Self-Dis- 
closure); the remaining 10 were in the untreated 
control group. 


Procedure 


All 40 of the experiment subjects were telephoned 
during the third week of the semester. Two experi- 
menters, one male and one female, each contacted 
half of the subjects. After introducing himself or 
herself as “a graduate student doing a study in 
cooperation with the Yale Physical Education De- 
partment . . . concerning people's decisions to engage 
in health-improvement programs,” the experimenter 
went on to ensure the subject of the confidential 
nature of the study and asked if the subject would 
give 10-15 minutes to answer some questions. All 
agreed to do so. (A few subjects had to be called 
back at a more convenient time.) 

After asking a few background questions (age, 
other physical education activities, whether subject 
smoked cigarettes, etc.) and then giving subject the 
time to get some writing materials, the experimenter 
continued: 


{Balance-sheet manipulation] Past research has 
shown that a very useful way for people to con- 
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sider some decision they’ve made or are making— 
like deciding to take an exercise class or not smok- 
ing cigarettes [Lor cutting down on smoking, if 
subject had indicated that she smoked]—is to 
write down the advantages and disadvantages that 
will result from their decision. Oftentimes, in going 
through the process of actually writing down the 
considerations involved in a decision, people think 
of benefits and difficulties they did not originally 
realize were involved—and it’s useful for them to 
be aware of these. What I would like you to do 
is spend the next couple of minutes writing down 
the benefits you hope to gain from taking the 
exercise class [“not smoking” or “cutting down on 
smoking”], and the difficulties and problems you 
expect to encounter, 

The way I'd like you to do that is this: At the 
top left of your page, write down GAINS TO SELF 
and list all the gains and benefits that will result 
from your regularly attending exercise class [cut- 
ting down or not smoking]. Don’t tell me what 
you're writing, just try to include as many con- 
siderations as you think are relevant. Be sure to 
be frank and candid, including all personal con- 
siderations that may affect your regularly attending 
the class [cutting down or not smoking]. Take 
about a minute to do that; PII signal you when a 
minute’s up. 


After one minute, the experimenter went on: 


Now, on the top right side of your page, write 
LOSSES TO SELF, and again list all you can think 
of, being frank and honest. Here you'll want to 
include any difficulties, inconveniences, or expenses 
you may encounter. 


In similar fashion, in turn, the subject was in- 
structed to make the headings GAINS TO IMPORTANT 
OTHERS, LOSSES TO IMPORTANT OTHERS, APPROVAL 
FROM OTHERS, DISAPPROVAL FROM OTHERS, SELF-AP- 
PROVAL, and SELF-DISAPPROVAL. The experimenter gave 
a brief explanation of what kinds of entries might 
go under each heading and paused one minute for 
each while the subject thought and listed her entries, 
Finally, the experimenter instructed the subject to 
make a last category, EXTRAS, and to enter there any 
considerations she had not already placed in one of 
the other categories. 


[Self-disclosure manipulation] [Then, to induce 
either high or low self-disclosure, the experimenter 
went on to say:] OK, now that you're done with 
the list, I'd like you to spend a couple of minutes 
reading the list over aloud [silently], to me [your- 
self], reviewing the benefits you can gain from the 
exercise program [not smoking], and the diffi- 
culties you will have to overcome. The way Pd 
like you to do this is to concentrate on one cate- 
gory at a time—eg., the GAINS TO SELF category— 
thinking about the benefits to be gained. When 
you think about the losses and disapprovals cate- 
gories, I’d like you to realize that to be successful 
in regularly attending class [not smoking], you 
will have to deal with or overcome the losses or 
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disapprovals, if they arise. I’d like you to con- 
centrate on one category until I signal you to go 
on to the next one. If you don’t have any entries 
under some category, spend the time trying to 
think of possible considerations you may have 
overlooked the first time through, and jot them in. 
OK, let’s start with the GAINS TO SELF category— 
review aloud [silently] the benefits yowve listed, 
and try to vividly imagine the benefits and diffi- 
culties you’ve listed, We'll move through all the 
categories, spending about 30 seconds on each. 


The experimenter responded to high self-disclosure 
subjects’ disclosures in an affirmative, accepting man- 
ner, offering invariably positive feedback. Balance- 
sheet entries that were favorable to following the 
desired course of action were met with responses 
such as “Good,” “Uh-huh,” and “That’s an advan- 
tage,” whereas negative entries were responded to 
with phrases such as “Yes, that can be a problem” 
and “It’s good that you’re aware of that.” 


[Communicating the recommendation] [After the 
experimenter and subject had proceeded through 
all the categories, the experimenter said:] Now 
that you've given some thought to one aspect of 
your personal health program, I would like to give 
you some information from prior studies about 
the importance of regular physical exercise. As you 
may imagine, I’m especially interested in health 
and physical education, so I hope you'll listen 
carefully, and realize the importance of attending 
exercise classes on a regular basis, and as often as 
possible. 


The experimenter then went on to play a stan- 
dard 2-minute tape recording (of his or her own 
voice) that strongly recommended regular and fre- 
quent attendance at exercise class, Following the 
tape, the experimenter answered any questions that 
the subject had and concluded the interview by re- 
questing that the subject not discuss the phone call 
with any members of her class, since “it’s important 
for each person to generate her own list of balance- 
sheet entries, without priming or forethought.” 


RESULTS 


Did our various treatments affect subjects’ 
attendance? A 2 (Balance-Sheet) x 2 (Self- 
Disclosure) x 2 (Experimenter) analysis of 
variance was conducted. This analysis indi- 
cated that one variable, the balance-sheet 
treatment, had a significant main effect, 
F(1, 32) = 13.79, p<.001. The self-dis- 
closure and experimenter variations had no 
significant main effects and there were no 
significant interaction effects, 

As seen in Table 1, the balance-sheet pre- 
diction was dramatically borne out: The mean 
weekly attendance of subjects in the relevant 
balance-sheet condition was double that of 
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the irrelevant balance-sheet subjects. For the 
untreated control group, attendance was 
about the same as for the irrelevant balance- 
sheet group (p> .50) and was significantly 
less than for the relevant balance sheet group, 
t(28) = 2.87, p < .01. 

The solid lines in Figure 1 show the week- 
by-week attendance for the relevant and the 
irrelevant balance-sheet groups. An additional 
analysis of variance (2 X 2X 2X7) was 
carried out for the data from these two 
groups, using subjects’ weekly attendance 
during the 7 consecutive weeks as a repeated 
measures factor. The results for the two ex- 
perimental variables and the experimenter 
variation were the same as reported above, 
the only significant finding being a main effect 
for the balance-sheet variable. A significant 
main effect for variations in weekly attend- 
ance was also obtained, F(6, 192) = 4.51, 
p < 001. The latter F pertains to the obser- 
vation that for both the relevant and the ir- 
relevant balance-sheet conditions there was a 
gradual decrement in attendance from the 
first to the seventh week. There was no sig- 
nificant interaction effect (F < 1), indicating 
no difference between balance-sheet conditions 
with respect to the rate of decrement. Figure 
1 also includes the data for the untreated con- 
trol group (shown by the broken line), which 
deviates only slightly over the 7-week period 
from the data for the irrelevant balance-sheet 
group. 


UNTREATED CONTROLS 
(N = 10) 


NO. ATTENDANCES PER SUBJECT 
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TABLE 1 


MEAN NUMBER or TIMES SUBJECTS ATTENDED CLASS 


Balance sheet 


Condition 
Relevant Irrelevant 
Self-disclosure 
High 11.6 5.4 
Low 12.1 6.2 
Nole. n = 10 for each cell. The mean for untreated control 
subjects = 5.6. 
DISCUSSION 


The results offer straightforward evidence 
for the effectiveness of the balance-sheet pro- 
cedure. Merely hearing the 2-minute “pep 
talk” advocating regular and frequent attend- 
ance was not enough, nor was considering an 
irrelevant (although health-related) decision 
enough, to increase class attendance—a con- 
clusion supported by the finding that there 
was essentially no difference between the at- 
tendance of irrelevant balance-sheet and no- 
treatment (baseline) control subjects. Com- 
parison of the untreated control subjects with 
those in the relevant balance-sheet condition, 
however, shows that, as predicted, the rele- 
vant balance sheet had a substantial effect 
on subsequent behavior. Subjects who com- 
pleted balance sheets relevant to attending 
exercise class actually attended class about 
twice as often as did other subjects. 


RELEVANT BALANCE SHEET (N = 20) 


WEEK 


Ficure 1. Week-by-week attendance for the relevant balance-sheet group, the 


jrrelevant balance-sheet group, 


and the untreated control group. 
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It is not surprising that the self-disclosure 
variable had no discernible effect. Previous 
studies (e.g., Colten & Janis, in press; 
Jourard, 1968; Mulligan, in press) of the 
effects of self-disclosure have involved con- 
siderably more intense, interpersonally more 
meaningful encounters with the interviewer 
than the brief telephone interaction in the 
present experiment. We noticed that the bal- 
ance-sheet entries of the high self-disclosure 
subjects were so abbreviated (e.g., “Better 
health,” “Inconvenience of driving to gym,” 
“Friends will admire,” etc.) that it was prob- 
ably unlikely that the subjects felt any sig- 
nificant sense of self-disclosure or that the 
interviewer’s responses would be perceived as 
being particularly empathic and accepting. 
Moreover, Janis’s (1975) model of the effects 
of self-disclosure in helping relationships sug- 
gests that after having received acceptance 
for their initial self-disclosures, subjects are 
more motivated to follow recommendations 
made by a counselor as a way of continuing 
to receive the counselor’s approval. In the 
present experiment no expectation was created 
that there would be any future interviewer— 
subject interaction or that the interviewer 
would be monitoring the subject’s attendance; 
therefore, it does not seem reasonable to ex- 
pect that subjects would attend exercise class 
as a way of winning the (unknowing) inter- 
viewer’s aproval. 

The accumulated evidence points to the 
effectiveness of balance-sheet procedures. Past 
studies (Colten & Janis, in press; Mann, 
1972; Janis, Note 1) have shown that when 
people are making personal decisions they 
tend to benefit from the careful consideration 
of pros and cons induced by a systematic 
balance-sheet procedure. With the success of 
the present study—dramatic behavioral ef- 
fects over a 7-week period as a result of a 
single 15-minute telephone interview—we ex- 
pect to see further investigations and applica- 
tions of the decisional balance-sheet tech- 
nique in a variety of decision-making situa- 
tions. 

Earlier we mentioned two alternative medi- 
ating processes that could account for the 
positive effects of the relevant balance-sheet 
procedure: (a) an emotional inoculation ef- 
fect, whereby participants are better able to 


tolerate subsequent negative feedback (in the 
present study, for example, embarrassment 
from failing to perform certain of the exer- 
cises correctly, inconveniences, fatigue, and 
muscular aches resulting from attending the 
exercise classes) or (b) a self-persuasion 
effect, whereby the participants become aware 
of more positive incentives that outweigh the 
negative incentives for carrying out the given 
course of action (in this case, attending the 
exercise class). If emotional inoculation were 
the predominant effect, we would expect to 
find that the superiority in attendance result- 
ing from the relevant as compared with the 
irrelevant balance-sheet procedure would ap- 
pear gradually, week by week, as the partici- 
pants encountered more and more instances 
of minor negative consequences and perhaps 
one or two major setbacks that might tempt 
them to quit. If the positive effects are attri- 
butable primarily to self-persuasion, however, 
we would expect to find an immediate effect, 
just as typically occurs when role playing or 
authoritative persuasive communications are 
used to promote a recommended course of 
action. The week-by-week attendance data in 
Figure 1 show that there was, in fact, an im- 
mediate effect that appeared in the very first 
week following the telephone interview and 
that continued without being augmented dur- 
ing the subsequent 6 weeks. These findings 
suggest that self-persuasion was one of the 
mediating effects of the relevant balance-sheet 
procedure. 

We know from the content of the balance 
sheets read to the interviewer by subjects in 
the self-disclosure condition that generally 
they listed many favorable consequences of 
the decision and few unfavorable ones. This 
mobilization of predominantly positive incen- 
tives is similar to what happens in role-play- 
ing experiments when a person is induced to 
write an essay in support of an assigned posi- 
tion on an issue with which he is familiar 
under conditions of favorable sponsorship 
(see Collins & Hoyt, 1972; Elms & Janis, 
1965; Janis & Gilmore, 1965). In the present 
experiment, the subjects were asked to fill 
out the balance sheet for only one alternative 
and it was one to which they had already 
committed themselves. Furthermore, the in- 
terviewer was associated with a prestigeful 
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educational institutfon and was unambigu- 
ously in favor of the decision. Under less 
favorable conditions, a relevant balance-sheet 
procedure would probably be less effective and 
might even have the reverse effect. The limit- 
ing conditions for «the positive effect of the 
relevant balance-sheet procedure observed in 
this study obviously require further investiga- 
tion. f 


REVERENCE NOTE 


1. Janis, I. L. Pilot studies on a balance-sheet pro- 
cedure and other-interventions for improving per- 
sonal decision’ making. Unpublished manuscript, 

- 1969, (Available from I. L, Janis, Department of 

* Psychology, Yale University, 2 Hillhouse Avenue, 

New Haven, Connecticut 06510.) 
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Effects of a Child’s Imitation versus Nonimitation 
on Adults’ Verbal and Nonverbal Positivity 


John E. Bates 
University of California, Los Angeles 


In response to the need for more information about how children can affect 
their own social development through the effects they have on adults, the 
imitativeness of child confederates in a basketball teaching situation was ex- 
perimentally varied. The responses of 48 male college subjects to this manipu- 
lation were measured on 21 verbal and nonverbal variables. The variables were 
factor analyzed, and an analysis of variance was done on the positivity factor 
scores, Subjects who had been imitated by the child were significantly more 
positive in their factor scores than subjects who had not been imitated (F = 
20.85, p<.01). There was also an interaction effect (F= 4.55, p< .05), in 
which subjects who were instructed to explicitly ask the child to imitate were 
less positive when the child did so than were subjects who had simply 
modeled. Results with the positivity factor scores were supported by separate 


analyses of variance on the 21 variables. 


Understanding adult-child relationships is 
obviously crucial for the understanding of 
children’s social development. A great deal of 
research in this area has attempted to show 
the effects of adults, such as parents and 
teachers, on the behavior or personality de- 
velopment of children. However, the possi- 
bility that the child is active in his social 
development is being admitted more and 
more. Writers such as Bell (1968, 1971), 
Gewirtz (1969), Staats (1971), and the New 
York Longitudinal Study group (e.g., Chess 
& Thomas, 1973; Thomas, Chess, & Birch, 
1968; Thomas, Chess, Birch, Hertzig, & 
Korn, 1963) have argued that the child has 
influence over the adult and, by the kinds of 
responses he produces, plays an appreciable 
role in his own social development. However, 
very little actual research has yet been done 
to examine the adult-child relationship as a 
two-way influence process. 

In recent years, a few studies of the effects 
of children on adults have begun to appear. 


This article is based on a dissertation submitted to 
the University of California, Los Angeles in partial 
satisfaction of the requirements for the doctoral 
degree. The author wishes to thank his doctoral 
committee, B. Guilani, O. I. Lovaas, A. Mehrabian, 
W. H. Miller, and especially his chairman, P. M. 
Bentler, for their helpful comments. 

Requests for reprints should be sent to John E. 
Bates, who is now at the Department of Psychol- 
ogy, Indiana University, Bloomington, Indiana 47401. 


Berberich (1971) found that he could shape 
various teaching behaviors of college-student 
females by manipulating the trial-by-trial 
response correctness of a fictitious 4-year-old 
male pupil. In a similar context, Osofsky 
(1971) found that nonrelated mothers’ en- 
couragements and reinforcements varied ac- 
cording to the dependent, independent, or 
stubborn roles played by girl accomplices 
(aged 10-12). Later, Osofsky and O’Connell 
(1972) manipulated the dependency of 4—6- 
year-old daughters in an achievement task. 
They found that girls’ parents tended to be 
more physically and verbally interactive and 
more controlling when the girls were depen- 
dent than when independent. 

Yarrow, Waxler, and Scott (1971) also 
found, in a study designed to measure adult 
effects on children, that teachers’ behavior 
depended not only on the teacher’s nurturant 
versus nonnurturant set, but also on the be- 
havior of the children. For example, when the 
teacher had a nonnurturant set, she herself 
actually initiated more negative contact with 
attention-seeking children. It was also found 
that when a child’s response to a teacher- 
initiated positive contact was coded by raters 
as positively reinforcing, the teacher initiated 
another positive contact with the child much 
sooner than if the child’s response had been 
unreinforcing or nonresponsive. 

Finally, Stevens-Long (1973) varied, on a 
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videotape, a child’s activity level, aggression 
toward an adult, and label of normal versus 
emotionally disturbed. Adults viewing the 
tape reported how they felt toward the child 
and how they would discipline him. Stevens 
found that, despite the mitigating effects of 
the label of emotional disturbance, adults 
were more negative and punishing toward an 
over- or underactive child than toward a nor- 
mally active child. 

With the exception of the Osofsky and 
O’Connell (1972) study, all of the experi- 
ments on the effects of the child on adults 
have used unrelated adult-child pairs. This 
allows for the desirable control of past rela- 
tionship history which could affect current 
interaction. However, it also raises the prob- 

“lem of generalizability to more naturally oc- 
curring adult-child dyads. Halverson and 
Waldrop (1970) investigated this problem of 
“ownness” by measuring the consistencies of 
mothers’ behavior toward their own and oth- 
ers’ nursery school children in an achievement 
task situation. They obtained correlations 
ranging from .48 to .63 between behavior 
with own and other’s child on amount of talk- 
ing, number of positive encouraging state- 
ments, and number of negative controlling 
statements. In addition, the mothers tended 
to be more positive and encouraging with 
someone else’s child and more negative and 
controlling with their own, It is risky to 
generalize too far from one study, especially 
considering that the children themselves may 
have influenced the level of positivity of the 
adults, but it appears possible that the prob- 
lem of “ownness” can be compensated for 
in studies using nonrelated adult—child pairs. 
One might, in generalizing from results with 
nonrelated pairs to natural relationships, 
predict more negativity and less positivity in 
the natural relationship. Obviously, however, 
further research is needed on this issue. 
The present study is an experimental ana- 
logue of a child-teacher relationship, which 
means that it has the problem of “ownness” 
discussed above. However, judging from the 
Halverson and Waldrop (1970) study, if 
effects were obtained here, they would also 
likely be seen in more natural, long-term 
child-teacher relations, although one would 
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perhaps expect more negative responses to 
the children in the natural relationships. 

The aspect of child behavior selected to be 
experimentally varied in this study concerned 
responsiveness to the adult’s teaching efforts. 
A child’s response to adult influence attempts 
seemed likely to be important on the basis of 
Yarrow’s (1963) observations of the effects 
of the levels of responsiveness of two foster 
infants and Berberich’s (1971) earlier men- 
tioned finding that teacher behavior was 
shaped by the learning success of the pupil. 
Clinical experiences also indicated the im- 
portance of responsiveness. To give one such 
clinical example, fathers of extremely femi- 
nine-identified boys were urged by a thera- 
pist to teach their sons some stereotypically 
masculine skills and interests. In cases where 
their efforts were met with lack of behavior 
change on the part of the children, these 
fathers’ efforts rapidly fell back to the origi- 
nal low level. 

The specific form that responsiveness to 
teaching attempts took in the present study 
was imitation versus nonimitation by chil- 
dren. Selection of imitation as the independent 
variable has interesting benefits. First of all, 
imitation or observational learning has been 
generally considered to be a very crucial and 
frequently occurring factor in the child’s 
learning, especially in his social development 
(e.g., see Bandura & Walters, 1963; Mischel, 
1970). So imitation is a child behavior that 
almost certainly occurs a great deal in adult- 
child relationships and is probably noticed by 
the adult. Second, imitation has been very 
well studied, experimentally and otherwise, 
as a dependent variable which is at least 
partially under the control of adult behavior, 
The most relevant examples of such studies 
concern the effects of adult rewardingness and 
nurturance on child imitation, Bandura and 
Huston (1961) found that children who had 
previously interacted with a rewarding, nur- 
turant model imitated the model’s incidental 
behaviors more than children who had inter- 
acted with the same model acting nonnur- 
turantly. Mischel and Grusec (1966) found 
that the rewardingness of the model, along 
with the model’s possibilities for control over 
the child in the future, influenced children’s 


degree of imitativeness. Finally, Yarrow and 
Scott (1972) found that a nurturant model 
produced more model-absent, delayed imita- 
tion and more thematically elaborated, well 
organized imitation than a nonnurturant 
model—taken by the authors as an opera- 
tional definition of greater internalization of 
the nurturant model. To summarize, imitation 
is regarded as an important behavior of chil- 
dren, and the independent variable of degree 
of nurturance shown by an adult model 
toward children affects the dependent varia- 
ble of degree to which children imitate. 

Thus, the results of the present study 
might offer information about the formation 
and maintenance of adult-child feedback 
loops. If the adult subject’s level of positivity 
(which can be related to nurturance) is af- 
fected by the child’s level of imitation, as 
hypothesized, here would be a good experi- 
mental model of how upward or downward 
relationship spirals or learning spirals might 
be formed and maintained. As an example 
of a negative spiral, an adult models some 
behaviors which the child fails to imitate; 
the adult responds to this by being less posi- 
tive and less rewarding in his subsequent 
interactions with the child; the child then is 
somewhat less likely to imitate the adult the 
next time he has the opportunity, and so 
forth. 

A second independent variable in the study 
was that of directness of instruction. In real 
life, two kinds of modeling situations fre- 
quently occur—those in which an adult 
simply models behaviors without specifically 
asking a child to imitate and those in which 
an adult models but also directly asks the 
child to imitate. It was felt that experimental 
variation of directness of the adult’s instruc- 
tion of the child might suggest whether and 
how such implicit versus explicit demand 
contexts are important in natural adult-child 
relationships, 

In outline, this study was designed to mea- 
sure the effects of a child’s imitation or non- 
imitation on adults. Male college student 
subjects attempted to teach a child confeder- 
ate some basketball moves. Their teaching 
instructions were indirect (in which they only 
modeled the behavior) or more direct (in 
which, in addition to modeling, they asked 
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the child to pay attention and to reproduce 
the behavior later). 

The results of the manipulations of imita- 
tion versus nonimitation and direct versus 
indirect instruction contexts were assessed in 
a number of ways. There were several brief 
periods of unstructured adult-child inter- 
action during which the adult was observed. 
The behaviors of interest here were his verbal 
and nonverbal indicators of positivity, for 
example, his enthusiasm, the distance he 
chose to stand from the child, and the amount 
of talking to the child. As a measure of the 
delayed effects of the experimental manipula- 
tions, the adult’s reinforcingness toward the 
same child was measured. In this situation, 
the adult “taught” the child some word pairs 
by using monetary rewards and punishments 
as feedback after each trial. Finally, as an- 
other delayed measure the adult evaluated the 
child’s intellectual and social characteristics 
and indicated his motivation to interact with 
the same child (or a different one—as the 
subject chose) in a similar, future study, It 
was planned that the different measures 
would be combined in some way according to 
their intercorrelations to obtain an overall 
index of positivity. 

It was expected that there would be greater 
positivity on all measures when the child 
imitated than when the child failed to imitate. 
There was no hypothesis as to differences be- 
tween the coaching (direct instruction) and 
modeling only (indirect instruction) condi- 
tions when the child imitated. But it was 
predicted that there would be less positivity 
when the child failed to imitate after direct 
instruction, as opposed to indirect instruction 
(modeling only) because, presumably, sub- 
jects would see refusal of a direct request as 
more negative on the part of the child and 
would reciprocate that negativity. In gen- 
eral, there is good evidence that the non- 
verbal and verbal communication variables 
selected, such as eye contact, proximity, posi- 
tivity of speech content, amount of talking, 
etc., do vary according to the degree of 
positivity of the subject toward an other 
(Mehrabian, 1972). Furthermore, viewing the 
imitation and nonimitation of the child con- 
federate in this study as positive and nega- 
tive behaviors, respectively, one can find prior 
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support for the prediction that imitation by 
the child leads to reciprocal positivity or 
negativity in the subject (Heller, Myers, & 
Kline, 1963; Kelly & Stahelski, 1970; 


`> Mehrabian, 1971b; Raush, 1965; Shannon & 


Guerney, 1973). However, in view of the pos- 
sibility that personality variables might affect 
` the nonverbal responses of subjects in this 
kind of situation (e.g., Exline & Messick, 
1967; Mehrabian, 1971a, 1971b), Mehrabi- 
an’s (1970) scales of affiliative tendency and 
sensitivity to rejection were administered to 
the subjects. 

As implied in earlier discussion, the major 
N of this study was to increase knowledge 
abðut ways in which children can affect 
adults. Because of the particular variables 
selected for study, it was hoped that this 
knowledge might have relevance, at least of 
an hypothesis-suggesting nature, for the 
better understanding of reciprocal influence 
and feedback loop processes in adult-child 
relationships. 


METHOD + 
Design 


This study used basically a 2 X 2 factorial design, 
with 2 levels of imitation (IMIT), complete versus 
none, and 2 levels of specificity of instruction 
(SPEC), no specific instructions (i.e., modeling 
only) versus modeling plus direct instructions to 
attend and imitate. However, in order to test for 
effects due to differences in confederates, a third 
2-level factor (CONF) was added, making the design 
a2X2X2 factorial. 


Subjects 


There were 48 male, undergraduate college stu- 
dents who acted as subjects, randomly assigned, 6 
each, to the 8 cells ‘of the design. Five subjects, 
relatively evenly distributed among conditions and 
between confederates, expressed suspicion in debrief- 
ing that the child’s behavior was not completely 
natural. These subjects’ data were retained, since the 
suspicions were mild and in the nature of post hoc 
explanations of what had happened and since the 
subjects had continued to cooperate fully in their 
assigned roles as teachers. 


Confederates 


Two 13-year-old boys, who were both in the 
seventh grade, served as the experimenter’s paid 
confederates. The confederates’ task was to interact 


More detailed method information, such as pre- 
cise instructions can be found in Bates (1973). 
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with the adult subjects in a neutral to quietly 
friendly way, to respond to but not take conversa- 
tional initiatives, to exactly imitate or completely 
fail to imitate the adult subjects according to the 
experimenter’s directions, and to read word associa- 
tion responses to subjects in a natural way at a later 
time. Each boy participated equally in each of the 
four conditions. 


Overview of Procedure 


The subject was met at the experimenter’s office 
and given the basic plan of the study and the im- 
portance, for the child’s future skill development, of 
his teaching the child basketball moves. Then he 
was taken to the clinic playroom (where a basket- 
ball backboard, hoop, and net were installed at 
almost regulation height). Here he was shown the 
sequence of basketball moves he was to teach and 
then told the precise method (i.e., coaching versus 
modeling only) he was to use in teaching. These 
instructions also tried to give the subject the expec- 
tation that the boy would imitate. The subject was 
next introduced to the child confederate, who was 
brought in from another room. The experimenter 
left the room supposedly to observe the confederate’s 
attending responses, but actually to leave the subject 
and confederate alone together. The subject was in- 
structed to wait about 30 seconds (to allow the ex- 
perimenter time to “set up to observe the confeder- 
ate’s attending responses”), then “teach” the con- 
federate the moves, and, when finished, was removed 
from the playroom by the experimenter. On leaving, 
the experimenter covertly signaled the confederate as 
to whether he should imitate. Both the experimenter 
and the subject went immediately into the observa- 
tion room to observe the confederate’s imitation or 
nonimitation, which constituted the introduction of 
the independent variable of imitation level, After 
watching the confederate for 45 seconds, the experi- 
menter took the subject back into the playroom, 
This teach-leave-observe-return sequence was re- 
peated three times. The confederate either imitated 
every trial or did not imitate at all with any one 
subject. On the subject’s third return to the play- 
room the ball was taken out and the subject and 
confederate were left alone for 2 minutes while the 
experimenter prepared for the next phase of the 
study, 

An observer rated the quality of the subject’s be- 
havior toWard the confederate during all four pe- 
riods when the subject and confederate were in the 
playroom alone together. The rater did not see the 
confederate’s imitation or nonimitation (he was be- 
hind a partition at those times), nor did he hear 
any comments the subject happened to make to the 
experimenter about the confederate’s behavior (he 
kept his ears plugged). The subject did not see the 
rater. 

For the second phase of the study the subject and 
the confederate were taken to another room where 
they sat, visually separated by a plywood screen, at 
a table. Here, the subject’s task was to “teach” the 
confederate a short list of paired associates with 
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signals of monetary reward and punishment as feed- 
back after each response by the confederate. The 
confederate read his responses from a prearranged 
schedule which was on a card taped to the confed- 
erate’s side of the screen. 

The word pairs were chosen to represent a range of 
difficulty of association. The confederate’s responses 
were 25%, 50%, and 75% correct on subsequent 
trials. On some words, once correct, he continued 
correct, but on a few others, he showed lapses (a 
manipulation intended as a possible inducer of 
frustration in the subject). 

Then, the subject was taken to another room where 
he rated the confederate and the study and com- 
pleted a form requesting that he volunteer for future 
participation in a similar study. Finally the subject 
was debriefed, which included asking the subject 
whether he had any comments or questions that he 
had not written where asked this question on one of 
the questionnaires. No subject appeared disturbed by 
the events of the experiment, even where he had not 
been imitated. A letter discussing the reason for 
deception, the background, the major results, and the 
relevance of the results to adult-child relationships 
was mailed to each subject’s summer address about 
1 month after the last subjects were run. 


Basketball Teaching Phase Measures 


Because the measures have already been described 
in greater detail elsewhere (Bates, 1973), they are 
listed rather briefly here. Altogether there were 12 
different measures of the subject’s social interaction 
behavior toward the confederate throughout the 
basketball phase of the study. Positive expression 
rate, negative expression rate, and the derived index 
(net expression rate; ie, positive minus negative) 
were all based on continuous tallies and taken from 
Mehrabian (1972). Nonverbal expression of enthu- 
siasm, scored on a 5-point scale, and distance be- 
tween the subject and the confederate (who stood on 
a standard spot), scored on a 4-point scale, were 
both ratings which were made after the end of each 
period the subject was alone with the confederate. 

Other measures of nonverbal behavior included: 
face observation rate, a stopwatch timing of the sub- 
ject’s looking at the confederate’s face; segment time, 
the time the subject actually spent each segment be- 
tween the experimenter’s departure and the comple- 
tion of the subject’s two demonstrations; and ele- 
ments omitted, a count of how many elements of the 
standard sequence of basketball moves the subject 
left out of each demonstration (as a measure of the 
subject’s involvement in his modeling task). 

There were four different measures of the subject’s 
verbal behavior, each made once for a segment. 
First, tone is a 9-point scale which includes positive, 
neutral, and negative scores and which was taken, 
with a few modifications, from Mehrabian (1972) 
and scored using an electronic filter to make speech 
content unintelligible. The second measure, content, 
also adapted from Mehrabian (1972), was scored on 
a 13-point scale from very positive (e.g., segments 
containing statements such as “So you play the 
drums too; that’s really fantastic”) to very negative 
(e.g., segments containing statements such as “God, 
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this room is really a dump”). The third verbal be- 
havior measure, conversation rate, was based on a 
count of the number of words spoken by the subject 
(other than teaching instructions in the coaching 
condition). Finally, the subject’s sighs were tallied, 
both from the audio tape and by the experimenter 
while in the observation room with the subject. 
(The experimenter observed 7 sighs in the observa- 
tion room; 59 were heard in the recordings of the 
subject-confederate interaction.) 

There were four different ratings on each of the 
variables above (except for elements omitted, be- 
cause the fourth segment did not involve the basket- 
ball sequence), one for each segment the subject 
was alone with the confederate. Those variables with 
“rate” in their names consisted of a summation of 
scores for the last three segments divided by the 
number of seconds in those segments. Variables that 
were not rates consisted of sums over the middle two 
segments (elements omitted), sums over the last 
three segments (enthusiasm, segment time, distance, 
and sighs), or averages over the last three segments 
(content and tone), It was planned that, where 
analyses of variance showed significant or trend main 
effects or interactions involving imitation level in the 
first segment scores, that is, before the imitation 
manipulation had been introduced, the final analysis 
of these variables would be done on difference 
scores. Such scores consisted of the first segment 
score subtracted from, depending on the variable, 
the sum, average, or rate of the later segments. 

Nonverbal behavior ratings were made from 
behind an observation mirror; verbal behavior was 
rated from audio tapes and transcriptions of the 
tapes. The scales were all regarded, largely on the 
basis of prior research (see Mehrabian, 1972), as 
measures of positivity of feeling toward the confed- 
erate, although their results were examined sepa- 
rately as well as in an overall positivity index. 


Behavior Ratings? 


One observer’s ratings were used for the facial 
expression, enthusiasm, face observation, distance, 
and elements omitted measures. Reliability was 
checked on 10 of the subjects by another, indepen- 
dent rater. For positive expressions the Pearson r 
between the two raters was .90 j for negative expres- 
sions, a relatively rare event, occurring in only 9 
of the 40 jointly rated segments, r= .42 (85% exact 
agreement, 100% within difference of 1 or less) ; for 
distance between the subject and the confederate, 
r= 87; for face observation, r=.99; and for ele- 
ments omitted, r= .80. The raters were unable to 
EN pi agreement in their ratings of enthusiasm 
r= .38). 

Two independently made sets of content ratings 
correlated r = .79 and were averaged together. Reli- 
ability on tone checked on 12 randomly selected 
subjects (a total of 48 different segments) was 
r=.94. One rater’s scoring of tone was used for 
all subjects. 


*The author is grateful to Larry Bjorklund, 
Donna Kramer, George Speckart, and Frank Wagner 
who made the behavior ratings. 
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All ratings were, of course, done without knowl- 
edge of the imitation level condition. Raters of 
verbal content did know whether subject was coach- 
ing versus modeling only, however, bias was not 
likely since the raters were only minimally aware 
of the two specificity of instruction conditions and 
since the hypotheses regarding this variable were 
not obvious and not disclosed to the raters. Raters 
also could not help knowing which child the subject 
was working with, however, again, the raters were 
unaware of hypotheses regarding the two confed- 
erates’ behavior. 


Confederates’ Behavior 


There needed to be some assurance that the con- 
federate was not producing any changes in the sub- 
ject’s attitude by means other than the responsiveness 
versus nonresponsiveness via imitation level. For 
example, the confederate might have become less 
positive toward a subject whom he failed to imitate 
than one whom he did imitate. Although both 
subjects were carefully trained to be consistently 
courteous and attentive and respond to verbal 
initiatives of the subjects without taking initiatives 
themselves, it was important to have some measures 

` of the confederates’ positively behaviors. Since the 
confederate had his back to the mirror and stood in 
the same place with every subject and since the be- 
havior rater was busy rating the subject’s behavior, 
the most feasible measures of the confederate’s be- 
havior were the verbal ones, Thus, each of the con- 
federate’s responses in the second interaction segment 
was scored for verbal tone and content as being 
positive, neutral, or negative. It appears that the 
confederates were able to remain relatively neutral 
in all conditions. There were only 3 statements non- 
neutral in content by the confederates in the sec- 
ond segments, out of 128 total, 2 in the nonimitate- 
modeling, and 1 in the imitate-coaching condition. 
Furthermore, ratings of the confederates’ voice tone 
in the second segments showed no segments in which 
they were other than neutral. One confederate did 
break out of his role several times to make state- 
ments not in reply to the subject or to ask the sub- 
ject personal questions (total of 9 words in the 
imitate-modeling condition, 15 in nonimitate-model- 
ing, and 22 in imitate-coaching). 

It was expected that there could be differences in 
the subjects’ responses due to differences in the two 

“children, beyond effects due to the possible system- 
atic biases discussed above and effects due to imita- 
tion level and specificity of instruction. While this is 
not a centrally important variable, it does relate to 
the generalizability of the findings. So confederate 
level was included as a separate factor in the data 


analysis. 


Word Association Teaching Phase Measures 


Since the confederate behaved in exactly the same 
way in the word association phase with all sub- 
$ jects no matter what his behavior in the basketball 
-® mhase, the subject’s behavior here was viewed as a 
delayed measure of his positivity during the basket- 
ball phase. The measures were very simple ones: 
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Total rewards and punishments over all responses 
over all three trials were analyzed separately; in 
addition reward and punishment were combined 
into a single index, net reward. 


Subject’s Evaluation of the Confederate 
and the Study 

At the end of the study the subject answered 
multiple-choice questions concerning his appraisal of 
the confederate’s intellectual and social abilities, and 
subject’s own reactions to the confederate and the 
study in general. These ratings were examined in four 
categories: intelligence (two items), social charac- 
teristics (six items), subject’s liking of the study 
(one item), and subject’s preferred phase of the 
study (one item). 


Volunteer Measure 


As another measure, indicating the subject’s desire 
to participate in another similar study, the subject 
was asked to complete a volunteer form. The subject 
was asked whether he would volunteer 1 hour with- 
out credit or pay for a similar study in the future, 
and if so, whether he would prefer to work with the 
same or a different boy. The main piece of data 
here was the number of different hours during the 
week subject felt he could arrange to be free, 
starting with O (in the case of those who would 
not volunteer). 


RESULTS 
Combined Positivity Index 


In order to obtain an overall index of the 
subjects’ positivity toward the child confed- 
erates, a factor analysis was performed on 
the dependent measures for all 48 subjects. 
Measures of positive minus negative expres- 
sions and net reward were excluded since the 
two measures are linearly dependent on their 
components. 

In the analysis of the 19 variables, squared 
multiple correlations were used as initial com- 
munality estimates and iterated to stability. 
Since all variables were a priori considered 
to be measures of positivity, an unrotated, 
single-factor solution was sought. The first 
five eigenvalues for this analysis were 2.86, 


8The objective of this analysis was to obtain a 
weighted combination of the variables and not to 
explore the factor structure of the different mea- 
sures. However, a two-factor solution was also 
obtained and orthogonally rotated using varimax. 
The two-factor solution did not reveal any in- 
terpretable dimension beyond that of the one-factor 
solution. In addition, analyses of variance on the 
two-factor solution factor scores yielded essentially 
identical results as the same analysis of the one- 
factor solution scores (described below). 
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TABLE 1 
Factor ANALYSIS 

Variable Loading 
1. Positive expression rate 63 
2. Negative expression rate — 44 
4. Enthusiasm 39 
5. Elements omitted difference —.22 
6. Face observation rate 59 
7. Distance —.34 
8. Segment time difference 4 
9. Conversation rate difference 4 


10. Content 

11. Tone difference 
12. Sighs difference 
13. Reward 

14. Punishment 
16. Intelligence 


17. Social characteristics 38 
18, Enjoyment of study 30 
19. Preferred phase ats 
20. Preferred boy AT 


21. Volunteered hours 1 


1.31, 1.00, .91, and .69. The factor loadings 
are presented in Table 1. (The variable 
numbers correspond to those in Tables 2 
and 3.) 

The loadings suggest that the factor is a 
bipolar positivity factor, as was expected. 
There are positive loadings on such measures 
as positive expressions, face observation, and 
conversation rate. There are negative loadings 
on measures such as negative expressions, dis- 
tance the subject stands from the confederate, 
and sighs. Each subject’s factor score was 
estimated by the regression method and 
yielded a variable having a mean of O and a 
variance of 1. This variable was then 
regarded as the combined positivity index. 

Data presentation format. Before reporting 
the results it would be useful to explain how 
they are presented in the tables. A three- 
factor analysis of variance was performed on 
the positivity index, as well as on each of the 
21 separate measures. There were two levels 
of specificity of instruction (i.e., modeling 
only vs. modeling plus coaching), two levels 
of imitation (i.e., imitate vs. nonimitate), and 
two levels of confederate (i.e., Child 1 vs. 
Child 2). Cell means for the Specificity x 
Imitation conditions in these analyses can be 
seen in the rows of Table 2. There were too 
few significant effects or interactions in- 
volving confederates (see below) to necessi- 
tate presenting the means for this variable. 
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Table 3 contains a summary of the results 
of the analyses of variance and ¢ tests 
(which were used to assess the relevant grov 
differences wherever an analysis of varianc 
showed a main or interaction effect or trend) 
Only the significant or trend Fs and żs are 
included. One-tailed ż tests were used for 
predicted differences, and two-tailed tests 
were used otherwise. For all analyses of vari- 
ance dj = 1,40; and for the ¢ tests df = 46 
in tests of main effects means, 22 in first- 
order interaction tests and 10 in second-order 
interaction tests. 

Positivity index analysis. To summarize the 
results of the analysis of the combined posi- 
tivity index (see Row 22 of Tables 2 and 3 
for more detail), there was a highly signifi- 
cant main effect for imitation level (F = 
20.85, p<.01) with imitated subjects 
being significantly more positive than non- 
imitated subjects, as was hypothesized. In 
addition, there was a significant Specificity of 
Instruction X Imitation interaction effect (F 
= 4.55, p < .05). Here, in both the model- 
ing and coaching conditions, subjects who 
were imitated by the child were significantly 
more positive than those who were not. How- 
ever, the hypothesized greater negativity in the 
nonimitate-coaching condition relative to the 
nonimitate-modeling condition was not ob- 
tained, and in fact, the nonimitate-coaching 


TABLE 2 
CELL MEANS 


Imi- | Non- | gmi- | Non- 

rae tate- | f™E | tate | ime 
Variable model- | tate- | coach- | tate- 
k model- | °° coach- 

AA RE A aa aT 


1. 043) 056 045 
2 014) 004| 012 
E 029| 052| 032 
4. 8.75 | 10.33 | 8.67 
at ‘ments omitted 
ifference 917| 167| .250 
6. 368| 242| 1341 297 
Te 7.33 | 9.33 | 8.00] 8.50 
8. Segment time difference | 39,9 46.4 42.5 43.6 
9. Conversation rate 
difference 30 | —.24 16 19 
10. Content 3.78 | 3.28 | 3.78 | 3.40 
11. Tone difference —.32 | —.36 | —.29 | —.52 
12. Sighs difference 25) 1.92] 1.75] 1.42 
13. Reward 147.5 |144.6 | 137.1 | 145.4 
14. Punishment 86.2 | 93.3 | 97.5 | 84.5 
15, Net reward 62,2 $1.2 39.6 60.8 
16. Intelligence 7.42 TAZ 7.58 6.58 
17. Social characteristics 23.0. | 21.2 | 21.1 | 20.1 
18. Enjoyment of study 4.17 3.67 4.00 4.00 
19. Preferred phase 1.50 1.50 1.67 1,42 
20. Preferred boy 1.25 -92 | 1.33] 1.00 
21. Volunteered hours 10.50 | 4.17 | 8.50] 7.33 
22. Total positivity score 77 | —.68 21 — 31 


Be 


| 


X 


EFFECTS OF A CHILD’S IMITATION VERSUS NoNIMITATION 847 
TABLE 3 
SUMMARY OF ANALYSES OF VARIANCE AND / TESTS: SIGNIFICANT AND TREND FS AND (S 
Analyses of variance t tests 
Variable 
Source F Comparison t 
1. Positive expression rate IMIT 9.18*** I>N 3.09° 
2. Negative expression rate IMIT 10.06*** I<N 3.17s 
SPEC X CONF 2.86* C1> C2 1.94» 
3. Net expression rate IMIT LEST esas I>N 3.925 
4. Enthusiasm IMIT 13.98*** I>N 3.44¢ 
IMIT X CONF 4.18** I1 >Ni 3.89° 
N1 < N2 1,98 
ó. Face observation rate IMIT 5.02** I>N 226° 
7. Distance IMIT CASS jaa I<N 2.95° 
9. Conversation rate difference IMIT 4.01* I>N 1.96° 
i SPEC X IMIT 5.04** IM > NM 2.78° 
5 NC > NM 2.274 
10, Content IMIT 3.28* I>N 1,88° 
11. Tone difference IMIT X CONF 9,430" 2>N2 2,978 
` Ii < I2 2.204 
N1 > N2 2.364 
12. Sighs difference SPEC X IMIT 3.44* IM < NM 2.68° 
IM < IC 2.534 
14, Punishment SPEC X IMIT X CONF 3.24" IM1 < NMI 1,89° 
IMi <IM2 2.00» 
15. Net reward SPEC X IMIT 3.39* IM > IC 2.25» 
16. Intelligence IMIT 3.81* I>N 1.99° 
17. Social characteristics SPEC 6.06** M>C 2.364 
IMIT 4.82** I>N 2.08° 
CONE 3:23* 
22. Total positivity score IMI" 20.85*** I>N 4.71° 
SPEC X IMIT 4.55** IM > NM 5.24° 
IC > NC Lise. 
IM > IC 1,88 


Note, SPEC = independent variable of specificity of instruction; IMIT = imitation level; CONF = confederates factor; 


I = imitate; N = nonimitate; C = coaching; M = modeling; 1 = first confederate; 2 = second confederate, 


ap <.1, one-tailed, 

bp <.01, two-tailed. 
< .05, one-tailed. 
.05, two-tailed. 


b 
e p <.01, one-tailed, 
* 

ek 


<. 
< .05. 
< 0t. 


< 
< 
b 
b 
ae 


mean positivity was greater (but not signifi- 
cantly so) than the nonimitate-modeling 
mean. No difference between the imitate- 
modeling and imitate-coaching condition was 
hypothesized. However, there was a trend 
with the imitate-modeling subjects being 
somewhat more positive than the imitate- 
coaching subjects. Thus, on the combined 
positivity index, a subject’s level of positivity 
was clearly affected by whether the child 
imitated or not, but this effect also partially 
depended on whether the subject was merely 
modeling the basketball moves as opposed to 
modeling plus giving direct coaching instruc- 
tion. The effects were independent of any 
differences in the two different children. 


Individual Positivity Measures 


The hypotheses of the study were also 
tested for each of the 21 different dependent 
variables separately. For each variable, a 
separate three-way analysis of variance was 
performed and ¢ tests were done, just as for 
the combined index. (See Tables 2 and 3, 
described above, for the cell means, analysis 
of variance, and ¢ test results.) 

Initial differences or trends involving imita- 
tion level were found for the following 
variables: elements omitted (IMIT x CONF 
F = 3.16), segment time (SPEC xX IMIT 
F = 3.24), conversation rate (SPEC X IMIT 
F = 3.61), tone (IMIT Xx CONF F = 3.03), 
and sighs (IMIT F = 4.85, p < .05). Thus, 
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in order to control for the initial noncompara- 
bility among conditions, for these five vari- 
ables all further analyses (including the 
factor enalysis reported above) were done on 
difference scores, as described in the Method 
section. The results of the analyses of vari- 
ance are summarized below in terms of the 
effects of the study’s independent variables. 
Confederates. Although the factor analysis 
and subsequent analysis of variance on the 
factor scores do a psychometrically better 
job of summarization for this and the other 
independent variables, the results of the 
analyses of variance for confederates may also 
be summarized in this way: Out of 84 F tests 
of main or interaction effects involving con- 
federates for the 21 dependent variables, only 
2 reached significance (IMIT X CONF for 
enthusiasm and IMIT x CONF for tone dif- 
ference, the first showing greater positivity 
for one child and the second for the other 
child), and 3 others showed trends. This 
small number of significant effects combined 
with the absence of confederate effects in the 
analysis of the positivity factor scores indi- 
cated that individual differences between 
the two children in the experiment did not 
appreciably affect the results. 
Specificity of instruction. There was one 
significant specificity of instruction main ef- 
fect (for social characteristics, with subjects 
in the modeling condition perceiving the chil- 
dren as having more positive social attributes 
than did subjects in the coaching condition). 
There was also one significant interaction 
involving specificity (SPEC X IMIT for con- 
versation rate difference) and two trends for 
this same interaction. Here, there was a 
slight hint, confirmed by the analysis of vari- 
ance on the positivity factor scores, that the 
effect of imitation by the child depended on 
how directly the subject had to state his de- 
sire to be imitated. These results definitely 
failed to support the secondary hypothesis 
that nonimitation would have a greater nega- 
tive effect in the coaching than in the model- 
ing condition. The major source of this result 
is that on many of the variables imitate- 
coaching subjects were less positive than they 
had been expected to be. Finally, there were 
also two trends, one for SPEC X CONF and 
one for SPEC X IMIT x CONF. Apparently, 
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other than in interaction with imitat el, 
specificity of instruction had little in m 
subjects’ behavior. 

Imitation level. In contrast to the : 
for the other two independent variables 
21 separate analyses of variance show: 
substantial number of imitation level ma'r 
effects. There were 7 significant IMIT Fs, f 
positive expression rate, negative expressior 
rate, positive minus negative expression rate, 
enthusiasm, face observation rate, distance, 
and social characteristics. All ¢ tests on the 
imitate versus nonimitate comparisons for 
these variables were significant in the ex- 
pected directions. Furthermore, main effects 
tests for 3 other variables approached 
significance, and the corresponding cell com- 
parisons all showed significant ¢s in the hy- 
pothesized direction. As verified by the posi- 
tivity index analysis, the main hypothesis of 
the study, that there would be greater posi- 


tivity in the imitate rather than in the non- _ 


imitate condition, received strong support. 

An attempt was made to determine why 
certain variables showed a significant effect 
for imitation level, while other variables did 
not. One possibility was that variables that 
proved to be good measures of positivity 
would show a significant effect of imitation, 
while those that were poor measures of posi- 
tivity would not show a significant effect. 
The absolute sizes of the factor loadings for 
the positivity factor (right part of Table 1) 
represent a measure of the extent to which a 
given variable measures positivity. The sizes 
of the imitation level main effect F ratios 
represent a measure of the extent to which 
the variable is affected by the independent 
variable of imitation level. If variables that 
are better measures of positivity yield more 
significant effects, then the factor loadings 
and Fs should be positively correlated. The 
Pearson correlation coefficient was .55, sig- 
nificant at the .01 level, thus verifying this 
interpretation. 


Personality Measures 


The two Mehrabian scales of affiliative 
tendency and sensitivity to rejection were 
analyzed in four ways. First, they were cor- 
related with each other across all 48 subjects. 


a 


EFFECTS oF A CHILD’S Imitation VERSUS NoNIMITATION 849 


The r here is 30 (p < .05). Thus, to some 
extent the higher a subject’s score on affili- 
ative tendency, the higher his score tended 
to be on sensitivity to rejection. Second, these 
two measures were correlated, over all sub- 
jects, with each of the dependent measures 
of the experiment, including the combined 
positivity index. With the combined index, r 
for affiliative tendency = —.14 and r for 
sensitivity to rejection = —.12. Out of 42 
correlations with the separate measures, there 
was only one significant one, that of —.36 
between sensitivity to rejection and net re- 
ward. Subjects who described themselves as 
more sensitive to interpersonal rejection 
tended to be less rewarding toward the child 
whom they were attempting to teach word 
associations. Third, in order to see whether 
subjects responded differently in the different 
experimental conditions according to person- 
ality, rs between the combined index and the 
personality measures were computed for 
the imitate-modeling, nonimitate-modeling, 
imitate-coaching, and nonimitate coaching 
conditions separately. Of the 8 correlations, 
none reached the size required for signifi- 
cance. Despite the one somewhat ironic cor- 
relation between sensitivity to rejection and 
net reward, the two personality measures 
were not appreciably related to subjects’ re- 
sponses to the child in this study. As a final 
analysis, to see whether the personality vari- 
ables were randomly distributed among the 
different experimental groups, a three-way 
analysis of variance was performed on each 
of these variables. For affiliative tendency 
there was a trend toward significance for 
specificity of instruction, F(1,40) = 3.02, 
p<.1, where the subjects in the modeling 
condition had slightly but not significantly 
higher affiliative tendency than those in the 
coaching condition, #(46) = 1.99, p < .1. For 
sensitivity to rejection there were no signifi- 
cant effects or trends. Thus, the personality 
measures were unrelated to the experimental 
conditions. 


Discussion 


Although the experimental manipulation of 
imitation level did not have the expected ef- 
fect on every dependent measure, the results 
for the combined positivity index and about 


half of the separate variables did support the 
main hypothesis of the study. College student 
adult males were indeed more positive toward 
a child who imitated them than toward one 
who did not imitate. They not only tended 
to rate an imitative child higher on intel- 
lectual and social characteristics on question- 
naires that they did not expect would be seen 
by the child, they also tended to display 
more nonverbal cues of positiveness and re- 
sponsiveness (Mehrabian, 1972) within the 
actual presence of an imitative child. They 
talked more, showed more positive regard for 
the imitative child in what they said to him, 
remained in closer proximity to him, looked 
at his face more, and showed more positive 
and fewer negative expressions. (They were 
rated as more enthusiastic with the imitative 
child, too, but because of the low reliability 
on this variable, conclusions based on it must 
remain tentative.) 

The secondary hypothesis of the study, 
dealing with the effect of imitation level 
within the context of directness of the adult’s 
demand to imitate, were not supported. As 
shown by the combined index, imitation 
tended to produce more positiveness in both 
the modeling and coaching conditions, but 
there was a greater effect due to imitation in 
the modeling condition, that is, where the 
adult did not make his desire that the child 
imitate known to the child. A number of 
interpretations of this finding are possible, 
and which is the best would have to be dis- 
covered by further research. However, one 
fairly simple interpretation is that the adults 
did not “give the child any credit” for imi- 
tating when they had explicitly asked that he 
do so. As attribution theory (e.g., Beckman, 
1970; Kelley, 1967) would suggest, the adults 
may have seen the child’s imitation in the 
modeling condition as due to some positive 
feelings toward them or positive quality of 
the child; whereas those in the coaching 
condition attributed it merely to their instruc- 
tions. On the other hand, it is also plausible 
that, as learning theorists might suggest, the 
imitation in the modeling condition was more 
reinforcing because obtaining it required less 
effort. 

It would not be very fruitful to discuss all 
of the findings on the individual variables. 
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However, it might be pointed out that for 
most variables not showing the expected ef- 
fects, the means for the imitate versus non- 
imitate cells were in the expected directions 
relative to one another, but the variances 
were too high for statistical significance to be 
obtained. Where the means were not ordered 
as expected, usually it was the case that the 
imitate-coaching condition mean was lower or 
the nonimitate-coaching condition mean was 
higher than predicted. This generalization 
must be made cautiously, though, since the 
data relevant to it are not strikingly clear. 

As a further comment on the findings, the 
issue of generality across children should be 
mentioned. To what extent were the indi- 
vidual characteristics of the child confeder- 
ates important? The general lack of signifi- 
cant analysis of variance interaction effects 
involving the confederates variable indicates 
that child differences did not significantly af- 
fect the subjects’ responses to imitation level 
or specificity of instruction. 

As with the individual differences in the 
children, it appears that the individual per- 
sonality differences of adults on affiliative 
tendency and sensitivity to rejection had no 
clear relationship to the dependent measures. 
Perhaps if there had been larger numbers 
within the different cells of the experiment 

. there might have been significant relation- 
ships shown between these personality vari- 
ables and the adults’ response to the child’s 
imitativeness; or, perhaps other personality 
variables would show such relationships. 
However, further research has to be done to 
answer these questions. 

Although it must be done tentatively, im- 
plications can be drawn from this study for 
more naturally occurring adult-child relation- 
ships. In this study, when a child responded 
imitatively to an adult’s implicit or explicit 
desire to be imitated, the adult was likely to 
react more positively than when the child 
responded nonimitatively. One can consider 
this positiveness toward the child to be 
closely analogous to the nurturance of the 
adult in the imitative learning paradigm men- 
tioned earlier. It can be recalled that one of 
the main findings of studies with such a 
paradigm is that children imitate a nurturant 
model more than a nonnurturant one. The 
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main finding of this study suggests - 


sibility of a reverse side to this pl 
non—when the child imitates, the adı 
comes more nurturant than when the 

does not imitate. While one must be cai 

in generalizing from a study involving y: 
male adults interacting with children 
were strangers to them, the findings 
suggest the good possibility, at least, of on 
way in which a child may affect his own 
development. 

Ultimate causality in such a stream of inter- 
personal behavior in real life is difficult or 
impossible to decide, but if a child fails to 
imitate an adult who has modeled some h 
haviors the adult expects to be copied, the 
adult is likely to be less positive, less 
nurturant with that child. This, in turn, is 
likely to reduce the child’s imitativeness on 
subsequent occasions. Of course, the reverse 
may also occur—an imitative child produces 
more positivity in the adult, which then 
would lead to greater likelihood that the child 
will imitate. In practical terms, if the gen 
eralization above does indeed apply to natu 
rally occurring relationships, the findings of 
the study have potential implications for 
improving adult-child relationships. First, 
while there probably needs to be some level 
of responsiveness by an adult to a child’s 
behavior, it may be beneficial to develop ef- 
fective ways to inform parents and teachers 
in advance of some of their possible negative 
responses to child behaviors so that they can 
prevent the formation of increasingly nega- 
tive relationships. While the adults probably 
should not dissemble, some sort of formalized 
training or even simple foreknowledge ought 
to lead to some genuine attenuation of nega- 
tive reactions or a more beneficial way of 
expressing these reactions to the child. 
Second, and even more important, when 
faced with an already ineffective, unsatisfy- 
ing relationship between parent or teacher 
and child, a clinician might use his awareness 
of the importance of child effects in choosing 
a locus for intervention. Part of the interven- 
tion might involve, for example, an attempt 
to increase directly the child’s imitative or 
other forms of positive responsiveness rather 
than simply an attempt to change the adult’s 
techniques of influencing the child. 
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Helping and Self-Attributions;: 
A Field Experiment 


Seymour W. Uranowitz 
University of California, Berkeley 


A field experiment was conducted to test the derivations of Bem’s self-percep- 
tion theory as described by Lepper, who investigated the behavioral effects of 
self-attributions formed as a result of previous behavior. Women in a shopping 
center were asked by an experimenter to watch his packages while he re- 
trieved either his wallet (high justification) or a dollar (low justification). It 
was hypothesized that low justification subjects would make self-attributions 
of “helpfulness,” while high justification subjects would attribute their be- 
havior to external factors and would thereby make no such self-attribution. 
To test this hypothesis, subjects were given the opportunity to help a second 
experimenter, who crossed the subjects’ path and dropped a package, appar- 
ently unnoticed. The prediction was that low justification subjects would show 
more subsequent helping behavior than high justification subjects or control 
subjects, who had no initial contact. This prediction was confirmed, thereby 


lending additional support to the self-perception analysis, 


Lepper (1973) has recently attempted to 
broaden the scope of Bem’s (1967, 1972) 
self-perception theory to include behavioral 
consequences of self-attributions, a phenome- 
non largely unexplored except for an in- 
triguing finding by Freedman and Fraser 
(1966). The authors demonstrated what they 
called the “foot-in-the-door” phenomenon, by 
which they meant that a person’s initial com- 
pliance with a small, inconsequential request 
would increase the likelihood that he would 
comply with a later, more substantial request, 
possibly because he identifies himself as a 
“doer.” Lepper (1973) attempted to system- 
atically explore behavioral consequences of 
self-attributions. He found that children who 
initially resisted the temptation to play with 
a toy under mild threat showed more resist- 
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ance to cheat in a game to obtain prizes 3 
weeks later, presumably due to the self- 
attribution of honesty, 

The purpose of the present study was to 
bring Lepper’s paradigm back into the field, 
where Freedman and Fraser had first ob- 
tained the foot-in-the-door result confirmed in 
Lepper’s study. If the precision of a labora- 
tory setting could be maintained in the field, 
using an important social behavior of wide- 
spread interest, significant results would have 
clear implications both for the problem of the 
effects of self-attributions upon subsequent 
behavior and in interpreting relevant areas of 
the behavior itself. The behavior thus chosen 
for study was helpful behavior, and the pre- 
sumed mediating self-attribution would be 
helpfulness or altruism. The hypothesis is 
that subjects who comply with an innocuous 
request under conditions of minimal justifica- 
tion show more helpful behavior in a subse- 
quent encounter than either subjects in the 
high justification or control conditions, the 
latter involving no initial contact. The basic 
rationale is that under low justification, sub- 
jects are more likely to make self-attributions 
of “helpfulness,” in accordance with the 
insufficient justification studies by Lepper 
(1973) and (through inference) Freedman 
and Fraser (1966). 
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HELPING AND SELF-ATTRIBUTIONS 


METHOD 
Subjects 


The subjects were 60 women aged from about 
mid-20s to mid-60s who were approached in a 
shopping center in Concord, California. There were 
20 subjects in each justification condition and 20 
control subjects, who received only the dependent 
measure, 


Procedure 


Justification variation. Women walking alone out 
of a shopping center department store who did not 
appear to be in a great hurry were approached by 
the first experimenter, a college-aged male, who 
was carrying five bulky grocery bags in a very 
awkward manner. He asked the subject if she would 
watch his bags for a moment while he went back 
into the store from which they had both just 
emerged to retrieve something he claimed to have 
lost while shopping in the store. In the high justifi- 
cation condition, he appeared to be in a highly 
a ted state with a worried look on his face and 
hurriedly explained that he had lost his wallet, which 
contained “a lot of money.” In the low justification 
condition, he appeared calm and seemed to be in 
no great hurry as he explained that he had dropped 
a dollar bill in the store. All 40 experimental sub- 
jects (as well as 4 who were lost because of in- 
advertent path changes between experimenters) 
complied with this request. Upon the subject’s 
compliance with the request, the experimenter placed 
his bags on the ground near the subject’s feet and 
ran back into the store, taking the same route each 
time to a far corner of the store. This procedure 
standardized the amount of time each subject waited 
alone with the experimenter’s packages, a period of 
approximately 1 minute. In both experimental condi- 
tions, the experimenter returned with a smile on his 
face and reported that he had found what he 
had lost, 

Dependent measure. The dependent measure was 
adapted from a field experiment by Regan, Williams, 
and Sparling (1972), in which the dependent vari- 
able was whether subjects informed a confederate 
that candy was dropping unnoticed from a torn bag. 
In the present study, after thanking the subject, the 
first experimenter spent several minutes retrieving 
the grocery bags from the ground and arranging 
them in his arms in order to enable him to continue 
without dropping them. The subject continued on 
her way as he was engaged in this activity, and in 
fact the first experimenter did not leave the scene 
of the encounter until the subject could no longer 
see him. This was done in order to prevent any 
confounding of the subject’s subsequent behavior due 
to the notion that she was being observed by an 
individual whom she had just helped. 

A few stores away, at a distance of about 50 feet 
from the initial encounter and in the direction the 
subject was walking, the second experimenter, a 
college-aged female, waited in an inconspicuous 
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manner, ostensibly looking in a store window. A 
fairly heavily traveled area was used for the en- 
counter between the second experimenter and the 
subject, as it has been shown that the likelihood of 
a victim receiving help from a bystander is inversely 
related to the number of onlookers (cf. Darley & 
Latané, 1968; Korte, 1969). 

The second experimenter observed the encounter 
between the first experimenter and the subject, but 
remained blind at all times to the conditions of all 
experimental subjects. She held a large grocery bag 
in her arms, as well as a small flat bag, measuring 
about 4X 6 inches (10.16 X 15.24 cm), which was 
wedged between her body and the large bag in her 
arms. As the subject approached, she left her vantage 
point and crossed the subject’s path at a point where 
it was assured that she would be seen by the subject 
(about 8-10 feet or 2.44-3.05 m). She then released 
pressure on the large bag imperceptibly, allowing the 
small bag which was wedged next to her body to 
drop, apparently unnoticed, and continued on her 
way. The measure of helpfulness was whether the 
subject stopped the second experimenter to return 
the dropped package or to inform her that she 
had dropped it. 


RESULTS 
Justification Manipulation 


The observed effects of justification are 
clear-cut. It can be seen in Table 1 that a 
much greater number of low justification sub- 
jects helped the second experimenter than did 
subjects in either the high justification or 
control conditions. The differences are signifi- 
cant by the chi-square test (Siegel, 1956), 
indicating the significant effect the minimum 
justification manipulation had on subsequent 
behavior, x°(2) = 9.2, p < .02. Additional 
chi-square analyses demonstrated that the 
number of helping subjects in the low justifi- 
cation condition differed significantly from 
those in both the high justification condition, 
x°(1) = 3.84, p < .05, using Yates’ correc- 
tion for continuity, and the control condition, 
x°(1) = 6.55, p < .02, using the correction 


TABLE 1 


Number or Supyects WHO HELPED THE 
SECOND EXPERIMENTER 


Justification 
Variable 
Low High Control 
Helped 16 9 7 
Did not help 4 11 13 
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for continuity. No significant difference was 
found between high justification and control 
subjects, x?(1) = .104, ns. 


Discussion 


The major hypothesis that compliance with 
an initial request under minimal justification 
conditions produces greater compliance with 
a later more substantial task was strongly 
supported by the present results, thus pro- 
viding further experimental evidence of a 
“foot-in-the-door” effect. The disparate para- 
digm and setting used here which yielded 
results consistent with those of other studies 
in this area attest further to the apparent 
robustness of the effect and also provide a 
new direction for field research in helping 
behavior. The results obtained here, when 
combined with the others, provide firm 
ground from which new self-perception and 
“foot-in-the-door” effects can be explored not 
only in the laboratory, but also in the field, 
as was done here. What is now clearly needed 
is self-report data which yields evidence of 
the hypothesized changes in self-attributions. 
Although previous attempts have not been 
fruitful (e.g., Lepper, 1973), further explora- 
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tion is certainly called for in light of th 
nificant results obtained using this parac: 
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` Evaluative Reactions of Adolescents Toward Speakers of Standard 
English and Mexican American Accented English 


Ellen Bouchard Ryan and Miguel A. Carranza 
University of Notre Dame 


Sixty-three Mexican American, black, and Anglo adolescent female subjects 
rated the personalities of male speakers of standard English and Mexican 
American accented English. In order to demonstrate that the functional separa- 
tion of speech styles would be reflected in these evaluative reactions, two speech 
contexts (home and school) and two sets of rating scales (status stressing and 
solidarity stressing) were employed. Although standard English speakers re- 
ceived more favorable ratings in every case, the differences were significantly 
greater in the school context than in the home context and for status ratings 
than for solidarity ratings. Contrary to expectations, Mexican Americans did 
not prefer accented English in the home context or on solidarity scales. 


Accented English speech has traditionally 
been the trademark of language minority 
groups as they adapt to the majority culture 
and language. In particular, Ortego (1969- 
1970) stated that the vast majority of bi- 
lingual, and even monolingual, Mexican 
Americans in the Southwest speak English 
with an unmistakable influence from Spanish 
phonology (see Metcalf, 1972; Ornstein, 
1971). According to Ortego and Carter 
(1970), an accent can reduce chances for 
educational and occupational success if it 
serves to evoke a prejudicial attitude in the 
listener toward the speaker. 

There are many references in the literature 
to the diligent attempts of some Mexican 
Americans to eliminate any trace of accent 
from their own speech and that of their chil- 
dren in the hope of obtaining a greater num- 
ber of opportunities. For example, Barker 
(1947) noted in his study that many Mexican 
American bilinguals in Tucson, Arizona, mani- 
fested a feeling of inferiority due to their 
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accented English. The association of inferior 
status with this accent was so strong that 
some parents spoke only English to their 
children so that they would not grow up with 
an accent and thus have fewer job possibili- 
ties. According to Krear (1969), when parents 
with a poor command of English use only 
that language with a child, the child’s person- 
ality and speech suffer. In an investigation of 
the English spoken by Spanish-speaking in- 
formants in Texas, Sawyer (1964) observed 
two types of evidence of the social stresses 
felt by speakers who saw no advantage in 
being identified as members of the Spanish- 
speaking minority. First, Spanish words 
which could be avoided were not employed, 
although they were regional terms commonly 
used in the English-speaking community. Sec- 
ond, unavoidable Spanish words were pro- 
nounced differently for English-speaking lis- 
teners than for bilingual listeners. 

In contrast to the drawbacks of accented 
speech, some positive functions served by an 
accent can also be noted. The encouragement 
of ethnic pride which is part of the Chican- 
ismo movement may lead to increased ac- 
ceptance of accented speech as a badge of 
ethnic loyalty. Furthermore, maintaining an 
accent may be associated with maintaining 
one’s identity and separateness from the domi- 
nant cultural group. Taking on a new lan- 
guage and an associated personal identity 
involves the fear of failing to succeed and 
the threat of losing one’s original identity 
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(Lambert, 1967; Huntsberry, Note 1). In 
the light of these negative and positive social 
and psychological attributes of accepted 
speech, the present study explored the reac- 
tions of adolescents toward speakers of stan- 
dard and accented English. 

Evaluative reactions toward speakers of 
several coexistent language varieties have 
frequently been investigated with adaptations 
of the matched guise technique developed by 
Lambert, Hodgson, Gardner, and Fillenbaum 
(1960). In studies employing different varie- 
ties of the same language (d’Anglejan & 
Tucker, 1973; Anisfeld, Bogo, & Lambert, 
1962; Giles, 1970; Shuy, 1969; Tucker & 
Lambert, 1969), listeners tend to rate the 
speakers of standard or upper-class speech 
styles more favorably than speakers of lower- 
class, regional, or ethnic varieties. For exam- 
ple, Williams, Whitehead, and Miller (Note 
2) found that teachers assigned more positive 
ratings to the speech of Anglo children than 
to the speech of black or Mexican American 
children, Furthermore, within each ethnic 
group, the speech of middle-class children 
was favored over that of lower-class children. 
Although the extent to which the reactions 
depend on speaker and listener characteristics 
has received much attention, previous research 
has not considered the importance of speech 
context and the role of the status and solidar- 
ity dimensions of social interaction discussed 
by Brown (1965) and Fishman (1971). 

Agheyisi and Fishman (1970) have criti- 

cized the typical studies of evaluative reac- 
tions for not considering the importance of 
the context of speech. The studies based on 
reading samples from a group of speakers 
have typically used formal, or technical, pas- 
sages, whereas those based on spontaneous 
speech usually involve informal topics. Since 
the language varieties of a speech community 
often serve different functions, reactions elic- 
ited in one context may not generalize to 
other contexts. 

In situations where two stable language 
varieties exist, Fishman (1971) indicated that 
one speech variety (“high language”) may be 
associated with status, high culture, and as- 
piration toward social mobility, while a sec- 
ond variety (“low language”) may corre- 
spond to solidarity, comradeship, and inti- 
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macy. The speaker usually reflects this value | 


clustering of his speech community by choos- 
ing the appropriate language variety for spe- 
cific situations. Most frequently, the values 
of solidarity (associated with the low lan- 
guage) are enacted in the domains of home 
and neighborhood, while those of status (as- 
sociated with the high language) are enacted 
in the domains of school and work. 

It is reasonable to expect that the various 
speech styles used in the Mexican American 
community served different functions. Clearly, 
Spanish is more associated with the home and 
family, whereas standard English is stressed 
most in school and work situations. In re- 
sponse to English-speaking-only school poli- 
cies and other pressures to learn English, 
dominantly Spanish-speaking Mexican Ameri- 
can families tend to shift toward greater use 
of English in the home and friendship con 
texts (Grebler, Moore, & Guzman, 1970 
Krear, 1969). As this shift occurs, the | 
lish heard in Mexican American homes and 
neighborhoods is frequently accented. 7 
English spoken with a Mexican American 
cent, in contrast to standard English, may 
also be associated with the home context, 
group identification, and solidarity values. 

In addition to speech context, then, the 
different values associated with each speech 
style must be considered. Several researchers 
(Cheyne, 1970; Giles, 1971, 1972; Lambert, 
1967) have attempted to distinguish rating 
dimensions in evaluative reaction studies. The 
general finding is that ratings related to a 
dimension of status, prestige, or competence 
tend to reflect the relative socioeconomic 
status of the language groups represented and 
are not affected greatly by language loyaliy. 
On the other hand, ratings along dimensions 
such as personal integrity and social attrac- 
tiveness tend to be less clear-cut and quite 
susceptible to language loyalty. For example, 
in the Cheyne study, both Scottish and En 
lish groups rated English speakers as more 
competent than Scottish speakers, but t} 
Scots judged their own group higher on scale 
of integrity and friendliness, 

Tn the present study an attempt is made to 
distinguish between the status and solidarity 
values which Fishman (1971) claimed should 
be associated with different language varie 
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ties. Brown (1965) discussed status and soli- 
darity as two important dimensions of social 
interaction affecting language use. According 
to Brown, status differences are marked by 
demonstrable influence, power, and control. 
For all Americans it seems reasonable to hy- 
pothesize higher ratings on a status scale 
for speakers demonstrating greater proficiency 
in standard English. On the other hand, 
Brown characterized solidarity as being at- 
tributed to a person who is similar to the 
rater, Solidarity is marked by frequent inter- 
action, self-disclosure, and intimacy. Since 
accented speech allows for the identification 
of ethnic group, Mexican Americans, in con- 
trast to Anglos, might be expected to rate 
speakers of Mexican American accented Eng- 
lish higher on solidarity scales than standard 
English speakers. 

On the basis of Fishman’s (1971) work 
with contextual domains, the status values 
would be expected to be highlighted in school 
situations, while solidarity values would be 
emphasized in home situations. Fertig and 
Fishman (1969) did find that bilingual Puerto 
Ricans rated home words (both English and 
Spanish) more positively on a solidarity- 
stressing dimension and school words more 
positively on a status-stressing dimension. 

Incorporating two variables related to func- 
tional separation, Carranza and Ryan (1975) 
investigated evaluative reactions of bilingual 
Anglo and Mexican American adolescents 
toward male speakers of standard English and 
standard Spanish. Two contexts, home and 
school, were selected as those which are func- 
tionally most separate for the bilingual 
Mexican American adolescent; these contexts 
correspond to the two main potentially con- 
flicting value clusters for the Mexican Ameri- 
can (Ulibarri, 1968). Spanish, the language 
of the home domain, is taken to be the low 
language reflecting solidarity values, while 
English, the language of the school, is taken 
to be the high language reflecting status 
values. In this study, a passage concerning a 
home situation and a passage concerning a 
school situation were employed. Each of 16 
speakers, who read one passage in English or 
in Spanish, was rated on 15 bipolar adjective 
scales, 4 of which were designated a priori as 
status scales and 4 as solidarity scales. 
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Factor analysis supported the claim that 
the status and solidarity scales represented 
different rating dimensions. Both Anglo and 
Mexican American students rated English 
speakers more favorably than Spanish speak- 
ers for the school situation; for the home 
situation, however, Spanish speakers were 
preferred slightly. Although English was pre- 
ferred on both status and solidarity scales, 
the difference between the languages for soli- 
darity scales was significantly smaller than 
for status scales. Unexpectedly, Mexican 
Americans did not rate Spanish speakers 
higher than English on solidarity scales, nor 
were any group differences observed. Confirm- 
ing the importance of manipulating variables 
associated with functional separation, these 
results indicate that speakers of English are 
preferred over speakers of Spanish only in 
the school domain and much more strongly 
with respect to status scales than solidarity 
scales. 

In the present investigation, the method- 
ology developed in Carranza and Ryan (1975) 
has been used to study the evaluative reac- 
tions of Anglo, black, and Mexican American 
female high school students toward male 
speakers of standard English and Mexican 
American accented English. The ratings of 
black subjects are of interest because they 
frequently interact with the other two groups 
in urban settings. Like Mexican Americans, 
black adolescents are members of a minority 
group with associated nonstandard speech 
varieties, but, like Anglos, they are outside 
the group associated with Mexican American 
accented speech. 

It was predicted that standard English 
speakers would be preferred by Mexican 
American subjects in the school domain with 
accented English speakers preferred in the 
home domain, whereas standard English 
would be preferred in both domains by Anglo 
subjects. In addition, it was expected that 
the Mexican American ratings of accented 
English, relative to standard English, would 
be more favorable for solidarity scales than 
for status scales, whereas Anglo ratings 
would be similar for both types of scales. 
Like the Anglo raters, the Black subjects were 
not expected to differentiate between contexts 
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and scale types, although they were expected 
to be more tolerant of accented speakers. 


METHOD 
Subjects 


Three groups of volunteer students in an all-female 
Catholic high school in Chicago participated in this 
study. The groups were as follows: 21 bilingual 
Mexican Americans who had learned Spanish at 
home, 21 Anglos, and 21 blacks. None of the Anglos 
or blacks had Spanish as their mother tongue, al- 
though all were enrolled in Spanish classes. 

All three groups included students ranging in 
grade from freshman to senior, with an average 
grade of beginning sophomore and with an average 
age of approximately 154 years. The parents of 
Anglo and Mexican American subjects had attained 
a median level of education equivalent to high school 
graduation, whereas the parents of the black sub- 
jects had attained a median level corresponding to 
an eighth-grade education. In order to provide in- 
formation concerning socioeconomic status, a 7-point 
scale of fathers’ occupations was devised similar to 

those used by others (cf. Hatt & North, 1947; U.S. 
Bureau of the Census, 1960; Warner, Meeker, & 
Eells, 1949). Whereas the median occupation of 
Anglo fathers was middle-middle class, the median 
level of black and Mexican American fathers was 
lower-middle class. Thus, Anglo subjects were of 
slightly higher status than the Mexican American 
subjects on the basis of father’s occupation and 
slightly higher than the Black subjects on both 
father’s occupation and parental education. 


Materials 


Two narrative paragraphs, slightly revised versions 
of those employed in Carranza and Ryan (1975), 
were employed; each contained approximately 140 
words and specific opportunities for accented pro- 
nunciations. The first, with person, place, and topic 
selected from the home domain, was a discussion 
between mother and son in the kitchen at break- 
fast time; while the other, with person, place, and 
topic from the school domain, concerned a teacher 
giving a history lesson to her class. 

Taped readings of these two paragraphs were 
collected from two samples of high school males, 
17 native speakers of English and 17 Mexican 
American speakers of accented English. After fa- 
miliarizing themselves with the text, the students 
read the paragraphs aloud several times. The most 
fluent readings for each speaker were considered for 
selection as stimuli. In several cases where the 
subject repeated part of a sentence, reading errors 
were edited out. From the tapes of the first group, 
the readings of 3 students were selected for the 
“home” paragraph (standard English-home pas- 
sages) and another three readers for the “school” 
paragraph (standard English-school passages). Like- 
wise, from the Mexican American group, the read- 
ings of 3 students were chosen for the home para- 
graph (accented English-home passages) and an- 


other 3 readers for the school paragraph 1 
English-school passages). The readings ave 
minutes in length and were taped with a Wo! 

5730 tape recorder. 

The two stimulus tapes contained 2 practice sp 
ers followed by the 12 test speakers. Each 1 
contained a different randomized block ordering 
the test speakers and was used for two testi 
sessions. 

The rating sheets contained 15 bipolar adjectiy 
scales which can be used to describe persons; the 
scales were identical to those employed by Carranza 
and Ryan (1975). Four adjectives were selected to 
represent each of two dimensions: status stressing 
(educated—uneducated, intelligent-ignorant, success- 
ful-unsuccessful, and wealthy-poor) and solidarity 
stressing (friendly—unfriendly, kind-cruel, trust- 
worthy-untrustworthy, and good-bad). Since all of 
these, except wealthy-poor, have been considered 
highly evaluative, the 7 additional adjectives were 
selected to represent the activity and potency fac- 
tors also found to be important in semantic differ- 
ential ratings (Osgood & Suci, 1955). 

Each adjective pair was placed along a 7-point 
scale. On each of two rating sheets, with different 
random orderings of the adjectives, half of the 
favorable adjectives were on the left and half on 
the right. The test booklets, which contained 14 
sheets (for rating 2 practice speakers and 12 test 
speakers), included 7 of each rating sheet mixed 
randomly. For the analysis, the responses were 
scored with integers from 1 to 7, with the value 
of 1 assigned to the most negative judgment and 
the value of 7 assigned to the most Positive judg- 
ment. 


Procedure 


The subjects were tested in four sessions in high 
school classrooms with about 20 subjects per ses- 
sion. There were students from all three ethnic 
groups in each testing session. The test packet, given 
to each subject, included written instructions, 14 
rating sheets, a simple bilingual task sheet (for 
another study), and a language background ques- 
tionnaire. The experimenter read over the written 
instructions with the subjects and answered ques- 
tions which arose before the task began. The sub- 
jects were told that this was a study of first im- 
pressions of personalities based on limited informa- 
tion. In particular, they were requested to rate each 
taped speaker on the basis of voice cues only, just 
as they might judge the personality of an individual 
speaking on the telephone or the radio, The first 
two speakers rated were practice speakers to acquaint 
the subjects with the procedure. During the task, 
after each speaker read a passage, the tape recorder 
was stopped until all subjects had rated the speaker 
on the 15 adjective scales. After evaluating the 
speakers, subjects filled out a one-page rating sheet 
for several Spanish and English concepts (a task 
not related to this experiment) and a language back- 
ground questionnaire. The sessions were approxi- 
mately 1 hour in length. 
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RESULTS 


The basic data consisted of the total rat- 
ings of each subject on each adjective for the 
four sets of speakers (standard English- 
home, accented English-home, standard Eng- 
lish-school, and accented English—school). 
One subject was eliminated due to incom- 
plete data based on the criterion of omission 
of more than 8 ratings from the entire set of 
12 speaker sheets or more than 3 from any 
given page. Where other omissions occurred 
in the ratings, a neutral score of 4.0 was as- 
signed. 

Data analysis included three steps: (a) 
analyses of variance comparing total ratings 
on the four a priori status scales with the 
four a priori solidarity scales, (b) factor 
analyses of the ratings for all of the adjec- 
tives, and (c) analyses of variance with the 
factor scores. 

Using the total ratings of the four adjec- 
tives designated as status and the four desig- 
nated as solidarity scales, a mixed-design 
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Ficure 1. Mean ratings of standard English and 
accented English speakers in home and school con- 
texts. 
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Ficure 2. Mean group ratings of standard English 
and accented English speakers on status and soli- 
darity scales. 


analysis of variance with the following four 
factors was employed: group (Anglo, black, 
and Mexican American), scale type (status 
stressing and solidarity stressing), context 
(home and school), and language (standard 
and accented). The major prediction con- 
cerned an interaction among group, context, 
and language factors. Although this interac- 
tion was not significant, the Context X Lan- 
guage interaction was significant, F (1, 60) 
= 29.4, p < .001. The mean ratings in Fig- 
ure 1 indicate that whereas speakers of ac- 
cented English received lower ratings in both 
contexts, the difference was much larger in the 
school context than in the home context. 
Figure 2 provides the means for the only 
significant effect involving group differences, 
the Group X Scale Type X Language inter- 
action, F (2, 60) = 8.45, p < .001. The Dun- 
can multiple-range test (Edwards, 1968) was 
used to determine how the 12 means involved 
in the interaction differed from each other. 
For both scale types, all group ratings of 
accented English were significantly lower than 
those for standard English speakers, but only 
status ratings were unfavorable. Anglo sub- 
jects rated accented speakers significantly 
lower on status scales than either black or 
Mexican American subjects. For standard 
English, the status ratings of Mexican Ameri- 
can subjects were lower than those of Anglos, 
which in turn were lower than those of 
blacks. Thus, a smaller distinction between 
the status characteristics of the two types of 
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speakers was made by the Mexican Americans 
than by the other groups. On the solidarity 
scales, the black raters assigned significantly 
higher scores to accented speakers than did 
the Mexican Americans, while neither group 
differed from the Anglo students. Black stu- 
dents also rated standard English speakers 
more favorably on solidarity scales than did 
the other two groups. Considering the data 
from another point of view, status ratings of 
accented speakers were lowest for all groups, 
followed by solidarity ratings of those speak- 
ers. Thus, it is evident that the solidarity 
ratings of the two speech types were less di- 
vergent than the status ratings. None of the 
solidarity means were below the neutral score 
4.0, and they were closer together than status 
means both across groups and between lan- 
guage varieties. Related to this interaction, 
there was a significant interaction between 
scale type and language, F (1,60) = 62.4, 
p < .001, as well as main effects for language, 
F (1, 60) = 118.3, p < .001, and for scale 
type, F (1,60) = 42.5, p< .001. Ratings 
were higher for standard English than for 
accented English, but the difference was 
larger for status scales than for solidarity 
scales, Furthermore, the ratings over all pas- 
sages were higher for solidarity scales than 
for status scales. Also, there was a significant 
Scale Type x Domain interaction, F (1, 60) 


TABLE 1 


Factor Loaprncs FROM THE ROTATED FACTOR 
MATRIX OF SEMANTIC DIFFERENTIAL RATINGS 


Factor 2 Er 3 


Adjective Factor 1 
Educated” -173 -325 | .191 
Wealthy’ 654 209 | .202 
Successful 736 414 | 149 
Tntelligent® 664 -546 | .098 
Trustworthy? 357 749 | 168 
Friendly 199 779} 211 
Good» 528 636 | .129 
Kind’ .161 -776 | .081 
Strong .726 142] 321 
Energetic 434 307 | .576 
Athletic .266 272 | .607 
Active 426 256 | 494 
Aggressive 337 128 | .526 
Brave -587 170 | .328 
Tndividualistic 001 | —.023 | .397 
Cumulative proportion of 
variance 467 541] 586 


a Scales designated as status stressing. 
» Scales designated as solidarity stressing. 
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= 7.3, $ < .01, which indicated that 
ference between overall ratings for hoi 1 
for school contexts was smaller for soli y 
scales than for status scales. 

In order to utilize the data for all 15 
jectives and to confirm the a priori dist 
tion between status and solidarity scales, í 
tor analysis of the data was employed, 

Factor analysis was applied to the inte 
correlations among the 15 adjectives; the cor 
relations were computed using the total rat 
ings given by the subjects for each set oi 
three speakers on each scale. The resultin 
15 X 15 correlation matrix was subjected t 
a principal-factor analysis using an eigen- 
value equal to or greater than .6 as criterion 
for extraction. The extracted factors were 
then rotated according to the varimax prin- 
ciple. 

Table 1 provides the factor loadings of the 
adjective scales on the three extracted factors, 
The cumulative proportion of variance ac- 
counted for by these factors was slightly less 
than 60%. The first two factors provide for 
a separation between the designated status 
adjectives and the solidarity adjectives. The 
activity/potency scales loaded high on the 
third factor, except for strong and brave, 
which were more highly related to the status 
scales and individualistic, which was not re- 
lated to any of the three factors, 

With factor scores as the dependent varia- 
ble, another similar four-factor analysis of 
variance (with three factors rather than two 
scale types) was performed. In Table 2 are 
presented the mean factor scores necessary to 
interpret the significant Factor x Domain x 
Language interaction, F (2, 120) = 4.2, p< 
-05. Main effects indicated that the overall 
ratings were much lower for accented speech 
than for standard English, F (1, 60) = 82.7, 
p < .001, and that ratings for the school pas- 
sage were less favorable than for the home 
passage, F (1,60) = 7.01, p < .05. A signifi- 
cant Context x Language interaction revealed 
that the difference between ratings of the two 
speech styles was greater in the school than 
in the home context, F (1,60) = 16.4, p< 
-001. The Duncan multiple-range test was em- 
ployed to clarify the meaning of a significant 
Scale Type x Language interaction, F (2, 
120) = 24.6, p<.001, and the three-way 
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TABLE 2 


MEAN FACTOR SCORES ON THREE FACTORS FOR STANDARD AND ACCENTED ENGLISH IN HOME 
AND SCHOOL CONTEXTS 


Home School 
Dimension 
Standard Accented Standard Accented 
English English English English 
Factor 1; status 5184 —.3014 -6041 —.8210 
Factor 2: solidarity 0232 —.1812 3407 —.1827 
Factor 3: activity/potency -2000 -0070 -0033 —.2098 


Scale Type X Context X Language interac- 
tion. Although the ratings for all scale types 
were significantly lower for the accented Eng- 
lish than for the standard English speakers, 
the ratings of accented speakers on the first 
factor (status) were significantly lower than 
on the other two factors, and ratings of stan- 
dard English speakers on the status factor 
were significantly higher than ratings of stan- 
dard English on the other factors. The results 
of the Duncan test for the three-way interac- 
tion can best be described factor by factor. 
For the first factor (status), ratings of ac- 
cented speakers in the school context were 
lower than in the home context, and in both 
contexts, ratings of accented speakers were 
lower than those of standard English speakers, 
which were equivalent across contexts. For 
the second factor (solidarity), ratings of 
standard English passages in the school con- 
text were significantly higher than accented 
passages in both contexts. No significant dif- 
ferences among the four passages were found 
for the third factor (activity/potency). Sig- 
nificant interactions also occurred between 
factor and context, F (2,120) = 5.8, p < .01, 
and among group, factor, and context, F (4, 
120) = 4.2, p < 05. 


Discussion 


The results of this study confirmed the 
prediction that the functional separation of 
language varieties would be reflected in eval- 
uative reactions toward speakers of standard 
and accented English. Since all the groups 
rated the accented speakers higher in the 
home context than in the school context, it is 
evident that even students who do not asso- 
ciate the accented speech with their own 


group are more tolerant of deviations from 
standard speech in a home context. This 
effect of speech context demands a reinter- 
pretation of the nature of evaluative reac- 
tions. Lambert (1967) and Robinson (1972) 
have suggested that such reactions to speak- 
ers involve a two-stage process in which a 
speaker’s group is identified on the basis of 
speech characteristics and then the associated 
stereotypes applied to him. The present study 
indicates, however, that the rater takes into 
account the appropriateness of the speaker’s 
language behavior as well as his group iden- 
tity. 

The expectation that Mexican Americans 
might prefer Mexican American accented 
speakers in the home context and with re- 
spect to solidarity scales, or that this group 
would at least be more favorable than other 
groups, was not confirmed. It may be that 
Spanish, rather than accented English, is the 
language preferred in the home and used more 
commonly as a symbol of ethnic identity. It 
should be noted, however, that although 
Carranza and Ryan (1975) found Spanish 
to be the preferred home language, they did 
not observe an overall preference of Mexican 
Americans for Spanish on the solidarity scales, 
These unexpected findings concerning speech 
preference of Mexican American adolescents 
clearly highlight the need for more research 
in this area. 

In order to clarify the restrictions on gen- 
eralizability of results and to suggest im- 
provements or alternatives for further re- 
search, it is relevant to consider the limita- 
tions of this study. First of all, since only 
three speakers for each of the four passages 
were rated, it is possible that some true per- 
sonality differences among them affected the 
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ratings as well as the speech styles. Clearly, 
replications with other speakers must be per- 
formed. Second, in this study, middle-class 
female judges from a Catholic high school 
rated male speakers. Since previous research 
(reviewed by Lambert, 1967) has established 
that evaluative reactions are affected by sex 
and social class as well as age and regional 
background of both listener and speaker, the 
relationship of these various social categories 
to the ratings should be explored through 
future systematic variation of such character- 
istics. 

Although the importance of the speech con- 
text was established here, manipulation of 
that variable was very limited. In addition to 
varying the topic, one could also vary place 
of testing. The ratings on status and solidar- 
ity scales did differ somewhat; however, 
further study is needed to ensure the selec- 
tion of the best adjectives to represent the 
two sets of values. Whereas the distinction 
between speech styles was limited to reading 
pronunciations of English, eventually it would 
be desirable to employ the whole range of 
English and Spanish speech styles available to 
Mexican Americans. 

Finally, the lack of expected group differ- 
ences could be attributable to individual dif- 
ferences within ethnic groups. In addition to 
the social categories already mentioned, a 
number of personality and background char- 
acteristics (e.g., linguistic abilities, language 
history, degree of ethnic loyalty, ethnocen- 
trism, and prejudice) may be relevant. 
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Reciprocity and Concession Making in Bargaining 


James K. Esser and S. S. Komorita 
University of Indiana 


In a dyadic bargaining situation, it was hypothesized that bargainers expect 
concessions to be reciprocated, and violations of this expectation result in 
fewer and smaller concessions. Subjects were led to believe that they were 
bargaining with another subject but actually bargained with a preprogrammed 
strategy used by the experimenter. Frequency of reciprocated concessions was 
systematically varied, and subjects were also classified according to their moti- 
vational orientation (competitive vs. cooperative). Using offers on the last 
trial as the main dependent variable, the following results were obtained: 
(a) Mean concessions were smaller in the incomplete reciprocity conditions, 
(b) subjects classified as cooperative conceded more than subjects classified 
as competitive, and (c) the effect of violation of reciprocity was greater for 
competitive subjects. It was suggested that a consistent pattern of recipro- 


cated concessions may be the most effective way of inducing concessions. 


A critical problem in bargaining is the 
effects of concession making. Although con- 
cession making may be desirable because it 
maximizes the probability of reaching agree- 
ment, it may also be undesirable because it 
can lead to an unfair agreement for the party 
who concedes most. Thus, there are two op- 
posing motivational forces operating in a 
bargaining situation—the motive to reach 
agreement and the motive to achieve an ad- 
vantageous bargain—and the participants 
must somehow resolve this conflict. 

With regard to theory and research on the 
effects of corcession making, Siegel and 
Fouraker’s (1960) level of aspiration theory 
assumes that concessions by one bargainer 
will raise the aspiration level of the other 
bargainer, thus evoking a tough bargaining 
stance. The implication of their theory is that 
one should make small concessions, if any, 
and that “it pays to be tough.” In addition 
to the data reported by Siegel and Fouraker, 
there seems to be considerable evidence to 
support their position (cf. Bartos, 1966; 
Komorita & Brenner, 1968). Moreover, there 
is also evidence to suggest that a moderately 
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tough strategy may be more effective i 
ducing concessions than an extremely í 
strategy (Cann, Esser, & Komorita, Note 1). 
In opposition to the Siegel-Fourak i 

pothesis is the reciprocity hypothesis. 

ner (1960) has postulated the existenc: 
universal norm of reciprocity, and the appli 
cation of this norm to the bargaining situa- 
tion yields a prediction opposite to that of 
Siegel and Fouraker (1960). According to the 
reciprocity hypothesis, a bargainer expect 
concessions to be reciprocated. If an indi- 
vidual’s expectations are disconfirmed by a 
tough bargainer who does not reciprocate his 
concessions, the individual may react against 
the norm violation with fewer and smaller 
concessions. The reciprocity hypothesis, there- 
fore, predicts that a tough strategy will evoke 
a tough strategy in reply. A related view, 
Proposed by Osgood (1959) assumes that 
given a stalemated negotiation, a deescalation 
of the confiict may be achieved by imple- 
menting a small unilateral concession which 
is subsequently increased if reciprocat 

the opponent. Again, there is an implic 
sumption that reciprocated concessions can be 
expected. There is some indirect support for 
the reciprocity hypothesis in the bai 
literature, which shows that subjects ten: 
reciprocate the frequency of concessions by a 
programmed opponent (Chertkoff & Conley 
1967; Pruitt, 1968; Pruitt & Johnson, 197 
In addition, there is also some support for 
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the Osgood hypothesis (Crow, 1963; Hamner, 
1974; Komorita & Barnes, 1969; Pilisuk & 
Skolnick, 1968). 

Finally, a third alternative (Kelley, Beck- 
man, & Fischer, 1967) assumes that the op- 
ponent’s concession making has no effect on 
the bargainer’s concessions. Rather, each bar- 
gainer tends to follow a predetermined offer 
schedule which is largely unaffected by the 
opponent’s concessions. According to this hy- 
pothesis, toughness is irrelevant. Evidence in 
support of this position have been reported 
by Pruitt and Drews (1969) and Liebert, 
Smith, Hill, and Kieffer (1968). 

The fact that all of these opposing hy- 
potheses have some empirical support sug- 
gests that the validity of these hypotheses 
may vary with the situation. Under certain 
conditions—or with certain types of individ- 
uals—a tough strategy may be optimal. 
Under other conditions, however, a tough 
strategy may induce toughness in return, thus 
escalating the conflict. Another possible rea- 
son for the lack of consistent support for a 
sihgle hypothesis is that there are several 
parameters underlying toughness, for exam- 
ple, size of opening offer, concession fre- 
quency, concession magnitude, and respon- 
siveness to the opponent’s concession making. 
Thus, one can be very tough on the opening 
offer but relatively yielding on subsequent 
trials, or one can be yielding on the opening 
offer and be very unyielding on subsequent 
trials, 

Similarly, a test of the reciprocity hypothe- 
sis requires an operationalization of reciproc- 
ity, especially for a bargaining situation. In 
the present study, three parameters were used 
to define reciprocity: (a) frequency of recip- 
rocated concessions, (b) magnitude of re- 
ciprocated concessions, and (c) the timing of 
such concessions (immediate vs. delayed). 
Based on this specification of reciprocity, the 
purpose of this study was to determine the 
effects of varying degrees of reciprocity in 
inducing concessions. 

In order to vary the degree of reciprocity 
on a systematic basis, pairs of subjects were 
led to believe that they were bargaining with 
each other, but in actuality, both bargained 
with a preprogrammed feedback strategy by 
the experimenter. In all conditions, the total 
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(cumulative) magnitude of concessions of the 
subject was matched by the program, so that 
an agreement would be reached only if the 
subject conceded to the “focal point” solu- 
tion of $80. At the same time, the proportion 
of reciprocated concessions was varied across 
the conditions. This procedure was used so 
that subjects could easily perceive violations 
of reciprocity, while retaining the property 
that in all conditions the same solution 
(agreement price) would be reached. 

It was hypothesized that in a bargaining 
situation, subjects expect a give-and-take sys- 
tem of reciprocated concessions, and when 
this expectation of reciprocity is violated, 
subjects will react against this disconfirma- 
tion by making fewer and smaller conces- 
sions. Thus, it was predicted that the greater 
the frequency of reciprocated concessions, 
the more effective it would be in evoking 
concessions from the subject. 


EXPERIMENT 1 (Pitor STUDY) 


Method 
Subjects 


The subjects for the pilot study were 55 male 
undergraduate students enrolled in introductory psy- 
chology classes. Participation in the experiment ful- 
filled part of the requirements of the course. Three 
subjects who were suspicious of the deception were 
dropped from the analysis, leaving a total of 52 
subjects. 


Procedure R 


Subjects were scheduled in groups ranging from 
four to eight persons, and after arriving at the 
appointed time, they were randomly assigned to two 
rooms. Each subject was instructed to assume the 
role of a buyer in a bargaining situation and was 
led to believe that a subject in the other room, 
paired at random, would assume the role of the 
seller. The object of the bargaining was presented 
in the scenario as a used appliance in average condi- 
tion. A comparable new appliance could be pur- 
chased for $120, and an alleged “second-hand dealer” 
would buy it for $40. In order for both the buyer 
and the seller to make a profit, the agreement price 
must be between $50 and $110. The midpoint of 
both of these ranges provided a focal point of $80. 

The following additional restrictions were imposed: 

1. Offers and counteroffers would be made by 
written bids. All offers would be restricted to a 
range of from $40 to $120. Once made, a concession 
could not be retracted (all offers had to be repetition 
of the previous offer or a concession). 

2. There would be 20 trials, with the seller making 
the first offer. If an agreement were not reached 
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by the 20th trial, the seller would have the oppor- 
tunity to accept or reject the buyer's last offer. 

3. If an agreement were reached, the lower the 
agreement price, the more money the buyer would 
earn, and the higher the agreement price, the more 
money the seller would win. If no agreement were 
reached, neither the buyer nor the seller would earn 
any money. 

4. Subjects could earn anywhere from zero to 
$2.00, depending on the agreement price. A conver- 
sion scale would be used at the end of the experi- 
ment to determine how much they had won. 


Independent Variables 


The experimenter, as the seller, began each bar- 
gaining session with an offer of $120. Following the 
subject’s initial offer, counteroffers were made ac- 
cording to a preprogrammed feedback strategy based 
on Equation 1 (used by Komorita & Brenner, 1968) : 


En = En-1 — (Sn — Sn-1), (1) 


where En and S, denote offers made on the nth trial 
by the experimenter and subject, respectively, and c 
denotes a concession parameter. It can be seen that 
if c is equal to 1, the program would reciprocate 
(match) the concessions made by the subject (the 
seller would come down in price as much as the 
subject went up in price). 

Three feedback conditions were used: 100%, 50%, 
and 25% reciprocity. In the 100% reciprocity condi- 
tion, concessions by the subject were immediately 
reciprocated on the next trial by concessions of equal 
magnitude (¢=1). In the 50% and 25% conditions, 
no concession was made except on “catch-up” trials. 
A catch-up trial consisted of a single cumulative 
concession which was equal in magnitude to the 
sum of the subject’s concessions since the previous 
catch-up trial. These catch-up trials were adminis- 
tered on random trials but with the restriction that 
they always followed a concession by the subject. 
In the 50% condition, there was an average of one 
catch-up trial for every two concessions made by 
the subject, while in the 25% condition, there was 
an average of one catch-up trial for every four 
concessions by the subject. It can be seen that the 
50% and 25% reciprocity conditions involve delayed 
matching strategies because they match the cumu- 
lative concessions of the subject but do not match 
concessions from trial to trial. 

The purpose of using catch-up trials was to con- 
trol the total magnitude of concessions in the three 
conditions. With this procedure, the frequency of 
reciprocated concessions was varied, but at the same 

time, the cumulative concessions of the program 
matched the cumulative concessions made by the 
subject. Moreover, the subjects in all three condi- 
tions would reach agreement at a price of $80, 
the “focal-point” solution based on “splitting the 
difference.” 

Any offer by the subject which was greater than 
or equal to $80 was immediately accepted and the 
bargaining terminated. There were 20 subjects in 
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the 


the 100% condition and 16 subjects in ea 
two partial reciprocity conditions. 


Dependent Variables 


Because previous research (cf. Benton, Ke) & 
Liebling, 1972; Cann et al, Note 1) suggests that 
two stages of bargaining may be involved, vo 


dependent variables were used, As the end o! jhe 
bargaining draws near, subjects may respond to ‘he 
increased pressure to agree by conceding 
readily. The effect of these final concessions is 
Previous differences in conceding, as a function 
different feedback conditions, may be “washed 

This is called an “end effect.” Accordingly, 

the subject’s 19th offer (before the end effect) 

the subject’s final offer (including the end eff 
were analyzed. In addition, following the bargair 
session, each subject was asked to rate his oppor 

on a series of 7-point semantic differential scal 


Results and Discussion 


The means and standard deviations for the 
19th and final (20th) offers are shown in 
Table 1. In 14 cases where subjects had 
already reached agreement by Trial 19 (at 
$80), $80 was used as the value for both the 
19th and final offer, An analysis of variance 
of the 19th offer showed a significant overall 
effect of conditions, F(2, 49) = 6.98, p< 
-005. The two degrees of freedom were also 
partitioned into two planned comparisons: 
(a) The first was the difference between the 
100% condition versus the two incomplete 
reciprocity conditions; this comparison was 
significant at the .01 level, F(1, 49) = 8.69, 
(b) Second was the difference between the 
two incomplete reciprocity conditions (50% 
vs. 25%); this was also significant, F(1,49) 
= 5.26, p < .025. 

Similar analyses for the final offer showed 
a significant effect of conditions, F(2,49) = 
6.20, p <.005. Both planned comparisons 
were also significant, F(1,49) = 5.26, p< 
05; and F(1,49) = 5.59, p < .025, respec- i 
tively. For the final offer data, the variances 
were heterogeneous, but Mann-Whitney U 
tests were also significant for these data. 
Thus, the predicted end effect was not ob- 
tained, and these differences between condi- 
tions seem to persist into the final offer, 

The data were also analyzed for frequency 
of reaching agreement at $80. A chi-square 
test of the data for all three conditions 
(2X3 table) was not significant, x7(2) = 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF NINETEENTH 
AND FINAL OFFER, EXPERIMENT 1 


Proportion reciprocity 


Offer 


100% 50% 25% 
(n = 20) (n = 16) (n = 16) 
Trial 19 
M 74.90 71.31 64.62 
SD 6.62 8.11 10.04 
Final 
M 78.05 76.00 70.75 
SD 3.66 5.06 9.29 


3.60. However, since there are two degrees of 
freedom, these data were also analyzed with 
the two partial reciprocity conditions com- 
bined. In the 100% condition, 13 of 20 sub- 
jects (65%) reached agreement, while in the 
two partial reciprocity conditions, only 13 of 
32 (41%) reached agreement. The chi-square 
value for the resulting 2 X 2 table was 2.93, 
and for a one-tailed test (df = 1), this was 
significant at the .05 level. This result indi- 
cates that agreements are more likely in the 
100% condition than in the two partial 
reciprocity conditions. 

Analyses of the postquestionnaire data also 
revealed that opponents who violated reci- 
procity were viewed with displeasure. Com- 
pared with subjects in the two partial reci- 
procity conditions, subjects in the 100% 
condition rated the other person as more co- 
operative, reasonable, fair, warm, and yielding 
(all significant at the .05 level). 

Overall, these results support the prediction 
that frequency of reciprocated concessions is 
a significant factor in inducing concessions: 
the greater the frequency, the greater the 
concessions by the subject. Moreover, the 
results also suggest that a reciprocity norm 
may be invoked by bargainers, and violations 
of the norm may evoke a perception of the 
opponent as competitive and exploitative. 
More importantly, subjects seem to resist 
these attempts at exploitation and respond to 
toughness with toughness. 

Despite this support for the reciprocity 
hypothesis, an alternative interpretation must 
be considered. Though the cumulative magni- 
tude of concessions was constant across the 
three conditions, the size of concessions in the 
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partial reciprocity conditions (on catch-up 
trials) were generally larger than in the 100% 
condition. Hence, it could be argued that in 
the partial reciprocity conditions, subjects 
may have perceived these unexpectedly large 
concessions as a sign of weakness, raised their 
level of aspiration, and thus evoked a tough 
bargaining stance. The results of Experi- 
ment 1, therefore, could be interpreted as sup- 
porting both the reciprocity hypothesis as well 
as the Siegel-Fouraker (1960) hypothesis. 

In order to resolve this problem of inter- 
pretation and to replicate the results of Ex- 
periment 1, a second experiment was con- 
ducted in which a control group was added to 
the design. In the control group all conces- 
sions were reciprocated (100% reciprocity), 
but the magnitude of the concessions were 
one half the size of concessions made by the 
subject. Since the control condition is a tough 
strategy which does not involve catch-up 
trials, a comparison of the control condition 
and the incomplete reciprocity conditions 
should provide a test of the two hypotheses. 


EXPERIMENT 2 


In addition to the use of a control group, 
an attempt was made to determine the effects 
of the motivational orientation of subjects 
and the possible interaction between reciproc- 
ity and orientation. Research with the Pris- 
oner’s Dilemma Game has shown that a sub- 
ject’s initial response is predictive of his 
subsequent behavior in the game (Komorita, 
1973; Terhune, 1968). Moreover, in a bar- 
gaining situation, Cann et al. (Note 1) have 
shown that subjects’ initial offers are highly 
correlated with their final offers. 

It was assumed that a cooperatively ori- 
ented subject is primarily motivated to reach 
agreement, whereas a competitively oriented 
subject is primarily motivated to achieve an 
advantageous bargain. Hence, it was pre- 
dicted that competitively oriented subjects 
would be more likely to react to incomplete 
reciprocity by making fewer concessions. The 
cooperator, on the other hand, was expected 
to continue conceding in an effort to induce 
cooperation. 

Finally, two procedural changes were made. 
First, the modal final offer for all subjects in 
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Experiment 1 was $80, and it is plausible 
that many of them would have continued to 
concede if the offer of $80 had not yielded 
an agreement. To reduce the number of sub- 
jects who would reach an agreement prior to 
the allotted number of trials, the agreement 
value demanded by the program was changed 
to $90. Second, instead of using a fixed num- 
ber of bargaining trials, the number of trials 
in Experiment 2 was unspecified (ambiguous). 
This change was made to replicate the end 
effect found in previous bargaining studies 
by increasing the time pressure to reach 
agreement. 

Given these changes, the following hy- 
potheses were tested: 


1. Mean offers will be a direct function of 
frequency of reciprocated concessions (repli- 
cation of the results of Experiment 1). In 
addition, according to the reciprocity hy- 
pothesis, subjects in the control condition 
(who receive concessions of smaller magni- 
tude) will concede less than subjects in the 
complete reciprocity condition. 

2. Subjects classified as competitive—on 
the basis of their initial offer—will concede 
less than subjects classified as cooperative. 

3. The effect of violations of reciprocity will 
be greater for subjects who are initially com- 
petitive than for those who are cooperative 
(an interaction between orientation and feed- 
back conditions). 


Method 
Subjects 


The subjects were 78 male undergraduate students 
enrolled in introductory psychology classes, and 
participation in the experiment fulfilled part of the 
requirements of the course. 


Procedure 


The instructions and procedure were identical to 
those used in Experiment 1 except for the following 
changes: 


1, Subjects (as buyers) made the first offer and 
were classified as cooperative or competitive on the 
basis of this initial offer. Based on the data of 
Experiment 1 (median initial offer of $45), subjects 
were classified as cooperative if the initial offer was 
greater than $45 and as competitive if the initial 
offer was equal to or less than $45. Subjects in each 
class were then randomly assigned to the various 
experimental conditions (randomized blocks design). 
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2. Subjects were instructed that the | ng 
would last a fixed amount of time, unless i ed 
by reaching an agreement, but the time li vas 
left unspecified. After the 19th trial, they were 
warned that the time was almost up and that iere 
would be time for “only a few more trials.” “he 


bargaining then continued for 3 more trials. jI 
to the 22nd trial, subjects were told that this s 
the last offer and that the seller would have 5 
opportunity to accept or reject this last offer. 

3. Following the warning to the subject 
Trial 19, a final concession was made to $90 (if 
level had not been reached by Trial 19), and 
further concessions were made after Trial 19. 


As in Experiment 1, three levels of recipro 
were used: 100%, 50%, and 25%. In the coni 
group, each concession by the subject was imm 
ately followed by a concession based on a c parai 
eter of .50. Hence, in the control condition, fr: 
quency and immediacy of concessions were identic 
to the 100% reciprocity condition, but the magni 
tude of concession was one half the size of the 
subject’s concession, 

As in Experiment 1, two dependent variables were 
used: subject’s 19th offer (prior to the warning) and 
his final offer on Trial 22. In addition, two question- 
naires were administered. After receiving all instruc- 
tions—but before the bargaining began—each subject 
was asked to give his estimate of what he considered 
to be a “fair” agreement price, This question was 
included as a check on the salience of the focal- 
point solution ($80) provided in the instructions. 
A postexperimental questionnaire was also adminis- 
tered to test the effectiveness of the deception. 
Responses on this questionnaire indicated that two 
subjects were suspicious and that one subject did 
not understand the instructions. The data for these 
three subjects were not included in the data analy- 
sis. Unfortunately, due to the time factor, ratings 
of the opponent were not obtained in Experiment 2, 


Results 


The means and standard deviations of the 
19th offer are shown in Table 2. These data 
were analyed by a 4 X 2 (Reciprocity x Ori- 
entation) analysis of variance. The results of 
this analysis indicated that the main effect of 
orientation was significant at the .01 level, 
F(1, 67) = 16.94. Table 2 shows that sub- 
jects who were competitively oriented con- 
ceded less than those who were cooperatively 
oriented, thus confirming Hypothesis 2. The 
main effect of feedback conditions was also 
significant at the .01 level, F(3, 67) = 5.44. 
The nature of the differences between the 
reciprocity conditions seem to replicate the 
results found in Experiment 1, thus confirm- 
ing Hypothesis 1. 


| 


RECIPROCITY AND CONCESSION MAKING 


There also seems to be an interaction be- 
tween orientation and feedback conditions, as 
hypothesized, but the interaction is only of 
borderline significance, F(3, 67) = 2.30, p< 
.10. Finally, three planned comparisons were 
made: (a) First was the 100% reciprocity 
condition against the three partial reciprocity 
conditions; this comparison was significant at 
the .01 level, F(1,67) = 13.31. (b) Second 
was the control condition (100%; c= .50) 
versus the combined 50% and 25% condi- 
tions; this comparison was not significant at 
the .05 level, F(1,67) = 2.50. (c) Finally, 
the 50% versus the 25% conditions compari- 
son also was not significant (F < 1). 

The comparisons with the control condition 
suggest, therefore, that the critical component 
of the feedback program is not strictly fre- 
quency of reciprocated concessions but a 
combination of frequency and magnitude. Any 
deviation from complete reciprocity, either in 
terms of frequency or magnitude, does not 
seem to be optimal in inducing concessions 
through the 19th trial. 

Table 3 shows the means and standard de- 
viations of offers on the final trial. The main 
effect of orientation was significant at the .01 
level, F(1, 67) = 10.43, but the main effect 
of feedback conditions was not significant, 
F(3,67) = 1.48. Thus, the differences due to 
feedback, which were significant on the 19th 
trial (prior to the warning), were “washed 
out” on the final trial. These results, as ex- 


TABLE 2 
MEANS AND STANDARD DEVIATIONS Or NINETEENTH 
OFFER, EXPERIMENT 2 


Proportion reciprocity 
Inia Res 
100% | 50% 25% | Control 

>$45 

M 77.89 | 78.70 | 73.63 | 72.44 | 75.06 

SD 5.69 3.20 |} 11.12 | 11.27 

n 9) 10 8 9 36 
<$45 

M 77.70 | 64.11 | 65.90 | 61.40 | 67.36 

SD 4.79 | 10.34 | 10.57 | 10.02 

n 10 9 10 10 39 
Totals 

M 77.79 | 71.79 | 69.34 | 66.63 

n 19 19 18 19 


869 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF FINAL OFFER 
(TRIAL 22), EXPERIMENT 2 


rs Proportion reciprocity 
oh Total 
100% 50% 25% Control 

>$45 

M 79.22 | 81.40 | 81.13 | 81.22 | 80.75 

SD 4.38 3.60 4.49 2.49 

n 9 10 8 9 36 
<$45 

M 82.20 | 73.22 | 76.10 | 75.60 | 76.87 

SD 6.29 5.12 3.99 8.46 

n 10 9 10 10 39 
Total 

M 80.79 | 77.53 | 78.34 | 78.26 

n 19 19 18 19 


pected, replicate the end effect found by 
Benton et al. (1972). 

Finally, the Orientation X Feedback Condi- 
tions interaction was significant at the .05 
level, F (3,67) = 3.96, thus supporting Hy- 
pothesis 3. The nature of this interaction is 
shown in Figure 1. It can be seen that the 
partial reciprocity conditions had marked ef- 
fects for those subjects who were competi- 
tively oriented, as hypothesized, but had 
negligible effects for those with a cooperative 
orientation. The major implication of these 
results is that the end effect found in previous 
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Ficure 1. Mean final offers of cooperative and com- 
petitive subjects as a function of reciprocity. 
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studies may be characteristic of cooperative 
subjects but not of competitive subjects. 
Competitive subjects seem to persist in their 
resistance to violations of reciprocity even 
when subjected to high time pressure. 
With regard to frequency of reaching 
agreement, only three subjects reached $90 
as demanded by the programmed opponent, 
and because of the small number of subjects 
who reached agreement, no attempt was made 
to analyze these data. With respect to the 
analysis of the prebargaining estimates of 
fair price, the mean fair price estimate was 
$72.01, clearly different from the intended 
focal-point solution at $80. This suggests that 
the bargaining situation was somewhat am- 
biguous in that no single solution was ac- 
cepted as fair, and although the intended 
focal-point solution may have been salient for 
many subjects, not all of them accepted it as 
an equal profits solution. As expected, a 
4 X 2 (Reciprocity X Orientation) analysis of 
variance of fair price estimates yielded no 
significant effects of reciprocity, F(3, 67) = 
1.51; orientation, F(1, 67) = 1.75, or the 
interaction, F (3,67) = 1.27. Therefore, the 
obtained differences in concessions between 
the conditions cannot be attributed to differ- 
ences in fair price estimates. 


GENERAL DISCUSSION 


The results of this study provide consider- 
able support for the proposed reciprocity hy- 
pothesis. Though the results of Experiment 1 
are subject to the alternative interpretation 
that catch-up concessions may have raised the 
subject’s level of aspiration, comparisons in- 
volving the control group in Experiment 2 
clearly indicate that this was not the case. 
The control condition represents a tough 
Strategy which does not involve catch-up 
concessions; hence, this condition is not sub- 
ject to the level of aspiration interpretation. 
Since the control condition evoked behavior 
very similar to the other partial reciprocity 

conditions—but clearly different from the 
complete reciprocity condition—the results 
support the reciprocity hypothesis but not the 
Siegel-Fouraker hypothesis. 

These results, moreover, are consistent with 
the results of studies using the Prisoner’s 
Dilemma Game. A review of studies with pre- 
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programmed strategies in the Prisone jj- 
lemma Game indicates that the most ef 


f e 
(optimal) strategy of inducing cooperat is 
one which imitates the subject’s previous e- 
sponse, a “tit-for-tat” strategy (Oska 
1971; Wrightsman, O'Connor, & Ba 


1972). A tit-for-tat strategy, of course, i a 
reciprocal strategy, and these results based : » 
the Prisoner’s Dilemma Game lend furt! 
Support for the reliability of the pres: 
findings. 

Despite this support for the reciprocity 
pothesis, several qualifications are necessa 
First, with regard to the generality of the 
sults, attempts should be made in future : 
search to specify the conditions under whic 
the reciprocity hypothesis is valid. In th: 
present study, the montary incentives were 
quite small, and no losses (costs) were in- 
volved if an agreement could not be reached; 
however, such losses may be an important 
factor underlying the effects of “toughness.” 
In addition, subjects in the present study had 
full knowledge of their opponent’s profit 
scheme. This is another important variable 
restricting the generality of the results, be- 
cause the reciprocity hypothesis assumes that 
a bargainer must first perceive violations of 
reciprocity and then react against this at- 
tempt at exploitation. Hence, the validity of 
the reciprocity hypothesis may be restricted 
to situations in which there is sufficient infor- 
mation to evaluate a reciprocal strategy as 
being fair or unfair. 

Second, the significant interaction between 
orientation and reciprocity suggests that reac- 
tions to violations of the reciprocity norm 
may be much stronger for those who are com- 


petitive than for those who are cooperative, . 


Though all subjects, regardless of orientation, 
tended to respond favorably to a strategy of 
complete reciprocity, in the incomplete reci- 
procity conditions, only cooperative subjects 
continued to concede to a comparable level. 
The final qualification involves the results of 
the control condition in Experiment 2, which 
suggests that the effects of reciprocity is not 
simply frequency of reciprocity. In terms of 
the operational definition of reciprocity, all 
three dimensions—frequency, magnitude, and 
immediacy of concessions—must be present 
in order to induce competitive bargainers to 
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continue conceding. For competitive subjects, 
violation of reciprocity on one or more of 
these dimensions induced less concession 
making. 

These results are consistent with Kelley 
and Stahelski’s (1970) attribution interpreta- 
tion that a cooperative person perceives 
others as being cooperative or competitive. 
He may cooperate initially, but if the other 
person does not reciprocate his cooperative 
behavior, he quickly reciprocates the other’s 
competitive behavior. A competitive person, 
on the other hand, views others as being uni- 
formly competitive and behaves competitively 
toward both cooperative and competitive 
persons. Hence, all person’s confronted by a 
competitive person eventually reciprocates his 
competitiveness. A competitive person, there- 
fore, is more likely to make an attribution 
error by failing to note how his own behavior 
affects the other’s behavior. In the present 
study, all feedback programs converged 
toward the same agreement value, but com- 
petitive subjects yielded less than cooperative 
subjects only in the incomplete reciprocity 
conditions. Hence, it appears that competitive 
subjects failed to perceive the cooperative 
nature of catch-up trials and did indeed make 
attributional errors in the incomplete reci- 
procity conditions. 

This attribution interpretation is, moreover, 
consistent with the end effect found in Experi- 
ment 2. Though the main effect of reciprocity 
conditions was not significant on the final 
trial (the end effect), the significant interac- 
tion indicates that the differences between 
conditions were maintained for competitively 
oriented subjects but not for cooperative sub- 
jects. It is plausible, therefore, that the end 
effect typically found in bargaining studies is 
characteristic of cooperative subjects and can 
be attributed to an averaging across both 
types of individuals. This interpretation is 
consistent with the hypothesis that a coopera- 
tively oriented individual is primarily moti- 
vated to reach an agreement, and regardless 
of the earlier behavior of the other party, he 
is likely to make a final offer which he be- 
lieves the other party will accept. It is plau- 
sible that a competitive person, on the other 
hand, because he is primarily motivated to 
achieve an advantageous bargain, is likely to 
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exploit the situation by making a tough final 
offer and is willing to take a greater risk that 
the other party will reject his final offer. 
For such subjects, achieving a “good” bargain 
is more important than reaching an agreement. 


REFERENCE NOTE 


1. Cann, A., Esser, J. K., & Komorita, S. S. Equity 
and concession strategies in bargaining. Paper 
presented at the meeting of the Midwestern 
Psychological Association, Chicago, May 1973. 
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Effect of Expectations of Future Interaction on Reward 
Allocations in Dyads: Equity or Equality 


E. Gary Shapiro 
Department of Sociology 
University of Iowa 


Subjects divided a reward between themselves and another where they had 
made unequal work contributions. It was found that for allocators with high 
inputs perceptions of future interaction affected the choice of a principle of 
reward allocation. When future interaction was expected high input allocators 
divided the reward equally, when no future interaction was expected rewards 
were divided according to equity. Low input allocators divided rewards accord- 
ing to equity regardless of their expectations of future interaction. The results 
support the view that subjects who expect future interaction with another, 
who would be affected by the allocation, have greater self-presentation con- 
cerns which effect their distribution of rewards. 


Equity theory (Adams, 1965) states that 
persons tend to maintain proportionality 
between their inputs and outcomes and the 
inputs and outcomes of a comparison other. 


Consequently, equity theory proposes that 


persons who allocate rewards within a group 
distribute them in proportion to each mem- 
bers’ contributions. While this proposition 
has been supported by a number of studies, 
other studies have failed to support equity 
indicating that rewards are distributed 
equally. The purpose of this article is to clar- 
ify these conflicting results. 

Early studies supporting equity in situa- 
tions of reward allocation (Leventhal, Allan, 
& Kemelgor, 1969; Leventhal & Bergman, 
1969; Leventhal, Weiss, & Long, 1969) 
showed, that within a dyad, when inputs 
were equal that persons react to an uneven 
division of rewards by reallocating the re- 
wards to restore a more nearly equal distribu- 
tion. These studies fail to differentiate equity 
and equality since when inputs are equal the 


The research reported in this article is part of the 
author’s dissertation submitted in partial fulfillment 
of the doctoral degree at the University of Michi- 
gan. I would like to thank Dorwin Cartwright, 
Howard Schuman, Michael Flynn, and Robert Cole, 
the members of my doctoral committee, for their 
assistance on this research. An earlier version of this 
article was presented at the meetings of the Ameri- 
can Sociological Association, 1972, in New Orleans. 

Requests for reprints should be sent to E. Gary 
Shapiro, Department of Sociology, University of 
Towa, Iowa City, Iowa 52242. 


equity allocation is the same as an equality 
allocation. However, other studies (Leven- 
thal & Anderson, 1970; Leventhal & Lane, 
1970; Leventhal & Michaels, 1969) have sup- 
ported equity over equality. In these studies 
where members of a dyad made unequal work 
contributions, subjects allocated the rewards 
unequally. In contrast to these studies sup- 
porting equity over equality, there are stud- 
ies (Morgan & Sawyer, 1967; Wiggins, 
1966) and coalition studies (reviewed by 
Gamson, 1964) which indicate, contrary to 
equity, that even. when inputs are unequal, 
subjects do allocate rewards equally. 

These conflicting results can be understood 
by reference to situational factors which dif- 
fered across these studies. In the studies 
where equity occurred, minimal social inter- 
action existed between subjects. For example 
in the Leventhal and Lane (1970) study the 
subjects had as “partners” fictitious others 
who were being tested in another room by 
another experimenter. This contrasts with 
studies where equal distributions were made 
in which subjects were in relationships for 
which the reward allocation had future impli- 
cations. In the Wiggins (1966) study the 
subjects were informed that they would spend 
1 hour together for each of 12 consecutive 
days, while in Morgan and Sawyer’s study 
(1967) the subjects were classmates. 

A critical difference in reward allocation 
appears to be whether the allocator expects 


873 


874 


to have future interaction with the persons 
affected by the allocation. This seems impor- 
tant in that when future interaction is ex- 
pected the image one projects to others be- 
comes more salient. Therefore, persons may 
allocate rewards in a manner which would 
lead to social approval on the part of persons 
being affected by the allocation. This study 
manipulated expectations of future interaction 
to determine its effects on reward distribu- 
tions, 

The input level of the allocator vis-à-vis 
the other was also manipulated in this study 
to determine if expectations of future inter- 
action affect the allocation behavior of both 
high and low input subjects in a similar man- 
ner. In the studies where equal distributions 
occurred, the allocation behavior of low and 
high input subjects could not be differentiated 
since the allocation of rewards were deter- 
mined by a group decision. There is some evi- 
dence (Cohen, 1972) which suggests that 
these group decisions predominantly reflect 
the wishes of subjects with high inputs as 
these subjects have more influence on the 
decision reached than do subjects with lower 
inputs. If in the studies where equal distribu- 
tion of rewards occurred high input persons 
had greater influence on the group decision, 
then future interaction need affect only the 
allocation behavior of high input subjects to 
resolve the conflict between studies support- 
ing equity and those supporting equality as 
principles of allocation. 

The specific hypotheses of this study were 
that in situations where equity and equality 
do not produce the same allocation of re- 
wards, that is, when inputs are not equal, 
when future interaction is anticipated there is 
a tendency for allocators to choose an alloca- 
tion principle which is more beneficial to the 
other’s outcomes and less beneficial financially 
to oneself. Therefore, when future interaction 
is anticipated high input allocators should use 
equality, while low input allocators should 
use equity. Conversely, when no future inter- 
action is expected allocators should allocate 
rewards according to the principle which is 
more advantageous to themselves financially. 

Therefore, when future interaction is not 
expected high input allocators should prefer 
equity and low input allocator equality. Stat- 
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ing the hypotheses in terms of the am: of 
money the allocator keeps for himsel gh 
input allocators should keep more th: W 
input allocators and allocators anticips iing 
future interaction should keep less than se 


who do not expect future interaction, 

In addition to studying the effect of fu: re 
interaction on reward allocation, this pres: t 
study attempted to determine if persons fac] 
with a conflict between equity and equa 
resolve it by following one or the other p 
ciple in their allocation or whether they a 
cate rewards so as to compromise between 
two principles. Previous studies have report 
data only on the mean allocations of grou 
This does not allow one to interpret the p: 
tern of allocations for individual subjeci 
There is some speculation (Meeker, 1971) 
that when individuals are placed in an am- 
biguous situation where more than one prin- 
ciple of behavior may be applicable, they re- 
act by adopting one or the other rather than 
compromising, 


METHOD 
Subjects 


Male undergraduates enrolled in introductory 
sociology and social psychology courses at The Uni- 
versity of Michigan served as subjects. They had 
volunteered to participate in a paid experiment con- 
cerned with “simulating certain features found in 
business and industrial settings.” Twenty subjects 
were randomly assigned to each experimental condi- 
tion. 


Overview 


The experiment consisted of having subjects per- 
form two experimental tasks for which they were to 
earn points. The points were to be combined for two 
subjects and were to earn a joint bonus at the rate 
of .6¢ per point. Each subject then allocated this 
joint bonus alone, believing that he was the only 
member of the dyad to do so. 


Procedure 


Subjects entered the laboratory, were seated in 
their appropriate place, were designated as Subject 
A or Subject B, and were read the following in- 
structions: 


The purpose of this study is to simulate some 
features found in business and industry and to 
assess their effects on worker performance. Today 
we are simulating a number of factors, 

The first is patterns of remuneration or as they 
are probably better known, wage and payment 
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plans. In all the wage plans we are simulating, 
each subject receives a $1.00 participation fee. 

You are to be in a variable bonus plan where in 
addition to the $1.00 participation fee, subjects 
have an opportunity to receive a bonus calculated 
in the following way. 

Subjects work with another subject who serves 
as his partner. Both subjects work on two tasks. 
Based on their performance on these two tasks 
each receives points. After the completion of these 
tasks, the points are combined from the two 
subjects and a joint monetary bonus is calculated 
by multiplying the total number of points by .6¢. 

One of the subjects is then asked to allocate the 
money. He may do so any way he wishes. He may 
give all the money to his partner, he may give all 
the money to himself, or he may divide the money 
in any way in between. 

The reason we are doing things like this is to 
simulate what often occurs in business. In business 
certain people can often determine how much 
they and other people will get paid. Other people 
have these decisions made for them and can not do 
so themselves. 

It will be left to chance to determine who will be 
the allocator. You will know whether you are the 
allocator before you begin to work on the ex- 
perimental tasks. Just before you begin to work 
on the first task you may open the envelope on 
your desk. The. envelope has an index card on 
which has been typed one of two words. If you 
are the subject whose card has allocator typed on 
it, you will be in charge of dividing the bonus. 
If your card has the word nonallocator on it, the 
bonus will be divided by your partner. 

The allocator will make his decision on this 
form. While the allocator is making his decision, 
the nonallocator will fill out this form on worker 
preferences for wage plans. 


The two forms, the allocation form and the worker 
preference for payment questionnaire, were shown to 
the subjects. The worker preference for payment 
questionnaire was a questionnaire which asked the 
nonallocator to indicate how unskilled workers 
would react to a variety of different wage plans. 
Since all the subjects participated as allocators, this 
form was never used and was included only to 
provide a rationale for why all subjects were given 
a form to complete after the experimental tasks 
were completed. The instructions then continued as 
follows: 


Let me tell you a little more about how you will 
be working during the experiment. All subjects 
work on the same tasks and have the same amount 
of time. You will work on two tasks for which 
you will earn points toward the bonus. After each 
of these tasks, you will work on a nonpoint scoring 
task. Even though you will not receive points 
for these tasks, you should work as hard on these 
as you do on the other two. This nonpoint scoring 
task is an important part of the experiment. 
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You will work on the first task for 6 minutes, 
then work on the nonpoint scoring task for 4 
minutes. During this 4 minutes I will evaluate 
your performance on the first task and award 
points. I will tell you the points scored before you 
begin to work on the next task. 

You will have 6 more minutes to work on the 
second point scoring task followed by 4 more min- 
utes on the nonpoint scoring task. I will score the 
second experimental task, total the points, and 
calculate the amount of the bonus during the time 
you work on the nonpoint scoring task. I will 
announce the points and amount of the bonus at 
the end of the time for the nonpoint scoring task. 
That will end the work session. 

After this, if you are the allocator I will ask 
you to distribute the bonus. If you are the non- 
allocator, then you will fill out the questionnaire 
on worker payment plans. Then I will ask you to 
fill out some other forms that are important 
parts of the experiment. 

OK, before we begin open the envelope on your 
desk and see if you are an allocator or a non- 
allocator. 


The subjects then worked 6 minutes on the first 
experimental task. The experimenter pretended to 
score this while the subjects worked 4 minutes on a 
nonpoint scoring task, an intervening task. The 
scores for the first task were then announced and 
the subjects worked 6 minutes on the second experi- 
mental task. The subjects then worked 4 minutes 
on the intervening task while the experimenter pre- 
tended to score the second experimental task. The 
experimenter announced the points scored and the 
amount of the joint bonus, $3.02. 

Each subject was then given a folder which con- 
tained the allocation form on which the experimenter 
had written the number of total points scored, 503, 
and the amount of the joint bonus, $3.02. The ex- 
perimenter then repeated that: 


The allocator may allocate the money any way 
he wishes. He may give all the money to his 
partner, he may give all the money to himself, or 
he may divide the money any way in between. 


The subjects then allocated the bonus by writing 
on the allocation form how much money they would 
keep for themselves and how much should be given 
to their partner. They were told to check that the 
two amounts added up to the total of the joint 
bonus. Subjects believed that they were the only 
member of the dyad who was allocating the money 
and that their partner was filling out (or had filled 
out) the worker preference for payment question- 
naire. 

The experimenter then picked up the folders with 
the allocation form and handed the subjects a post- 
experimental questionnaire. After the subjects had 
completed the postexperimental questionnaire, they 
were told the true purpose of the study and were 
asked not to disclose this to others. They were 
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then each paid an additional $1.50 for their partici- 
pation in the study. 


Manipulation of Independent Variables 


The independent variables in this study were the 
allocator’s level of inputs relative to the other 
member of the dyad and the expectation of future 
interaction. These variables formed a 2 X 2 factorial 
design. 

Level of input. Manipulation of differential inputs 

was accomplished by providing false feedback on 
the number of points each subject had earned on 
the basis of his performance on the experimental 
tasks. The experimenter announced the scores on 
the first task by stating that Subject A had scored 
— points and Subject B __ points. The high input 
subject was told he had scored 143 points, while the 
low input subject was told he had scored 84 points, 
Each subject heard his own score and the score of 
his partner and was asked to record the points scored 
by themselves and their partner on a score sheet 
which was provided for that Purpose. The process 
was repeated after the second task with the high 
input subject being told he had scored 184 points 
for a total of 327 points and the low input subject 
being told he had scored 80 points on the second 
task for a total of 176 points, 

Future interaction. Subjects were not directly 
informed whether they would have future interac- 
tion with their partner, rather the experimental 
situation was set up so the subjects would infer 
this. When subjects were not to expect future inter- 
action with the other member of the dyad, they 
participated alone and had as a partner “someone 
who has participated earlier.” Subjects in this no- 
future interaction were told the following: 


You will be separated from your partner, by 
both time and space. What this actually means 
is that you will have as a partner someone who 
has already participated or you will have as a 
partner someone who will participate after you, 
If you are selected to be the allocator, then you 
will have as a partner someone who has already 
participated. In this case of course we will know 
how many points he has received for his per- 
formance, They will be listed on the form inside 
an envelope. We have a few subjects who have 
already participated as non-allocators and who 
you may choose from to serve as your partner. 

If you are selected to be non-allocator, then 
you will have as a partner someone who will 
participate as an allocator. In this case we will 
put your scores on a form and place them in an 
envelope. Your envelope then will probably be 
picked in a few days by a subject who is partici- 
pating as an allocator. 


Before the subjects were given the first experi- 
mental task, they were asked to open the envelope 
on their desk and were then asked whether they 
were allocators or nonallocators. In response to 
their replies that they were allocators, they were 
told the following: 
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Since you are the allocator, you will ! sa 
partner someone who already has pa ted. 
Pick one of these to serve as your part His 


scores are posted inside. I'll tell you his at 
the same time I tell you yours, 


The subjects picked an envelope with th: res 
of his “partner” listed on the score sheet le. 
Subjects were allowed to choose from a num of 
smaller envelopes placed within a larger envelope. 


The subjects were then handed the first experime: tal 
task, 

The subjects who were to expect future inter 1 
with their partner participated in pairs and 3 
told: “You are to be separated from your pa 
by working separately in these adjacent roc 
The two subjects had been previously placed ir 
adjacent rooms which opened onto a large com 
room. The doors to these small rooms remai 
open during the experiment. The experimen 
stood in the large common room while reading t 
experimental instructions to both subjects simul 
taneously. While the subjects did not see each othe 
before or during the experiment, they heard each 
other respond to the experimenter’s questions and 
could also hear each other working on the experi- 
mental tasks. The situational arrangements were 
such that subjects would perceive that they would 
meet each other briefly at the end of the experi- 
mental session. Pretests confirmed that this was so, 


Experimental Tasks 


The first experimental task Was represented as a 
task of simple mathematical manipulation, it con- 
sisted of adding together five-digit numbers, The 
second experimental task was tepresented as a task 
of cognitive flexibility and consisted of forming 
words of four or more letters from the letters 
RCHALKED. The intervening task was to cut out fig- 
ures matching figures in a booklet. 


RESULTS 
Manipulation Checks 


In order for subjects to follow equity and 
distribute the rewards unequally, it was nec- 
essary that they had perceived that their 
performances on the experimental tasks had 
differed from that of their partner. Subjects 
were asked to compare their performance on 
each experimental task with the performance 
of their partner. Their responses indicated 
that subjects were aware of the differential 
contributions made by each member of the 
dyad. For both experimental tasks, high input 
subjects reported superior performance, while 
low input subjects reported inferior perform- 
ance. Differences between the reported per- 
formances for low and high input subjects 


REWARD ALLOCATIONS IN Dyaps 


were highly significant, Fs (1,76) = 552.8 
and 957.2, p < .001. Subjects were also asked 
to report the proportion of points they had 
earned. High input subjects reported they had 
contributed 64.3%, while low input subjects 
reported a mean contribution of 33.8% of the 
total points. 

To determine the effect of the future inter- 
action manipulation subjects were asked to 
indicate whether they expected to meet and 
talk with their partner at the end of the ex- 
periment. On a scale of 1 to 5 with 5 indicat- 
ing that they were certain they would not 
meet their partner and 1 indicating that they 
were certain they would meet their partner, 
subjects in the no-future interaction condi- 
tion reported a mean score of 4.4, while sub- 
jects in the future interaction condition re- 
ported a mean score of 2.0, F (1,76) = 83.4, 
p < 001. This result indicates that the ma- 
nipulation of future interaction was success- 
ful. 


Allocation of Rewards 


The major dependent variable in this study 
was the subjects’ allocation of the $3.02 bo- 
nus. According to the equity principle, sub- 
jects in the high input condition should have 
allocated themselves exactly 65% of this 
bonus, while subjects with low inputs should 
have given themselves exactly 35%. Such 
distributions would have been strictly pro- 
portioned to the inputs each contributed. An 
allocation was considered to be based on the 
equity principle if the allocation was within 
2% of the exact proportional distribution. 
This 2% leeway allowed for some minor 
errors by the subjects in the calculation of 
inputs and/or reward distribution. According 
to the equality principle, all subjects regard- 
less of inputs should have taken exactly 50% 
. of the bonus. An allocation was considered to 
be based on the equality principle if it was 
within 2% of equality.t An allocation was 


1 Reflecting the fact that an equal distribution 
involved less chance for calculation errors, all the 
allocations based upon the equality principle were 
within 1¢ of the exact equal distribution. When the 
rewards were distributed according to equality, 
there was no need for the subjects to calculate the 
exact proportion of inputs contributed by each sub- 
ject nor to divide correctly the reward according to 
the 65%-35% split. 
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TABLE 1 


PRINCIPLE OF ALLOCATION USED IN REWARD 
DISTRIBUTION, BY LEVEL or INPUT or 
ALLOCATOR AND EXPECTATION OF 
FUTURE INTERACTION 


Input level 
High Low 
Principle of 
Allocation Expect future | Expect future 
interaction interaction 
No Yes No Yes 
Equity 14 5 13 12 
Compromise 
(between equity 
and equality) 1 3 3 2 
Equality 4 12 3 5 
Other 1 = 1 1 


considered to be a compromise distribution if 
it was between the equity and equality distri- 
bution, while any distribution outside the 
equity-equality interval was considered to be 
an “other” distribution. 

As had been predicted, subjects used either 
equity or equality in their allocations rather 
than compromising between the two princi- 
ples. As can be seen in Table 1, 68 of the 
80 subjects followed either equity or equality. 
Forty-four allocated the rewards consistent 
with equity, while 24 followed equality in 
their distribution. Only 9 subjects appeared 
to compromise between the two principles in 
their allocation. Three subjects allocated the 
reward outside the interval produced by 
equity and equality; 1 high input subject 
took more than his equitable share, 1 low 
input took more than an equal share, while 1 
low input subject took less than his equitable 
share. 

Comparing those subjects who followed 
equity in their allocation to those subjects 
who did not follow equity, there was no sig- 
nificant effect for expectations of future inter- 
action on the allocation behavior of low input 
subjects, x’ (1) =.11, p<.50, However 
among subjects with high inputs the effect of 
future interaction was as predicted, x? (1) = 
8.12, p < .01. Among high input subjects 
who did not expect to have future interaction 
with the other subject, 70% used the equity 
principle in their allocation. For those high 
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TABLE 2 


MEAN PERCENTAGE OF REWARD ALLOCATED To SELF, 
BY FUTURE INTERACTION AND 
LEVEL OF INPUT 


Expect future interaction 


Level of input 


No Yes 
High (65%) 62.36" 55.11 
Low (35%) 38,39¢ 38.410 


Note. Entries with similar superscripts do not differ sig- 
nificantly from each other (p <.05) on a Newman-Keuls 
test of means. 


input subjects who did expect to have some 
future interaction with their partner only 
25% used the equity principle, while 60% 
used the equality principle. 

The effects of expectations of future inter- 
action and level of input on the allocation of 
the rewards can also be seen in Table 2. 
Table 2 presents the mean percentage of the 
$3.02 bonus that subjects allocated to them- 
selves.” 

As can be seen from this table, the sub- 
jects’ level of inputs was a major determinant 
in how the reward was allocated. As predicted 
by equity theory, high input subjects allo- 
cated themselves more than did low input 
subjects, F (1,76) = 173.2, p < .001. How- 
ever contrary to a straight equity model, ex- 
pectations of future interaction also signifi- 
cantly affected the allocation, F (1,76) = 
5.48, p < .05, and there was a significant 


2It can be noted that in this study’s no-future 
interaction condition, the condition which is a near 
replication of Leventhal’s procedures, the results are 
quite similar to those obtained by Leventhal and 
Lane (1970). In this study with 65% of the inputs 
contributed by the high input subject and 35% by 
the low input subjects, the mean allocations were 
62.36% and 38.39%. In the Leventhal and Lane 
study with high input subjects contributing 63% to 
the low input subjects 37%, high input subjects 
allocated themselves 60.94% and low input subjects 
allocated themselves 42.33% of the money. In this 
study the ratio of outcomes to inputs was .95938 
for high input subjects and 1.09685 for low input 
subjects. This compares to the ratios in the Leyen- 
thal and Lane study of .96730 for high input sub- 
jects and 1.14054 for low input subjects. The allo- 
cation behavior of high input subjects in both stud- 
ies is virtually identical, while low input subjects in 
this study allocated themselves slightly less than 
the low input subjects in the Leventhal and Lane 
study. 
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interaction effect, F (1,76) = 5.53, 05, 
A Newmans-Keuls test of means shows: ihat 
the effect of expectations of future inte:. tion 
was due to differences only among high ` vut 
subjects. High input subjects took re 
when future interaction was not expect an 
when future interaction was anticipated. No 


difference occurred between low input «llo- 
cators. 

In addition to comparing the allocation. of 
high and low input allocators, one can 
pare the actual allocations to the specific ore- 
dictions generated by equity theory. Equicy 
theory not only claims that rewards are 
tributed unequally when inputs are unec: |, 
but also that the distribution of rewards 
proportional to the inputs of the participan 
These more specific comparison have not be: 1 
presented in previous studies. 

In this study the mean allocation made by 
the high input subjects expecting some future 


interaction with their partner was signifi- , 


cantly different from the equity prediction 
t (19) = 6.35, p < .001. In addition, sub 
jects in both the low input conditions had 
mean allocations higher than the allocations 
predicted by equity, ts (19) = 1.94 and 1.92, 
2 < .10. While allocations are different than 
equality, they are also not totally consistent 
with equity. 


Discussion 


In this study where subjects were faced 
with a conflict between two principles of 
reward allocation, they responded by choos- 
ing either one or the other principle in their 
allocation rather than compromising. Such 
behavior may be a way to reduce intraper- 
sonal conflict. For a person who compromises 
between equity and equality in his allocation, 
it is likely that he recognizes that these two 


conflicting principles could both be applicable 5 


in the situation. For subjects who did not 
compromise but who chose to allocate th: 
rewards according to one or the other princi 
ple, intrapersonal conflict might be avoided 
by having the belief that only one principie 
of reward allocation is applicable in the situ- 
ation. 

The results of this study indicate that for 
subjects with high inputs the choice between 
equity and equality as a principle of reward 
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allocation is affected by whether the allocator 
expects to have future interaction with the 
other. This result supports the explanation 
presented to account for the conflicting results 
in studies of reward allocation. If high input 
subjects predominately influence a group 
decision on reward allocation, as was found 
by Cohen (1972), then their preference for 
equal distributions in situations where future 
interaction is expected accounts for the equal 
distributions found by Wiggins (1966) and 
by Morgan and Sawyer (1967). In the Lev- 
enthal studies where the situation does not 
lead subjects to expect future interaction, 
then equity is found. 

It was argued that when future interaction 
is anticipated, persons become more concerned 
about how they appear to others. This in- 
creased concern for favorable self-presenta- 
tion could be expected to overcome one’s 
interest in increasing the amount of money 
one would allocate to himself. Such a process 
has been reported by Brown and Garland 
(1971) who found that self-presentation ef- 
forts increased at the expense of money when 
future interaction was anticipated. Similar 
self-presentation arguments have also been 
advanced in equity studies; to account for a 
preference for equity over self-interest (Law- 
ler, 1968), to account for self-depriving be- 
havior (Leventhal & Bergman, 1969), and to 
account for a lack of stronger reallocation 
motivations (Leventhal, Weiss, & Long, 
1969). It has also been reported (Leventhal, 
Michaels, & Sanford, 1972) that subjects 
recommending rewards for others alter their 
reward distributions so as to avoid inter- 
personal conflict. 

There is some evidence in this study which 
is consistent with this explanation. As part of 
the postexperimental questionnaire, subjects 
were asked to indicate, “how satisfied do 
you believe the other subject will be with the 
amount of the bonus he will receive?” Neither 
future interaction nor level of inputs signifi- 
cantly affected the subjects’ responses. How- 
ever, deviation from equity in the alloca- 
tion, the difference between the amount of 
money subjects allocated to themselves and 
the amount predicted by equity, was found to 
be related to how satisfied subjects believed 
their partners would be. A regression analy- 
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sis which controlled for future interaction 
indicated that for high input subjects the 
greater the deviation less than equity, the 
more subjects felt that their partners would 
be satisfied with the amount of the bonus 
they would receive (p< .001). A similar 
analysis for low input subjects indicated that 
the greater the deviation more than equity 
the less satisfied they felt their partners would 
be (p < .01). 

Furthermore for high input subjects there 
were significant relationships between how 
subjects believed they would be perceived by 
their partners and the deviation from equity 
in their reward distributions. The subjects 
were presented, as part of the postexperi- 
mental questionnaire, with a semantic differ- 
ential of 11 paired personality traits. They 
were asked to indicate, “how you believe 
your partner would evaluate you a day after 
the experiment?” Regression analysis con- 
trolling for future interaction indicated that 
for high input subjects, taking less than an 
equitable share of the bonus was associated 
with perceptions of being seen by one’s part- 
ner as more sociable (p < .01), more sincere 
(p < .05), kinder (p < .02), warmer (p < 
.002), more pleasant (p < .001), and less 
selfish (p < .001). Among high input sub- 
jects, deviation from equity in the allocation 
was not found to be significantly related to 
perception of being seen as self-assured, in- 
telligent, sincere, motivated, or weak, Among 
low input subjects, similar regression analy- 
sis found that the only significant relationship 
was between taking more than one’s equitable 
share and perceptions of being seen as less 
sincere (p < .05). 

The fact that high input subjects who took 
less than their equitable share of the money 
were aware that such allocations would be 
more satisfying to their partners and recog- 
nized that the allocations would affect their 
partner’s evaluation of themselves in a more 
favorable way is consistent with this self- 
presentation argument in accounting for the 
allocation behavior of these high input sub- 
jects. 

In this study, the amount of future inter- 
action subjects might expect to have with the 
other was very minimal, merely meeting and 
talking briefly. Yet this was enough to lead 


880 


many high input subjects to choose equality 
rather than equity, thereby decreasing their 
share of the money. Where the consequences 
of future interaction are more substantial 
than in this study, one might expect that the 
trend toward equality would be even stronger. 
The generalizability of this trend to other 
situations must be made cautiously, for in 
this study the amount of reward to be di- 
vided was relatively small. The difference in 
this study between an equity allocation and 
an equal allocation was only 51¢. For 51¢ 
subjects may be willing to be accommodative 
to another, however, as the level of rewards 
becomes large and the monetary loss associ- 
ated with choosing equality rather than 
equity also becomes large, this trend to equal- 
ity might be reduced. 
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The diffusion of responsibility hypothesis states that antisocial behavior occurs 
when individuals are motivated to engage in socially taboo behavior and find 
themselves in a group of similarly motivated individuals. A competing hy- 
pothesis, the social comparison hypothesis, requires that in addition to group 
membership, communication concerning intended or actual antisocial behavior 
must occur. To test these two hypotheses, subjects singly or in groups of 
three were exposed to a situation eliciting either strong or weak desires for 
revenge and then given an opportunity to take revenge. Half of the groups 
were allowed communication and half were not. Subjects in groups strongly 
desiring revenge took increasing revenge over trials, while isolated subjects 
strongly desiring revenge showed an initial increase in revenge behavior fol- 
lowed by a decrease. Communicating groups did not take more revenge than 
noncommunicating groups. The results provide support for the diffusion of 


responsibility hypothesis. 


The diffusion of responsibility hypothesis 
states that antisocial behavior will occur 
whenever individuals are motivated to engage 
in socially taboo behavior and find themselves 
in a group of similarly motivated individuals. 
The mechanism by which this antisocial be- 
havior is produced is the spreading or “dif- 
fusion” of feelings of personal responsibility 
for the consequences of the antisocial be- 
havior throughout the group. This hypothesis 
has been used, ex post facto, to account for 
crowd behavior (Le Bon, 1968), the “risky 
shift” (Wallach, Kogan, & Bem, 1962), and 
bystander apathy (Darley & Latané, 1968). 
Attempts to experimentally test the hypothe- 
sis have only occurred within the context of 
the risky shift (Dion, Miller, & Magnan, 
cited in Dion, Baron, & Miller, 1970; Kogan 
& Wallach, 1967; Lamm, 1967; Marquis, 
1962; Pruitt & Teger, 1969; Teger & Pruitt, 
1967; Wallach, Kogan, & Bem, 1964; Wal- 
lach, Kogan, & Burt, 1967) and have been of 
questionable validity in assuming that risky 
behavior is antisocial and that people “na- 
turally” desire to engage in such behavior. 


The authors are grateful to Harry Lando for his 
assistance and suggestions in preparing this manu- 
script. 

Requests for reprints should be sent to Eugene W. 
Mathes who is now at the Department of Psychol- 
ogy, Western Illinois University, Macomb, Illinois 
61455. 


Jellison and Riskind (1970) present data sug- 
gesting that risky behavior is more socially 
desirable than cautious behavior, and the 
assumption that people naturally desire to 
engage in risky behavior has not been tested. 
Furthermore, measures of the variable most 
central to the hypothesis, feelings of respon- 
sibility, have not been attempted. The present 
experiment was designed to provide a more 
adequate test of the hypothesis by investi- 
gating a behavior that is clearly antisocial 
(revenge), manipulating the strength of the 
desire to engage in such behavior, and mea- 
suring feelings of responsibility as well as 
amounts of antisocial behavior. 

Subjects either singly or in groups of three 
were exposed to a situation eliciting either 
strong or weak desires for revenge and then 
were given an opportunity to actually take 
revenge. Two dependent measures were ob- 
tained, amount of actual revenge and amount 
of responsibility felt for the revenge. On the 
basis of the diffusion of responsibility hy- 
pothesis it was predicted that groups of sub- 
jects with strong desires for revenge would 
take more actual revenge than isolated sub- 
jects having similar desires, It was further 
expected that groups of subjects having 
strong desires for revenge would feel less re- 
sponsible for the revenge taken. No differ- 
ences in actual revenge or felt responsibility 
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were expected for groups or isolated indi- 
viduals experiencing weak desires for revenge. 
An alternative explanation of extreme 
group behavior is Brown’s (1965) cultural 
value and social comparison hypothesis. Ac- 
cording to Brown, in some situations extreme 
behavior is valued, and in such situations in- 
dividuals in groups compete with each other 
to show the most extreme behavior and thus 
exemplify the value. The net effect is more 
extreme behavior on the part of groups than 
isolated individuals. This hypothesis views 
group communication, at least in the form of 
knowing another’s position, as essential for 
extreme group behavior while the diffusion of 
responsibility hypothesis does not. In order 
to rule out this alternative explanation, half 
of the three-person groups were allowed to 
communicate and half were not allowed such 
an opportunity. If no difference in amount of 
actual revenge were found between communi- 
cating and noncommunicating groups, the 
diffusion of responsibility hypothesis, and not 
the cultural value and social comparison hy- 
pothesis, would be supported. 


METHOD 
Design and Subjects 


Group membership (individuals alone, groups with 
communication, groups without communication) was 
crossed with desire for revenge (weak, strong) to 
create a 3X2 factorial design. Ninety volunteers, 
38 men and 52 women, from the introductory psy- 
chology course at Iowa State University were ran- 
domly assigned to the six treatment conditions. Nine 
subjects were in each cell except for the two group 
communication conditions to which nine groups of 
three subjects each were assigned. All groups were 
homogeneous with respect to sex. 


Overview 


The study was presented to subjects as consisting 
of two experiments, one studying the effects of 
punishment on creative learning and the other study- 
ing the effects of punishment on rote memorization. 
For the “first experiment,” subjects, either singly 
or in groups of three, wrote essays which a con- 
federate read and either complimented and charged 
small fines to elicit a weak desire for revenge, or 
criticized and charged large fines to elicit a strong 
desire for revenge. The “second experiment” gave 
subjects an opportunity to actually obtain revenge. 
Roles were exchanged and the subjects taught non- 
sense syllables to the confederate. Revenge consisted 
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of the size of the fines exacted from the er for 
making errors. 
Procedure and Manipulations 

Individual and group membership mani ion, 
Subjects were contacted by phone and arra ents 
made to come to the experiment in group two 
or four strangers. In all conditions one subje- was 
subsequently recruited as a confederate, thus ing 
single subjects or groups of three subjects. 

Cover story. Upon arrival at the laborato 1b- 


jects in the individual conditions listened to 
which presented instructions and the cover s 
the experiment. Subjects were told that they 
going to participate in two experiments, one 
to do with the effects of punishment on c ve 
learning and the other with the effects of punish: ont 
on rote memorization, They were told that each a 
ject would receive $1.00 at the beginning of the 
periment and that punishment would consist >f 
monetary fines. Creative learning would invo e 
learning how to write creative essays, and r 
memorization would consist of learning a list of fi 
pairs of nonsense syllables. Subjects were told th: 
lots would be drawn to determine who would b 
teacher and learner for the first experiment and that 
the roles would be reversed for the second experi- 
ment. In the first experiment the learner would be 
given 5 minutes to write an essay on “perfect 
beauty.” The teacher would then make helpful com 
ments on it and exact a fine, The learner would 
then rewrite the essay. For the second experiment 
the learner would be given 1 min to learn a list of 
five pairs of nonsense syllables followed by four 
testings. For each error the teacher would fine the 
learner from 0 to 10 cents. Subjects were told that 
the testing would be carried out by the experimenter 
in a separate room and that the learner’s answer 
would be signaled to the teacher by means of a pair 
of lights labeled correct and incorrect. They were 
told that the amount of the fine would be com- 
municated by means of a dial in the teacher's room 
which was electrically connected to an appropriately 
labeled meter in the learner’s room. 

The tape for the group conditions was similar 
except that subjects were told that the first experi- 
ment would involve individual teaching with one 
subject teaching the other three subjects how to 
write creative essays, and the second experiment 
would involve team teaching with three subjects 
teaching one subject nonsense syllable pairs. After 
the tape had been played each subject was given 
$1.00, and slips were drawn to determine who would 
be the teacher. 

Revenge-eliciting manipulation. In the indiv iual 
conditions, while the learner wrote his essay, the 
teacher was taken into a separate room. There he 
was told that the experiment involved deception and 
that he would not read the learner’s essay or make 
a punishing fine. Rather, the experimenter would 
attach an arbitrary comment and fine to the essay. 
In the strong-desire-for-revenge conditions, the ex- 


of 
are 


ng 


| 
| 
| 
i 


DIFFUSION OF RESPONSIBILITY AND EXTREME BEHAVIOR 


perimenter returned to the learner’s room, picked up 


>the essay, and then returned to the teacher’s room 


where he attached a comment and fine to the essay 
such as: “I have doubts about this essay. It doesn’t 
make much sense. Maybe you could improve it by 
giving it more organization. I have no choice but 
to fine you 60 cents.” The weak-desire-for-revenge 
manipulation involved attaching a complimentary 
comment to the essay and exacting only a token 
fine such as: “I think that this essay is rather good. 
Tt is well organized. The only suggestion for im- 
provement that I can make is that maybe you should 
elaborate the last point a bit more. I am going to 
fine you five cents.” After the essay had been re- 
turned and the fine collected, the learner was asked 
to write a second essay. 

For group conditions, the strong and weak re- 
venge manipulations were similar to those in the 
individual conditions except that in the group con- 
ditions each member received a different insulting 
comment and fine or complimentary comment and 
token fine. For a given group, all members were 
either insulted or praised, and fines were collected 
publicly. 

Communication manipulation. In the individual 
conditions, after the subject had completed his essay, 
he was informed that the second experiment would 
be carried out. After the instructions concerning 
fining were repeated, the experimenter left the sub- 
ject and went into the learner’s room. The subject 
was then fed a bogus series of correct and incorrect 
responses, A randomly selected 10 of the 20 responses 
were called incorrect. The amount fined by the sub- 
ject for each incorrect response was recorded. In 
the group-membership-with-communication condi- 
tions, groups were asked to discuss the learner’s pro- 
gress and come to a joint decision concerning how 
much he should be fined. One of the group members 
then signaled this information to the learner. In 
the group-membership-without-communication con- 
ditions, partitions were placed between group mem- 
bers, and each member was given a dial. They were 
informed that each group member should decide 
individually how much the learner should be fined. 
The amount that the learner actually would be fined 
would be equal to the average of the three separate 
fines. 

Dependent measures. Revenge taken was measured 
by the amount of money the subjects fined the 
learner on each error trial, Felt responsibility was 
assessed by means of a questionnaire item given to 


“the subjects after the experiment was completed: 


“How responsible do you feel for the punishments 
you chose as a teacher?” Responses were made by 
checking an 8-point scale anchored at one end with 
“very responsible” and at the other with “not very 
responsible.” To check the effectiveness of the re- 
venge-eliciting manipulation subjects responded to 
two items: “To what extent did you see the ad- 
ministration of punishment as a chance to get re- 
venge?” and “How fair did you think the comments 
were that were made on your essay by your 
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teacher?” Responses were made on 8-point scales 
ranging from “to a small extent” to “to a large 
extent” and from “not very fair” to “very fair,” 
respectively. 


RESULTS 


The effectiveness of the revenge-eliciting 
manipulation was confirmed by responses to 
the revenge and fairness questions, Subjects 
who had previously been heavily fined saw 
their own fining activity as a chance to obtain 
revenge to a greater extent than did subjects 
who had previously been lightly fined: ¥ = 
4.15, 1.78, respectively; F(1, 48) = 23.54, 
p < .005. The main effect for group member- 
ship and the interaction was not significant. 
Subjects who had been heavily fined saw the 
fines exacted from them as less fair than did 
subjects who had been lightly fined: X = 
3.96, 6.78, respectively; F(1, 48) = 58.05, 
p < .005. The main effect for group member- 
ship was also significant: X = 6.06, 5.22, and 
4.83 for individual subjects, groups with com- 
munication, and groups without communica- 
tion, respectively; F(2, 48) = 3.81, p < .05. 
Subjects fined alone felt that the fines were 
fairer than did subjects fined in groups. 

The results for amount of money fined 
were analyzed by a 3 X 2 X 10 analysis of 
variance, with fine trials as the last factor; 
the summary is presented in Table 1. The 
main effect for desire for revenge provides 
further confirmation of the success of the 
revenge manipulation, with subjects in the 
weak revenge conditions fining less (2.96 
cents per trial) than subjects in the strong 


TABLE 1 


ANALYSIS OF VARIANCE FOR Fines EXACTED 
(REVENGE TAKEN) 


Source af MS F 

Group membership (A) 2 14.79 | 1.03 
Desire for revenge (B) 1 | 1,269.60 | 88.29*** 

Error 48 14.38 — 
Fine trials (C) 9 TEKER EES E an 
AXB 2 1.16 .08 
AXC 18 2.99 -90 
BXC 9 7.25 | 2,18** 
AXBXC 18 SAO RLS 

Error 432 3.32 — 
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revenge conditions (6.03 cents per trial). The 
main effect for fine trials shows that fines in- 
creased over trials. 

The marginally significant three-way in- 
teraction, plotted in Figure 1, provides some 
support for the diffusion of responsibility hy- 
pothesis, Individuals strongly desiring revenge 
took increasingly greater fines across trials 
through the seventh trial, after which fining 
behavior decreased rapidly until the last trial, 
where fines were less than on Trial 1. In con- 
trast, groups strongly desiring revenge con- 
tinued to exact increasingly heavy fines across 
all trials. The means for the strong-desire- 
for-revenge conditions across the last four 
trials are presented in Table 2. It can be seen 
that on Trial 10 groups were exacting sig- 
nificantly greater fines than individuals, and 
that on Trials 8 and 9 groups with communi- 
cation, but not groups without communica- 
tion, exacted greater fines than individuals. 

On the question assessing felt responsibility 
for fines exacted, a marginally significant 
effect for group membership was found: X = 

6.56, 5.56, and 5.89 for individual subjects, 

groups with communication, and groups with- 

out communication, respectively; F(2, 48) = 

2.57, p < .10. Groups with communication 

felt significantly less responsible for the re- 
venge they took than did isolated individuals 

(p < .05), with groups without communica- 
tion falling in between and not significantly 
different from the other two means. The main 
effect for revenge and the interaction was not 
significant. 


DISCUSSION 


Although both the fining and the responsi- 
bility results reach only marginal levels of 


TABLE 2 


Mean Fines Exacren (Cents) on Last FOUR TRIALS 
FOR STRONG-DESIRE-FOR-REVENGE CONDITIONS 


Condition Trial7 | Trial8 | Trial9 | Trial 10 
Individuals 7.11% | 6.22»e | 6.00 | 4.67° 
Groups with 

communication 6.89% | 8.228 | 8.44" | 8.22a 
Groups without 
communication 6.44> | 7.119» | 7.338 | 8.008 


Note. Means having different letter superscripts are signifi- 
cantly different from each other at the .05 level of significance. 
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significance, taken together they provide sup- 
port for the diffusion of responsibi 


pothesis. Subjects who desired to eng:ve in 
the antisocial behavior of revenge anc who 
found themselves in a group of similarly moti- 
vated individuals diminished their feei) -s of 
personal responsibility for their behavin and 
engaged in greater revenge. On the c'her 
hand, subjects desiring to engage in revonge 
but not finding themselves in a group of 


similarly motivated individuals maint. ied 
greater feelings of responsibility for thei 
havior and consequently engaged in less ve- 
venge. The three-way interaction suggests 
that diffusion of responsibility does not hve 
an instantaneous effect on revenge behav 

As Figure 1 demonstrates, the revenge be- 
havior of individuals and groups did not differ 
until the last few trials. One way of explain- 
ing these results might be in terms of guilt. 
Feelings of responsibility produced guilt in 
individuals for the revenge which they took 


during the early trials, and as a result the ~ 


revenge they took dropped drastically on th: 
later trials. Since individuals in groups felt 
less responsible for their revenge and thus felt 
less guilt, they continued to take more and 
more revenge until the trials were completed. 

Since the revenge behavior of communicat- 
ing and noncommunicating groups did not 
differ significantly, diffusion of responsibility 
appears to be a sufficient cause of group anti- 
social behavior; however, it may not be the 
only cause of such behavior. The fact that 
communicating groups showed slightly but 
consistently greater fining than groups with- 
out communication and the fact that com- 
municating groups did fine significantly more 
than individuals on Trials 8, 9, and 10, while 
noncommunicating groups differed from indi- 
viduals only on Trial 10, suggest that com- 


munication might well play a facilitatory role» 


in releasing antisocial behavior. Using 
Brown’s (1965) social comparison hypothesis 
to explain these results, it is likely that sub- 
jects in communicating groups discovered 
that the other group members also favored 
revenge and a prorevenge norm was thus es- 
tablished. The legitimacy produced by this 
norm further encouraged taking revenge (Le 
Bon, 1968: Nordhgy, Note 1). Group anti- 


hy- 7 
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o—o Strong desire for revenge, 
individuals 

&-A Strong desire for revenge, 
groups w. comm. 

o---0 Strong desire for revenge, 
groups w/o comm. 

@—e Weak desire for revenge, 
individuals 

4a Weak desire for revenge, 
groups w. comm. 

m@.-.m Weak desire for revenge, 

groups w/o comm. 


CENTS FINED 


FINE TRIALS 


Ficure 1. Fines exacted as a function of desire for revenge, group membership, and 
trials. Abbreviations are as follows: w. comm. = with communication; w/o comm, = 


without communication. 
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social behavior can probably stem from sev- 
eral causes including diffusion of responsi- 
bility and social comparison. 

At this point only a small number of 
studies have been carried out on group anti- 
social behavior (Festinger, Pepitone, & New- 
comb, 1952; Singer, Brush, & Lubin, 1965; 
Zimbardo, 1969), and these have focused on 
the deindividuating effects of group member- 
ship. The anonymity offered by group mem- 
bership is seen as providing protection against 
prosecution by external enforcers of social 
norms. The present study compliments this 
work in that it deals with the mechanism by 
which individuals in groups protect them- 
selves against the inner enforcer of social 
norms, feelings of personal responsibility for 
behavior. Future research should look at how 
diffusion of responsibility, social comparison, 
and deindividuation all contribute to group 
antisocial behavior. 


REFERENCE NOTE 
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z Chance Outcomes and the Just World: A Comparison 
of Observers and Recipients 


Robert Apsler and Henry Friedman 
Boston University 


An experiment was conducted in order to compare observers’ and recipients’ 
reactions to the recipients’ receipt of chance outcomes. It was predicted that 
(a) observers rate the performances, but not the personal characteristics, of 
rewarded recipients more positively than those of nonrewarded recipients, (b) 
recipients’ ratings of their performances are relatively unaffected by their out- 
comes, (c) observers tend to attribute the outcomes to the recipients’ behavior, 
while recipients tend to attribute their outcomes to chance, and (d) non- 
rewarded recipients are more likely than rewarded recipients to attribute their 
outcomes to chance. Consistent with a belief in a just world, observers rated 
rewarded recipients’ performances as superior to those of nonrewarded recipi- 
ents. They also rated some personal characteristics of rewarded recipients more 
positively than those of nonrewarded recipients. Contrary to the predictions 
derived from Jones and Nisbett, recipients’ ratings of their own performances 
were influenced by the outcomes in the same manner as were observers’ ratings, 
and none of the expected differences in attributions of the outcomes were 
found. An internal analysis showed that subjects who attributed the outcome 
to performance (approximately 20% of the subjects) rated the performances 
and personal characteristics of rewarded recipients much more favorably than 
those of nonrewarded recipients. In contrast, the ratings of subjects who 
attributed the outcome to chance or to some other source did not differ 
between rewarded and nonrewarded recipients. 


Lerner (Lerner, 1970, 1971; Lerner & Mat- 
thews, 1967; Lerner & Simmons, 1966) has 
proposed that individuals have a need to 
believe that they live in a just world; they 
believe in a world where people get what they 
deserve and where people deserve what they 
get. Individuals believe, for instance, that 
those among us who work hard or who perform 
good acts obtain rewards for their actions, 
while the sinners and the laggards receive 
punishments instead. Similarly, individuals 
want to believe that positive outcomes, 
whether money, success, or happiness, are 
obtained only by good people and, conversely, 
that negative outcomes only happen to bad 
<; persons. This belief in a just world is invoked 
by Lerner as a possible explanation for our 
society’s willingness to tolerate the suffering 


Portions of the results were presented at the 
1973 Eastern Psychological Association convention. 
The authors wish to thank Ronald Abeles, Frederick 
Agatstein, and Zick Rubin for their comments on 
an earlier draft. 

Requests for reprints should be sent to Robert 
Apsler, Department of Psychology, Boston Univer- 
sity, Boston, Massachusetts 02215. 


of many disadvantaged individuals. Perhaps, 
for example, we allow children to starve and 
permit patients at various state institutions 
to suffer from grossly inadequate budgets be- 
cause we feel that their suffering is deserved. 
The existence of undeserved suffering is in- 
compatible with a belief in a just world, 
argues Lerner, and so individuals persuade 
themselves that victims deserve their fate. 

In an early demonstration of the belief in a 
just world, Lerner (1965) showed that even 
explicitly random outcomes can influence ob- 
servers’ perceptions of how deserving recipi- 
ents’ behaviors must be. Subjects observed 
two people working at a task for which only 
one person could be paid. The observers, but 
not the workers, learned which worker was 
randomly selected for the payment. The ex- 
perimenter explained that the fairest way of 
selecting one of the workers for payment was 
by chance and that the workers had drawn 
numbers from a hat. Workers who drew even 
numbers would receive payment at the end of 
the study. Next, the observers listened to 
the workers solve anagrams, and then they 
rated the performance and personal character- 
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istics of both workers. Consistent with a 
belief in a just world, observers rated the 
performance of the paid worker as superior to 
that of the unpaid worker. Apparently, sub- 
jects imposed justice on the situation by per- 
suading themselves that the paid worker de- 
served to be rewarded —he must have con- 
tributed more than the unpaid worker. An 
additional finding was that the payment had 
no effect on subjects’ ratings of the workers’ 
personal characteristics. This latter finding is 
consistent with Lerner’s (1970) assumption 
that individuals prefer to attribute the cause 
of a person’s outcomes to his behavior. Only 
if no link is perceived between outcomes and 
behavior do individuals then see the recipi- 
ent’s personal characteristics as justification 
for his outcomes. 

Lerner’s research shows that observers’ 
reactions to an individual’s behavior can be 
influenced by that individual’s outcomes even 
when those outcomes occur by chance. But 
what about the individual who receives such 
a chance outcome? Will his perceptions of his 
own behavior be influenced by his outcomes 
in the same manner as the reactions of ob- 
servers? Not likely, is the answer suggested 
by Jones and Nisbett’s (1971) analysis of the 
perception of the causes of behavior. Jones 
and Nisbett contend that observers watching 
an actor attend primarily to the actor’s be- 
havior, while the actor’s attention focuses on 
his environment; the actor’s behavior is the 
active figure against the more stable ground 
(situation) for the observer, while the actor 
more effectively records variations in the 
environment than in his own behavior due to 
the location of his receptors. In a situation 
where the actor is the recipient of a chance 
outcome, therefore, it is likely that the dif- 
ference between the observer’s and the recipi- 
ent’s focuses of attention will result in differ- 
ing interpretations for the cause of the out- 
come. An observer attending closely to a re- 
cipient’s behavior, for instance, may miss sit- 
uational cues involving the arbitrary nature 
of a chance outcome and, instead, “see” an 
explanation for the outcome within the re- 
cipient. The Lerner (1965) study described 
earlier is certainly consistent with this no- 
tion. Apparently, the information concerning 
the arbitrariness of the payment was not 
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salient for observers, since they rat 
contribution of the paid worker m« 
tively than that of the unpaid work 
servers must have attributed the paym 
lack of payment to the workers’ behav! 

A recipient, on the other hand, whose 
tion focuses on situational cues shou! 
more likely than an observer to note the 
trary nature of a chance outcome an 
turn, to attribute his outcome to situati: 
factors, rather than to himself. A recent 
by Apsler (1972) indicates that recipien 
chance outcomes do indeed attribute 
outcomes to situational factors. Subjects 
received low numbers (negative outcomes) 
the 1970 draft lottery disagreed more t: 
subjects receiving high numbers with | 
principle of conscription. Presumably, su’ 
jects felt that conscription, a situational fac- 
tor, was responsible for their fate in the loi 
tery, and hence, their reaction to conscriptio: 
varied with the outcomes they received, If 
subjects had attributed their outcomes in t 
lottery entirely to themselves, then the: 
should have been no relationship between 
outcomes and reactions to conscription. Rubin 
and Pelpau (1973), however, found some 
evidence that the same subjects used by 
Apsler may also have attributed their out- 
comes to themselves. Subjects whose numbers 
were above the group median (all subjects 
heard their numbers drawn while sitting in 
groups of six) tended to experience an in- 
crease in self-esteem, while subjects with 
numbers below the median tended to experi- 
ence a decrease in self-esteem. These changes 
in self-esteem could possibly reflect subjects’ 
feelings of being at least in part responsible 
for their lottery numbers. Thus, studies of 
the draft lottery suggest that recipients of 
chance outcomes attribute their outcomes both 
to situational factors and to themselves, N 

Jones and Nisbett’s (1971) analysis of the 
perception of the causes of behavior suggests 
a second reason why observers and recipients 
may differ in their explanations for the causes 
of chance outcomes. Different information is 
available to the observer and recipient, and 
as a result, they draw different conclusions 
concerning the outcome. The observer, for 
instance, typically has little knowledge of the 
recipient’s intentions and history; he has lit- 


$ 
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tle information, therefore, that can contradict 


‘ his tendency to attribute the outcome to the 


recipient. The observers in Lerner’s study, for 
example, could easily have concluded that the 
unpaid worker did not try hard enough. But 
the recipient, on the other hand, does have 
information about his own intentions and 
history. His attributions, therefore, will likely 
depend upon the match between his outcomes, 
on the one hand, and his intentions and his- 
tory on the other. A recipient who tried hard 
and has a history of past accomplishments, 
for instance, might easily convince himself 
that he deserved an arbitrary positive out- 
come, After all, in the past he has repeatedly 
earned rewards when he made similar efforts. 
But an identical recipient who obtains a 
chance negative outcome would surely be sur- 
prised by the discrepancy between the out- 
come and his positive efforts and past success, 
He would conclude that he did not deserve 


_ the negative outcome and that some other 


agent must have been responsible. Thus, if 
recipients ever attribute chance outcomes to 
themselves, they would likely do so only 
when the outcomes are congruent with their 
intentions and histories. Results consistent 
with this line of reasoning were obtained by 
Johnson, Feigenbaum, and Weiby (1964), 
though they used a setting where outcomes 
are typically not attributed to chance. They 
manipulated the outcome of subjects’ teach- 
ing efforts and found that subjects who “suc- 
ceeded” in teaching their student (congruent 
outcome) attributed the success to themselves. 
Tn contrast, subjects who “failed” in teaching 
their student (incongruent outcome) attrib- 
uted the failure to the student. Thus, recipi- 
ents saw themselves as deserving of congruent 
outcomes, but they felt that incongruent out- 
comes were caused by something other than 


L themselves. 


In order to test the ideas discussed above, 
an experiment was conducted which resem- 
bled Lerner’s (1965) study in that observers 
watched while a worker was arbitrarily as- 
signed either a reward or no reward after 
performing a task. The first goal was to repli- 
cate Lerner’s finding that (a) the perform- 
ances of rewarded workers are rated by ob- 
servers as superior to those of nonrewarded 
workers, and (b) observers’ ratings of the 
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personal characteristics of rewarded and non- 
rewarded recipients do not differ. The second 
and major goal of the present study was to 
compare observers’ and recipients’ reactions 
to the receipt of chance outcomes. To ac- 
complish this comparison, the workers in the 
present study, as well as the observers, were 
subjects. It was predicted that (a) recipients’ 
ratings of their own performances are rela- 
tively unaffected by chance outcomes, (b) ob- 
servers tend to attribute outcomes to the 
recipients’ performances, while recipients tend 
to attribute their outcomes to chance, and 
(c) the tendency of recipients to attribute 
their outcomes to chance is greater for non- 
rewarded recipients than for rewarded recipi- 
ents. 


METHOD 


Female subjects (W = 101) participated in the 
study in order to fulfill a requirement or to earn 
extra credit in their introductory psychology course 
at Boston University. Separate, but nearly identical, 
sign-up sheets were used for subjects who became 
observers and for subjects who became recipients of 
an outcome so that observers would not feel they 
were in the same study as recipients., The sign-up 
sheets differed only in the experiment number, in 
the room assignment, and in the restriction that 
subjects must not have participated in the other 
experiment. One observer and one recipient were 
scheduled for each session, but at the sessions where 
only one subject arrived for the experiment, the 
single subject was always run as a recipient. 

The observer subject was ushered into a small 
viewing room and seated in front of a one-way 
mirror, The experimenter told the observer that she 
would watch another subject perform some tasks and 
said that the details would be explained on tape- 
recorded instructions that she would hear through 
the one-way mirror, The observer was further 
informed that the other subject would be assigned 
a reward or no reward between trials, and finally, 
she was given a questionnaire to turn over and 
fill out when she saw the other subject filling out a 
form. No explanation for the purpose of her role 
or of the study was given. 

The recipient subject was led to a room fitted 
with two one-way mirrors and was seated at a large 
table. The experimenter told her that all the in- 
structions had been tape-recorded and would be 
played through twice. The instructions explained 
that the recipient would be assigned to either a 
reward or no-reward condition, but they were in- 
tentionally ambiguous in discussing how the assign- 
ment would be made. The procedure of determining 
whether a recipient would be assigned to the re- 
ward or no-reward condition was made ambiguous 
because (a) it was feared that an assignment which 
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was too clearly random might diminish the just- 
world effect, and (b) the ambiguous procedure al- 
lowed subjects more freedom in interpreting the 
cause of the assignment, thus facilitating a test of 
the attribution predictions derived from Jones and 
Nisbett (1971). The taped message stated: 


This experiment is designed to test performance 
under neutral, positive, and negative conditions. 
The experiment consists of building with a mag- 
netic play kit. Upon the completion of each 5- 
minute period of construction, a photograph will 
be taken to be judged later by a series of people. 
The experiment is divided into two parts. In the 
first part, the experiment is done without positive 
or negative conditions applied. In the second part, 
the experiment takes place after the assignment 
of reward or punishment. The reward consists of 
receiving an extra hour of credit for the experi- 
ment, while the punishment consists of receiving 
only one hour of credit as posted on the sign-up 
sheets. The assignment of reward or punishment 
will be disclosed at the end of the first trial. 


At the conclusion of the taped instructions, the 
experimenter handed the recipient a Magnastics toy 
consisting of three magnetized metal plates on a 
cardboard surface and numerous small, metal geo- 
metric forms. The experimenter explained that as 
each metal piece is placed on one of the metal 
plates or added to a chain originating on one of 
the plates, it too becomes magnetized, thereby 
enabling the player to erect an endless assortment 
of constructions. The experimenter told the recipient 
to begin playing with the toy and then ignored her 
for the next 5 min. At the end of the 5 min, the 
recipient was asked to stop building, and the experi- 
menter took a photograph of the constructions. Dur- 
ing the 5-min building period, the experimenter 
randomly assigned the recipient to either the reward 
or the no-reward condition, and after taking the 
photograph said, “You have been assigned a reward 
of an extra hour credit (punishment of only 1 hour 
credit) for the next trial. Before you start will you 
please fill out this form.” 

The observer and recipient simultaneously filled 
out their one-page questionnaires. The first question 
on both forms served as the main dependent mea- 
sure. It measured reactions to the recipient’s per- 
formance during the first trial and read, “How good 
a job did the subject (you) do on the task just com- 
pleted?” Subjects rated the job on a good-bad scale 
ranging from 1 to 7. Observers’ ratings of recipients’ 
personal characteristics were obtained by asking 
them to “Circle the number which best describes 

your subject” on eight bi-polar adjective scales. 
Instructions to recipients, on the other hand, were 
intended to produce a check of the immediate im- 
pact of the outcome manipulation on recipients. 
They were instructed to describe how they felt on 
the eight adjective scales. Next, both the observer 
and recipient were asked to explain the process by 
which the outcome was assigned, Specifically, the 
question asked, “How was the assignment made? 
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(Circle one): Random, Due to Performar ther.” 


Subjects who circled “Other” were asked to porate, 
Finally, both subjects were brought toget quer- 
ried for suspiciousness, and extensively efed, 
RESULTS AND DISCUSSION 
Replication of Lerner (1965) 
Observers’ performance ratings in the ves- 
ent study were influenced by recipient 1t- 


comes in the same manner as were obse rs’ 
ratings in Lerner’s (1965) study. Th p 
row of means in Table 1 shows that o! 
ers rated rewarded recipients’ perform 
on the task significantly higher than 
rewarded recipients’ performances (= 27, 
P < .01).* Observers’ personal characteris. cs 
ratings, however, only partially replicai-d 
Lerner’s finding that the outcome manipu! i- 
tion had no effect on observers’ impressio: s 
of recipients’ personal characteristics. Scor: 

on five of the eight bi-polar adjective scale: 
used by observers to judge recipients’ personal 


characteristics intercorrelated highly and were 


combined into a single measure. The fiv 
scales were pleasant-unpleasant, worthless 
valuable, happy-unhappy, irritable-good-na 
tured, and fair—unfair. The intercorrelations 
ranged from .4 to .7 and all were significant 
at the .05 level or higher. Consistent with 
Lerner’s results, means on the five-scale mea- 
sure of personal characteristics did not diffe: 


significantly for observers of rewarded and “ 


nonrewarded recipients. Similarly, means fo: 
one of the other uncorrelated scales, cold- 
warm, did not differ significantly between 
observers of rewarded and nonrewarded re- 
cipients. Scores on the remaining two bi-polar 
adjective scales, on the other hand, were sig- 
nificantly influenced by the outcome manipu- 
lation, Rewarded recipients were seen as 
more “good” than nonrewarded recipients 
(t= 2.1, < .05). (The means on a 1-7 
bad-good scale were 5.6 and 4.8, respec- 
tively.) Surprisingly, rewarded recipients were 
also rated as more “Jucky” than nonrewarded 
recipients (ź = 3.6, p < 001). (The means 
on the 1-7 unlucky—lucky scale were 4.9 and 
3.0, respectively.) Thus, the outcome manip- 
ulation in the present study influenced ob- 
Servers’ ratings of recipients’ behaviors (the 


* All significance tests are two-tailed, 
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TABLE 1 
RATINGS OF THE RECIPIENT'S PERFORMANCE 


Outcome 
Role Reward No reward t 
M n M n 
Observer 5.8 18 44 19 Pi i 
Recipient 4.9 34 4.1 OS) Pot fe 


performance measure) and also of some per- 
sonal characteristics (goodness and luck). 
Basically, the present study replicated Ler- 
ner’s (1965) findings. Observers judged re- 
warded recipients to have performed better 
than nonrewarded recipients, even though the 
outcomes were assigned arbitrarily. At the 
same time, observers tended to rate the 
personal characteristics of rewarded and non- 


“© rewarded recipients similarly. 


Comparison of Observers and Recipients 


Recipients’ descriptions of their feelings 
indicated the degree to which the outcome 
manipulation had an impact on them. Means 
on all eight bi-polar adjective scales were 
more positive for rewarded recipients than for 


„ nonrewarded recipients, and half of the dif- 


ferences were highly significant. Rewarded 
recipients described their feelings as more 
“lucky” (p < .001), more “pleasant” (p < 
004), more “good” (p< .005), more 
“happy” (p < .02), more “good-natured” (p 
< .07), more “fair” (p< .2), more “valua- 
ble” (p< .3), and more “warm” (p < .5). 
The receipt of, or the failure to receive, the 
1 hour of extra credit, therefore, had a sub- 
stantial effect on some of the recipients’ feel- 


"i ings. 


Unexpectedly, the outcome manipulation 
also had a strong effect on recipients’ ratings 
of their performances. The bottom row of 
means in Table 1 shows that rewarded recipi- 
ents rated their own performances signifi- 
cantly higher than did nonrewarded recipients 
(t= 2.1, p < .05). The predicted interaction 
between role (observer vs. recipient) and 
outcome (reward vs. no reward) was not 
found. A 2X2 analysis of variance per- 
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ormed on the performance scores produced 
a significant main effect for the outcome 
variable (F = 11.1, p< .01), but no inter- 
action was found between role and outcome 
(F <1). Both observers and recipients, 
herefore, were influenced by the outcome 
manipulation. The arbitrary assignment of a 
reward resulted in observers and recipients 
alike rating recipients’ performances more 
ositively than when the assignment was no 
reward. 


Attribution Predictions 


Consistent with the absence of an interac- 
tion on the performance measure, but con- 
trary to predictions, there were no differences 
between observers’ and recipients’ explana- 
tions for the assignment of outcomes. It had 
been predicted that observers would attribute 
the assignment to recipients’ performances, 
while recipients would attribute it to chance. 
Instead, approximately 20% of both the ob- 
servers and the recipients (19% and 22%, 
respectively) responded that the assignment 
was based on recipients’ performances, while 
62% of the observers and 47% of the recipi- 
ents answered that the assignment was ran- 
dom. The remaining 19% of observers and 
31% of recipients circled the alternative 
“Other” and typically explained to the ex- 
perimenter that they did not know, thought 
that a reward could not be given, or they 
repeated the instructions they had received 
from the tape recording. A chi-square analy- 
sis performed on the data showed no signifi- 
cant effects. Nor did inclusion of the outcome 
variable produce a significant effect. Tt was 
not the case, as was predicted, that non- 
rewarded recipients were more likely than 
rewarded recipients to attribute their out- 
comes to chance. Thus the predictions derived 
from Jones and Nisbett’s (1971) analysis of 
the perception of the causes of behavior were 
totally unsupported. The test of the predic- 
tions seems to have been appropriate. Recipi- 
ents (actors) performed a behavior and sub- 
sequently received an outcome. Then, both 
the recipient and an observer explained the 
cause of the outcome. Ambiguous instructions 
gave both recipients and observers the op- 
portunity to see the cause as either in the 
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situation or in the recipient. And the variety 
of explanations offered by subjects for the 
cause of the assignment indicates that the 
instructions were successfully ambiguous. 
Nevertheless, there was no evidence of a 
trend for observers to attribute the outcome 
to recipients and for recipients to attribute 
the outcome to the situation. 


Alternative Interpretations 


Despite disconfirmation of the attribution 
predictions, subjects’ explanations for the 
assignment permit an internal analysis of the 
data which provides additional information 
about the mechanisms involved. Table 2 
shows the ratings of the recipients’ perform- 
ances separately for (a) subjects who felt 
that the assignment was based on recipients’ 
performances, and (b) subjects who thought 
that the assignment was random. The upper 
row of means in Table 2 shows that the ten- 
dency for subjects to rate performances of 
rewarded recipients more positively than 
those of nonrewarded recipients was espe- 
cially pronounced for subjects who perceived 
the assignment of outcomes to be based on 
recipients’ performances. For these observers 
who saw the assignment as based on perform- 
ance, the mean difference in performance rat- 
ings between reward and no-reward condi- 
tions of 4.5 (t= 7.6, $ < .001) is much 
larger than the mean difference of 1.4 for the 
full samples of observers. Similarly, for. this 
subset of recipients, the mean difference be- 
tween reward and no-reward conditions of 
2.0 (t = 2.7, p < .05) is considerably greater 
than the mean difference of .8 for the full 
samples of recipients. In contrast, the lower 
row of means in Table 2 shows that for sub- 


TABLE 2 


Ratines or Reciprent’s PERFORMANCE ACCORDING 
TO PERCEPTIONS or How THE 
ASSIGNMENT Was MADE 


Perception of Observers Recipients 
how the 
assignment 
No s No 
E RET | reward | Reward:| No, 
Performance 6.5 2.0 5.7 3.7 
(n=4) | =3) |a =8) |@ a6 
Random 5.6 4 6 eis 


. i 7 45 
( = 10) | ( = 13) | (z = 17) | (n = 13) 
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jects who saw the assignment as random, the — 
tendency to rate performances of rewarded ‘4 
recipients more positively than performances 
of nonrewarded recipients was substantially 
diminished. The mean differences between 
reward and no-reward conditions for sub- 
jects who said the assignment was random 
were not significant for either observers (¢ = 
1.4, p < .2) or recipients (¢ < 1). The paral- 
lel differences for the remaining subjects, 
those who gave the response “other,” were 
also not significant. Observers’ personal char- < 
acteristics ratings showed much the same 
pattern as the performance ratings when di- 
vided on the basis of observers’ perceptions 
of the assignment. The overall tendenc y, for 
instance, of observers to rate rewarded recipi- 
ents as more “lucky” and more “good” than 
nonrewarded recipients was much greater for 
“performance” observers and much less for 
“random” observers. On the other hand, the 
overall, negligible mean difference beween! 
reward and no-reward conditions for U: per- 
sonal characteristics measure consisting of 
five items remained unchanged by the in- 
ternal analysis. In sum, the approximate 20% 
of the subjects who believed that the outcome 
was based on performance rated perform- 
ances and personal characteristics muc! 


positively for rewarded recipients th Ji 
nonrewarded recipients. The ratings o` sib- 
jects who gave some other explanation for 
the outcome, however, did not differ for re- 


warded and nonrewarded recipients. 
There are several possible explanations for 
the results of the internal analysis. Perhaps, 
for example, only a few of the subjects in 
the study possessed a high need to believe in 
a just world. Perhaps in Lerner’s (1965) 
study, where 73% of the subjects gave re- 
sponses consistent with a belief in a just 
world, subjects simply had a greater nee! i 
believe in a just world. Or, the subjects who 
did not attribute the assignment to recipients’ 
performances could have expressed their 
need to believe in a just world in some man- 
ner other than through reevaluation of recipi- 
ents’ performances or personal characteristics. 
They may, for instance, have become angry 
or upset at the “injustice” of a random as- 
signment of outcomes, An internal analysis of 
recipients’ descriptions of their feelings, how- 
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ever, does not support this latter interpreta- 
tion. There were essentially no differences 
between the feelings responses of recipients 
who attributed the assignment to “perform- 
ance,” “random,” or “other.” 

There are still other explanations for the 
limited support of the just-world predictions. 
The outcome manipulation, for example, may 
simply have been too weak to alter the im- 
pressions many subjects probably formed dur- 
ing the 5-min period in which the recipient 
worked with the game. All subjects had am- 
ple opportunity to judge the recipients’ per- 
formances, and observers were able to form 
impressions of the recipient based on the 
approximately 10 min of observation time 
that preceded the outcome manipulation. The 
snment of an extra hour of experimental 
credit may just not have been a sufficiently 
powerful event to substantially influence 
many subjects’ premanipulation impressions. 
Another possible explanation for the limited 
support of the predictions is that there may 
be situations in which most individuals do not 
believe that persons deserve their outcomes. 
Individuals may not believe, for example, 
that persons deserve their outcomes in set- 
tings, such as that of the present study, 
where most people see the outcomes as ran- 
dom. But in Lerner’s (1965) study and in 
Rubin and Peplau’s (1973) research on the 
draft lottery, outcomes were explicitly ran- 
domized, and still, results consistent with a 
belief in a just world were obtained. The 
critical factor, therefore, does not seem to be 
whether or not the assignment is random. 

Finally, the internal analysis suggests sev- 
eral different interpretations for the specific 
role of the outcomes in the formation of sub- 
jects’ attributions. Lerner’s interpretation of 
the role of the outcomes is quite straight- 


, forward; the outcome triggered a need for 


justice which, in turn, distorted subjects’ per- 
ceptions. After learning of the outcome ob- 
tained by a recipient, subjects then justified 
the outcome by believing that the recipient 
deserved the outcome because of her per- 
formance or her personal characteristics. The 
most troublesome finding for Lerner’s inter- 
pretation is that for many of the subjects, 
neither performance nor personal character- 
istics ratings were influenced by the outcome. 
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Another possible role of the outcome was as a 
validator of subjects’ judgments. Many sub- 
jects probably formed judgments about the 
recipients’ performances before learning of the 
outcome, Assuming that subjects tended in 
general to associate the reward with good 
performances and no reward with bad per- 
formances, the assignment of the outcome 
must have been consistent with some sub- 
jects’ judgments of the performances and 
inconsistent with others’. Subjects for whom 
the outcome was consistent with their judg- 
ments may have reacted in a manner such as, 
“Ah ha, I was correct in my judgment of the 
performance, and I now see that the assign- 
ment was based on the recipient’s perform- 
ance.” This subject would then have re- 
sponded on the questionnaire with her judg- 
ment of the performance, or perhaps with an 
even more extreme rating because of the vali- 
dation, and she would have chosen “perform- 
ance” as the basis of the assignment. In con- 
trast, subjects for whom the outcome was in- 
consistent with their initial judgment of the 
performance may have been somewhat con- 
fused. They may have become more tentative 
(less extreme) in their judgments and may 
have concluded that the assignment was 
either due to chance or to some unknown 
process. The outcome, in other words, may 
have either supported or failed to support sub- 
jects’ hypotheses about the quality of the 
performance. And last, the role of the out- 
come may have been as an indicator of the 
experimenter’s judgment of the recipient’s 
performance. Assume, first of all, that those 
subjects who attributed the assignment to the 
recipient’s performance based their conclusion 
on their interpretation of the instructions; 
their attribution was not influenced by the 
assignment of outcomes. For these subjects, 
the outcome then indicated the experimenter’s 
evaluation of the recipient’s performance. A 
reward for instance, meant that the experi- 
menter had decided that the performance 
was a good one. Since the criteria for judging 
performances were never explained, these sub- 
jects may merely have imitated what they 
inferred were the experimenter’s ratings. 
Thus, the outcome may have served any of 
several different roles, and it may have served 
different roles for different subjects. 
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In conclusion, the support of the initial 
analyses for the belief in a just world must 
be qualified. The finding of the internal analy- 
sis, that it was primarily the 20% of the 
subjects attributing the outcome to perform- 
ance who rated rewarded recipients far more 
positively than nonrewarded recipients, raises 
questions concerning the types of individuals 
and situations necessary to demonstrate ef- 
fects consistent with a belief in a just world. 
The internal analysis also raises questions 
involving the mechanism through which the 
results were produced. 
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Moral Judgments One Year After Intentional 
or Consequence Modeling 


Jeffrey L. Sternlieb and James Youniss 
Center for Research in Thinking and Language 
Catholic University of America 


Children originally modeled toward 
in moral judgment were observed 12 


intentional and consequence orientations 
months later on the same kind of moral 


situations. Changes in orientation which had been obtained initially, 2 weeks 
after a modeling session, were no longer noticeable. After 1 year, all subjects 
tended toward the intentional orientation with the group originally lowest in 
intention now being like the transitional or mixed subjects had been on the 
initial pretest. Subjects who were successful on a conservation task were more 


likely to be consistently intentional 


There is some evidence that children who 
are trained to judge moral dilemmas accord- 
ing to actors’ intentions maintain this orien- 
tation for a longer time than children trained 
to judge according to consequences of actions 
(Cowan, Langer, Heavenrich, & Nathanson, 
1969). The phenomenon in question refers to 
the fact that immediately after training both 
intentional and consequence orientations 
showed significant modeling effects but after 
2 weeks the consequence orientation alone 
declined. This differential persistence over 
time suggested to Cowan et al. (1969) that 
prodevelopmental (intention) and contrade- 
velopmental (consequence) modeling were 


. initially alike quantitatively but qualitatively 


different. They argued that modeling of an 
intentional orientation was more effective be- 
cause it corresponded to the direction of a 
cognitive reorganization already going on 
within the child. This view is much like 
Turiel’s (e.g., 1966). 

Glassco, Milgram, and Youniss (1970) 
presented corroborative data concerning train- 
ing children toward intention. They observed 
subjects 6 months after they were trained 
(Crowley, 1968) and found differences be- 
tween intentionally trained and control sub- 
jects of the magnitude obtained on the origi- 
nal posttest. Since there was no group trained 
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(n=18) than not (n= 4). 


to judge according to consequence, it was not 
possible to evaluate relative persistence of 
orientations over 6 months. 

In the present study, children who had 
been modeled toward intentional or conse- 
quent orientations (Ahr, 1971) were reob- 
served 12 months later. The subjects had been 
grouped in the original study according to 
the consistency of their moral orientations. 
They had been given eight dilemmas con- 
trasting two actions—in one, an actor pur- 
posely performed some action which led to a 
small consequence that was bad or harmful 
and in the other an actor accidently did 
something with a larger or more harmful 
consequence. Examples of accidental acts 
were spilling a bottle of ink on a rug while 
trying to fill a pen or opening a kitchen door 
coming to dinner but knocking over a tray 
holding 15 cups. Intentional acts with small 
consequences were, for instance, purposely 
putting a drop of ink on a tablecloth or in 
climbing on a cupboard to get cookies from a 
shelf, breaking one cup. 

Subject groupings were made on the basis 
of consistency of choices and explanations of 
naughtiness in terms of actor’s intention. 
Some children judged at least seven of eight 
dilemmas according to intention (High group) 
and other children hardly used intention at 
all only mentioning intention for two or 
fewer dilemmas (Low group). A third group, 
which was eventually subdivided, judged be- 
tween three and six of the eight dilemmas ac- 
cording to intention (Mixed group). The 
High, Mixed, and Low division was designed 
to signify three levels of moral orientations 
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representing developmental structuring in the 
moral sphere (Ahr, 1971). 
Modeling was accomplished with the same 
dilemmas and procedure used by Cowan et 
al. (1969). An adult female model responded 
with a consistent moral orientation on alter- 
nate trials with a subject. The High group 
and half of the Mixed group were exposed to 
a model who based her judgments of naughti- 
ness on consequence alone (Down training). 
The other half of the Mixed subjects and the 
Low group were exposed to a model who 
based her judgments solely on intention (Up 
training). Two weeks after modeling, all chil- 
dren were assessed on a posttest in which the 
dilemmas used in the pretest were repeated. 


METHOD 
Subjects 


Sixty-three of Ahr’s experimental group of 70 sub- 
jects were enrolled in the same school 1 year later; 
they now ranged in age from 7 to 11 years. 


Tasks 


Four of the eight moral dilemmas used by Ahr 
and by Cowan et al. (1969) plus two new dilemmas 
were employed. In each dilemma there was a pair 
of stories in which one actor intentionally did wrong, 
his action producing a small consequence, and an- 
other actor did something accidentally, his action 
producing a large consequence. In addition, Ahr’s 
(1971) test of conservation of weight was read- 
ministered in a modified form. 


Procedure 


Names and year in school of subjects were known 
to the experimenter but he did not know to which 
experimental group each belonged in the original 
study. Once a subject was located he was admin- 
istered the two tasks—half of the children judged 
dilemmas first, while the other half were given con- 
servation first. 

The conservation of weight task involved four 
standard transformations with clay balls and two 
check items. At appropriate points in the task, one 
of the two balls was transformed into a “pancake,” 
“sausage,” “two pieces,” and “four pieces.” Between 
each transformation, except between the change from 
two to four pieces, the clay was returned to two 
equal looking balls. Check items, which were used 
to force the child to use a response other than the 

“same weight,” involved adding and taking away 
clay from one of the balls. Given that a child was 
correct on all items involving equal looking balls 
plus the check items, he could receive scores from 0 
to 4 on the conservation transformation items per se. 

All six dilemmas contrasted an intentional act of 
small consequence with an accidental act of large 
consequence in paired format. The subject was to tell 
which actor was naughtier and why. 


og 
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Reference Groups 


There were four experimental groups in th 
study defined by their judgments on eight d - 
mas and by the kind of modeling to which they w-re 
subsequently exposed. Subjects in the Low-Up 
group (original n =20) had judged the larger < 
sequence as the naughtier action at least seven ti 
in eight dilemmas. They were trained Up by a 
model who consistently judged naughtiness on 
basis of intention in a 24-dilemma training seri 
Children in the Mixed-Up group (original n = 15 
had given from three to six judgments based or 
consequence; they were also exposed to an intention 
oriented model. The Mixed-Down group subjects 4 
(original n = 14) had given three to six judgmen 
based on intention and they were exposed to 
model who consistently judged naughtiness accord 
to consequence. Children in the High-Down gr 
(original »=21) had given seven or eight ju: 
ments based on intention and were exposed to 
consequence oriented model. The High, Mixed, a: 
Low groups were meant to signify three levels « 
consistency in orienting toward intention as a mo 
important basis of badness than consequence. 


RESULTS 


Mean percentages of intentional judgmen 


on the original pretest, modeling series, post- ; 


test, and on the present 1-year posttest are 
reported in Table 1. The first set of ż test: 
were computed to evaluate pretest to 1-year 
posttest change within each group. Significant 
differences were obtained for the Low-Up 
group, ¢ (14) = 5.2, p < .01, the Mixed-Up 


group, ¢ (13) = 8.12, p < .01, and the Mixed- +~ 


Down group, ¢(12)=7.2, p<.01. The 
High-Down group was not different from its 
original pretest level. Thus, both Up-trained 
groups, who were found to change in the 
upward direction on the initial 2-week post- 
test, maintained the same upward direction 1 
year later. However, both Down-trained 
groups which had significantly moved down- 
ward in accord with the model on the 2-week 
posttest, were also now making intentional 
judgments. After 1 year, the Mixed group 
was making more intentional judgments than 
they had on the pretest and the High group 
was again at its pretest level. 

Between group differences, which had been 
created experimentally on the original pre- 
test, partially followed the same pattern 1 
year later. The Low-Up group made fewer 
intentional judgments than each of the other 
groups: Mixed-Up, Mixed-Down, and Hi” 
Down (és > 25.00; ps < .001). 


The relation of conservation success to 
moral judgment was evaluated by dividing 
subjects into pass and fail on each task. Sub- 
jects who passed conservation (+) had given 
correct responses on all four transformation 
items plus all other trials; three or fewer 
correct answers were scored as failure (—). 
Subjects were classified as intentional (+) if 
they judged all six dilemmas on an intentional 
basis; five or fewer intentional answers were 
scored as failure (—). Because all 10- and 
11-year-old subjects gave 100% intentional 
answers and succeeded on conservation they 
were excluded from analysis. Of the 22 re- 
maining subjects who passed the conservation 
criterion, 18 fulfilled the + intentional cri- 
terion, while 4 did not (x? = 10.7, p < .01). 
Of the 18 nonconservers, 10 were consistently 
intentional (+) and 8 were not (—). 

As in other studies in which time after 
intervention has been used as a measure of 
persistence, the intentional orientation was 
found to be sustained. The spans have been 
2 weeks (Ahr, 1971; Cowan et al., 1969), 6 
months (Glassco et al., 1970), and now, 1 
year, Further, as Cowan et al. (1969) found, 
training toward the consequence orientation 
does not appear to be sustained; they observed 
a drop-off after 2 weeks and the present re- 
sults show a marked decline after 1 year. The 
Mixed-Down group, which had significantly 
shifted toward consequence 2 weeks after 
modeling, was giving more intentional answers 
than they had on the original pretest after 1 
year. The High-Down group which also had 
modeled to consequence were back at their 
pretest level after 1 year. 

These data confirm the oft repeated result 
that intentional judgments to these types of 
dilemmas increase with increasing age (e.g., 
Bandura & McDonald, 1963). The present 
1-year longitudinal finding with respect to 
intentionality is that the youngest subjects 
(Low-Up) gained the most, the Mixed sub- 
jects gained less, and the High group which 
was originally at ceiling maintained its maxi- 
mum performance. Further, intentional judg- 
ments were correlated, although not deci- 
sively, with the general developmental indi- 
cator of conservation. 

It is not possible to tell more precisely how 

ains in intention were made since intention- 


Š 
Cid 


et 
D. 


MORAL JUDGMENTS 897 


TABLE 1 


MEAN PERCENTAGES OF INTENTIONAL JUDGMENTS ON 
THE PRETEST, MODELING SESSION, TWO-WEEK 
POSTTEST, AND ONE-YEAR POSTTEST 


Biip Pretest | Modeling | Two-week | One-year 


session | posttest | posttest 
Low-Up 11 66 65 67 
Mixed-Up 53 79 91 95 
Mixed-Down 63 43 36 96 


High-Down 97 66 56 98 


ality represents the direction in which chil- 
dren are developing (Piaget, 1932/1948) and 
is probably the direction in which adult social 
agents are directing children as they grow 
older. It can be said, however, that modeling 
influences obtained 1 year earlier are unre- 
lated to present moral judgments. But even 
though social learning which was demon- 
strated after a single modeling session did not 
persist, it seems reasonable to conclude that 
changes after 1 year reflect both subsequent 
social learning outside the laboratory and 
development from within the child (Ahr, 
1971). 
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Causes and Consequences of Time Perception Differenc: 
in Overweight and Normal Weight People: 


Judith Rodin 
Yale University 


Three experiments considered whether there are differences between overweight 

and normal weight subjects in time perception which represent the obese sub- 

jects’ lack of internal responsiveness as well as heightened external reactivity 

in a noneating setting. In the first study, no time-relevant cues were provided, 

and overweight subjects were inaccurate in their temporal judgments and 

showed significantly higher group variability than did normals, In the second 

study, the effects of differential temporal information generated by interesting 

and boring cues was considered. The presentation of these time-relevant ex- y 
ternal cues reduced the judgment variability of the overweight subjects and 

influenced their perceived passage of time significantly more than normals. 

The third study examined the influence, on eating behavior, of differences in 

perceived passage of time as a consequence of manipulating cues for interest 

or boredom. When bored, overweight subjects perceived time to pass more 

slowly than did normals and thus ate sooner. Similarly, they delayed eating, 

judging time to have passed more quickly than it actually had, when they 

were attending to interesting cues. The implications of a generalized lack of 

internal sensitivity for a theory about the development and consequences of 


obesity are discussed. 


In a wide range of experimental settings, 
Schachter and his associates (Schachter, 
1971) have demonstrated that the obese eat 
in response to external food-relevant cues 
such as the taste and sight of food and are 
relatively insensitive to internal physiological 
signals such as gastric motility and changes 
in blood sugar level. Further investigations 
have shown that these various findings about 
eating behavior are simply a special case of 
a much broader phenomenon of generalized 
stimulus sensitivity. In contexts unrelated to 
eating, the obese were found to be far more 
reactive than normal weight individuals to 
prominent external cues which affected their 
pain perception and ideation, emotionality, 
and distractibility (Schachter & Rodin, 
1974). These experiments all suggested a 
heightened sensitivity to and processing of 
external stimuli on the part of the obese but 
did not question whether their relative un- 
responsiveness to internal signals also extends 
beyond eating behavior. 


This research was supported by National Science 
Foundation Grant GS-37953. 

Requests for reprints should be sent to Judith 
Rodin, Department of Psychology, Yale University, 
New Haven, Connecticut 06510. 


To consider this issue for a nonconsumn.a- 
tory behavior, differences in time perceptic: 
were examined, since the experience of time, 
like hunger, is frequently assumed to have 
both internal biological and cognitive com- 
ponents. Several studies of time have sug- 
gested a time base identified with some in- 
ternal periodic, biological clock (Dimond, 
1964; Hammer, 1966; Hoagland, 1933; 
Ochberg, Pollack, & Meyer, 1964), while 
others, notably Ornstein’s (1969) experi- 
ments, have led to a cognitive, information- 
processing model of time perception. It was 
predicted that if the obese are both unre- 
sponsive to internal events and rely heavily 
on external cues, the periodicity associated 
with internal biological rhythms such as me- 
tabolism (Francois, 1927; Hoagland, 1933) 
should provide relatively little time-relevant 
information for them, while external, cogni 
tive variables should be quite important. 

If this line of reasoning is correct, there 
should be differences between overweight and 
normal weight subjects in time perception 
when there are no relevant external cues at 
all. Without salient external information, the 
duration judgments of the obese should be 
highly variable and inaccurate relative to the 
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estimates of normals. The first experiment 
tested the prediction of increased variability 
and inaccuracy for the obese in the absence 
of time-relevant external cues; in other words, 
when subjects had to rely solely on internal 
cues. 


EXPERIMENT 1 
Meth od, 
Procedure 


In the context of an experiment investigating 
physiological responses to complex stimuli, electro- 
encephalogram electrodes were fitted to the wrists 
and ankle of each subject, and he was seated in a 
darkened room in silence for 15 minutes while pre- 
sumed baseline physiological measures were taken. 
He was instructed to think about anything he wished 
during this period and told that he would later be 
asked to report his thoughts in order to evaluate 
the physiological data. He was asked to remove all 
jewelry (watches, rings, etc.) whose metal might 
interfere with the electrophysiological equipment. 
At the end of 15 minutes, each subject was asked 
to complete a number of physiological and attention 
questions including the time-relevant measure. This 
item asked subjects to judge the duration of the 
taped interval on a 60-minute scale which ranged 
from 0 to 60. 


Subjects 


Overweight and normal weight males were re- 
cruited from introductory psychology classes at Yale 
University and paid $2.50 for their participation in 
this and two short studies which followed. Initial 
subject recruitment was done on the basis of self- 
reports of height and weight obtained at the be- 
ginning of the school year. Height and weight were 
then actually measured at the conclusion of the ex- 
periment, and percentage overweight was calculated 
from Metropolitan Life Insurance (1959) norms, 
which are rated according to age and height. Fol- 
lowing procedures outlined by Schachter, Goldman, 
and Gordon (1968), 10 overweight subjects were 
included, ranging from 14.7% to 38.9% overweight 
and 10 normals, who ranged from —8.6% to +9.1% 
overweight, completed the sample. Subjects weighing 
> between +10% and +15% overweight (n=4) and 
less than —10% (n = 6) were excluded. 


Results and Discussion 


While the mean time estimates of both 
overweight and normal treatment conditions 
were close to the actual 15 minutes, the 
groups differed greatly in variability. The 
means and standard deviation for each group 
are presented in Figure 1. As predicted, the 
responses of obese subjects were highly vari- 
able (for obese subjects, o= 1.42), while 
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Ficurr 1. Mean and standard deviation of tem- 
poral judgments of a 15-minute interval containing 
boring, interesting or no-time-relevant external cues. 


normal weight subjects showed much less 
variability (for normal subjects, e = 3.68), 
a significant difference, F(9, 9) = 4.07, p< 
025. 

These data suggest that when time-relevant 
external cues are absent, there is greater 
group variability and a higher degree of in- 
accuracy in the temporal judgments of several 
obese subjects in contrast to the nonobese. 
If our line of reasoning is correct, these re- 
sults were obtained because externally re- 
sponsive overweight subjects had to rely on 
internal cues for judgments of the passage 
of time. This suggests that providing ex- 
plicit external time cues should decrease the 
judgment variability of the obese group. 
Furthermore, the estimates of the obese 
should be more influenced than those of nor- 
mals by the nature of the available time-rele- 
vant cues. For example, boring and interest- 
ing events should slow down or speed up, 
respectively, perceived passage of time more 
for a heavy subject than for his normal 
weight counterpart. Thus, his judgmental ac- 
curacy will vary as a function of the time- 


900 


relevant information conveyed by the ex- 
ternal cues. The second experiment examined 
the effects of manipulating external cues—in 
this case, interest and boredom—on over- 
weight and normal weight subjects. 


EXPERIMENT 2 


Method 
Procedure 


Essentially the procedure was identical to that in 
the first experiment with the following changes: 
One group of heavy and one group of normal weight 
subjects listened to an interesting tape recording for 
15 minutes, and one group in each weight category 
heard a boring tape. The interesting sequence was 
taken from a recording of Richard Nixon, Superstar, 
a humorous monologue by the comedian, David 
Frye. The passage had been rated as entertaining 
and interesting by a group of 20 judges, 10 over- 
weight and 10 normal weight, The same group rated 
as highly boring a passage taken from a textbook 
entitled Geography of the Pacific, and this sequence 
was read aloud and taped for the boring stimulus, 
The passages were as similar as possible in number 
of separate ideas or codable units as well as in 
length. Following the 15-minute session, subjects 
answered several questions about their perceived 
physiological responses and judged the length of the 
interval on the 60-minute scale used in Experiment 1, 


Subjects 


Subjects were Vale undergraduate volunteers, paid 
$2.50 for their participation. On the basis of height 
and weight measures obtained at the conclusion of 
the experiment, 20 overweight males whose weights 
ranged from +17.9% to +36.3% overweight and 20 
normal weight males, ranging in weight from —6.6% 
to +8.7% overweight, were included as subjects, 
Six subjects whose weights fell outside the accept- 
able cutoffs outlined in Experiment 1 were elimi- 
nated. 


Results and Discussion 
Interest Ratings 


Subjects found the interesting tape signifi- 
cantly more interesting than the boring tape, 
F(1, 36)= 7.76, p < 01. However, there 
were no significant differences between over- 
weight and normal weight individuals in how 
interesting or boring each group found the 
two tapes (F < 1). Thus any differences in 
time estimation between weight groups should 
be due to differences in the way the interest- 
ing and boring cues influenced the perception 
of time rather than to differences in interest 
per se. 


Junita Rop 


speeding up a clock 


Effects of External Cues on 
Temporal Judgment 


The data presented in Figure 1 indicate 
that overweight subjects were far more TEDT 
fected than were normals by the manipulation 
of external time-relevant cues. They judged . 


the length of the interval to be longer than 
did normals when the tape was boring and, { 
somewhat shorter than did normals when they ` 


heard an interesting tape, Weight X Interest 
PASOS 05- The analysis of 
variance also indicates a significant main 
effect for the influence of external events on "y 
time perception, with the boring period rated 
as significantly longer than the interesting 
interval of the same actual duration, F (1, 36) 
= 5.84, p < .05. Scheffé post hoc comparison 
# tests indicate that these differential effects 
of boredom and interest on judgments of 
temporal duration were accounted for almost 
entirely by the overweight subjects, obese 
#(18) = 2.96, p < .01; normal £ Ki 
Providing external time-relevant informa- 
tion does appear to influence the accuracy of 
temporal judgments made by overweight sub- 
jects. To the extent that the obese group 
actually relied on such environmental cues, 
there should be far less variability among 
these subjects than among those in Experi- 
ment 1 who made their judgments in the 
absence of any external cues for the passage 
of time. The standard deviations given in 
Figure 1 clearly show this effect. The vari- 
ability among all treatment groups in the 
second experiment was small and nonsignifi- 
cant. When the variances of the two obese 
treatment groups of Experiment 2 are pooled 
and compared to that obtained in Experiment 
1, there is significantly greater variability for 
the obese group in the absence of external 


groups in Experiment 2 is Rot Significant] 
different from that ol netmals in the first ra 
periment (F = 1.46), 

In light of these findings, we next turned 
to a consideration of how these differences in 


and found that actually slowing down or 
influenced the amount ` 


A „consumed by overweight subjects more than 


the amount consumed by normals. Their 
study:was based on the assumption that the 
passage of time is an important external cue 
for eating. 

Combining the Schachter and Gross (1968) 


_ findings with the results of the present stud- 


ies, the effects on eating behavior of events 
which slow and speed perceived time may be 
readily predicted. If more time seems, to 
elapse for the obese when they are bored and 
if the passage of time is a powerful cue for 
eating, overweight people should eat sooner 
when bored than when they have been doing 
something absorbing. This prediction should 
hold not only for boredom and interest but 


l for any cognitive manipulation which affects 


perceived passage of time for the obese. When 
considering boredom, however, an additional 
io may be operative. Overweight indi- 
“viduals may simply be more likely to think 
about food when they are bored, and as Ross 
(1974) has shown, obese subjects instructed 
to think about food do eat considerably more 
than do normals. Experiment 3 examined how 


those variables which influence subjects’ per- 
`- ceived time in turn affect consummatory be- 


havior. 

“i EXPERIMENT 3 
Method 

Overview 


Amount eaten and latency to eat were 
measured after overweight and normal weight 
subjects listened to either boring or interest- 
ing tapes. These were simply 50-minute ex- 
cerpts from the identical sources of material 
used for Study 2: Geography of the Pacific 
and Richard Nixon, Superstar. Since thinking 


, about food may also be a response to boredom 


in the obese, a control group was tested after 
having explicitly focused on food-relevant 
thoughts in the preceding period. These sub- 
jects heard a 50-minute tape taken from M. 
F. K. Fishers With Bold Knife and Fork, 
which describes the social and gastronomic 
activities surrounding a banquet attended by 
renowned French gourmets. 


° 
Procedure 


Testing occurred during midafternoon, 
after lunch and well’ before dinner. As in the 
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previous two experiments, the study was ex- 
plained as an investigation of physiological 
responses to complex auditory stimuli, Each 
subject was told to listen to the tape for one 
half hour and then take a break during which 
time he could relax, drink some milkshake 
which was provided, leaf through magazines, 
or simply sit quietly. He was specifically in- 
structed, “It is critical that you do not take 
a break until you think that exactly one half 
hour has passed and then call the experi- 
menter as soon as you are ready to begin the 
next part.” Thus, a strong experimental de- 
mand was placed on being as veridical as 
possible in the judgment of elapsed time. The 
subject was further instructed to shut off the 
tape recorder when he began his break which, 
unknown to him, also simultaneously stopped 
a timer. ; 

The milkshake was a frozen prepared mix 
to which milk had been added. A full gallon 
in an opaque plastic container was available, 
and each subject could fill his glass as many 
times as he liked. The milkshake container 
was stored in a refrigerator in the room and 
was not visible while the tape was playing. 

Although each subject believed that he was 
controlling the length of the break, only 10 
minutes were actually allowed. If the subject 
signaled that he was ready before 10 minutes 
had elapsed, the experimenter explained that 
he had a few more things to do and would 
return shortly. If a subject had not yet termi- 
nated the break, he was stopped after 10 
minutes. 


Subjects 


Subjects were males recruited from South- 
ern Connecticut State College and the Uni- 
versity of New Haven on the basis of survey 
forms completed at registration which included 
height and weight data. They were paid $3.00 
for their participation. From height and 
weight measures obtained at the conclusion 
of the experiment, 30 overweight subjects 
were included in the data analysis, ranging 
from +15.4% to +69.2% overweight. There 
were 30 normals ranging from —9.7% to 
+8.5% overweight. There were 10 subjects in 
each of six treatment conditions. Six subjects 
were eliminated because their weight fell out- 
side the cutoffs designated in Experiment 1, 


902 


and 3 were not included because they were 
allergic to chocolate milkshakes. 


Results and Discussion 


Subjects were instructed to stop the tape 
recorder and begin their break when they be- 
lieved that they had been listening to the tape 
for 30 minutes. The data in Figure 2 show 
that for overweight subjects, this judgment 
was strongly influenced by the nature of the 
taped material, thus replicating, with a dif- 
ferent dependent variable, the temporal dura- 
tion judgment data obtained in the second 
study. Obese subjects hearing the boring tape 
appear to have judged time as passing slowly 
and therefore began their break sooner than 
those hearing the two interesting tapes. Nor- 
mal weight subjects, on the other hand, took 
a break at about the same time, no matter 
which tape they heard, Weight x Tape 
F(2, 54) = 3.31, p < .05. Post hoc orthog- 
onal comparisons indicate that while the 
obese judged more time to have elapsed when 
listening to the boring tape than to either of 
the interesting tapes, #(27) = 4.38, p < .01, 
their time perception was not significantly af- 
fected by whether the interesting tapes were 
relevant or irrelevant to food (ż < 1). 
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Ficure 2. Mean time actually elapsed when sub- 
jects initiated a rest period after exposure to food 
and nonfood-relevant external cues varying in in- 
terest. 
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FIGURE 3. Mean ounces of milkshake drunk during | j 
a 10-minute break after exposure to food and non- 


food-relevant external cues varying in interest, 


In contrast, the amount of milkshake actu- 
ally consumed by overweight subjects was 
greatly influenced by whether or not the tape 
was food relevant. The means for number of 
ounces drunk by each treatment group are 
presented in Figure 3. When the obese had 
listened to a description of gastronomic de- | 
lights, they subsequently drank almost twice 
as much milkshake as they did after listening ‘ 
to an interesting, humorous monologue, £(18) 
= 4.08, p < .002. Similarly, listening to the i 
food-relevant tape also had a significantly ; 
greater effect on the amount drunk by obese 
subjects than did listening to a boring tape, 
t(18) = 2.36, p <.02. None of these con- 
trasts were significant for normal weight sub- 
jects, although they tended to drink least 
after hearing the food-relevant tape. 

The analysis of variance shows a significant 
main effect for weight, with overweight sub- 
jects consuming more milkshake overall than 
did normals, F(1, 54) = 15.25, p < .01, an 
a significant Weight x Tape interaction, 
F (2,54) = 9.30, p < 01. 

These data suggest that overweight sub- 
jects may eat more frequently when bored be- 
cause initiation of eating is influenced by 
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perceived passage of time,’ and time passes 
more slowly for the obese than for normals 
when they are bored. Similarly, the obese may 
eat less frequently than normals when in- 
volved in something absorbing. That these 
results are not merely a function of the obese 
subject’s thinking more about food when 
bored is revealed by both the time and eating 
data, First, if thinking about food makes an 
overweight person more likely to eat sooner, 
subjects in the food-relevant-interesting tape 
condition should have begun their break first, 
thus having the shortest latency to eat. In- 
stead, they had the longest. Second, obese 
subjects hearing non-food-relevant tapes, 
whether boring or interesting, ate significantly 
less than subjects listening to the food-rele- 
vant material. This result is surprising in 
light of the common complaint of overweight 
people that they eat more when they are 
bored, The present data suggest that rather 
than eating more food during each eating 
instance, they may simply eat more fre- 
quently. 


GENERAL DISCUSSION 


Tt was assumed that studying time per- 
ception provides a means for generalizing to 


-nonconsummatory behavior the hypothesis 


that: the obese are relatively unresponsive to 
internal cues as well as highly sensitive to 
external stimuli. First, it seems clear that 
perceived passage of time is influenced by 
environmental stimuli for overweight people. 
The specific kinds of events which occurred 
during an interval affected the perceived 
length of that period. Consequently, it is 
suggested that their accuracy is a function of 
the veridicality of information conveyed by 
the relevant external time cues. Next, it ap- 
pears that without external criteria upon 
which to judge the passage of time, the obese 
group was highly variable, and many subjects 
were quite inaccurate in their temporal judg- 
ments, Normal weight individuals, on the 
other hand, were relatively unaffected by 


1 All subjects who drank any milkshake poured 
their first glass within the first minute after shutting 
off the tape recorder. It was striking, however, that 
only in the treatment condition where obese sub- 
jects listened to the boring tape, did they begin 
drinking within the first 20 sec. 


these various manipulations. Their judgments 
were fairly accurate and stable whether or 
not external signals were available and were 
not greatly influenced by the nature of the 
specific environmental events. 

These differences between weight groups 
indicate that they are responsive to different 
kinds of stimuli in judging time, just as they 
are in eating. Internal signals seem to provide 
little useful stimulus information for the 
obese, while external cues are quite likely to 
elicit a response. The effects of response style 
on temporal estimation are especially inter- 
esting because several investigators have 
found time perception correlated with a va- 
riety of personality variables such as intro- 
version-extroversion (Eysenck, 1959; Lynn, 
1961), anxiety (Cahoon, 1969; Ruiz & 
Krauss, 1968), and field dependence-inde- 
pendence (Friel, 1969). Since there are cer- 
tain commonalities between field dependence 
and our characterizations of external respon- 
siveness, Friel’s results are particularly rele- 
vant to the present study. He found that 
field-dependent individuals were more accu- 
rate than field-independent people and that 
field-independent individuals were generally 
more likely to overestimate temporal dura- 
tion. Since extremely short time intervals were 
used (9-46 sec) in Friel’s study, no direct 
comparison of his data and those from the 
present experiment are possible. However, it 
is at least clear that overweight subjects were 
not more accurate than were normals in this 
study. 

The findings of the three experiments re- 
ported here, in conjunction with our earlier 
work (Rodin & Slochower, 1974; Schachter 
& Rodin, 1974), suggest that the aberrant eat- 
ing behavior of the obese may be a special 
case of a much broader phenomenon of gen- 
eralized external sensitivity and internal in- 
sensitivity, These indications are especially 
intriguing, for, if correct, there are major 
implications for our conception of the nature 
and origin of obesity. In addition, the present 
study adds further weight to a model stressing 
the importance of external cues in controlling 
the behavior of the obese, since it shows that 
even in areas unrelated to food or eating, 
overweight individuals often do not or cannot 
use information from internal stimuli. This 


904 JupirH Ropin 


notion thus has significance for the study of 
the personality and social behavior of over- 
weight individuals, as well as for their eating 
behavior. One might ask, for example, 
whether the obese have less imagery than 
normals, whether they daydream less, and so 
forth. The consequences of a generalized lack 
of internal responsiveness remain to be more 
fully considered. 
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Resistance to Extinction as a Function of Self- or Externally 
Determined Schedules of Reinforcement 


Howard R. Weiner and Richard A, Dubanoski 
University of Hawaii 


The major purpose of this study was to investigate one component of the 
self-reinforcement process—that of standard setting. Specifically, this experi- 
ment assessed the effect of self-selected versus other-determined schedules of 
reinforcement on resistance to extinction. Children, from grades two through 
four, were randomly assigned to one of two conditions. One group (Self) was 
permitted to select one of three schedules of reinforcement before making re- 
sponses at a game-like apparatus, For the second group (External), the 
schedules were determined by a yoking procedure. Extinction procedures were 
instituted for all subjects after 20 contingently reinforced responses were made. 
The results showed that the Self Group persisted significantly longer and 
emitted more responses in extinction than did the External Group. A strong 


partial reinforcement effect was also found. 


The self was viewed, by many early psy- 
chologists, as a determinant of behavior rela- 
tively free of situational influences. Recently, 
learning theory proponents (Bandura, 1971; 
Homme, 1965; Skinner, 1953, 1971; Watson 
& Tharp, 1972) have suggested that what 
had been labeled intrapsychic, purposive 
characteristics of the self are actually learned 
behaviors which respond to the same learning 
principles that govern externally directed be- 
havior. It may be assumed that self-directed 
behaviors are established originally by ex- 
ternal contingencies, and these behaviors fol- 
low the same principles that govern the acqui- 
sition, maintenance, and extinction of those 
that are publicly accessible. There is experi- 
mental evidence that the course of self-rein- 
forced responses, from establishment through 
extinction, parallels that of externally rein- 

` forced responses. 

Recent research regarding the acquisition 
of self-reinforced responses (Bem, 1967; 
Hartig & Kanfer, 1973; Kanfer & Marston, 
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1963; Meichenbaum & Goodman, 1971) has 
demonstrated that self-control] mechanisms 
can be established experimentally by means 
of learning procedures similar to those used 
for the establishment of external stimulus 
control, In addition to direct training, there 
is evidence that modeling procedures play an 
important part in the transmission of patterns 
of self-reinforcement (Bandura & Kupers, 
1964; Hildebrandt, Feldman, & Ditrichs, 
1973; McMains & Liebert, 1968; Mischel & 
Liebert, 1966). Results of the studies above 
show that mechanisms of self-reinforcement 
can be transmitted either by direct training 
or modeling or a combination of both. 
Furthermore, consistency between what is 
taught and what is modeled facilitates the 
adoption of standards of reinforcement. 

Once established, patterns of self-reinforce- 
ment can be maintained through intermittent 
reinforcement (Kanfer & Karoly, 1972). 
Bandura and Perloff (1967) and Speidel 
(1972) have demonstrated in laboratory set- 
tings that contingent self-reward maintained 
behavior as well as external reward. Addi- 
tional evidence for the behavioral mainte- 
nance properties of self-reinforcement comes 
from classroom studies conducted by Lovitt 
and Curtiss (1969) and Glynn (1970), The 
results of these studies suggest that self- 
determined reinforcement is as effective as 
experimenter-determined reinforcement in 
maintaining academic performance, 
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There is some evidence to suggest that, 
under certain conditions, self-reinforced re- 
sponses may be more resistant to extinction 
than externally reinforced responses. Kanfer 
and Duerfeldt (1967) found that subjects 
who were permitted to self-reinforce on a dis- 
crimination task made more correct discrimi- 
nations in extinction compared to subjects 
who received experimenter-controlled rein- 
forcement. Johnson (1970), Johnson and 
Martin (1971), and Bolstad and Johnson 
(1972) all found some greater resistance to 
extinction for the groups trained in self-rein- 
forcement procedures as compared to the 
groups receiving  experimenter-determined 
reinforcement. All of the studies above in- 
volved discrimination tasks at which the self- 
reinforced groups were taught to evaluate 
their own performance. The pairing of pro- 
prioceptive and vocal responses during self- 
reinforcement training may have provided 
the subjects with secondary reinforcers which 
could be self-administered, and thus maintain 
responding, during extinction. 

Although there has been a lack of uni- 
formity in the definition and application of 
self-reinforcing events, the major implication 
to be derived from the literature is that self- 
reinforcement is analogous to external rein- 
forcement in the acquisition, maintenance, 
and extinction of behavior. There has been, 
however, little investigation into the sub- 
sidiary processes that comprise a self-rein- 
forcing event. It seems likely that there are 
two important components in the process of 
self-reinforcement (Bandura, 1971; Bandura 
& Perloff, 1967). They are the following: (a) 
the self-selection of a standard of behavior 
and (b) the contingent self-administration 
of rewards. Previous investigations of the 
self-reinforcement process have neglected to 
analyze the contributions of the subprocesses 
to the whole. In order to interpret meaning- 
fully the process of self-reinforcement, it is 
necessary to ascertain the independent effects 
of the elements of a self-reinforcing event. 
The main purpose of this study was to isolate 
and examine the effect of one of the com- 
ponents—that of self-selection of a standard 
of achievement on resistance to extinction. In 
addition to standard-setting behavior, the 
partial reinforcement effect was examined 
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under three schedules which were self- or 
externally determined. 


METHOD 
Subjects 


Initially, 188 children, in grades two through four, 
were used in this study. Of this total, 16 subjects 
were discarded. The remaining 172 children, 82 boys 
and 90 girls, were of multiethnic origins and from 
predominantly lower-middle-class backgrounds. The 
mean age of the group was 8.71 years, with a range 
of 7.16 to 11.75 years. 


Apparatus 


The apparatus was a wooden box, 30.4 X 58.4 X 
50.8 cm. Using this apparatus, the subject could 
make a response by dropping a hard rubber ball into 
an aperture at the top of the box. The ball returned, 
after a 3-second delay, into a catchbox at the lower 
right. Three toggle switches, arranged in a triangular 
pattern, enabled the selection of one of three fixed- 
ratio schedules of reinforcement, Each switch had a 
number above it (1, 2, or 4) indicating a schedule 
and an amber light which went on when the switch 
was activated. An M & M candy dispenser, mounted 
on the apparatus, delivered token reinforcers, after 
the appropriate number of responses had been made. 
The backup reinforcers were attractive, prepackaged 
toys such as jump ropes, yo-yos, racing cars, air- 
planes, jewelry sets, and checkers, The retail value 
of these toys ranged from 29¢ to 49¢ each. 


Procedure 


The children came individually to the testing 
room, ostensibly to test some new game equipment. 


The subjects were randomly assigned, by an al- - 


ternation procedure, to one of two groups. One 
group (Self) was permitted to select one of three 
schedules of reinforcement, For the second group 
(External), the schedule was determined by a pro- 
cedure which yoked subjects by sex, grade level, 
and schedule of reinforcement. If, for example, a 
subject was assigned to the Self Condition, and 
selected the FR-2 schedule, then another subject, 
from the same grade and of the same sex, would be 
assigned to the External Condition under the FR-2 
schedule. 

Upon entering the testing room, each subject was 
seated at a low table which held the apparatus. On 
another table, in full view, were displayed the backup 
reinforcers. Children in the Self Group were shown 
how the game was played at each of the three 
schedules. They were told that they would be per- 
mitted to choose one of the three ways to play, that 


10f the 16 subjects who were discarded, 4 were 
discarded for equipment malfunction, 1 for inability 
to follow directions, and 11 because they did not 
complete 20 responses in acquisition prior to ex- 
tinction procedure. Ten of these discarded subjects 
had been from the Self Group and 6 from the Ex- 
ternal Group. 
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RESISTANCE TO EXTINCTION 


the choice was entirely theirs, and that they could 
play as long as they liked in order to win tokens 
which would be exchanged later for a toy of their 
choice, The instructions for the External Group were 
shorter because only one of the schedules was demon- 
strated—that which had been selected by the yoked 
partner, When instructions were completed, the ex- 
perimenter went into the adjoining room and the 
subject was left alone to play the game. After the 
subject made 20 contingently reinforced responses, 
extinction procedures were instituted. The criterion 
for extinction was 60 seconds without a response, 
or if the responding continued without a 60-second 
pause, then the subject was terminated at the end 
of 300 seconds. The experimenter returned to the 
testing room at the end of the extinction session and 
exchanged the tokens for a toy chosen by the sub- 
ject. All children were requested not to disclose de- 
tails of the experiment to their classmates and re- 
turned to their respective classrooms. 

Two dependent measures were taken. They were 
the following: (a) time to a criterion of extinction 
and (b) number of responses emitted to the same 
criterion during the period of nonreinforcement. 


RESULTS 


A Pearson product-moment corrglation be- 
tween the scores for time to extinction and 
number of responses showed an extremely high 
relationship (r = .93). The means and stan- 
dard deviations, in seconds to extinction for 
the Self and External Groups, in each of the 
schedules are presented in Table 1. 

From this table it is evident that the Self 
Group was more resistant to extinction than 
was the External Group. An analysis of vari- 
ance revealed that the main effect of standard 
selection was significant, F(1, 160) = 5.66, 
p< .02. Within each schedule, the Self 
Group took longer to extinguish than did the 
External Group. However, a Newman-Keuls 
Test for multiple comparisons revealed that 
the difference between the Self and Exter- 
nal Groups was significant only in FR-4 
(p < 05). 

The partial reinforcement effect was strong, 
with subjects in FR-1 being the least resistant 
and those in FR-4 taking the longest time to 
extinguish. The analysis showed that this 
schedule effect was significant, F(2, 160) = 
22.80, p < .01, and a Newman Keuls Test 
showed that the means for each of the sched- 
ules was significantly different from the other 
in each case (p < .01). 

Overall, there was no significant difference 
between sexes in resistance to extinction. 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS: 
Time to EXTINCTION ` 
Schedule 

Condition FR-1 FR-2 FR-4 
M SD M SD M SD 
Self 63.93 | 77.98 | 127.72 | 115.68 | 196.86 | 107.10 
External | 44.78 | 57.79 | 96.38 | 102.67 | 147.27 | 105.04 


Note. FR-1 n = 28; FR-2 n = 29; FR-4 n = 29. 


However, the significant Sex X Schedule in- 
teraction, F(2, 160) = 3.49, p < .04, indicated 
that boys and girls did vary from each other, 
depending on the schedule. A Newman-Keuls 
Test showed that this difference occurred in 
FR-4, where boys (M = 201.60 sec) took 
significantly longer than girls (M = 148.00 
sec) to extinguish (p < .05). 

The mean number of responses emitted in 
extinction and the standard deviations for the 
Self and External Groups are presented in 
Table 2. Because the means and standard 
deviations for this dependent measure were 
positively correlated, a logarithmic transfor- 
mation was performed on the raw data prior 
to analysis. 

An examination of these data reveal that 
the Self Group made more responses in ex- 
tinction than did the External Group. An 
analysis of variance performed on this depen- 
dent measure showed that this difference 
was significant, 7(1, 160) = 4.75, p < .03. A 
strong partial reinforcement effect was again 
found, F(2,160) = 42.99, p< .01, and a 
Newman-Keuls Test revealed that the means 
of the schedules were significantly different 
from each other in each case (p < .01). The 


TABLE 2 


MEANS AND STANDARD DEVIATIONS: 
NUMBER oF RESPONSES 


Schedule 
Condition FR-1 FR-2 FR-4 
M SD M SD M SD 
Self 8.89 12.70 | 19.65 19.04 | 36.10 | 22.88 
External 6.82 11.92 | 16.45 | 19.84 | 26.41 | 22.22 


Note. FR-1 n = 28; FR-2 n = 29; FR-4 n = 29, 
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Sex X Schedule interaction was also signifi- 
cant, F(2, 160) = 3.80, p < .03, with a New- 
man-Keuls Test (p < .05) indicating that in 
FR-4 boys (M = 38.27) emitted significantly 
more responses than did girls (M = 25.56). 


Discussion 


Previous research has demonstrated that 
subjects taught to manage their own behavior 
were more resistant to extinction than sub- 
jects whose behavior was managed by an- 
other (Bolstad & Johnson, 1972; Johnson, 
1970; Johnson & Martin, 1971; Kanfer & 
Duerfeldt, 1967). This effect may have been 
due to one or both components of self rein- 
forcement, subjects setting their own stan- 
dards or subjects reinforcing their own behay- 
ior. The main purpose of the present study 
was to investigate the role of standard setting 
in the self-reinforcement process. The results 
revealed that subjects who were allowed to 
select their own standard for reinforcement 
exhibited greater resistance to extinction than 
did subjects whose standards were selected by 
the experimenter. 

These results may be accounted for in 
several ways. First, Rotter (1966) formulated 
a hypothesis which states, in part, that the 
effects of reinforcement on preceding behavior 
depend partially on whether the subject per- 
ceives the reward as dependent or indepen- 
dent of his own behavior. If the individual 
perceives the outcome of an event as con- 
tingent upon his own behavior, the locus of 
control is labeled as internal. When reinforce- 
ment is perceived as following but not con- 
tingent upon his behavior, then it is per- 
ceived as chance, fate, or the control of a 
powerful other. The locus of control is then 
said to be external. Rotter’s basic hypothesis 
is that reinforcement received under external 
control is less likely to strengthen behavior 
than is the occurrence of reinforcement under 
conditions of internal control. In this present 
study, it is possible that the Self Group, who 
were allowed to select their own schedules, 
perceived the received rewards as more de- 
pendent upon their own behavior than did 
the External Group even though in both 
groups the rewards were delivered contin- 
gently. Therefore, the Self Group had their 
responses strengthened to a greater degree 
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than the subjects in the External Group, as 
evidenced by the greater resistance to extinc- 
tion displayed by the Self Group. 

Second, in an earlier, but related concep- 
tion, White (1959) formulated the concept of 
competence as a supplement to theories of 
motivation built on primary drives. Compe- 
tence is a characteristic of all species to at- 
tempt to master the environment, which is 
evidenced in such behaviors as exploration, 
curiosity, manipulating the surroundings, and 
producing effective changes in the environ- 
ment. By means of these behaviors an orga- 
nism learns to interact effectively with its 
environment, Past research indicates that 
perceived control of the environment is re- 
inforcing (Chaikin, 1971; Kanfer & Seidner, 
1973; Kavanau, 1969). In the present study 
children who were permitted to select their 
own schedule were, in effect, manipulating 
their environment more than subjects who 
had the schedule imposed on them. The Self 
Group may have perceived their standard- 
setting behavior as a sense of control or 
mastery of the environment. These percep- 
tions may have contributed to their increased 
persistence during extinction as compared to 
the External Group for whom the sense of 
manipulation of the environment was less 
likely. 

A third explanation for the greater resist- 
ance to extinction of the Self Group as com- 
pared to the External Group might be formu- 
lated without any hypothetical concepts such 
as competence or locus of control. It is pos- 
sible that the subjects in this study had been 
differentially reinforced for setting standards 
of achievement in the past. That is, parents 
or other agents selected performance criteria 
for their children and rewarded them for at- 
taining these levels. Nonattainment was either 
not rewarded or was punished. Subsequently, 
the children were rewarded for the selection 
of similar standards. The process of standard 
selection, as well as the actual attainment, 
was connected with reinforcement, and thus 
became a conditioned reinforcer. Johnson and 
Martin (1971) found greater resistance to 
extinction for their self-reinforced group, 
which they attributed to the establishment 
of self-evaluation as a conditioned reinforcer. 
In the present study, standard selection, 
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itself, might well have been a conditioned 
reinforcer at work for the Self Group, 
which would account for the greater resist- 
ance of this group, as compared to the 
External Group. 

Furthermore, it is possible that since the 
subjects in the Self Group selected their own 
schedule, they picked one that was most re- 
inforcing to themselves, and thus most resist- 
ant to extinction. Finally, these results may 
be explained in terms of cognitive dissonance 
theory. Brehm and Cohen (1962) have 
found that a choice between attractive alter- 
natives creates dissonance which has motiva- 
tional properties. If it is assumed that the 
subjects in the Self Group experienced disso- 
nance arising from the “free choice” situation 
and those in the External Group, having no 
choice, experienced no dissonance arousal, 
then it might also be assumed that the moti- 
vational properties of the dissonance caused 
the Self Group to persist at the task longer 
than the External Group. 

The other major finding of this study was 
the partial reinforcement effect, that is, the 
greatest resistance to extinction exhibited by 
the subjects in the FR-4 schedule, the least 
by those in FR-1, and an intermediate 
amount with FR-2. This result corroborates 
findings of previous studies (Bijou, 1957; 
Cowan & Walters, 1963; Hops & Walters, 
1963; Kass, 1962; Meyers, 1960). In addi- 
tion, the partial reinforcement effect has been 
extended to include a contingency in which 
subjects select their own schedules. Typically, 
this effect has been demonstrated for schedules 
selected for the subject by the experimenter. 

The partial reinforcement effect has been 
widely used as evidence of the existence of 
the reinforcement phenomenon, that is, that 
the behavior in question was indeed a result 
of its consequences rather than some other 
factor. Because the subjects in the Self Group 
(as well as those in the External Group) did 
show this effect, there is some assurance that 
the results of this condition were due, in 
fact, to reinforcement rather than some other 
factor, 
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Ninety-six high school girls participated in an experiment designed to test 
hypotheses derived from the idea that previously observed differences in com- 
pliance rates to threats and promises are based on the inherent restrictiveness 
of threats and their curtailment of the recipient’s autonomy and’ freedom of 
choice. Subjects received an influence message, presumably written by a co- 
worker, which was a threat or a promise phrased in either a prescriptive or 
proscriptive form. In addition, the message indicated that the subject, would 
be able to choose among alternative ways of complying or that she would not 
have the opportunity to make such a choice. Results dramatically supported 
two of the hypotheses: (a) threats produced more compliance when the 
option to choose among alternative compliance modes was openly extended 
than when it was not; and (hb) differences in compliance rates to threats and 
promises were far greater when choice options were not made avgilable than 
when they were made available. Contrary to a third hypothesis, mo differences 
-in compliance emerged between prescriptive and proscriptive influence attempts 
—whether’ threat or“ promise. The results were discussed in terms of both 


Despite the widely shared assumption that 
threats are an effective means of influencing 
another’s*behavior, there is much evidence to 
the contrary. Tt has consistently been found 
that threats often boomerang, producing defi- 
ance rather than: compliance. Promises and 
$ other commonly used influence strategies fre- 
E: quently are far more successful in convincing 
ji a reluctant other to engage in nonpreferred 
behaviors (Heilman, 1974; Rubin & Lewicki, 
Note 1). But what is ‘it about a threat that 
‘triggers resistance? Under what conditions 
‘,\ ‘will resistance:be attenuated? 

: Implicit in the arguments of a number 
ka of authors (Brehm, 1966; Deutsch, 1973; 

«Deutsch & Krauss, 1960, 1962; Kelley, 1965; 
Steiner, 1970) is the premise that threats are 
inherently restrictive and are perceived by 
the recipient as undermining his or her free- 
‘dom and autonomy. Although the notion that 
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reactance theory and interpersonal dynamics. 


threats are more restrictive than other types 
of influence strategies has not been empiri- 
cally validated, it is nonetheless intuitively 
plausible. Since threats communicate to their 
recipients that failure to comply will result 
in an undesirable outcome, they alter the 
utilities of available alternatives, eliminating 
the attractive ones. The recipient is thus left 
with a choice between two evils; he must 
either engage in a nonpreferred activity to 
avoid the threatened punishment or risk in- 
curring costs by defying the threat. If the 
individual cannot escape from the situation, 
he is trapped. His autonomy and freedom of 
choice are curtailed. 

In contrast, promises, although in many 
ways parallel to threats, do not share their 
restrictive impact to the same degree. The use 
of reward as an incentive does not create the 
same type of dilemma as does using punish- 
ment as an incentive. Rather than feeling 
cornered, the recipient of a promise is con- 
fronted with a choice among competing alter- 
natives which are positive in nature. Action is 
based on considerations of preference rather 
than on considerations of defense. Feelings 
of autonomy are not similarly jeopardized. 

Thus, the reward and punishment contin- 
gencies, which are by definition central to 
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promises and threats, have distinctly different 
consequences for an individual’s sense of 
freedom. This difference between threats and 
promises may account for differences in their 
effectiveness in securing compliance. Threats, 
as compared to promises, may be prone to 
evoke defiance because of the recipient’s 
desire to preserve his integrity and freedom 
to do as he pleases. Needs derived from 
self-concept maintenance, fear of appearing 
weak and foolish before others, and concerns 
about the relationship with the threatener are 
all possible motivators for such behavior. 
Unless their inherent restrictiveness is tem- 
pered, or defying them entails substantial 
costs, threats are likely to fail. 

If it is true that resistance to threats is 
precipitated by the constraints they impose, 
then loosening these constraints by confirming 
the recipient’s freedom to make his own deci- 
sions should decrease noncompliance. Such 
confirmation, however, should not appreciably 
affect the success of promises. Since promises 
are not as inherently restrictive, and thus are 
not as likely to make as salient the issues 
of freedom and autonomy, reassurance of 
freedom of choice is likely to be of only 
peripheral importance, 

Even if defiance of threats can be lessened, 
can threats ever be as effective in securing 
compliance as promises? If our reasoning is 
correct, the answer is an affirmative one. 
Counteracting the restrictiveness of threats 
should remove the major source of resistance 
to them. Therefore, when the recipient's right 
to choose is insured, the method of influence 
used—be it threat or promise—is expected 
to be of little importance in determining 
behavioral responses. 

Factors that more subtly mitigate or exac- 
erbate the restrictiveness of influence at- 
tempts also are apt to result in differing rates 
of resistance to threats. One such factor is 
the wording of the influence attempt. Influ- 
ence attempts can be either prescriptive (pre- 
scribing the behavior the influencer wishes to 
encourage) or proscriptive (proscribing the 
behavior the influencer wishes to discourage). 
Such semantic factors may psychologically 
define the situation in different ways (see 
Langer & Abelson, 1972, for a discussion of 
semantic factors in appeals for help). The 


MADELINE E. HEILMAN AND KATHERINE A. GARNER 


wording of a prescriptive threat underscores , 
the desires of the influencer, while the word- 
ing of a proscriptive threat emphasizes the 
denial of personal preference. Although iden- 
tical in intent, these threats are likely to have 
different psychological consequences; being 
forbidden to act as one wishes is more likely 
to make salient the reduction of one’s auton- 
omy than is being bidden to act in the inter- 
ests of another. Thus prescriptive threats are 
expected to produce more compliance than 
proscriptive ones. Again, however, such vari- 
ations are anticipated to have only minimal 
impact on compliance to promises. 

In the following experiment, subjects were. 
confronted with a choice between a pleasant 
and an unpleasant alternative and received 
messages from their co-workers containing 
either a threat of punishment or a promise of 
reward urging the subject to choose the un- 
pleasant alternative. Half of the threats and 
promises were worded in a prescriptive form, 
half in a proscriptive form. In addition, the 
message included information about whether 
the subject would be given a choice among ` 
alternative compliance modes. Half were told 
that if they complied they would be able to 
determine kow they complied; the other half 
were told they would not have such a choice. 

Three hypotheses have been advanced. It 
was predicted that threats would produce 
more compliance when the option to choose 
was openly extended than when it was not. 
No such differences were anticipated when 
the method of influence was a promise. 
Second, the difference in frequency of com- 
pliance obtained by threats and by promises 
was expected to be far greater when choice 
options were not made available than when 
they were made available to recipients. Last, 
it was hypothesized that prescriptive threats 
would elicit more compliance than proscrip- 
tive threats but that this distinction would 
have little effect on reactions to promises. 


METHOD 
Subjects 


Ninety-six females were recruited in their New 
York high school to participate in an experiment 
about impression formation. They were told at the 
time of recruitment that they could earn from $2.00 
to $3.00, depending upon what happened during the 
research session. 
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} 
emer Procedure 


The experiment was conducted in five group 
sessions during 3 consecutive days with group size 
ranging from 15 to 26 subjects randomly assigned 
to each of eight experimental conditions. Scheduling 
was done in such a way that subjects believed an- 
other group of girls was scheduled at the same time 
but in a different room. In fact, there was only one 
experimental group at any given time. 

At the start of the session, subjects learned that 
the study was designed to simulate conditions in 
the consumer taste-testing industry to study how 
people form impressions of other people in particular 

+ types of work relationships. To make the setting 
realistic for the subjects, the experimental room 


Subjects were told that they would be working in 
pairs with a peer in another room and were further 
informed that they would not learn the identity 
of their pairmate nor she of them; all identification 
was done by a system of code numbers. 

It was explained that each member of the work 
pair would be doing a different task. They, the sub- 
jects, would be “tasters” and their pairmates were 
to be “tabulators.” As tasters, they would be tasting 
a number of samples of either orange juice or one 
of several types of vinegar concentrate. Whether 
they tasted vinegar concentrate or orange juice was 
to be entirely up to them.: Their pairmates’ task 
was described primarily as a clerical one. 

Subjects were then told that, to parallel conditions 
in the testing industry, they and their pairmates 
would be paid differently. For their tasting chores, 
subjects were to be given a basic pay rate of $2.50 
for the session. But, their pairmates’ earning would 
vary depending upon what they, the subjects, 
chose to taste. A pairmate would be paid more if 
the subject who was her co-worker tasted vinegar 
concentrate than if she tasted orange juice. 

Pairmates were described as having the power to 
fine or bonus the subjects if they wished to do so. 
By filling out the appropriate change of payment 
forms, they ostensibly could increase or decrease the 
subject’s earnings by 50¢. It was emphasized that 
the pairmate’s choice to fine or bonus the subject 
would not directly affect the pairmate’s earnings, 
that is, fines collected from subjects would not be 
added to the pairmate’s earnings, nor would bonuses 
be subtracted from the pairmate’s earnings. Pair- 
mates also were to have the opportunity to send a 
short message to the subjects, saying whatever they 
wanted, Subjects understood that they would not 
have a chance to respond to these notes. Both of 
these aspects of the experimental situation—the fine 
and bonus system and asymmetry with respect to 
communication—were described as typical in the 
consumer taste-test setting. 

Subjects then were told that while there was only 
one type of orange juice to taste, there were four 
different varieties of vinegar concentrate: red, white, 
thyme, and rice vinegar concentrates.* Pairmates 
were said to control whether a subject would be able 
to select which of the vinegar concentrates she 


f contained necessary equipment for consumer testing. 


would taste if, in fact, vinegar concentrate were her 
tasting choice. Whether the pairmate gave the sub- 
ject a choice of vinegar types was believed to be 
entirely up to her, 

The instructions completed, an assistant who had 
previously left the room reentered with the messages, 
presumably sent from the subjects’ pairmates. Each 
message was sealed in a separate envelope. After 
reading the messages, subjects completed a short 
questionnaire. Each then indicated her choice to 
taste either orange juice or vinegar concentrate on 
a separate choice form. At this point, subjects were 
debriefed and asked not to discuss the experiment 
with anyone until the end of the week. 


Design 


The experimental design was a 2 X 2 X 2 factorial 
varying (a) whether the pairmate threatened to 
fine or promised to bonus the subject; (b) the 
wording of the threat or promise in a prescriptive or 
proscriptive manner; and (c) whether the pairmate 
allowed the subject a choice of the type of vinegar 
concentrate. 


Experimental Manipulations 


The messages which subjects received from their 
pairmates contained two parts. Because of the group 
setting, each message was written in a different 
handwriting with one of many writing instruments. 
All messages were prewritten by females. 

Pairmate’s threat or promise, prescriptive or pro- 
scriptive. The first part of the message form con- 
tained a blank space where the pairmate could 
ostensibly write whatever she wanted to say to the 
subject. In all cases, the subject was urged to taste 
the vinegar concentrate. The prescriptive threats 
and promises were worded, respectively: ‘Taste 
vinegar concentrate or I’ll make sure you are fined 
50¢. Otherwise nothing will happen.” and “Taste 
vinegar concentrate and I'll make sure you are 
bonused 50¢. Otherwise nothing will happen.” The 


17To insure that the subject’s choice would not 
be confused with the wishes of the experimenter, the 
instructions stated; “We who are conducting the 
research don’t care which you choose because taste 
tests are needed for all the products.” 

*Pilot work was done to insure that the four 
vinegar concentrates were perceived to be equally 
unpleasant to taste. Independent judges were asked 
to rate “How pleasant or unpleasant do you think 
it would be to taste X?” on a 9-point scale where 
higher numbers indicated greater unpleasantness. 
Each tasting substance was rated by 10 judges. The 
ratings for each of the vinegar types were: red 
vinegar, 6.5; white vinegar, 7.0; thyme vinegar, 7.0; 
rice vinegar, 7,3. These means were not significantly 
different from each other. An additional 10 judges 
rated the pleasantness of tasting orange juice. The 
resulting mean, 3.8, indicated that orange juice was 
perceived as more pleasant to taste than each of the 
vinegar concentrates. 
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proscriptive threats and promises were worded, re- 
spectively: “Don’t taste orange juice or I'll make 
sure you are fined 50¢. Otherwise nothing will 
happen.” and “Don’t taste orange juice and PI 
make sure you are bonused 50¢. Otherwise nothing 
will happen.” 3 

Choice-no choice. Printed at the bottom of the 
message form was the statement: “If your pairmate 
chooses vinegar, who will decide which kind she 
tastes,” and a place for the pairmate to check either 
“Tabulator” (indicating that the pairmate herself 
would make the decision) or “Taster” (indicating 
that the subject would make the decision). In the 
choice condition, the “taster” option was checked 
and in addition, the pairmate had written: “The 
decision would be up to you. You can choose what- 
ever kind you want.” In the no-choice condition, the 
pairmate had checked “Tabulator” and had addi- 
tionally written: “I will choose the type of vinegar.” 


Dependent Measures 


The main dependent measure was the subject’s 
choice of vinegar concentrate or orange juice indi- 
cating compliance with or defiance of the influence 
strategy. Questionnaire items also elicited the sub- 
ject’s reactions to her pairmate’s message and her 
impressions of her pairmate on a series of 9-point 
bipolar adjective rating scales. 


RESULTS 


Questionnaire data verified that the experi- 
mental inductions had their intended effect. 
Responses indicated that all subjects appro- 
priately interpreted the influence notes they 
received to be threats or to be promises, and 
that virtually all of them believed that their 
pairmates meant to carry out their threats or 
promises. (The five subjects who did not find 
the influence attempt credible were all in 
different experimental conditions.)* All sub- 


*The phrase “Otherwise nothing will happen.” 
was included to insure that subjects realized that 
complying with a threat would not result in a bonus 
and that failure to comply with a promise would 
not result in a fine. Data from pilot work suggested 
that this clarification was needed. 

4To assess the credibility of the influence tech- 
nique, subjects were asked to indicate what would 
happen if they tasted vinegar concentrate and what 
would happen if they tasted orange juice by check- 
ing one of three responses for each question: “I will 
be fined 50¢”; “I will be bonused 50¢”; or, “I will 
be neither fined nor bonused.” The threat was con- 
sidered credible if subjects in threat conditions be- 
lieved they would be fined for tasting orange juice 
and neither fined nor bonused for tasting vinegar 
concentrate. The promise was considered credible if 
subjects in promise conditions believed they would 
be bonused for tasting vinegar concentrate and 
neither fined nor bonused for tasting orange juice. 
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TABLE 1 


NuMBER or Susjecrs WHo CHose TO Tas’ 
IN EACH EXPERIMENTAL CONDITION 


VINE 


| | 
| No choice | Choice 
Format | 
| Threat | Promise | Threat | Pro 
| | 
Prescriptive 4 | 10 11 11 
Proscriptive 5 | 9 8 9 


Note. Twelve subjects were included in each of the eigh 
experimental conditions, 


jects also understood the choice manipulation 
When asked, “If you taste vinegar concen- 
trate will you or your pairmate decide which 
of the four types you will taste?” those in 
choice conditions unanimously checked “I 
will” and those in no-choice conditions all 
checked “My pairmate will.” 


Compliance 


Table 1 presents the number of subjects 
who complied (chose to taste vinegar concen- 
trate) in each of the experimental conditions. 
These data were analyzed using a procedure 
developed by Dunn (1961) for examining 
predetermined contrasts among binomial pop- 
ulations. Three simultaneous confidence inter- 
vals were generated, corresponding to each 
of the experimental hypotheses. The overall 
confidence coefficient was set at .95. 

The hypothesized effect of the choice vari- 
able on compliance to threats was supported: 
Subjects were significantly more likely to 
comply to threats when given a choice than 
when not given a choice about how to com- 
ply ($ < .05). It is apparent from examina- 
tion of Table 1 that incorporation of a choice 
in the influence message had virtually no ef- 
fect on compliance to promises. The hypothe- 
sized interaction effect between choice and 
method of influence was also supported. The 
traditional difference between frequency of 
compliance to threats and promises all bu‘ 
disappeared in choice conditions, while it hel 
firmly in no-choice conditions (p < .05). The 
third hypothesis, predicting an interaction 
involving prescriptive versus proscriptive in- 
fluence attempts, was not supported. There 
was no significant difference in the rate of 
compliance elicited by the two different types 
of influence attempts in either threat or 
promise conditions. 
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TABLE 2 


MEans OF SUBJECTS’ RATINGS ON THE 
“PLEASED-ANGRY” SCALE 
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TABLE 4 


PERCEPTION OF PAIRMATE: MEAN FACTOR SCORES FOR 
THE EVALUATIVE FACTOR 


No choice Choice No choice Choice 
Format n— IIO Format 
Threat | Promise | Threat | Promise Threat | Promise | Threat | Promise 
Prescriptive 7.167 | 4.317 | 7.091 | 2.583 Prescriptive 8.211 | 5.846 | 7.472 | 4.027 
Proscriptive 7.083 | 4.167 | 6.545 | 3.231 Proscriptive 8.477 | 5.361 | 6.919 | 4.015 
Combined 7.125 | 4.242 | 6.818 | 2.907 Combined 8.344 | 5.604 | 7.196 | 4.021 
Note. Ratings are on a 9-point bipolar scale where 1 = pleased Note. Lower means signify aoe positive evaluations, # = 12 


and 9 = angry. » = 12 in each experimental condition. 


Affective and Attitudinal Reactions 


Affective responses to the influence strat- 
egies did not parallel the subjects’ compliance 
decisions. Differences were confined to two 
main effects. On all scales provided, threats 
elicited significantly more negative responses 
than promises. Subjects who were threatened 
characterized themselves as feeling signif- 
cantly angrier and more defiant, uncoopera- 
tive, insulted and put down than did those 
who were promised (p < .01 on each of these 
dimensions). A significant main effect for the 
choice variable on each of these scales indi- 
cated that when given a choice, subjects con- 
sistently reacted less negatively than when 
not given one (p < .05 on each dimension). 
However, in some cases, this tendency was 
much stronger in promise conditions than in 
„threat conditions. It should be noted that the 
threat-promise main effect accounted for a 
substantially greater percentage of the vari- 
ance than the choice main effect on all affec- 
tive dimensions, suggesting that the type of 
influence strategy was a more potent determi- 
nant of subjects’ affective responses than 


TABLE 3 


ANALYSIS OF VARIANCE OF SUBJECTS’ RATINGS ON THE 
“PLEASED-ANGRY” SCALE 


Percentage of 


Source df MS F total sum of 
squares 
Prescriptive- 
proscriptive (A) | 1 «436 | 121 07 
Threat-promise (B) | 1 | 240.413 | 66.558%* 40.80 
Choice-no choice (C)| 1| 20.046 | 5.550* 3.40 
AXB 1 744 | .206 «13 
AXE 1| 1.844] 510 31 
BXC 1| 8,946 | 2.477 1.52 
AXBXC 1| 6.136 | 1.699 1.04 
Error (Unit) 88 | 3.612 52.72 
*p <.05 


m each experimental condition, 


whether she was offered a choice. Means and 
analysis of variance of the subjects’ ratings 
on the most central affective scale (“pleased— 
angry”) can be found in Tables 2 and 3, 
respectively. 

Subjects’ perceptions of their pairmates 
were obtained by ratings on a series of 9-point 
bipolar adjective scales. When these ratings 
were factor analyzed using a Varimax rota- 
tion, two factors were found to account for 
approximately 70% of the variance: evalua- 
tion and potency.’ Weighted factor scores 
were computed for each factor and analyses 
of variance were conducted. No significant 
differences emerged among conditions in the 
analysis of the potency factor. All influencers 
were perceived as equally potent. However, 
two distinct main effects emerged for the 
evaluative factor (see Tables 4 and 5). More 
favorable evaluations were made of (a) 
promisers than threateners (p < .001) and 
(b) those pairmates believed to have pro- 
vided a choice than those pairmates believed 
to have not (p < .001). 


Discussion 


These findings support the notion that 
resistance to threats arises from their re- 
strictive nature. When restrictiveness was 
counteracted by adding a choice component 


5 The scales and their factor loadings included in 
the evaluative factor were the following: kind-nasty, 
.91; considerate-inconsiderate, .90; likable-not lik- 
able, .90; good-bad, .86; friendly—hostile, .84; co- 
operative-uncooperative, .83; treats as an equal—does 
not treat as an equal, .82; fair—unfair, .78; selfish- 
unselfish, .77. The scales and their factor loadings 
included in the potency factor were the following: 
weak-strong, .65; wishy-washy-determined, .58; 
active-passive, .51. 
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TABLE 5 


PERCEPTION OF PAIRMATE: ANALYSIS OF VARIANCE 
FOR THE EVALUATIVE Factor 


Percentage of 


Source df MS F total sum of 

squares 

Prescriptive- 
proscriptive (A) 1 1.757 -605 34 
Threat~promise (B) | 1| 199.395 | 6s.63s* 38.98 
Choice-no choice (C)] 1 49.815 | 17.147* 9.74 
AXB 1 .012 004 -00 
xC 1 -626 -215 «12 
BXC 1 484 166 | 209 
AXBXC 1 3.800 | 1:308 74 
Error (Unit) 88 2,90: 49.98 

*p < 001, 


to a threat, defiance was sharply diminished. 
In fact, when choice was incorporated into 
the threat presented in this situation, it was 
as effective at securing compliance as a 
promise. 

Contrary to predictions, prescriptive influ- 
ence messages did not differ appreciably from 
proscriptive ones, whether threats or prom- 
ises, in obtaining compliance. In fact, there 
is no evidence that subjects differentially per- 
ceived or responded to the subtle phrasing of 
the influence attempt in any way. Thus, fail- 
ure to support this compliance hypothesis 
does not necessarily undermine our central 
thesis that variations in degree of restrictive- 
ness affect the extent of resistance to threats, 
Rather, it raises the question of whether pre- 
scriptive and proscriptive threats are, as we 
had assumed, experienced differently. 

The critical question posed by these find- 
ings is why should including the opportunity 
to choose in a threat have such dramatic 
consequences? Pilot work insured that each 
of the vinegar alternatives was almost identi- 
cal in perceived valence. Therefore, it is not 
likely that heightened compliance to threats 
in the choice condition was a result of the 
subjects’ greater desire to taste one rather 
than another of the vinegar types. Nor is it 
plausible that differences in the credibility of 
the influence attempt account for these re- 
sults. Data collected from subjects indicated 
that virtually all of them believed the influ- 
encer would follow through on her threat 
or promise. 

One could argue that being offered a choice 
simply softens the blow of being threatened, 
thereby boosting positive feelings about the 


threatener and the interaction and 1 y 
them comparable to those experienced 1 
promised, From here, it is a simple ta o 
explain our compliance results. However 
data refute this argument. In fact, the lack f 
consistency found between these participa 
attitudes and feelings on the one hand ; 
the behavior they engaged in on the othe; 
very striking. This finding is a provocati, 
one, suggesting that voluntary behaviora 
compliance need not be accompanied by 
positive affect. 

It is quite clear from the data that threat: 
sparked distinctly more unfavorable reactions 
than did promises in choice as well as no- 
choice conditions, Threats were always re- 
ceived unenthusiastically. Negative feelings 
and attitudes were not substantially neutral- 
ized when threatened subjects received word 
of their privilege to choose. Why, then, did 
these subjects, angry according to their own 
reports, abide by the wishes of their pair- 
mates as often as those who were promised? 
All that differed in the choice and no-choice q 
situations was the prospect of engaging in the’ 
process of making a choice. Why is this so 
important? 

Several possibilities come to mind. It is 
conceivable that the opportunity to choose 
satisfied some personal need experienced by 
participants. One could argue, for instance, as 
does Brehm ( 1966), that the reason people 
defy threats is to protect or regain what they 
believe to be their right to choose freely, 
Their objective is to reduce the personal dis- 
comfort (reactance) which results from such 
infringements on autonomy. From this point 
of view, being extended a choice, even a 
trivial one, can nullify the constraints inher- 
ent in the threat, thereby dispelling the 
motivation to defy. In Brehm’s terms, 
reactance is eliminated. 

But why would negativism toward the 
threatener prevail? Because reactance is dis- 
pelled by the choice option, other conse- 
quences of the threat need not be. The effects 
of threats on subsequent attribution Processes 
and their salience even when confronted with 
new and contradictory data about the threat- 
ener has been documented (Heilman, 1974), 
The recent work of Kanouse and Hanson 
(1971) is relevant to these findings. They 
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argue that negative information about an- 
other is weighted more heavily than posi- 
tive information in forming impressions when 
both types of information are available. 
Consequently, in this study the offer of a 
choice (which in isolation might foster 
favorable attitudes) may have been over- 


shadowed by the negativity of the threat in ` 


determining impressions of the pairmate al- 
though, at the same time, it may have been 
very effective in minimizing reactance. 

Another plausible explanation of our com- 
pliance results focuses on interpersonal rather 
than intrapersonal needs and motivations. 
From this point of view, it is not the existence 
of a choice, per se, but that it is offered by 
the threatener which is critical. Several re- 
searchers (Deutsch & Krauss, 1960, 1962; 
Kelley, 1965) have argued that people resist 
threats because complying implies an accept- 
ance of a status difference imposed by the 
threatener and, as such, it signifies submitting 
or yielding. If this is so, the addition of a 
choice by the threatener may reaffirm the 
recipient’s status as an equal, making it pos- 
sible for him to cooperate publicly without 
feeling he appears weak or foolish. Again, 
however, this need not necessarily imply that 
the recipient will radically alter his evalua- 
tions of the threatener. In short, being of- 
_ fered such a choice by the threatener, himself, 
may enable the recipient to comply without 
“losing face.” 

The data collected do not lend any insights 
into which one of these explanations is the 
more accurate. In this study, all of those who 
were in a position to choose how to comply 
were well aware that the opportunity to do 
so was extended by their pairmates. Thus, it 
is impossible to determine whether the rather 
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striking attenuation in the incidence of resist- 
ance to threats incorporating choice options 
was mediated by personal or interactional 
concerns. Research is now in progress which 
has been designed to do so. 


REFERENCE NOTE 


1. Rubin, J., & Lewicki, R. J. A three-factor ex- 
perimental analysis of interpersonal influence. 
Unpublished manuscript, Tufts University, 1971. 
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Past research has found that people generally prefer tasks of intermediate 
difficulty to very easy or very hard tasks. Intermediate difficulty tasks can 
maximize both the motivational value of success and the informational value 
of the outcome and are thus predicted by both an incentive value and an 
informational value approach. The present study unconfounds the predictions 
of these two approaches by applying a Bayesian analysis to attributions of 
ability made from success or failure. If people are primarily interested in 
maximizing the information they gain about themselves, they should prefer 
easy or hard tasks to moderate difficulty ones when the easy or the hard 
tasks are made more diagnostic of their true ability. If people are primarily 
interested in maximizing the simple expected value of success, they should 
prefer intermediate difficulty tasks regardless of diagnosticity. Data from 120 
male undergraduates strongly support the idea that diagnosticity is the major 
determinant of preference among achievement tasks and suggests that it 
should be a basic parameter in all research on self- and social attribution. 


The theory of achievement motivation 
(Atkinson, 1975) predicts that people who 
are primarily concerned with seeking success 
(high in achievement motivation) should pre- 
fer tasks of intermediate difficulty rather than 
tasks that are very easy or very hard. This 
prediction is based on the assumption that 
the incentive value of an outcome in an 
achievement task is inversely proportional to 
the probability of that outcome. Thus if the 
probability of success is high, or the task is 
easy, the incentive value of success is low, 
while if the probability of success is low, or 
the task is hard, the incentive value of success 
is high. The expected value of an achieve- 
ment task is the product of the probability 
of success on that task times the incentive 
value of success. This expected value is maxi- 
mal when the probability of success and fail- 
ure are both equal to .50, thus implying that 
subjects motivated to seek success can maxi- 
mize their expected satisfaction by working 
on tasks of intermediate difficulty. A number 
of studies have confirmed this prediction both 
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for subjects high in achievement motivation 
and for general populations of college stu- 
dents, who are generally high in achievement 
motivation (see Atkinson & Feather, 1966; 
Atkinson & Litwin, 1960; Litwin, 1966; 
Moulton, 1965). 

An alternative approach emphasizes the 
informational value of success or failure in an 
achievement task (Weiner, Krieze, Kukla, 
Reed, Rest, & Rosenbaum, 1972). Informa- 
tion theory (see Garner, 1962) postulates 
that the informational value of an outcome 
is inversely proportional to the probability 
of that outcome. Thus, the expected informa- 
tional value of a task is also maximal when 
the probability of success and the probability 
of failure are both equal to .50. Weiner et al. 
argue that individuals high in achievement 
motivation prefer intermediate difficulty tasks 
not because these maximize the incentive 
value of success but because they maximize 
the informational value of the outcome. In 
particular, they argue, intermediate difficulty 
tasks maximize the extent to which people can 
draw inferences about their own ability level 
from their outcomes on the task, rather than 
attributing their success or failure to external 
causes, such as the ease or difficulty of the 
task in the first place. 
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DETERMINANTS OF CHOICE AMONG TASKS 


It is incorrect, however, merely to assume 
that a task where success and failure are 
equally likely is necessarily more informative 
about the level of ability a person possesses. 
This in turn depends upon additional param- 
eters, namely, (a) the conditional probability 
that persons with one level of ability will 
succeed at that difficulty level as compared 
to the conditional probability that persons 
with another level of ability will succeed at 
that difficulty level and (b) the conditional 
probability that persons with one ability level 
will fail as compared to the conditional prob- 
ability that persons with another level of 
ability will fail. Consider the simplest case 
in which we have two kinds of people, those 
with high ability (H) and those with low 
ability (L) performing a task with two pos- 
sible outcomes, success (S) and failure (F). 
Let us now consider a very easy task, in 
which the overall probability of success is .90, 
and an intermediate difficulty task, in which 
the overall probability of success is .50. Sup- 
pose further that on the very easy task, the 
conditional probability that a person with 
high ability succeeds, or P(S|/H), is .97, 
while the conditional probability that a per- 
son with low ability succeeds, or P(S|L), is 
.80; and that on the intermediate difficulty 
task, P(S|H) is .52, while P(S|L) is .48. 
It is clear then that success on the easy task 
distinguishes better between persons with dif- 
ferent ability levels than success at the inter- 


“ mediate difficulty task. Since the conditional 


probabilities of failure, P(F|H) and P(F|L), 
are the complements of P(S|H) and P(S|L), 
respectively, failure on the easy task also dis- 
tinguishes better between ability levels than 
failure on the intermediate difficulty task. 
Under these circumstances a person interested 
in learning what his ability level is should pre- 
fer the easy task to the intermediate difficulty 
task, because the easy task promises to be 
more diagnostic of his ability level. Diagnos- 
ticity is a term we will shortly define in a 
precise Bayesian sense. 

If intermediate difficulty tasks are gen- 
erally more informative about a person’s 
ability level than very easy or very hard 
tasks, this is because intermediate difficulty 
tasks are ordinarily more efficient at discrimi- 
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nating between people of different ability 
levels. Mathematically, very easy or very 
difficult tasks cannot be made as diagnostic 
as a maximally diagnostic moderately difficult 
task, since the necessity of maintaining the 
overall difficulty level or P(S) at a particular 
value constrains how far apart the conditional 
probabilities of success, P(S|H) and P(S|L), 
can be set. The following equation must hold: 


P(S) = P(S|H) P(H) + P(S|L) P(L). 


If we assume that people with high ability 
and people with low ability are equally 
common and are represented equally in the 
determination of the overall difficulty of the 
task, then the initial probabilities of high 
and low ability, P(H) and P(L), both 
equal .5. Under these circumstances the con- 
ditional probabilities of success must be 
symmetric about the overall difficulty level. 
It is easy to see that P(S|H) and P(S|L) 
can and usually would be farther apart when 
they must average out to .50 than when they 
must average out to .90 or .10. Clearly, the 
conditional probabilities of failure are also 
more constrained in easy and hard tasks than 
in intermediate difficulty tasks. 

Nevertheless, as we have seen, it is quite 
possible to arrange matters so that a very 
easy or a very hard task is more diagnostic 
than a task of intermediate difficulty. This 
gives us the important ability to unconfound 
the previously identical predictions made by 
the incentive value and informational value 
approaches to choice among tasks in achieve- 
ment situations. If in fact people choosing 
moderately difficult tasks in past research 
were doing so to maximize the information 
they gained about themselves, they should in 
turn choose the easy or the hard tasks and 
avoid the moderate difficulty ones when the 
easy or the hard tasks are made more diag- 
nostic. If subjects are seeking to maximize the 
simple expected value of success, they should 
continue to choose the intermediate difficulty 
tasks regardless of diagnosticity. 

The foregoing reasoning is in fact an appli- 
cation of the principles of Bayesian inference 
to the problem of attribution of ability in 
achievement situations. Bayes’s theorem has 
been used by the present authors in earlier 
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work as the basis for reformulating tradi- 
tional problems in social judgment and at- 
tribution (Brickman, 1972; Brickman & 
Pierce, 1972; Trope, 1974; Trope & Burn- 
stein, in press; Trope, Note 1). In the present 
situation, if a person can specify what his 
prior odds are that he has high ability, or 
P(H)/P(L), and also what the conditional 
probability of success is both given high abil- 
ity and given low ability, or P(S|H) and 
P(S|L), Bayes’s theorem specifies how these 
variables should be combined to determine his 
odds on having high ability after having per- 
formed the task and learned whether he suc- 
ceeded or failed. For the case of two ability 
levels and two outcomes, the model states 
that the posterior odds favoring high ability 
given success should be: 


The prior odds on high ability are thus re- 
vised following the observation of success by 
a quantity known as the likelihood ratio asso- 
ciated with the outcome of success, which is 
the ratio of the conditional probability of 
success given high ability and the conditional 
probability of success given low ability. This 
likelihood ratio represents the informational 
or diagnostic value of the outcome. 

Using the idea of likelihood ratios it is 
possible to derive an expected diagnostic 
value for a task. The expected diagnostic 
value or expected diagnosticity (ED) repre- 
sents how much information a person can 
expect to gain about his ability by perform- 
ing the task. This expected value is the sum 
of the diagnostic value of each possible out- 
come weighted by the probability of that 
outcome, or 


ED = Ð P(O)LRO)), 
i 


where Oj; refers to outcome j and LR(O;) 
refers to the likelihood ratio associated with 
outcome j. 

In this study the difficulty of tasks and 
their expected diagnostic values are simul- 
taneously manipulated to test the hypothesis 
that people prefer tasks of high expected 
diagnosticity regardless of their difficulty. 
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METHOD 
Subjects 


One hundred twenty male undergraduates enrolled 
in an introductory psychology course at the Univer- 
sity of Michigan participated in a study of “mental 
abilities.” Participation fulfilled part of a course 
requirement. 


Design 


Each experimental condition involved one easy 
task, P(S) =.70, one moderate task, P(S) = .515, 
and one hard task, P(S) =.305. Each of these tasks 
had high diagnosticity (ED = 2.8) in some condi- 
tions and low diagnosticity (ED = 1.1) in other 
conditions. The design included four experimental 
conditions involving combinations of task difficulty 
and diagnosticity. In Condition 1, only the moderate 
difficulty task has high diagnosticity. In Condition 2, 
the mirror image of Condition 1, only the moderate 
difficulty task has low diagnosticity. In Condition 3, 
only the easy task has high diagnosticity, and in 
Condition 4, the mirror image of Condition 3, only 
the difficult task has high diagnosticity. Thirty 
subjects were assigned to each of the four conditions. 


Procedure 


Subjects in groups ranging from five to seven were 
told by the experimenter that they were taking part 
in a study on individual differences in “integrative 
orientation” using a test called the “Minnesota 
Multiphasic Integrative Orientation Test.” The test 
booklet indicated that people high in integrative 
orientation were better able to generate new infor- 
mation from previously known information, to ab- 
stract and generalize, to find new solutions to 
problems, and that the test predicted many differ- 
ences in general intellectual accomplishment. It was 
emphasized that standard intelligence tests did not 
dependably measure integrative orientation. 

Subjects were repeatedly told that previous phases 
of the survey had established that about half of the 
students at the University of Michigan were high 
in integrative orientation and half were low. This 
was done in order to establish the prior P(H) and 
P(L) at approximately equal levels in all conditions. 

At this point subjects were introduced to the 
battery of three tests that comprised their experi- 
mental condition. The three tests were always labeled 
test H, test K, and test J. Subjects were told that 
each test consisted of 16 items involving verbal and 
nonverbal analogies, with the type of item differing 
from test to test. It was further explained that 
since previous research had found that “when exam- 
inees determine for themselves the type of items 
they work on, their performance reflects much better 
their real integrative orientation,” subjects would be 
free to decide for themselves how many items from 
each test they would work on. At this point sub- 
jects were given a second booklet describing the 
specific tests in their condition, after which they 
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: TABLE 1 
3 PERCENTAGES OF Success AMONG STUDENTS HIGH OR Low IN ABILITY FOR TESTS VARYING IN 
EXPECTED DIAGNOSTIC VALUE AND DIFFICULTY 
Difficulty 
Expected diagnostic 
value 
Moderate Difficult 
High P(S|H) = 75.16 P(S|H) = 50.85 
P(S|L) = 27.84 P(S|L) = 10.15 
Low P(S|H) = 54.98 P(S|H) = 33.44 
P(S|L) = 48.02 P(S|L) = 27.56 


made their choices and filled out the dependent 
measure items. 


Manipulation of Difficulty and Diagnosticity 


The major information about difficulty and diag- 
nosticity was presented to subjects in the form of a 
bar d am that depicted the percentages of stu- 
dents high and low in integrative orientation who 
succeeded and failed-on items from tests H, J, and 
K, respectively. Each test was represented by two 
bars, one displaying the percentages of success and 
failure for students high in integrative orientation, 
or P(S|H) and P(F|H), and one displaying the 

y percentages of success and failure for students low 


«in integrative orientation, or P(S|L) and P(F|L). 


The shaded portion of each bar corresponded to 
the percentage of success and the unshaded portion 
to the percentage of failure. Table 1 gives the re- 
ported percentages of success for high and low abil- 
ity students in each diagnosticity and difficulty 
condition. These percentages were combined to pro- 
duce the four conditions described above. It can 
be noted that for each task P(S|H) and P(S|L) 
are symmetric about the overall P(S). The precise 
values of P(S|H) and P(S|L) were in fact calcu- 
lated from an expanded version of our equation for 
expected diagnosticity in which P(H), P(L), and ED 
were specified. Formally, 


ED = P(H) [P(S|H) LR(S) + P(FJH) LR(F)] + 
P(L) [P(S|L) LR(S) + P(F|L) LR(F)]. 


The diagrams were accompanied by a brief expla- 
nation in which subjects were told, for high diag- 
` nosticity tests, that the diagram indicated that there 
was a relatively large difference between the per- 
formance of students high in ability and students 
low in ability. For low diagnosticity tests, subjects 
were told that this difference was relatively small. 
Furthermore, for each test it was indicated whether 
the overall percentage of success was high, moderate, 
or low, and the overall percentages of success and 
failure were specifically reported. 


Measurement of Dependent Variables 


Subjects were informed that they were to choose a 
total of 16 items to work on and that they them- 
selves could decide how many items would come 


from each test. Before stating how many items from 
each test they chose, subjects answered two ques- 
tions about their preferences among the three tests. 
The first question asked: “Which test out of three, 
test H, test K, and test J, do you prefer most to 
work on? If you had to work on all items of just 
one test, which test would you choose?” The second 
question asked: “How strong is your preference for 
each of the tests, compared to the others?” This 
question was followed by a 10-point bipolar scale 
ranging from 1 for “very weak preference” to 10 
for “very strong preference.” Subjects rated their 
preference for each of the tests by marking the 
letters H, J, and K at appropriate points on the 
scale. Subjects then indicated how many items from 
each test they would like to work on. 

In a subsequent booklet subjects were asked to 
state their expectations about each test by rating 
(a) how difficult they expected it would be, (b) 
how challenging they expected it would be, (c) how 
gratifying success on the test would be, (d) how 
distressing failure of the test would be, and (e) how 
good they expected the test to be for finding out 
whether they were high or low in integrative orien- 
tation. All questions were answered by placing the 
three tests at appropriate points on 10-point bipolar 
rating scales, 


RESULTS 


To test the effects of the diagnosticity and 
difficulty manipulations, each of the depen- 
dent measures involving ratings was subjected 
to two analyses of variance. One analysis of 
variance included diagnosticity as a within 
subjects factor and experimental conditions 
as a between subjects factor. The other analy- 
sis of variance included two levels of difficulty 
as a within subjects factor, where difficulty 
varied within equal levels of diagnosticity, 
and, as before, experimental condition as a 
between subjects factor. In these analyses, 
the main effects of diagnosticity and diffi- 
culty and the interactions of diagnosticity and 
of difficulty with experimental conditions are 
orthogonal and partition the variation in the 
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overall design due to tests and Tests X Ex- 
perimental Condition interaction. 


Manipulation Checks on Difficulty and 
Diagnosticity 

Subjects’ ratings in response to the ques- 
tions about the difficulty and the diagnosticity 
of the tests were used as manipulation 
checks. The mean ratings of test difficulty 
over the four conditions were 4.13, 5.36, and 
7.57 for the easy, moderate, and difficult 
tests, respectively. The analysis of variance 
with difficulty and condition as factors yielded 
a powerful effect of difficulty, F (1,116) = 
130.34, p < .005, and a significant Difficulty 
X Condition interaction, F (3,116) = 3.72, 
p< .05. The interaction reflected a larger 
difference between the two levels of difficulty 
in the first two conditions, where the extreme 
levels of difficulty were contrasted, than in 
the last two conditions where adjacent levels 
of difficulty were compared. The mean rat- 
ings of diagnosticity over all conditions com- 
bined were 3.97 and 6.78 for the low and 
high diagnosticity tests, respectively. The 
analysis disclosed a strong effect of diagnos- 
ticity, F (1,116) = 101.38, p < .005, and an 
insignificant Diagnosticity X Condition inter- 
action (F = 2.61). We can conclude, then, 
that subjects were able to judge the difficulty 
of tests from overall probabilities of success 
and failure and to combine the overall proba- 
bilities and conditional probabilities of out- 
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MEAN NUMBER OF Items SELECTED, MEAN OF PREFERENCE RATINGS, AND CHOICE FREQUENCY 
FOR EACH TEST IN Four EXPERIMENTAL CONDITIONS 
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comes into judgments about the diagnosticity EE 
z 
of tests. 


Item Selection, Test Preference, and Test 
Choice 


We turn now to the measures of preferenc 
among the tests, namely, the number of ite 
selected from each test, the ratings of degre 
of preference for each test, and the frequenc) 
with which each test was chosen as the most ` 
preferred test. The pertinent data are dis- 
played in Table 2. 

Table 2 shows that high diagnosticity tests 
were preferred to low diagnosticity tests on all y 
three measures of preference in every condi- 
tion. This conclusion was supported by the 
analyses of variance of the number of items 
chosen from each test and the preference rat- 
ings of each test. These analyses yielded 
highly significant F ratios for the main effect 
of diagnosticity (F = 19.28, p < .005 and F 
= 12.07, p < .005, for the number of items 
and preference ratings, respectively) and in- 
significant Diagnosticity X Condition interac- 
tions. The choices of the most preferred test 
were collapsed into a two-dimensional con- 
tingency table (diagnosticity by condition) 
As expected, the partitioning of the total chi- 
square (see Winer, 1962) disclosed a highly 
significant chi-square associated with 
nosticity, x° (1) = 16.59, p < .005, an: 
insignificant chi-square for the Diagnostic 
x Condition interaction. 


Measure 
Experimental Number of Items Preference Ratings Choice Frequency 
condition M 

Easy Moderate | Difficult Easy Moderate | Difficult Easy Moderate | Difficult 
Test Test Test Test Test Test Test Test Test 

1 L H L L H L L H L 
5.50 6.87 3.63 6.00 6.58 4.05 11 16 3 

2 H L H H L H H L H 
7.37 3.80 4.83 6.65 4,88 5.02 16 4 10 

3 H L L H L L H L L 
7.03 4.37 4.60 5.80 5.25 4.92 15 6 9 

4 L L H L L H L L H 
4.90 4,33 6.77 5.48 5.05 6.08 9 6 15 


Note. The data for Number of Items Tests and Preference Ratings Tests are mean values. The data for Choice Frequency Tests 
are the number of subjects who chose each test. L = low diagnosticity, H = high diagnosticity. 
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TABLE 3 
RATINGS OF EXPECTED CHALLENGE, GRATIFICATION FROM SUCCESS AND DISTRESS FROM FAILURE: 
MEANS For EACH TEST IN Four EXPERIMENTAL CONDITIONS 
Measure 
Experimental Challenge Gratification Distress 
condition La 
Easy | Moderate | Difficult Easy Moderate | Difficult Easy | Moderate | Difficult 
Test Test Test Test | Test Test Test Test Test 
1 L H L L H L L H L 
4.65 6.17 7.78 4.40 6.58 7.50 6.80 5.82 3.50 
2 H L H H L H H L H 
4.98 5.67 7.75 4.68 5.50 6.87 6.18 4.82 3.93 
3 H L L H L L H L L 
4.45 5.60 7.25 5.00 4.87 5.68 5.20 4.37 3.75 
4 L L H L L H L L H 
3.88 | 5.32 8.33 4.07 5.00 7.67 6.00 5.35, 4,62 


The data in Table 2 also indicate that 
among tests of equal diagnosticity easy tests 
were preferred to more difficult tests. Signifi- 
cant main effects of difficulty were obtained 
for both number of items and preference rat- 
ings (F = 5.65, p < .05 and F = 16.27, p< 
005, respectively). The Difficulty x Condi- 
tion interaction effect was insignificant for 
both measures. To determine the effect of 
difficulty on choices of the most preferred 
test, the choices were collapsed into a two- 
dimensional table (difficulty within equal di- 
agnosticity levels by condition). The parti- 
tioning of the total chi-square indicated a 
marginally significant effect of difficulty, 
x2 (1) = 2.8, p<.1, and an insignificant 
Difficulty X Condition interaction effect. 


Ratings of Challenge, Gratification from 
Success, and Distress from Failure 


We come now to the analysis of subjects’ 
expected affective reactions to the tests. Table 
3 presents the means of the ratings of each 
test with respect to how challenging it is 
expected to be, how gratifying is success ex- 
pected to be, and how distressing is failure 
expected to be. 

The effects of difficulty on these ratings 
are striking. With one minor exception (of 
the rating of gratification in the third condi- 
tion) the ratings on all three dimensions in 
all four conditions are linearly related to 
difficulty. Challenge and gratification are 
positively related to difficulty, while distress 


is inversely related. The analysis of variance 
with difficulty and condition as factors yielded 
powerful main effects of difficulty (F = 
122.58, p < .005; F = 73.13, p < .005; and 
F = 68.98, p < .005, for the measures of chal- 
lenge, gratification, and distress, respectively). 
The interaction between difficulty and condi- 
tion was also significant in these analyses 
(F = 4.20, p < .05; F = 6.00, p < .005; and 
F = 10.17, p < .005, respectively). These in- 
teractions merely reflect the fact that in the 
first two conditions, with diagnosticity held 
constant, easy and difficult tests were con- 
trasted, while in the last two conditions, 
with diagnosticity held constant, moderate and 
difficult or moderate and easy tests were con- 
trasted. Not surprisingly, the effects of diffi- 
culty were greater in the first two conditions 
than in the last two. 

Diagnosticity also affected affective ratings, 
although to a lesser extent. Ratings of chal- 
lenge and gratification were reliably higher 
for high diagnosticity tests than for low diag- 
nosticity tests (F = 8.48, p< .005; F= 
9.43, p < .005, respectively), although diag- 
nosticity did not affect ratings of distress. In 
those cases where difficulty and diagnosticity 
work against one another in the determination 
of challenge or gratification (the high diag- 
nosticity moderate test versus the low diag- 
nosticity difficult test in Condition 1, or the 
high diagnosticity easy test versus the low 
diagnosticity moderate test in Conditions 2 
and 3), it is the more difficult and less diag- 
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nostic test that is rated more challenging and 
gratifying to succeed at. 


Discussion 


The pattern of results for all measures in 
all four experimental conditions were in the 
expected direction with high diagnosticity 
tasks preferred to low diagnosticity tasks. 
The data definitely failed to confirm the hy- 
pothesis that moderately difficult tasks are 
preferred to easy and difficult tasks. The 
effect of difficulty, when it appeared, was 
manifested in a preference for the easier tasks 
over the more difficult ones. It is worth not- 
ing, however, that high diagnosticity tasks of 
greater difficulty were always preferred to low 
diagnosticity easier tasks. Hence, it seems 
reasonable to conclude that subjects were 
primarily oriented toward maximizing the ex- 
pected amount of information about their 
ability, rather than either the expected value 
of success in the motivational terms of the 
theory of achievement motivation or the sim- 
ple information value of the outcome (success 
or failure) in terms of information theory. To 
the extent that a cognitive and a motiva- 
tional approach are opposed, the present 
study clearly indicates that people seek infor- 
mation and not merely satisfaction. 

With the diagnostic value of the task held 
constant, however, an interest in satisfaction 
may be seen to emerge in subjects’ preference 
for easier tasks over harder ones. It may be 
that subjects merely wished to obtain the in- 
formation about their ability with minimum 
effort, which presumably would be associated 
with easier tasks. Or it may be that subjects 
desired to maximize their chances of the 
pleasures of success and felt free to do so 
once assured that the task with the higher 
probability of success would be just as in- 
formative as the one with the lower proba- 
bility of success. If people prefer more diffi- 
cult or challenging tasks, they may do so only 
when these tasks promise to be more informa- 
tive or diagnostic, and then in spite of, rather 
than because of their difficulty. 

While the data on task choice indicate that 
diagnosticity was a primary determinant of 
this decision, the data on expected affective 
reactions to success and failure indicate that 
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difñculty was the primary determinant of per- 
ceptions of. how challenging the task is, how 
gratifying success would be, or how distress- 
ing failure would be. These data confirm At- 
kinson’s assumption that the incentive value of 
success and failure would be inversely propor- 
tional to the probability of success and fail- 
ure, respectively. 

While the present approach and that of 
Weiner et al. (1972) both emphasize infor- 
mation seeking rather than the motivational 
value of success as determinants of choice 
among tasks, the differences between Wein- 
er’s analysis, based on an information theory 
approach, and the present analysis, based on 
a Bayesian inference approach, are vital to a 
number of other problems in self- and social 
attribution. The information theory analysis 
is inappropriate to the general interests of 
attribution theory, because it deals only with 
the probabilities that various events will 
occur, rather than the probabilities that de- 
termine what inferences can be drawn from 
these events. With regard to choice among 
tasks, the information theory analysis could 
not generate predictions different from a moti- 
vational analysis. Furthermore, the informa- 
tion theory analysis led to correct predictions 
(general preference for intermediate difficulty 
tasks) only because of implicit and unspeci- 
fied assumptions about the diagnostic value of 
various tasks. Similarly, in another area of 
attribution theory, it has been assumed that 
improbable, deviant, or out-of-role behaviors 
are generally more informative than equiva- 
lent bits of conforming or in-role behavior. 
Again, this is true only because out-of-role 
or noncompliant behavior is ordinarily more 
diagnostic of internal states than conforming 
behavior (Trope & Burnstein, in press). If, 
however, we designed an experiment in which 
in-role behavior were more diagnostic than 
out-of-role behavior, then we would expect the 
former to lead to more confident attributions, 
even though it was overall more probable. 
We would especially expect this if subjects 
were for any reason additionally reluctant to 
treat unexpected events or out-of-role be- 
havior as diagnostic (Brickman, 1972; Con- 
ner, 1972). 

The present study demonstrates that a 
basic parameter of Bayesian inference—the 
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diagnostic value of evidence—is psychologi- 
cally important to subjects in their choice of 
achievement tasks. It does not tell us to what 
extent subjects are rational (as compared to 
a normative Bayesian model) in the infer- 
ences they make about themselves from par- 
ticular outcomes of success or failure that are 
of high or low diagnosticity, Assessing this re- 
mains a problem for future research. It might 
be especially fruitful to explore the extent to 
which subjects who differed in their overall 
level of achievement motivation or self-esteem 
differed in the importance they placed on 
diagnosticity or the extent to which they were 
rational in making inferences about them- 
selves. The present approach would expect 
subjects high in achievement motivation to 
show a stronger preference for high diagnos- 
ticity tasks than those low in achievement 
motivation (who might actually prefer undi- 
agnostic tasks at various difficulty levels), 
and perhaps also to be more rational in their 
use of information gathered from these tasks. 


REFERENCE NOTE 


1, Trope, Y. The informational values of behaviors 
and social situations; A Bayesian analysis of at- 
tribution processes. Paper presented at the meet- 
ing of the Midwestern Psychological Association, 
Chicago, Illinois, May 1973. 
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The present study investigated the role of self-concept as a moderator of the 
similarity-attraction relationship described by Byrne. Subjects conducted face- 


to-face interviews with confederates who 


played roles as job applicants. Simi- 


larity-dissimilarity was manipulated by the confederate roles as well as by 
the information given the subjects in accordance with the procedures described 
by Byrne. The hypothesis was tested and confirmed that persons with favor- 
able self-concepts would be more attracted to others with similar attitudes 


than to others with dissimilar attitudes. 


Persons with negative self-concepts 


did not show a preference for similar others. It was suggested that earlier 
studies in this area suffered from methodological flaws and did not constitute 
an adequate test of the hypothesis. Suggestions for future work in the area 


were given. 


Recent research has focused on the rela- 
tionship between attitude similarity and inter- 
personal attraction (Bowditch, 1969; Byrne, 
1961, 1965, 1969; Byrne, Clore, & Worchel, 
1966; Byrne, London, & Reeves, 1968). 
These researchers have consistently found 
that the greater the degree of attitude simi- 
larity between two persons, the greater is 
their attraction for each other. 

It is important that effort be directed 
toward the identification of limiting or boun- 
dary conditions where this relationship does 
not hold, since there are a number of situa- 
tions where attraction or rejection due to at- 
titude similarity—dissimilarity could result in 
unfair bias. For example, Golightly, Huffman, 
and Byrne (1972) found that attitude simi- 
larity affects the amount of money one person 
will approve as a loan to another, even when 
relevant financial information (debts, income, 
etc.) is held constant. Personnel decisions to 
hire, fire, promote, and demote persons in the 
business would represent another example. 
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The present study seeks to establish the 
self-concept as a moderator of the similarity- 
attraction relationship. Previous research in 
this area is inconclusive and suffers from 
methodological limitations. Griffitt (1966) 
found that subjects are more attracted to 
others with similar self-concepts but found no 
effect on attraction for similarity of another’s 
self-concept to a subject’s ideal self-concept. 
Thus, Griffitt was concerned with personality 
similarity and did not actually test the inter- 
action of self-esteem and similarity as dis- 
tinct variables. This relationship was tested 
by Hendrick and Page (1970), who compared 
the responses of low-, moderate-, and high- 
self-esteem subjects to others representing 
varying degrees of attitude similarity. They 
found no effect for self-esteem on an attrac- 
tion measure; however, the Self-Esteem X 
Similarity interaction approached significance, 
a finding the authors attributed to the ten- 
dency for moderate-self-esteem subjects to 
respond less negatively to dissimilar others 
than either the low- or high-self-esteem sub- 
jects. One problem with the Hendrick and 
Page (1970) study is the way in which they 
chose to define the low-self-esteem group. 
This group had an average self-ideal correla- 
tion of —.05, which is essentially zero. The 
fact that one’s self-description bears no rela- 
tion to one’s ideal description (in contrast to 
a negative relationship, for example) may or 
may not mean that one is low in self-esteem. 
Also, the fact that the subjects in the Hen- 
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3 drick and Page study never saw or interacted 
with the others they evaluated but only saw 
their purported responses to an attitude ques- 
tionnaire may undermine the generalizability 
of the findings to real-world, face-to-face 
settings. 

It is apparent that the role of the self- 
concept in the similarity—attraction relation- 
ship has not been adequately tested. It is 
likely, as Griffitt (1966) suggested, that if a 
person A esteems himself highly, he will also 
esteem highly those similar to himself and 
think less of those dissimilar to himself. Ac- 

foie to Korman (1970), a negative self- 

concept develops as a result of a life history 
of failure to achieve goals. Thus, an individual 
with a negative self-concept may associate 
similar others with unpleasant outcomes and 
be less attracted to them than to dissimilar 
others, 


METHOD 
Subjects 


Subjects were 36 female and 28 male psychology 
students who participated to satisfy a course re- 
quirement. Four upperclass psychology majors served 
as confederates. 


Instruments 


Prior to the experiment, information was obtained 
about the subjects’ education, work experience, and 
demographic characteristics. Subjects also completed 
a 14-item attitude scale dealing with salient campus 
issues (e.g., Viet Nam), and the self-assurance scale 
of the Ghiselli Self-Descriptive Inventory (Ghiselli, 
1971). Scores on this instrument were used to mea- 
sure self-concept. Scores ranged from 17 to 37 or 
from the 5th to the 90th percentile on Ghiselli’s 
norm tables for the employed adult population 

(male and female), The median score was 26, which 
} corresponds to the 60th percentile. Those scores 
above the median were designated as favorable, those 
below as unfavorable. The median percentile for 
the low group was the 33rd, while the high group 
median was the 86th percentile. 

Since the study involved role-played employment 
interviews, two dependent measures were constructed. 
The first was a 25-point scale which measured 
overall attraction to the applicant as a prospective 
employee. The scale was divided into 5 blocks each 
containing 5 rating points. Thus, Points 1-5 were 
labeled poor, 6-10 below average, 11-15 average, 
16-20 above average, and 21-25 superior. The second 
was a 9-point, behaviorally based scale measuring 
expectation of effort on the job. The lowest point, 1, 


1 Copies of all forms and descriptions used in this 
study are available from the author upon request. 
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was anchored by the statement, “This person almost 
always fails to complete work on time.” The high- 
est rating, 9, was anchored by, “This person would 
take the initiative to see that a job was done right 
even if he was not personally responsible for it, and 
even if it meant changing his own immediate plans.” 
The final instrument was a check on the similarity 
manipulation. It consisted of a 5-point perceived 
similarity-to-self (of the confederates) scale which 
subjects filled out at the end of the experiment. 


Procedure 


Subjects returned in groups of eight for the actual 
experiment. Small sessions were necessary because of 
a limited number of confederates (four) to role 
play as job applicants. Subjects were told the 
purpose was to relate characteristics of themselves 
to ratings they would make about persons playing 
roles as job applicants and that they would each 
interview another person who would be playing a 
role as an applicant for the job of “Laboratory 
Assistant—Department of Psychology.” Subjects 
were given a description of this job, a list of ques- 
tions to ask the applicant, and a set of materials 
about the person they were to interview. Included 
were the applicant’s responses to the same attitude 
scale the subject had completed earlier. The sub- 
jects’ responses were used to make the applicant 
appear similar in half of the cases and dissimilar in 
the other half. Similarity was operationalized to 
mean that all responses on the 5-point scales were 
within 1 point of the subject’s original ones. Dis- 
similarity was defined as a discrepancy of 3 points 
or more on all items except those marked 3. For 
these a response of 1 or 5 was considered dissimilar. 

Also included was the applicant’s resume, which 
provided information concerning education, employ- 
ment, and demographic factors. This information 
was also manipulated so as to make the confederates 
appear similar to the subjects in half of the cases 
and dissimilar in the other half (similar or dissimi- 
lar college majors, work experience, etc.). 

After the subjects had studied the applicant ma- 
terials, they conducted 15-minute interviews with 
confederates in soundproof cubicles. Confederates 
were thoroughly trained in how to respond to the 
subjects’ questions so that the interviews would 
be homogeneous. In addition, they tried to play 
their roles so as to be as consistent with their 
responses to the attitude items as possible. Confed- 
erates were aware that the study was concerned with 
the degree of similarity—dissimilarity between them- 
selves and the subjects; however, they had no 
knowledge of either the self-esteem measures or the 
specific hypothesis being tested. It was felt that this 
interview procedure would sharpen the dissimilarity— 
similarity effect and provide a closer approximation 
to the real world than if subjects only read the 
confederates’ responses. When the interviews were 
over, each subject completed the rating scales and 
the manipulation check described above. 

To determine if the manipulation had been suc- 
cessful, the mean ratings of the applicants on the 
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perceived similarity-to-self scale were compared for 
the designated similar and dissimilar groups. These 
were 3.91 and 2.56 (1=very dissimilar, 2=dis- 
similar, 3=?, 4=similar, 5=very similar), re- 
spectively, F (1,62) = 35.08, p < .001. It was con- 
cluded that the manipulation had been successful. 

The design was a 2 X 2 factorial with two levels 
of subject self-concept (favorable and unfavorable) 
and two levels of “applicant” similarity (similar and 
dissimilar). 


Resutts 


Table 1 shows the mean ratings of overall 
attraction toward the applicant and expecta- 
tion of effort for the four cells. Analyses of 
variance of these data revealed that the only 
significant interactions were those which 
clearly establish the self-concept as a mod- 
erator of the attraction relationship. The 
effect of similarity on attraction holds only 
among high self-concept subjects; it is re- 
versed among low self-concept subjects. The 
same moderating effect occurs for the sub- 
jects’ expectations of the applicants’ expen- 
diture of effort on the job. The reversal effect 
for low-self-concept subjects represents a dis- 
tinct boundary condition for Byrne’s (1971) 
“law of attraction.” 

To examine the interactions more carefully, 
comparisons were made among the cells using 
the Scheffé procedure described by Winer 
(1971). For the attraction variable, the high- 
self-concept-dissimilar applicant cell was sig- 
nificantly different from the high-similar cell, 
F (1,60) = 7.91, p < .01, and the low-dis- 
similar cell, F (1,60) = 7.91, p < .01. The 
same cells were different for the expenditure 
of effort ratings, F (1,60) = 12.12, p < .01 
and F (1,60) = 6.8, p<.05, respectively. 


TABLE 1 


MEAN RATING OF ATTRACTION TO THE APPLICANT 
AND EXPECTATION OF EFFORT 


Similarity 
Self-concept Low High 
Soas | Egoe | Attac | kort 
Unfavorable | 16.88 7.63 14,94 7.13 
Favorable 12.50 5.88 17.00 7.63 


* Self-Concept X Similarity F (1, 60) = 8.09, p < .01, 
b Self-Concept X Similarity F (1, 60) = 10.04, b < .01. 


RUSSELL L. LEONARD, JR. 


Thus, a large portion of the interaction ap- 
pears to be due to two factors: the tendency 
of subjects with high self-concepts to reject 
attitudinally dissimilar others and the failure 
of low self-concept subjects to prefer similar 
others. These results occurred despite the fact 
that the overall self-concept level of the sub- 
jects was higher than that for the general 
population of employed adults (Mdn = 60th 
percentile). 


DISCUSSION 


These results underscore the importance of 
the self-concept as a limiting factor or boun- 
dary condition to the similarity-attraction 
relationship. It seems clear that the improved 
procedures in this study allowed for a more 
adequate test of the moderating effect of the 
self-concept than was possible in previous 
studies (Griffitt, 1966; Hendrick & Page 
1970). The previous studies relied on “paper 
others, while the present study involved face- 
to-face interaction between the participants, a 
fact which probably served to heighten the 
saliency of the situation for the subjects. The 
present study also employed a more subtle, 
5-point attitude response format rather than 
the 2-point agree—disagree scales used earlier. 
Also, the self-assurance scale of the Ghiselli 
Self-Descriptive Inventory (Ghiselli, 1971) is 
probably a better measure of self-esteem than 
the Q-sort self-ideal discrepancy measure. 
Finally, the present study involved a double 
manipulation of similarity, attitudes and lif 
history. 

More research is needed to establish addi- 
tional boundary conditions to the similarity- 
attraction relationship and to find ways of 
eliminating its potential biasing effect in criti- 
cal interpersonal decision making. It has been 
shown that similarity can influence loan deci- 
sions (Golightly, et al., 1972) and hiring 
decisions (Baskett, 1973). On the positive 
side, Leonard (Note 1) showed that the simi- 
larity bias does not occur in an interview set- 
ting when interviewers are given complete and 
detailed information concerning the relation- 
ship between interview-elicited information 
and future job performance. Such results are 
encouraging and suggest that the “law of at- 
traction” (Byrne, 1971) is lawful only under 
certain conditions. 
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The Vicarious Partial Reinforcement Effect: 
An Empirical and Theoretical Analysis 


Paul B. Paulus and John J. Seta 
University of Texas at Arlington 


Past research has shown that subjects persist 
positive reinforcement if they have previously observed a 


receive no obvious 


model who was successful 25% of the time 
successful model. This effect is termed vicari- 
The present study examined the role of 
the VPR effect by varying the degree 


to subjects who observed a 75% 
ous partial reinforcement (VPR). 


different types of opinion similarity on 


longer on a task in which they 


(on the same task) as opposed 


of similarity (similar or dissimilar) of opinion about social issues, the degree 


of similarity (similar or dissimilar) of an opinion 
(25% or 75%) independently 


model success 


relevant to the task, and 


ina 2X2X2 factorial design. 


The VPR effect was demonstrated in all conditions except when the model 


and subject had a dissimilar task-relevant opinion. 


The experiment was also 


designed to test various hypotheses about the mediating psychological processes 


of the VPR effect. 


The expectancy-frustration, 


differential-attraction, and 


suspicion interpretations of the VPR effect were not supported. An exchange 


theory interpretation of the 
the results. 


Most of the research involving partial rein- 
forcement in psychology has been concerned 
with individual learning. However, much of 
this individual learning occurs while others 
are observing. Consequently, it is of interest 
to ascertain the effects of such observation on 
the subsequent behavior of an observer. This 
situation has been investigated in a number 
of studies (Berger, 1971; Berger & Johans- 
son, 1968; Braun, 1972; Lewis & Duncan, 
1958; Rosenbaum & Bruning, 1966). The 
situation employed in the Berger studies con- 
sists of an initial observation phase in which 
an observer watches a model’s task perform- 


The research was supported by funds from the 
Organized Research Committee of the University of 
Texas at Arlington. Requests for reprints should be 
sent to Paul B. Paulus, Department of Psychology, 
University of Texas at Arlington, Arlington, Texas 
76019. We would like to thank the model, Ron 
Turner, for the conscientious acting and helpful atti- 
tude which persisted throughout the experiment. 

1 Since an individual’s trials to quitting in extinc- 
tion is being measured, a more exact term would 
be the vicarious partial reinforcement extinction 
effect (VPREE). The VPR terminology seems ap- 
propriate in the present context, since extinction 
measures have been most generally applied in the 
present paradigm. There would then be no reason 
for confusion about whether acquisition or extinction 
is being measured. 


VPR effect was proposed as an explanation of 


ance. The model is either relatively success- 
ful or unsuccessful in his performance. Dur- 
ing a subsequent extinction phase, the ob- 
server performs the task while the model is 
excused. The observer is never successful in 
the extinction phase due to experimenter in- 
tervention and is given the option of quitting 
at any time. The subjects in the unsuccessful 
model condition generally complete more 
trials before quitting than those in the suc- 
cessful model condition (Berger, 1971; Ber- 
ger & Johansson, 1968). This effect will be 
termed the vicarious partial reinforcement 
(VPR) effect.* Though the VPR results may 
be due to processes analogous to those in- 
volved in the partial reinforcement extinction 
effect found in individual learning, the theo- 
retical implications of the VPR effect may 
not be strictly comparable to those of the 
partial reinforcement effect. Consequently, as 
suggested by Berger (1971), the term success 
is used instead of reward or reinforcement, 
and perseverance is substituted for the term 
resistance to extinction to avoid any impli- 
cation of analogous processes. 

Although the VPR effect has been reliably 
demonstrated by Berger and Johansson 
(1968) and Berger (1971), the necessary and 
sufficient conditions for the effect and the na- 
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ture of the mediating psychological processes 
remain unspecified. Berger proposed that the 
VPR effect will occur only if one uses models 
who are similar to the observer in terms of 
ability or beliefs. This proposal is based on 
Festinger’s (1954) social comparison theory. 
This theory suggests that individuals desire 
to have valid opinions about the world and 
themselves. In the absence of objective in- 
formation, individuals tend to compare their 
opinions or abilities with others of generally 
similar opinions or abilities. Berger extended 
this theory to the situation in which one is 
performing an unfamiliar task. The observers 
in the VPR situation may be uncertain about 
the standards of performance on the task and 
may employ the model’s performance as a 
standard of performance in absence of their 
own task experience. The observers viewing 
a successful model may then have higher 
expectations about their own future per- 
formance than observers of an unsuccessful 
one. Since all the observers subsequently 
perform equally poorly, the observers view- 
ing a successful model may have a greater 
discrepancy between their expectations and 
subsequent performance than the observers 
viewing an unsuccessful one. According to an 
expectancy—frustration hypothesis, since the 
observers of a successful model may have a 
greater discrepancy between their expecta- 
tions and subsequent performance than ob- 
servers of an unsuccessful model, these ob- 
servers would become more frustrated and 
quit sooner than the observers of an unsuc- 
cessful model.” However, since one should 
compare one’s task performance only with 
that of similar others, one should find the 
VPR effect only when an observer views a 
similar model. Berger’s experiments supported 
the social comparison interpretation in that 
the VPR effect was obtained only when the 
observer and model were similar in opinions 
or ability. 

However, Berger’s (1971) manipulation of 
similarity of opinion presents a problem. 


2 Anthony G. Greenwald (personal communica- 
tion) has pointed out that frustration could be a 
cue for increased persistence, since individuals may 
have been reinforced for persevering in the face of 
frustration. However, we are explicitly interested in 
testing the version of the frustration theory dis- 
cussed by Berger. 
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Similarity of opinion is conventionally ma- 
nipulated by variation in degree of agreement 
about a number of social issues (cf. Byrne, 
1971; Castore & DeNinno, 1972), Berger’s 
model and observer were similar only to the 
extent that they either believed or disbelieved 
in the existence of extrasensory perception 
(ESP). Thus, the model and subject in 
Berger’s experiment varied in similarity on a 
task-relevant opinion, their belief in ESP. 
Festinger’s (1954) social comparison theory 
implies that if the observer either “knows” 
or assumes that the model has dissimilar 
task-relevant opinions and/or abilities, he 
will not compare himself to this model, while 
the observer will compare himself to the 
model if he either “knows” or assumes that 
the model has appropriate task-relevant opin- 
ions and/or abilities (cf. Castore & DeNinno, 
1972). Therefore, conventional manipulation 
of similarity of opinion may not affect the 
occurrence of the VPR effect. In an ambigu- 
ous situation, individuals may assume that 
another individual is an appropriate basis for 
comparison as long as this individual has not 
expressed dissimilarity on a task-relevant 
opinion or is not evidently different in ability. 

The present experiment was designed to 
examine the role of different types of opinion 
similarity by varying similarity on a number 
of social issues, task-relevant similarity as 
used by Berger, and the success of the model 
independently in a 2 X 2 x 2 factorial design. 
General similarity on a number of social 
issues. was varied by informing the observer 
that the model either held similar or dissimi- 
lar beliefs on six social issues. Task-relevant 
similarity was varied by having the model 
either express a similar or dissimilar belief 
about the existence of ESP. Finally, success 
was manipulated by having the model per- 
form correctly on either 25% or 75% of the 
observation phase trials. 

A second interest of the present study was 
to test various hypotheses about the medi- 
ating psychological processes of the VPR ef- 
fect. Berger (1971) felt that his studies 
eliminated a number of hypotheses. One 
major hypothesis that has been eliminated is 
a discrimination hypothesis which suggests 
that subjects will discriminate the initiation 
of the extinction phase more quickly after 
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observing a successful than an unsuccessful 
model. Since the extinction period should be 
equally discriminable in both the similar and 
dissimilar model conditions, the VPR effect 
should have occurred in both conditions. Ac- 
cording to Berger, two hypotheses are still 
tenable, an expectancy—frustration and a dif- 
ferential-attraction hypothesis. He has fa- 
vored the expectancy-frustration hypothesis. 
As pointed out earlier, this hypothesis sug- 
gests that the greater the expectation of suc- 
cess produced during the observation phase, 
the greater the frustration produced by the 
observer’s subsequent failure during extinc- 
tion, An observer who views a model who is 
very successful supposedly expects much more 
success in his subsequent performance than 
the observer who views a less successful 
model. The difierential-attraction hypothesis 
of Festinger (1961) suggests that goals be- 
come more attractive when they are harder 
to attain or when they require more effort to 
achieve. Observers who view a model who is 
not very successful may consider the task to 
be more difficult and hence more attractive 
than the observer who views a highly success- 
ful model. This greater attraction may lead 
to greater task perseverance. 

The present experiment was designed to 
differentiate whether the mediating psycho- 
logical process is an expectancy—frustration 
or a differential—attraction one. First, a con- 
trol group was employed in which subjects 
did not observe a model’s performance prior 
to the extinction phase. These subjects only 
experienced the extinction phase. This condi- 
tion made possible a determination of the 
degree to which the various experimental 
conditions increased or decreased the amount 
of persistence. On the basis of the expect- 
ancy—frustration hypothesis, one would expect 
that those observers who view a model who 
is 75% successful should have stronger expec- 
tations of success than the control subjects 
and hence become more frustrated and quit 
sooner than the control subjects. The expect- 
ancy—frustration hypothesis also suggests that 
since the control group will have at least some 
minimal expectations of success, the 25% 
success group may persevere somewhat longer 
than the control group, although this effect 
may be relatively weak. On the other hand, 
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the differential-attraction hypothesis suggests 
that the observer who views a 25% successful 
model will consider the task more attractive 
and hence persevere longer than a contro! 
subject. This hypothesis also suggests that 
since the control subjects will have no ps 

ticular degree of attraction to the task whil 
the 75% condition subjects have some mini- 
mal amount of attraction (due to the minimal 
level of difficulty), the 75% success subjects 
may persist somewhat longer than the control. 
Again, this effect will be relatively weak and 


probably not statistically significant for the 


sample size used in the present study. 

The Multiple Affect Adjective Check List 
(Zuckerman & Lubin, 1965) was adminis 
tered to all subjects immediately after task 
completion to provide a measure of frustra- 
tion (negative affect) independent of task 
performance. A within-cell correlation analy- 
sis provides additional information as to the 
degree to which frustration mediates task per- 
formance. The expectancy-frustration hy- 
pothesis predicts that faster quitting will be 
positively correlated with higher degrees of 
frustration. 

Finally, a questionnaire was administered 
to the subjects immediately after the observa- 
tion phase to determine their degree of 
attraction to the task and their expectation 
of success. This procedure provided checks 
on the effectiveness of the model success 
manipulations in producing differential de- 
grees of attraction and expectation and, 
coincidentally, on the validity of the expect- 
ancy-frustration and differential—attraction 
hypotheses. 


METHOD 
Subjects 


Subjects were 117 undergraduate students (60 
females and 57 males) from the introductory psy- 
chology classes. The subjects were run individually 
and were randomly assigned to one of the nine 
experimental treatments with the restriction that 
subjects were assigned to each of the nine conditions 
before the n+ 1th subject was assigned to a condi- 
tion. An approximately equal number of males and 
females were assigned to each condition. The data 
of 9 subjects were not used in the analysis: Five 
subjects saw the model leaving the experimental 
room and were disqualified before participating, 2 
subjects were deleted due to the experimenter’s error 
during the experiment, and the other 2 subjects were 
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eliminated because one looked into the experimenter’s 
desk and another was interrupted by an intruder 
opening the lab door. 


Procedure 


An ESP task identical to Berger’s (1971) was 
employed. Immediately upon arriving for the ex- 
periment, the model (a confederate of the experi- 
menter) was instructed to sit at a table. A screen 
on the table separated the model from the experi- 
menter, Similarity of general opinion was varied by 
having the observer and model fill out a short atti- 
tude questionnaire on six social issues. The experi- 
menter explained that he was running a survey not 
related to the present experiment and was trying to 
accomplish two things at one time, After briefly 
reviewing the questionnaire, the experimenter casu- 
ally told the model and observer that their atti- 
tudes were either very similar or dissimilar, depend- 
ing on the experimental condition, When task- 
relevant similarity was being varied, the experimenter 
asked the subject if he believed in ESP, After ob- 
taining an answer, the experimenter then repeated 
the question for the model. Depending upon the 
experimental condition, the model either gave a 
similar or dissimilar answer. 

The ESP instructions were similar to those em- 
ployed by Berger (1971). The experimenter stated 
that the experiment was a study of ESP. The sub- 
ject was asked to guess which of the four suits of 
cards the experimenter held in his hand. He was 
instructed to observe the other subject (model), 
since he would be asked to continue the same task af- 
ter the model was finished. After asking the observer 
to shuffle the deck of cards, the experimenter picked 
up the top card of the deck and held it so that 
the observer could see it. The model was not able 
to see the card because of the screen separating the 
model from the experimenter. The experimenter said 
“Now” as a signal to the model to give the experi- 
menter an answer. After about 5 sec had elapsed, 
the model was asked to make another guess in 
response to the next card. The experimenter’s feet 
were visible to the model and were used by the 
experimenter as a signal of which suit he had in his 
hand. The model participated for a total of eight 
trials, The model was successful on all but the 
second and eighth trials in the 75% model success 
condition and on only the second and eighth trials 
in the 25% model success condition. After the mode] 
completed the eighth trial, he was excused and his 
seat was occupied by the observer. The observer was 
told that he would participate in the same task as 
the model. However, it was explained that there 
was no set number of trials for his part of the 
experiment and that he could continue in the experi- 
ment until he indicated to the experimenter that he 
wished to stop. Once again, the observer was asked 
to shuffle the deck of cards. The observer was led 
to believe that his choice of suits was always incor- 
rect, since the experimenter always held up to the 
observer a suit different from the one the observer 
had called out. The number of trials until the 
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observer decided to stop was recorded as the pri- 
mary dependent measure. In the control condition, 
the subjects did not observe a model’s performance 
prior to the extinction phase. 

Immediately after the observation phase, a ques- 
tionnaire assessing the degree of attraction to the 
task and the expectation of success was administered 
to one half of the subjects, while the other half of 
the subjects did not receive this questionnaire. This 
procedure enables one to determine if asking subjects 
these questions before task completion affects task 
performance. The Multiple Affect Adjective Check 
List was administered immediately after task com- 
pletion to determine the subject’s degree of anxiety, 
hostility, and depression, High scores on the anxiety, 
hostility, and depression measures were assumed to 
be indicative of high levels of frustration, Finally, a 
questionnaire was given to the subject to assess his 
degree of suspicion concerning the model and the 
experiment in general, how similar he perceived him- 
self and the model to be in opinion and ability, and 
to what degree he liked the task. 

In summary, the expectancy-frustration and differ- 
ential-attraction hypotheses of the VPR effect will 
be investigated by use of a control group and verbal 
questionnaires. Furthermore, on the basis of 
Festinger’s (1954) social comparison theory, it was 
predicted that (a) the VPR effect will occur whether 
or not the model is similar or dissimilar to the 
observer on the six social issues (general opinions) 
and (b) the VPR effect will be demonstrated when 
the model and observer have similar task-relevant 
opinions but not when their task-relevant opinions 
differ. 


RESULTS AND DISCUSSION 


To check on the effectiveness of the experi- 
mental manipulations, the similar conditions 
were compared to the nonsimilar ones, The 
subjects in both of the similar conditions felt 
significantly more similar to the model than 
those in the nonsimilar conditions, for task- 
relevant similarity conditions, t(46) = 3.13, 
$ <.01, two-tailed; for general similarity 
conditions, #(33) = 2.86, p < .01, two-tailed.* 
Therefore it can be concluded that the ex- 
perimental manipulations of similarity were 
effective. 

The mean and median trials to quitting for 
the eight experimental conditions and the 
control condition are contained in Table 1. 
Tt can be seen that the VPR effect occurred 
in all of the similarity conditions except for 
the task-relevant dissimilar condition, 


®The questionnaire was administered randomly to 
one half of the subjects in the 25% and 75% 
model success conditions without regard to the simi- 
larity condition to which the subject was assigned, 
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TABLE 1 


MEAN AND MEDIAN NUMBER OF TRIALS TO 


QUITTING AS A FUNCTION OF TASK-RELEVANT 


SIMILARITY, GENERAL SIMILARITY, AND THE Success OF THE MODEL 


Task-relevant similarity 


General similarity 


Model gecene Similar Dissimilar Similar Dissimilar 
M Mdn M Mdn M Man M Mdn 
25 71.07 48 40.69 31 68.76 49 60.84 39 
75 38.92 15 63.84 42 37.92 23 22.38 15 


Note, The control subject's mean and median were 55.54 and 46, respectively. 


Since the data were characterized by 
heterogeneity of variance and skewness of 
distribution, a log transformation was ap- 
plied to the data in order to allow an analy- 
sis of variance. The analysis revealed a sig- 
nificant effect of degree of model success, 
F(i, 96) = 8.77, P< 005, and significant 
Model Success X Similarity X Similarity 
Type interaction, F(1, 96) = 4.40, p< 05. 
Specific comparisons revealed generally reli- 
able differences in all except the task-relevant 
dissimilar condition, for the task-relevant 
similar condition, F(1,96) = 5.63, $ < .025; 
for the task-relevant dissimilar condition, 
F < 1.00; for the general similar condition, 
F(1,96) = 3.64, p < .07; for the general dis- 
similar condition, F(1,96) = 7.20, p< .01.* 

Although the results in the general similar 
condition are somewhat weak, one can gen- 
erally conclude from the above results that 
individuals will quit more quickly after ob- 
serving a 75% successful model than a 25% 
successful one, except when the model and 
observer differ on a task-relevant opinion. In 
fact, in the task-relevant dissimilar condition 
there was a slight reversal of the VPR effect. 

To check on whether there were any un- 
intended reactive effects in the conditions in 
which the VPR effect was obtained, the sub- 
jects who had filled out a questionnaire before 
entering the extinction phase were compared 
to those who were not given a questionnaire. 
The 25% model success subjects who were 
given the questionnaire prior to the extinction 
phase did not differ significantly in the num- 


4 Berger analyzed his data by Mann-Whitney U 
tests. Analysis of the present data in this fashion 
does not alter conclusions. 


ber of trials to quitting from the 25% model 
success subjects not given the questionnaire 
at that time (F < 1). A similar comparison 
of the 75% model success subjects was also 
not significant (F < 1). 

The performance of the 25% and 75% 
model success subjects relative to the control 
group is counter to the differential-attraction 
hypothesis. For those conditions in which the 
VPR effect occurred, the degree of persever- 
ance of the combined 25% success conditions 
and the combined 75% success conditions 
were each compared with that of the control 
condition. Although the perseverance of the 
25% model success subjects and the con- 
trol subjects was not significantly different 
(F < 1), the perseverance of the 75% model 
success subjects was significantly less than 
that of the control subjects, F (1, 108) = 4.18, 
p< 05. Also, these same 25% and 75% 
model success subjects did not differ signifi- 
cantly in attraction to the task (F <1). 

The above results appear to support the 
expectancy—frustration hypothesis. This hy- 
pothesis predicts that those observers who 
view a model who is 75% successful will have 
higher expectations than control subjects and 
hence will become more frustrated and quit 
sooner. However, for the three conditions in 
which the VPR effect occurred, the subjects 
in the 25% model success condition did not 
differ significantly in their expectations from 
those in the 75% model success condition 
(F <1), nor were the Multiple Affect Adjec- 
tive Check List scores of these two groups 
significantly different (F < 1). Consequently, 
the expectancy—frustration hypothesis does 
not appear to be a viable explanation for the 
VPR effect. Braun (1972) also reported that 


Vicarrous PARTIAL REINFORCEMENT Errect 


perceived expectancies were not re- 
sk perseverance in a VPR situation, 
rnative explanation for the VPR 
uned in this experiment could be 
ubjects who observed a 75% suc- 
odel became more suspicious about 
del and subsequently about the experi- 
ent than their 25% counterparts, Due to 
their higher suspicion, they may have quit 
sooner this explanation were a viable 
alte then for the conditions where the 
was obtained the 75% model 
bjects should be more suspicious 
5% ones and the more suspicious 
hould quit sooner than the less sus- 
subjects. In these conditions it was 
eed found that the 75% model success 
subjects were slightly more suspicious than 
the 25% model success subjects, F(1, 76) 
= 3.20, p< 10. However, the 75% model 
Success subjects were also slightly more sus- 
picious than the 25% model success subjects 
in the task-relevant dissimilar condition, 
(1,76) = 2.62, p< .11. For the suspicion 
hypothesis to be a viable explanation of the 
VPR effect, a slight trend toward a VPR 
‘ould also have been demonstrated in 
‘vant dissimilar condition, Instead 

ersal was obtained, 
uai ihen accounts for the difference in 
between the 25% and 75% 
ss subjects? An extension of 
‘ednar’s (1967) exchange theory 
a ichievement behavior to the VPR 
situs ppears to be one reasonable alter- 
native. They argued that individuals weigh 
the potential gains and losses in attempting 
and performing various tasks. More specifi- 
cally, a subject’s comparison level, which is 
his perceived desirability of anticipated future 
mes, is presumably governed by the sub- 
! history of success in achievement 
si Subjects with a past history of 
vave a low comparison level. That 
associate a low cost to continued 
fa ı high reward to possible success. 
In con subjects with a past history of 
success wiil have a high comparison level. 
They will associate a high cost to failure and 
will have relatively little to gain by continued 

success. 

In the case of the VPR situation, when 
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someone observes another individual perform- 
ing correctly 75% of the time, he may 
develop a high comparison level with its asso- 
ciated high cost of failure and low reward 
value of success. That is, he has relatively 
little to gain by performing well and a great 
deal to lose by performing poorly. In con- 
trast, for the subject who observes a 25% 
successful model, success may have a high 
reward value, while failure may have a rela- 
tively low cost. Thus, the subject observing 
the 25% successful model would then be 
expected to persevere longer on the extinction 
phase than those in the 75% condition, since 
they associated less cost to their continued 
failure and a higher reward to any possible 
future success than did their 75% counter- 
parts. It is assumed that the individual will 
have to consider the model to be a relevant 
basis for social comparison before this indi- 
vidual could vicariously develop a compari- 
son level in an ambiguous task situation and 
associate any definite cost to failure or 
reward to success, 

One possible way to test the exchange 
theory interpretation would involve giving 
subjects a choice to perform either the ESP 
task or another task after observing the 
model perform the ESP task. The exchange 
theory interpretation would predict that sub- 
jects observing a 75% successful model will 
demonstrate a greater preference for the 
alternative task than those who observe a 
25% successful model, since the 75% sub- 
jects have little to gain but much to lose by 
performing the ESP task. Some type of con- 
trol group similar to that employed in the 
present experiment would be required to 
demonstrate that the potential differences in 
preference of the 25% and 75% conditions is 
not due to increased attraction to the ESP 
task by the 259% subjects. This choice para- 
digm has been found useful in analyzing the 
adequacy of various theories in accounting 
for persistence behavior in a somewhat dif- 
ferent task setting (Paulus & Konicki, 1973). 
We are presently initiating a study to test 
the above predictions. 

It is desirable in future studies to assess 
the generality of the VPR effect across task 
situations. It is of course possible that the 
VPR effect will occur in only certain types 
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of task situations. However, a study by 
Paulus and Dorsett (Note 1) found some 
evidence for a VPR effect with a concept 
identification task. This study and several 
others (Lewis & Duncan, 1958; Rosenbaum 
& Bruning, 1966) suggest that the VPR 
effect is certainly not limited to the ESP task 
situation. 
REFERENCE NOTE 


1. Paulus, P. B., & Dorsett, R- E. The effect of a 
models error-making behavior on observer per- 
sistence. Unpublished manuscript, University of 
Texas at Arlington, 1971. 
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Altruistic Behavior and the Parent-Child Relationship 


Martin L. Hoffman 
University of Michigan 


The hypothesis that altruistic behavior 


relates positively to having parents who 


(a) are altruistic, (b) frequently use victim-centered discipline techniques, 
especially those suggesting reparation and apology, and (c) frequently express 
affection was tested. Subjects were 40 male and 40 female fifth graders—all 
middle class, white, first-borns of above-average IQ. Their altruistic behavior 
was assessed in terms of peer reputation, Parental data were obtained from 
separate interviews with mothers and fathers, Altruistic behavior in girls re- 


lated significantly to altruistic values 


(p<.10), and to frequent use of vic 
.01). Altruism in boys related significa: 


of their mothers (p<.05) and fathers 
tim-centered discipline by fathers (p< 
ntly to altruistic values of fathers (p< 


-05), to affection (p < .05), and to use of victim-centered discipline (p <.01) 
by mothers. The results suggest altruistic children have at least one parent 
(usually the same sex) who communicates altruistic values, thus serving as 
model, and one who uses victim-centered discipline (usually the opposite sex). 
Affection appears influential only in the mother-son dyad. 


After decades of neglect, research on altru- 
ism has burgeoned, perhaps as part of the 


„ shocked reaction to incidents like the murder 


of Kitty Genovese 10 years ago which was 
witnessed by over 35 people, none of whom 
helped or even called the police. The magni- 
tude of the research effort is indicated by 
the recent appearance of four reviews (Bryan 
& London, 1970; Krebs, 1970; Macaulay & 
Berkowitz, 1970; Staub, 1974). 

Altruism has been defined implicitly in the 
research as purposive action on behalf of 
someone else that involves a net cost to the 
actor. Types of altruism studied include 
helping another in distress and sharing or 
making a donation to someone in need. The 
main focus has been on factors in the situa- 
tion or in the person that momentarily gov- 
ern the altruistic response. The research on 
children has been concerned mainly with age 
differences and creating various types of 
altruistic behavior in the laboratory. With 
few exceptions (Hoffman, 1963; Hoffman & 
Saltzstein, 1967; Mussen, Harris, Ruther- 
ford, & Keasey, 1970; Rutherford & Mussen, 

This paper was prepared in conjunction with 
Grant HD-02258 from the National Institute of 
Child Health and Human Development. 

Requests for reprints should be sent to Martin L. 
Hoffman, Department of Psychology, University of 
Michigan, Ann Arbor, Michigan 48104. 


1968) the contribution of the parent to the 
child’s altruism has not been a primary focus 
of research interest, 

The role of the parent as socialization 
agent has three major facets—model, disci- 
plinarian, and supplier of the child’s affec- 
tional needs—each of which may contribute 
to the child’s altruistic behavior. First, it 
seems likely that parents who express altru- 
istic values should help make altruism salient 
in the child’s life and provide guidelines for 
altruistic behavior, There is some evidence 
for this. Fully committed civil rights activists 
in the early 1960s were found by Rosenhan 
(1969) to have at least one parent who had 
been fully commmitted to an altruistic cause 
during some extended period of the respon- 
dent’s formative years. Hoffman (1970b) 
found that preadolescent boys with a human- 
istic, as opposed to a conventional, moral 
orientation had fathers whose moral judg- 
ment responses indicated a tendency to empa- 
thize openly with someone in distress; this 
finding was not obtained for girls. And, the 
many experiments showing that children will 
imitate a model who shares (Bryan & London, 
1970) are also suggestive. 

The contribution of discipline to altruism 
is less obvious. The argument here is that 
since altruism often involves subordinating 
one’s own needs for the sake of another per- 
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son in distress, the child’s prior experiences 
in situations in which his needs led to acts 
that contributed to another’s distress must 
play a significant role. Such acts are ordi- 
narily viewed negatively by the parent, whose 
customary discipline pattern in these situa- 
tions should therefore play a role in the 
development of altruism. Of particular im- 
portance are victim-centered techniques— 
those which direct the child’s attention to the 
other person’s plight. Techniques that point 
up the harmful consequences of the child’s 
behavior for the victim or encourage him to 
imagine himself in the other person’s place 
may help put the feelings and thoughts of 
the victim into the child’s consciousness and 
thus help guide his future actions in an altru- 
istic direction. Techniques that suggest con- 
crete acts of reparation should have special 
value since the message in this case has a 
direct, immediate connection with action: 
When you hurt someone, do something to 
make it right again. They are thus especially 
likely to give the child the actual experience 
of helping another, of subordinating his own 
needs in favor of another person with whom 
he was moments earlier in conflict. The help- 
ful act may, in addition, be reinforced by a 
positive response from the parent or the 
recipient, as well as by the reduction of any 
guilt that may have been aroused in the situ- 
ation. Techniques that suggest the child 
apologize—an overt act, but not one that 
actually helps the other—should be some- 
where between the two in effectiveness. All 
three types of victim-centered techniques 
stand in contrast to power assertion and love 
withdrawal, for example, which are apt to 
(a) arouse resentment and anxiety and focus 
the child’s attention on the negative conse- 
quences to himself of conflict with others; 
(b) contribute nothing to his appreciation of 
the victim’s plight, nor provide him with the 
experience of helping others; and (c) expose 
him to a punitive rather than an altruistic 
model. 

Parental affection should help foster altru- 
ism in several ways. First, it is essential for 
emotional security which may in turn con- 
tribute to altruism by lessening the child’s 
preoccupation with his own emotional needs, 
thus opening him up to the needs of others. 
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This is suggested by the findings: (a) tha 
helping another child in distress relates posi- 
tively to emotional security (Murphy, 1937) 
(b) helping another child in distres 
creased by an experimentally induced expe 

ence of success (Isen, Horne, & Rosenhan 
1973) and by prior interaction with a frienc 

adult (Staub, 1974); and (c) that charitab! 
behavior correlates negatively with a need fo 

social approval (Staub & Sherk, 1970). Af- a 
fection may also foster a positive orientation 

to the parent which makes the child more 
receptive to victim-centered discipline and 
increases the likelihood of his emulating any” 
tively related to the mother’s generous use of 
praise and reward (though not to a more 
direct index of affection). The experimental 
1971a; Yarrow, 1973), although in the only 
study in which the nurturant-model condition 

was extended (the adult interacted with the 


altruistic parental model. 

The contribution of affection to altruism is 
indicated by several correlational studies, al- 
though there are some inconsistencies and the 
reason for the relation is unclear. Rutherford d 
and Mussen (1968) found that preschool boys 
who shared candy with their peers were more 
likely than nonsharers to view their fathers 
as nurturant. Hoffman and Saltzstein (1967), 
on the other hand, found no relation between 
altruism and the father’s affection in seventh 
grade children, but they did find one between 
altruism and affection by the mother, though 
research on the relative effectiveness of nur- 
child in 5 half-hour sessions, on 5 days, spaced 


only for boys. Mussen, Harris, Rutherford, 
and Keasey (1970) also found altruism in 
seventh grade boys, but not girls, to be posi- 
turant and nonnurturant adult models is in- 
conclusive (Grusec, 1971; Grusec & Skubiski, 
1970; Rosenhan & White, 1967; Staub, 
over 2 weeks), the children did acquire a 
relatively enduring altruistic response pattern. 

Following this line of reasoning, it was hy- 
pothesized that altruistic behavior in the child 
would relate positively to having parents who 
(a) are themselves altruistic, (b) frequently 
use victim-centered discipline techniques, es- 
pecially those which suggest reparation and 
apology, and (c) frequently express affection 


to the child. It was expected that Hypothesis 
a would be primarily true with respect to 


the same-sex parent, with whom children 
typically identify. Hypotheses b and c were 
expected to be supported mainly with respect 
to mothers, since they are typically the main 
disciplinarians, and past research indicates 
that it is primarily maternal discipline and 
affection that have consistent effects on the 
child (Hoffman, 1970a). 

Though the previous research provides 
scattered support for these hypotheses, as 
noted earlier, it is limited. In one study 
(Rutherford & Mussen, 1968), for example, 
the parent data were based on doll-play re- 
five si by the child, which may signify how 


the child wishes his parent behaved rather 
than his actual behavior. In another, data 
were obtained only for mothers (Mussen et 
al., 1970). In no case were data obtained on 
all three hypothesized parental attributes. 
This makes it impossible to determine which 
attribute has the major influence on altruism 
or whether all three attributes operate in 
concert, which parent is more important, or 

~ which combinations of mother and father 
attributes are most effective. The present 
study was designed to enable such assess- 
ments to be made; independent data were 
obtained on all three hypothesized attributes 
for both parents. 

Another innovation was the control on birth 
order which has been found to relate to al- 
truism—older children showing more helping 

+ behavior than younger children (Staub, 197 1a, 
1971b)—and to both discipline and affection 

(Clausen, 1966; Sears, Maccoby, & Levin, 

1957). Older children, being bigger and 

stronger, may also be expected to exert con- 

trol over their younger siblings which might 
interfere with parental effects. Older siblings 
have been found to serve as models (Brim, 

1958; Rosenberg & Sutton-Smith, 1964), for 

example, and while there is no evidence that 

sibling models reduce the effectiveness of 
parent models, the experimental finding by 

Bandura, Grusec, and Menlove (1967) sug- 

gests that this may often happen: Children 

were found to reject an adult model’s stringent 
standards of self-reward in favor of a peer 
model’s more lenient standards. 

1 Also, in an unpublished finding, Hoffman and 


Saltzstein (1967) obtained a relation between birth 
order and both discipline and affection. 


ALTRUISTIC BEHAVIOR AND THE PARENT-CHILD RELATIONSHIP 
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Birth order was controlled by using first- 
born children from small families, for several 
reasons. First, there is more parent-child 
interaction in small families (Nisbet, 1961) 
and the intensity of these interactions is less 
diluted by child-child interactions (Clausen, 
1966). Second, the parent-child relation is an 
especially salient part of the lives of first- 
borns since they have all had the experience 
of being the only child. Parents also typically 
pitch their conversation at the level of the 
first-born child (Bossard & Boll, 1956; Koch, 
1954), and fathers are more likely to be in- 
volved in their socialization (Clausen, 1966; 
Henry, 1957; Sears et al., 1957). Finally, the 
“ripple effect” (Gump & Kounin, 1960) — 
the effect of witnessing discipline adminis- 
tered to another child—should be less pro- 
nounced, hence less of a confounding factor 
for first-born children, since their initial ex- 
periences with parental discipline involve 
discipline directed at them, rather than at an 
older sibling. Parental influence should thus 
be highlighted in first-born children, although 
it is also possible that the homogeneity of 
such a sample might preclude obtaining evi- 
dence for this influence. 


METHOD 
Sample 


The subjects were 40 male and 40 female fifth 
graders in an affluent middle-class school district in 
suburban Detroit. All were white, first-born children 
from intact homes, and none was rated as a be- 
havior problem by the teachers. All the parents had 
completed high school and at least one in each family 
had gone to college. 


Assessment of Altruistic Behavior 


Our aim was an index of the child’s altruistic 
behavior in a natural setting. Because of the birth- 
order and IQ requirements, the subjects had to be 
selected from a larger pool and were thus distributed 
throughout the city, making direct observations im- 
practical. The most obvious alternative was to ob- 
tain the data from their classmates, who were in 
constant contact with them and who, in contrast to 
adult informants, could be counted on to discount 
behaviors clearly designed to gain adult approval, 
To assure their having had enough contact with 
the subjects to make valid judgments, the data were 
collected near the end of the school year, 

The procedure was the same as that used by 
Hoffman and Saltzstein (1967) and Mussen et al. 
(1970). The subject’s classmates gave responses to a 
sociometric questionnaire consisting of the following 
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items: (a) Which three boys (girls) in your class 
are most likely to follow the rules even when the 
teacher is not around? (b) . - . to care about how 
other kids feel and try not to hurt their feelings? 
(c) . . . to stick up for some kid that the other 
kids are making fun of or calling names? (d) .. + 
to blame themselves a lot if they do something 
wrong? The subject’s altruism score was the total 
number of nominations he received for items b 
and c divided by the number of possible nominators, 
that is, the number of children in his class. 

Although not used, teacher ratings of the extent to 
which the children were “considerate of their peers 
and will often go out of their way to make some 
sacrifice for them” were also obtained. The correla- 
tion (gamma) between these ratings and the socio- 
metric index was .34 for boys (p<.05) and .47 
for girls (p < 01). 


Parent Measures 


Parental data were obtained from individual 
interviews administered by trained female interview- 
ers. The fathers and mothers were interviewed sepa- 
rately, on different days. 

Personal values as index of altruistic parental 
model. It was not possible to obtain actual observa- 
tional data on the parents, and the next best thing 
seemed to be an index of the salience that helping 
other people in need had in their value hierarchies. 
The more salient the parent’s altruistic values, the 
more frequently he would be expected to express 
them in words and in action, The measure consisted 
of a list of 18 values which were described as being 
“important to men (women) your age.” Included 
were such items as respecting one’s parents, having a 
good sense of humor, showing consideration of other 
people's feelings, putting work before play, going 
out of one’s way to help other people, trying one’s 
best in everything one does, following rules and 
regulations, having confidence in oneself, and attend- 
ing church regularly. The respondent was first asked 
to rate how important each value was for him (her) 
on a 4-point scale, and then to rank order the three 
most important ones. Scoring weights of 3, 2, and 1 
were assigned to the ranked items. Unranked items 
were not scored (the initial ratings were included 
primarily to make sure the respondent thought about 
all the items before ranking them). The weights for 
the two altruism items (italicized in the above list) 
were summed to obtain the parent’s score on altru- 
istic values. 

This measure has been used by the writer with 
preadolescent subjects and found to relate positively 
and significantly to peer ratings of overt altruistic 
behavior. There is also some evidence that it relates 
to overt altruism for mothers in this sample. A 
mail-in questionnaire, returned by 56 mothers, in- 
cluded an item asking them to describe instances in 
which they used themselves as a model of good 
behavior for the child. The number of altruistic 
behaviors mentioned by the mothers was found to 
relate positively and significantly to their altruistic 
value scores. More general support for the assumed 
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positive relation between values and behavior in 
adults comes from Henshel’s (1971) finding that the 
correlation between placing a positive value on 
honesty and overt honesty increases dramatically 
with age, reaching a rather high level (.78) by 
early adolescence. 

Victim-centered discipline. The discipline measure, 
a modification of that used by Hoffman and Saltz- 
stein (1967), consisted of situations in which a child 
does harm to another person under three conditions 
—with provocation, without provocation but with 
social support, and with neither provocation, social 
support, nor malicious intent. The items were: 

1. The child and a friend are building a house of 
blocks. The friend says, “My house is nicer than 
yours. Yours doesn’t even look like a real house.” 
The child gets mad and kicks the friend’s house 
apart, whereupon the friend starts to cry. 

2. The child and his friends are making fun of 
another child. 

3. While in a restaurant a crippled man is sitting 
at the next table, and as he leaves, the child says in 
a voice loud enough for the man to hear, “Why 
does that man walk so funny?” 

The parent was asked to imagine each situation 
occurring when his child was 5 or 6 years of age, 
and to describe what he would have done or said 
in each case. The interviewers were specially trained 
to probe when necessary to obtain a uniformly ~ 
high degree of specificity of response, and every 
effort was made to obtain the actual words and ac- 
tions the parents reported using in the situations. 

These scoring weights were assigned: 3—if the 
parent explicitly encouraged an act of reparation 
such as rebuilding the friend’s house or offering to 
play with the child who had been made fun of; 2— 
if the parent encouraged the child to apologize to 
the victim; and 1—if the parent showed concern 
for the victim’s feelings. When two or more indi- 
cators were present, the weights were added. Thus, 
the maximum score possible for each item was 6 
(few 6s were obtained, none for the third item in 
which reparation and apology would be awkward). 
The weights for the three items were summed to 
obtain the score for victim-centered discipline. Two 
coders were used and they agreed 97% of the time. 
Although the validity of parental reports of past 
behavior is never certain, the parents were confident 
of their responses, and several mentioned that the 
interactions with their oldest child stood out in 
memory. The parents’ recall may also have been 
aided because they had only one or two other 
children to keep in mind and because the discipline 
situations in question did not often occur in the 
intervening years. 

Parental affection. An index of affection expressed 
toward the child when he was quite young was 
desired, since current affection might be partly a 
response to altruism and related behaviors in the 
child rather than a contributing factor. The affec- 
tion data thus had to be obtained from parental 
reports. For reasons already mentioned, it seemed 
safe to trust their recall of interactions with the 
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oldest child. To minimize “social desirability” (a 
possible problem since parents presumably know 
that showing affection is good), the measure used 
by Hoffman and Saltzstein (1967) was chosen be- 
cause it permitted the parent to indicate strong 
interest and concern for the child without neces- 
sarily having to claim that he often expressed af- 
fection directly. The parent was asked to indicate 
how frequently he engaged in each of 16 behaviors 
“when the child was about five or six years of age.” 
Included were items like praising the child for some- 
thing he did, preventing him from doing something 
he wanted to do, hugging or kissing him, playing 
games with him, joking around with him, punishing 
or scolding him, calling him by a Special pet name 
that he likes, giving him personal advice, smiling at 
him, explaining about things that go on in the world, 
talking with him about school work. The frequency 
scale used was: twice or more a day, about once a 
day, about once a week, about once a month, about 
once a year, and never, These responses received 
weights of 5, 4, 3, 2, 1, and 0, respectively, and the 
affection score was obtained by summing the weights 
for the five direct affection items (italicized in the 
above list). 


Controls and Statistical Analysis 


Birth order and number of siblings were controlled 
by using subjects who were all the oldest of two or 
three, and at least 2 years older than the second-born 
child. The control on IQ, deemed necessary because 
IQ has been found to relate to altruistic behavior 
(Hoffman & Saltzstein, 1967; Mussen et al., 1970), 
was accomplished by selecting only children who 
scored above the 65th percentile on the SRA Pri- 
mary Mental Abilities Test the previous year. In 
the final sample there was no relationship between 
1Q and any of the parent or child variables. 

The nonparametric correlational Statistic, gamma, 
was used because some of the distributions were 
skewed, 


RESULTS AND DISCUSSION 


The correlations obtained were all positive, 
in accord with the hypotheses, and almost 
half of them were statistically significant 
(see Table 1). Significant correlations were 
obtained between altruistic behavior in the 
children and altruistic values in the parents 
of the same sex; a correlation of borderline 
significance was also obtained between the 
girls’ altruistic behavior and their fathers? 
altruistic values. The children’s altruism was 
also correlated significantly with the use of 
victim-centered discipline, though unexpect- 
edly this relation was obtained only with 
respect to parents of the opposite sex. Fi- 
nally, a positive relation was obtained be- 
tween maternal affection and altruism in boys. 
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TABLE 1 


RELATIONSHIP (GAMMA) BETWEEN Cuinp’s ALTRUISTIC 
BEHAVIOR AND PARENTAL CHARACTERISTICS 


Parental characteristic enol E 

Mother 

Altruistic values shi 05 

Victim-centered discipline OL Se iaia 

Affection 18 HA is 
Father 

Altruistic values .34* 35" 

Victim-centered discipline PR he 12 


Affection | 12 -08 


*p <10 {retailed 
** p <05 (two-tailed). 
“+p < 101 (two-tailed), 


The findings thus provide general support 
for the proposition that all three parental 
attributes—values, discipline, and affection— 
can help foster the development of altruism 
in the child.* There is no evidence that all 
three are necessary in any of the parent-child 
dyads. It does appear, however, that altruism 
in boys is associated with all three, though 
not as manifested in the same parent, and in 
girls it is associated with two of the three, 

The most influential parental attribute ap- 
pears to be the espousal of altruistic values, 
which presumably reflects the parent’s role as 
an altruistic model. Its importance in this 
study is perhaps due partly to the first-born 
status of all of the subjects, since this means 
that none of them had older siblings who 
could serve as models intervening between 
them and their parents. The fact that the 
parent’s altruistic values are associated with 
altruism in the same-sex child fits well with 
the general assumption in the literature that 
identification with the same-sex parent is a 
significant factor in socialization. ‘The rela- 
tion obtained between the girl’s altruistic be- 
havior and the father’s altruistic values, and 
the absence of such a relation for boys and 
mothers, may be accounted for by the fact 
that identification with the mother is rare in 
boys. Girls, on the other hand, though their 
primary identification is typically with the 
mother, will also often identify with the 
father. The fact that fathers tend to be more 

2 The logic of inferring causality in correlational 
research of this kind is discussed elsewhere (Hoff- 
man, in press). 
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involved in the socialization of the oldest 
child, as compared to the youngest (Clausen, 
1966; Henry, 1957; Sears et al., 1957), may 
also contribute to the influence of the father’s 
values, as well as discipline, on altruism in 
girls. 

The finding that affection relates to altru- 
ism only in boys is consistent with the previ- 
ous research (Hoffman & Saltzstein, 1967; 
Mussen et al., 1970). A possible explanation 
is that the range of affection scores was re- 
stricted for the girls, This was tested and 
found not to be the case: There was no sig- 
nificant sex difference in the range of affection 
scores. The mean difference in amount of 
affection expressed, however, was significant. 
Both parents reported expressing more affec- 
tion to girls than boys. This difference, which 
has also been found previously (e.g., Armen- 
trout & Burger, 1972), suggests, in keeping 
with our initial assumptions about the role of 
affection, that a certain minimum amount of 
affection may be necessary for the child to 
become receptive to the parent’s victim-cen- 
tered discipline, or secure enough to be open 
to the needs of others. Individual differences 
above the minimum may have no measurable 
effect. If most girls are above the minimum, 
this could account for the lack of relation be- 
tween affection and altruism in girls. The 
affection given boys may often be below this 
minimum, however, with the result that vari- 
ations in amount of affection do make a dif- 
ference. The importance of affection in boys 
may thus be accounted for, although this ex- 
planation requires further investigation. 


CONCLUSIONS 


The findings suggest that altruistic chil- 
dren have at least one parent who communi- 
cates altruistic values and thus serves as an 
altruistic model, and at least one who fre- 
quently uses victim-centered discipline. The 
parent of the same sex as the child appears 
to be influential as a model (although fathers 
may also serve this function for girls), where- 
as discipline appears to be effective when 
used by the parent of the opposite sex. The 
importance of affection seems confined to 
boys. Further research is needed to determine 
whether the particular pattern obtained here 
is idiosyncratic to the present sample, is appli- 
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cable to other groups of first-born children, or 
can be generalized more widely. 
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Devaluation of Innocent Victims: 
An Attribution Analysis Within the Just World Paradigm 


Bruce W. Godfrey and Charles A. Lowe 
University of Connecticut 


Previous explanations for devaluation of victims have focused on observers’ 
need to believe in a just world. An attribution analysis suggested that nega- 
tive characterological information inadvertently conveyed by experimental 
manipulations could account for previously observed devaluation. This analysis 
led to the prediction that relative to victims who appear extrinsically moti- 
vated by experimental pressure, similar victims who appear intrinsically moti- 
vated by the worth of the study should not be devalued. A comparison be- 
tween two extrinsic motivation conditions conceptually replicating Lerner and 
Simmons and two intrinsic motivation conditions revealed that only extrinsi- 
cally motivated victims were devalued. Implications of these results for attri- 
bution versus just world explanations of devaluation were discussed. 


Although awareness of misfortune besetting 
others usually elicits sympathy and compas- 
sion, a number of studies have shown that 
victims of misfortune are sometimes devalued 
(Lerner, 1971; Lerner & Simmons, 1966; 
Simons & Piliavin, 1972). To explain these 
instances, Lerner formulated the just world 
hypothesis which postulates a need to believe 
in a fair, orderly world where people get 
what they deserve and deserve what they get. 
Just world motives are aroused, for example, 
when observers are confronted with an unde- 
niable misfortune befalling a victim. If this 
situation precludes compensating the victim, 
observers should restore justice by seeing the 
victim as deserving his fate, specifically, by 
perceiving the victim as either behaviorally 
deserving (e.g., he did something sufficiently 
bad) or personally deserving (e.g., he is a 
bad person). Although observers would prefer 
to employ the former tactic (Lerner, 1970), 
they will devalue the victim’s personal worth 
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when his behavior cannot be faulted. The 
greater the degree of perceived injustice, the 
greater the subsequent devaluation. 

Early confirmation of this prediction was 
provided by Lerner and Simmons (1966). 
They had observers view a videotape of a 
female student receiving a series of electric 
shocks during the course of a paired-associate 
learning experiment and then obtained ob- 
servers’ reactions to this innocent victim. 
Relative to observers’ self-evaluations, the 
victim was devalued whenever the degree of 
injustice was increased either by having the 
suffering continue or by portraying the suffer- 
ing as an altruistic act. These results and 
those of subsequent studies employing essen- 
tially the same paradigm (Lerner, 1971; Si- 
mons & Piliavin, 1972) have been interpreted 
as support for the just world explanation of 
devaluation. 

An analysis of the experimental procedures 
employed in these studies suggests the possi- 
bility that the observed devaluation was based 
on information contained in the experimental 
format but extraneous to the intended manip- 
ulations. If so, this devaluation could be ac- 
counted for by an alternative information- 
processing explanation which rests on the 
attribution theory assumption that impres- 
sions of others are based primarily on an 
objective consideration of behavior and its 
surrounding circumstances (Jones & Davis, 
1965; Kelley, 1971). When confronted with 
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ian a person who engages in the unusual behavior 
of letting himself receive a series of noxious 
and painful shocks, observers’ subsequent 
impressions would depend heavily on their 
i inferences regarding the probable cause of 
i this behavior. 
Extending Nisbett and Valins’ (1971) at- 
tribution analysis to the devaluation situation, 
the relevant question is whether the victim’s 
behavior is seen to be caused by forces ex- 
trinsic or intrinsic to the stimulus (e.g., the 
ongoing study described to subjects). When 
the victim is coerced into participating, either 
| y overt pressure from the experimenter or 
jis covert social pressure from peers, observers 
should see these extrinsic forces as sufficient 
reason for the victim’s behavior. In these 
instances, the victim would be seen as extrin- 
sically motivated, would appear weak and 
yielding, and hence should be devalued. In 
other instances, properties intrinsic to the 
stimulus could be seen as causing the victim’s 
behavior, for example, when strong extrinsic 
= pressures are absent or when the victim ex- 
plicitly states his belief in the study’s worth. 
In these instances, a belief inference—that 
the victim believed in the worth of the study 
—is likely to be made, and the victim would 
be seen as intrinsically motivated (Nisbett & 
Valins, 1971, p. 66). Since receiving the 
shocks with the intention of furthering a 
worthwhile study is an acceptable motive, 
subsequent devaluation should be precluded. 

In short, the above attribution explanation 
suggests that devaluation depends on observ- 
ers’ inferences regarding the cause of the 
victim’s behavior. Intrinsically motivated vic- 
tims who do not appear to yield to illicit 
= environmental demands should not be de- 
valued; extrinsically motivated victims who 
appear to suffer for these “wrong” reasons 
should be devalued. 

An inspection of the prototypical just 
world study (Lerner & Simmons, 1966) re- 
veals that those experimental conditions which 
produced significant devaluation could also 
have fostered the perception of extrinsic 
motivation. In two conditions (“reward deci- 
sion” and “midpoint”), observers believed 
that the victim would allow herself to receive 
a second series of shocks. After watching the 
victim suffer rather severely in the first series, 


a” 
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subjects could easily have assumed that the 
victim was excessively weak and yielding 
when she allowed herself to be subjected to 
this second series. In a third condition which 
produced the most devaluation (“martyr”), 
the victim initially refused to participate but 
after coaxing from the experimenter agreed 
to receive the shocks. Subjects could easily 
have believed that this agreement was caused 
by illicit pressure from the experimenter and 
that this victim was weak and yielding.’ 
Devaluation conditions in the replication 
studies (Lerner, 1971; Simons & Piliavin, 
1972) may be similarly interpreted. 

This analysis suggests that the attribution 
explanation may adequately account for de- 
valuation of innocent victims without recourse 
to a hypothesized drive to perceive a just 
world.* Moreover, in contrast to just world 
theory (Lerner, 1970, p. 227), attribution 
theory would predict that victims who have 
good motives should not be devalued when 
they are observed to suffer unjustly. To test 
this assertion, two experimental conditions 
which met just world criteria for devalua- 
tion but which portrayed the victim as in- 
trinsically motivated were compared with two 


1In an unpublished study reported in Lerner 
(1970), this criticism, that the devaluation of the 
martyr resulted from her yielding to external pres- 
sure, was investigated. After observing a victim in 
a simple digit-span learning situation, observers were 
told that a second task would either involve electric 
shocks or no shock. The victim was portrayed as 
agreeing to go through with this second task for the 
sake of the observers, although she really wanted to 
avoid doing it. The results indicated that a martyr 
who yielded to external pressure to participate was 
devalued only when the second series would involve 
shocks. We interpret this result to indicate that 
changing one’s mind and agreeing to perform a 
behavior does not make one appear weak and 
yielding and should not lead to devaluation when the 
behavior appears reasonable to observers (i.e, no- 
shock situation). However, when one is talked into 
agreeing to receive a series of shocks, a behavior 
which most observers would find quite noxious, one 
does appear weak and yielding and should be nega- 
tively evaluated. 

2It should be noted that the above attribution 
analysis as presented applies to rather specific in- 
stances of devaluation. This analysis should not be 
interpreted to suggest that devaluation will always be 
based on objective considerations or that the percep- 
tion of extrinsic motivation is the only condition 
which could produce devaluation. 
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experimental conditions which conceptually 
replicated the devaluation conditions in the 
original Lerner and Simmons (1966) study. 
Intrinsic motivation was manipulated by vary- 
ing the victim’s statements regarding the 
worth of the study and her willingness to 
participate. In the volunteer condition the 
victim was portrayed as intrinsically moti- 
vated by having her actually volunteer, stat- 
ing that although she did not like the idea of 
being shocked she did believe in the worth of 
the study and therefore would participate. 
Since this act of volunteering could lead to 
the perception that the victim brought about 
her own suffering and if so would reduce 
subsequent derogation (Lerner & Matthews, 
1967), a second intrinsic motivation condition 
(good reasons) was employed. In this condi- 
tion the victim was purportedly chosen at 
random by the experimenter, thus precluding 
the possibility that she would be seen as re- 
sponsible for her own fate. This victim was 
portrayed as intrinsically motivated by hav- 
ing her state the same reasons for her par- 
ticipation as those given by the volunteer 
victim. 

In the random condition, replicating the 
“midpoint” condition in Lerner and Simmons 
(1966), the victim was randomly chosen and 
when asked to proceed to the shock labora- 
tory left the room without comment. Since 
this victim did not state her feelings, her 
motivation had to be inferred by observers, 
presumably on the basis of their own feelings 
regarding the experiment. Finally, in the un- 
willing condition, replicating the “willing 
martyr” condition in Lerner and Simmons 
(1966), a randomly chosen victim initially 
refused to participate when she learned that 
she would be shocked. The experimenter 
then briefly persuaded her to participate em- 
ploying the same reasons given by the victim 
in the intrinsic motivation conditions. Thus, 
the appearance of extrinsic motivation was 
made clear. 

It is important to note that all of the 
above victims would obviously suffer, would 
not be compensated, and would do nothing 
wrong to merit their suffering. These condi- 
tions meet the just world criteria for devalu- 
ation. Thus, the just world prediction would 
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be devaluation in all four experimental condi- \, 
tions, especially when the victim suffers for 
altruistic motives (volunteer and good rea- 
sons conditions), since such suffering would 
be highly threatening to just world beliefs 
(Lerner & Simmons, 1966, p. 205).° 

The attribution prediction would be that 
devaluation should occur only in the random 
and unwilling conditions. This devaluation 
would result not from just world motives 
aroused by the observation of unjust suffer- 
ing but from the observers’ cognitive inter- 
pretation of the situation and their causal 
attributions for the victim’s behavior. In these” 
conditions the victim would appear to be ex- 
trinsically motivated to agree to engage ina 
self-sacrificing behavior. Victims in the re- 
maining conditions (good reasons and volun- 
teer) should not be devalued. Although these 
victims also suffer, they do not perform the 
unreasonable behavior of allowing themselves 
to suffer for that which they do not believe 
worthy. Thus, the good reasons and volunteer 
victims should be rated more favorably than s: 
the devalued random and unwilling victims. 
This pattern of results would lend support 
to the attribution interpretation for the de- 
valuation obtained in previous just world 
studies. 


METHOD 
Subjects 


The subjects were 66 female students from the C 
introductory psychology subject pool. Subjects were 
run in groups of from 4 to 6 including an experi- 
mental confederate. Each group was randomly 
assigned to one of four experimental conditions. 
The data from 10 subjects were not analyzed: 1 
subject failed to respond to all questions, and 9 
subjects were judged to be suspicious; this suspicion 
resulted largely from prior knowledge of false shock 
experiments. In order to be consistent with the 
Lerner and Simmons’ (1966) study, only females 
were employed as subjects. 


3 As mentioned in the text, the volunteer victim 
could be seen as responsible for her own fate, as 
behaviorally deserving, and hence need not be 
devalued (Lerner & Matthews, 1967). However, the 
good reasons victim personifies Lerner’s (1970, p. 
227) definition of a “true martyr.” She is not the 
direct cause of her own misfortune but rather is 
someone whose good motives make her vulnerable 
to others who inflict the suffering on her, a clear 
case of injustice. 


Procedure 


Subjects and a confederate arrived at the experi- 
mental room and were seated in a semicircle facing a 
television monitor. The experimenter introduced the 
study as involving the perception of persons in 
stress. He emphasized the worth of the study by 
explaining that the information thus acquired would 
be employed to assist those who have to deal with 
people undergoing stress, such as accident or disaster 
victims. The experimenter revealed that since stress 
was difficult to study, a decision had been made to 
choose one subject from each session to be put ina 
stressful condition. To produce this condition, the 
chosen subject would receive a series of mildly severe 
shocks that were painful but not harmful. The re- 
7 ygmaining subjects would observe, by closed-circuit 
television, the reactions of this subject receiving 
the shocks." At this point, all subjects were asked to 
indicate their estimates of the unpleasantness of the 
shock to be administered (1=very unpleasant, 9 = 
not at all unpleasant), their perceived worth of the 
study (1=very worthwhile, 9=not at all worth- 
while), and their own willingness to volunteer to be 
shocked (1 = very willing, 9=not at all willing). 

The experimental manipulations were introduced 
at this point and consisted of selecting the victim 
and her statements regarding her participation. 

1, Random condition (n =12): The experimenter 
explained that the person to be shocked would be 
chosen by a ssat number randomly selected prior to 
the session. The person sitting in the seat matching 
that number would receive the shock. The experi- 
menter then gave the number matching the confed- 
erate’s seat. Thus chosen, the confederate victim 
was asked to go to room 436, where Dr. Lowe 
would be waiting, and left the room without com- 
ment, 

2. Unwilling condition (n=14): The procedure 
was identical to the above until the point where the 
confederate had been chosen. At that point, the 
experimenter asked the confederate if she were 
willing to participate in this role. She replied that 
she did not want to take part if she had to be 
shocked. Although the worth of the study had 
previously been stressed by the experimenter, the 


*The rationale used by Lerner and Simmons 
(1966) relied on assisting military officers and busi- 
ness administrators and was not copied because it 
was felt to be outdated. Further, the paired-asso- 
ciate learning task employed in that study may have 
3 contributed to the observed devaluation. Having 
i shocks contingent upon mistakes possibly made the 
— victim appear inept. Although most psychologists 

a might expect mistakes in a paired-associate learning 
task, we do not know if introductory psychology 
students share this expectation, Lerner’s subjects 
may not have expected a participant in any psy- 
chology experiment to be shocked as frequently and 
as severely as the tape depicted and may have con- 
cluded that the victim was unusually stupid or prone 
to make mistakes. This belief could in part account 
for the overall strong rejection of the victim. 
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unwilling victim had apparently not been motivated. 
An exchange followed in which the experimenter 
mentioned that the decision to participate was up 
to the subject. The exchange was designed so that 
the experimenter appeared to persuade the confed- 
erate to participate by reminding her that the 
experiment was worthwhile and that her participa- 
tion would be helpful. The confederate then agreed 
to participate and left for the experimental room, 

3. Good reasons condition (n=15): The pro- 
cedure here was identical to that of the unwilling 
condition up until the point where the confederate 
was asked whether she were willing to participate, 
In this condition, the confederate demonstrated 
intrinsic motivation by employing the same reasons 
supplied by the experimenter in the unwilling condi- 
tion as the confederate’s own reasons for participat- 
ing. She replied that the area of research sounded 
worthwhile and that although she did not like the 
idea of being shocked, she would participate if it 
would be helpful. She then left for the experimental 
room. 

4. Volunteer condition (n = 15): In order to por- 
tray a true volunteer, the procedure was changed 
to eliminate the selection by seat number. Instead, 
the experimenter asked if anyone would be willing 
to volunteer. The confederate then volunteered and 
made a statement identical to the victim in the 
good reasons condition, that although she did not 
like the idea of being shocked, she thought the 
area of research sounded worthwhile and would 
participate if it would be helpful. The confederate 
then left for the experimental room. 

Once the confederate had left the room, subjects 
were asked to rate the hypothetical average female 
taking introductory psychology on 15 bipolar adjec- 
tives employing 11-point scales. A second question- 
naire was then administered to obtain Subjects’ esti- 
mates of how the confederate had answered the 
three questions asked initially regarding the un- 
Pleasantness of the shock, the worth of the study, 
and willingness to participate. These responses 
served to determine the extent to which observers 
saw the victim as intrinsically motivated. 

On completion of this questionnaire, the experi- 
menter explained that the victim would read from a 
dictionary while receiving shocks for approximately 
5 minutes. A rest period would then be taken in 
which subjects’ responses would be obtained. After 
this, the girl would again read and receive 5 minutes 
of shocks, The experimenter then casually said that 
they should be ready shortly upstairs. This served 
as a cue to the confederate listening by a hidden 
microphone to start a videotape, which shortly 
appeared on the monitor. The same videotape was 
used in all conditions. A man wearing a white coat 
was seen to make a final adjustment of an electrode 
and then flip a series of switches on two shock 
generators. The confederate was told to begin and 
was then seen giving a convincing performance of 
receiving what were supposed to be intermittent 
painful shocks for a period of 5 minutes while 
attempting to read. 
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After viewing the videotape, subjects were asked 
to rate the confederate on the same 15 bipolar 
adjective scales employed earlier. They also indi- 
cated their perceptions of the unpleasantness of the 
victim’s experience (1 = very unpleasant, 9=not at 
all unpleasant), the similarity between themselves 
and the victim (1=similar, 9= dissimilar), and 
their assessment of the worth of the study (1= 
very worthwhile, 9=not at all worthwhile). The 
experimenter then thoroughly debriefed the sub- 
jects, checking for suspicion and explaining the 
nature and purpose of the study. 


RESULTS AND DISCUSSION 


An important procedure in any inference 
experiment is to examine subjects’ perceptions 
of the causal properties of the experimental 
situation. Three such assesments were ob- 
tained prior to selecting the victim, Respond- 
ing on 9-point scales where low scores indi- 
cated very worthwhile, very unpleasant, and 
very willing, subjects indicated that they 
thought the study was worthwhile (M = 
3.09), saw the shocks as fairly unpleasant (M 
= 4.61), and were rather unwilling to volun- 
teer to be shocked (M = 6.96). 

After the victim had either volunteered or 
been randomly selected, subjects were asked 
to indicate how they thought the victim had 
responded to these same three questions. The 
results of analyses of variance performed on 
these scores indicated that the experimental 
groups differed significantly, F (3,52) = 
24.46, p < .001, for worth of study; F (3, 
52) = 27.58, p< .001, for willingness to 
participate; and F (3,52) =4.86, p< 01, 
for unpleasantness of shock. The means for 


TABLE 1 


MEANS For PERCEPTIONS OF THE VICTIM’S FEELINGS 
REGARDING SHOCK UNPLEASANTNESS, WILLINGNESS 
TO PARTICIPATE, AND STUDY WORTH PRIOR 
TO RECEIVING SHOCK 


RG Shock ae ; 
Experimental ck _ | Willingness Study 
condition unpleasant- | to participate| worth 
Volunteer* 4.60. 2.53a 1.20, 
Good reasons* 3.479 5.53, 1 40a 
Random? 3.58ab 6.425 5.00, 
Unwilling® 2.00; 7.64. 4.50s 


Nole. Low scores indicate that subjects saw the victim as 
believing the study to be very worthwhile, expecting the shock 
to be very unpleasant, and being very willing to volunteer. As 
indicated by Newman-Keuls tests, column means not having 
the same subscript differ significantly ($ < .05). 

an = 15. 
by = 12, 
en = 14, 
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these measures and the results of Newman- $ 
Keuls tests performed on these means are 
presented in Table 1. 

An inspection of Table 1 reveals that all 
victims were seen as having believed that the 
shock would be fairly unpleasant. However, 
the unwilling victim was rated as believing 
the shock would be more unpleasant (p < 
05) than was either the good reasons or 
volunteer victim. This rating was reasonable 
in that this victim openly stated her reluc- 
tance to be shocked as her reason for non- 
participation. While these undesired differ- 
ences were a source of concern in that they% 
could have differentially influenced perceived 
suffering, subjects’ perceptions of shock un- 
pleasantness obtained after the videotape 
shock session showed no differences among 
the four experimental groups, F (3,52) < 1, 
ns. 

An inspection of Table 1 suggests that the 
experimental manipulations were effective in 
producing the desired differences in subjects’ 
inferences of the victim’s intrinsic motivation ~ 
to participate in the shock role. The volunteer 
victim was seen as more willing (p < .05), 
and the unwilling victim was seen as less 
willing (p < .05) than was each of the other 
victims. The difference between the means for 
good reasons and random victims, although 
in the appropriate direction, did not reach an 
acceptable level of significance. However, 
subjects did make the predicted belief infer- » 
ences regarding the victim’s perception of the 
study’s worth. Both the volunteer and good 
reasons victims were thought to believe the 
study to be more worthwhile (p < .05) than 
were either the random or unwilling victims. 
Neither the volunteer and good reasons vic- 
tims nor the random and unwilling victims 
were seen to differ from each other in their 
perceptions of worth. 

Finally, subjects’ perceptions of the un- 
pleasantness of the shock and the worth of * 
the study obtained after the videotape shock 
session showed no differences among the four 
experimental groups, F(3,52) <1, for both 
measures. However, relative to their initial 
perceptions, all subjects now thought that the 
shocks were more unpleasant, 4.61 versus 
1.79; #(110) =9.07, p<.001. This dif- 
ference suggests that the victim’s expectation 


had been violated and that she probably got 
more than she had bargained for in receiving 
the shocks. Further, all subjects indicated 
that they now believed the study was less 
worthwhile, 3.09 versus 4.27; #(110) = 
3.20, p< .01. Thus, if the victim initially 
expected that the shock would be compen- 
sated for by intrinsic gratification, this was 
probably not the case. 


Evaluation of the Victim 


Subjects rated both the victim and the 
average girl taking introductory psychology 
on 15 bipolar adjectives. Total ratings were 
obtained correcting for direction, and the 
evaluation score was defined as the victim 
total minus the average girl total. An analy- 
sis of variance conducted on these difference 
scores indicated that the experimental ma- 
nipulations had influenced subjects’ feelings 
toward the victim, F (3, 52) = 6.31, p < .001. 
Mean evaluation scores and the results of 
a Newman-Keuls test performed on these 
. means are presented in Table 2. 

As expected, both the volunteer and good 
reasons victims were rated more positively 
(p < .01) than either the random or unwill- 
ing victim. Differences between the volunteer 
and good reasons victim and between the 
random and unwilling victims did not ap- 
proach significance. Statistical analyses per- 
formed on the total victim ratings produced 
equivalent results, F(3, 52) = 5.44, p < .01, 
and showed the same pattern of significant 
differences among means as that obtained 
using difference scores. An analysis of vari- 
ance conducted on perceived similarity scores 
revealed no differences among experimental 
conditions, F (3, 52) = 1.13, ns. 

These results reveal that the perception of 
unjust suffering did not always lead to de- 
valuation. When the victim allowed herself 
to be abused for something in which she did 
not believe, she was devalued relative to 
equally suffering victims who had not yielded, 
either silently or with mild persuasion, to 
extrinsic pressures. This manifestation of 
intrinsic motivation precluded the inference 
of coercion and coercibility, which in turn 
would have led to devaluation. 

Correlations were obtained to assess the 
relationship between perceived intrinsic moti- 
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TABLE 2 
MEAN EVALUATIONS GIVEN TO VICTIM 


Experimental condition Evaluation 
Volunteer* 12.00, 
Good reasons* 9.67.4 
Random» —2.67, 
Unwilling? —3.29, 


Nolte. Higher scores indicate more favorable evaluations, As 
indicated by a Newman-Keuls test, column means not having 
the same subscript differ significantly (p < .01). 


vation and subsequent evaluations. Although 
the correlations for willingness to participate 
(r = —.08) and for shock unpleasantness 
(r= —.09) did not approach significance, 
the correlation between study worth and 
evaluation was significant (r = —.26; CR = 
1.94, p < .05). The more subjects saw the 
victim as believing the study was worthwhile, 
the less they devalued her after watching her 
suffer. 

Thus, the controlling component of per- 
ceived intrinsic motivation appears to have 
been the belief inference that the victim saw 
the study as worthwhile. Perceptions of vic- 
tims’ expectations regarding shock unpleas- 
antness did not affect devaluation or per- 
ceived suffering. Although the random, good 
reasons, and volunteer victims were each seen 
to expect less unpleasant shocks than was the 
unwilling victim, both the unwilling and the 
random victims were devalued more than 
the good reasons or volunteer victims, and all 
victims were seen to suffer equally. Apart 
from considerations of worth, perceived will- 
ingness also had little effect on subsequent 
devaluation. Victims who were perceived to 
be equally willing to participate (e.g., good 
reasons and random victims) were differen- 
tially devalued in line with perceived differ- 
ences in their belief in the worth of the 
study. Conversely, victims who believed the 
study was not worthwhile (e.g., random 
and unwilling victims) were devalued equally, 
even though perceptions of their willingness 
differed. 

The above results taken together indicate 
that devaluations of innocent victims may be 
accounted for by a cognitive inference process 
in which observers analyze the available in- 
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formation regarding a victim’s behavior and 
its controlling determinants. This proposed 
process represents an alternative to the just 
world motivational explanation for devalua- 
tion and suggests that experimental ma- 
nipulations employed in previous devaluation 
studies (Lerner, 1971; Lerner & Simmons, 
1972; Simons & Piliavin, 1972) may have 
had unintended effects. For example, a victim 
who yielded to experimental coercion to be 
shocked solely so that observers could obtain 
lab credits (Lerner & Simmons, 1966, p. 207) 
may have appeared more a fool than an 
“altruist.” Similarly, while leading subjects to 
believe that a victim had apparently agreed 
to undergo a second series of painful shocks 
was intended to communicate that the suffer- 
ing was not “a single relatively isolated 
event” (Lerner & Simmons, 1966, p. 204), 
this manipulation could also have portrayed 
the victim as weak and yielding. This re- 
interpretation of previous devaluation studies 
together with the present results suggests that 
it may be possible to account for devaluation 
of innocent victims without recourse to an 
hypothesized need to restore justice. 

It should be noted that the present study 
has shown relative differences in victim de- 
valuation. Since no control or minimal suffer- 
ing condition was employed, we cannot be 
certain that the extrinsically motivated vic- 
tims were actually devalued in any absolute 
sense, especially since the average level of 
evaluation was considerably higher than that 
obtained by Lerner and Simmons (1966). 
However, there were a number of differences 
between the present study and Lerner and 
Simmons. In addition to employing a different 
confederate and different adjective scales, the 
present study also had subjects meet the 
confederate face to face, compared victim 
ratings to “average” rather than to “self- 
ratings,” enabled the victim’s behavior to have 
positive consequences for subjects, and found 
that subjects disapproved of the experiment, 
as reflected in decreased perceptions of the 
study’s worth (see Lerner, 1970, p. 211). 
While these features were constant across 
experimental conditions and thus could not 
have differentially influenced victim ratings, 
they probably did raise the average level of 
victim ratings. We can conclude that the ex- 
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trinsically motivated victims were ceitsin! 
devalued relative to intrinsically motivate: 
victims and probably devalued in an absol: 
sense. 

One feature of the present study deserv: 
closer attention, that the behavior of the vic- 
tim benefited subjects by saving them from 
the potentially threatening shock. Since this 
feature probably caused some positive affect 
toward the victim in all conditions, it would 
be of interest to determine if neutral observ- 
ers who were not “saved” by the victim’s 
behavior would produce the same level and 
pattern of victim devaluation. Further, it 
might be argued that having subjects benefit 
from the victim’s misfortune violated the just 
world devaluation paradigm. This would be 
the case if subjects had earned their fate 
thus making the different fates not unjust, 
as in a “parallel competition” situation (see 
Lerner, 1974). However, the present study 
employed a lottery situation where dif- 
ferential fates were not earned but assigned 
(with the possible exception of the volunteer 
victim). In a lottery or “similarity” situa- 
tion (see Lerner, 1974), any departure 
from equal fates is unjust, and the just world 
devaluation paradigm remains intact. 

The present study is by no means a com- 
plete test of attribution versus justice theory 
but does represent one instance within the 
just world paradigm where the two theories 
make competing predictions. Since both the 
good reasons and yolunteer victims obviously 
suffered, were not compensated, and had 
done nothing wrong to merit their suffering, 
the just world explanation would predict de- 
valuation, especially for the good reasons 
victim who did not bring on her fate by 
volunteering. This victim would seem to 
characterize a true martyr, whose good 
motives (i.e. a belief that the study was 
worthwhile) made her vulnerable to the ex- 
perimenter who inflicted suffering on her (cf. 
Lerner, 1972, p. 227). As predicted by the 
attribution explanation, however, neither vic- 
tim was devalued, in sharp contrast to the 
provocative just world assertion that martyrs 
are rejected. 

The present study has demonstrated that, 
in some instances, devaluation of victims may 
be adequately explained by an information- 
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processing model. This demonstration is con- 
sistent with recent results reported by Stokols 
and Schopler (1973) showing similar objec- 
tive appraisal of victims. Although beyond 
the scope of the present article, an extension 
of the attribution model proposed here to 
account for other instances of just world 
phenomena (e.g., Lerner, 1970; Lerner & 
Lichtman, 1968) and to uncover situations 
where the two explanations would offer op- 
posing predictions would be fruitful. Such an 
attempt would represent one manifestation of 
the recent shift in psychological theorizing 
away from motivational/drive models toward 
information-processing/attribution models (cf., 
Bem, 1972). 
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Boys and Girls Together: Sexual Composition and the 
Effect of Density and Group Size on Cohesiveness 


Joan E. Marshall and Richard Heslin 
Purdue University 


This study examined (a) sexual composition as a moderator of the effect of 
density and group size on members’ attraction toward a group and (b) the 
relative influence of group size compared to density on group cohesiveness 
and member feelings. The usual “small groups are more cohesive” relation 
was obtained for same-sex groups but was reversed for mixed-sex groups. An 
extension of deindividuation theory was proposed to explain the fact that 
women when in mixed-sex groups preferred large groups to small, and crowded 
groups to uncrowded, but the opposite held when in same-sex groups. 


Animal research has implicated the number 
of animals (rather than density per se) as the 
cause of the social pathology that emerged 
under crowded conditions (Calhoun, 1962, 
1966; Christian, 1963; Myers, Hale, My- 
kytowycz, & Hughes, 1971; Theissen, 1966). 
Much research on human crowding, on the 
other hand, has ignored group size as a pos- 
sible effect of reactions to crowding (cf. 
Griffitt & Veitch, 1971; Hutt & Vaizey, 
1966). The purpose of this study was to cor- 
rect this oversight by examining the inde- 
pendent effects of density and group size on 
members’ reactions. It was hypothesized that 
the number of people in a group, regardless 
of how crowded the group is, influences the 
members’ feelings and their desires to remain 
within the group. 

This hypothesis about the effects of group 
size is suggested by earlier research which 
confounded group size and density, the ex- 
perimenters in each case increasing the num- 
ber of people in the same size room (Griffitt 
& Veitch, 1971; Hutt & Vaizey, 1966; Ittle- 
son, Proshansky, & Rivlin, 1970; Rohles, 
1967). Each of these four studies reported 
negative social responses from increased 
“density” (confounded with increased number 
of group members). Hutt and Vaizey found 
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that children were more aggressive and less 
social as density increased. Griffitt and Veitch 
observed that a crowded group evaluated an 
anonymous stranger more negatively than an 
uncrowded group. Rohles (1967) reported an 
incidental occurrence of fighting in a crowded 
group with no similar aggressive incidents 
happening in a less crowded group. Finally, 
patients in a psychiatric hospital were found 
to behave more asocially in more crowded 
wards (Ittleson et al., 1970). It cannot be 
specified whether the amount of space, the 
group size, or the combination of the two 
factors was responsible for the decrement in 
social behavior in these studies, yet most of 
the authors seemed to imply that density 
was the major cause of the antisocial behav- 
ior. The experiment presented here confronted 
this problem by including independent ma- 
nipulations of density and group size in its 
design. 

Some recent investigations on crowding 
have eliminated the confounding of group 
size and density by varying the room size 
for same-sized groups (Freedman, Levy, 
Buchanan, & Price, 1972; Ross, Layton, 
Ericson, & Schopler, 1973; Stokols, Rall, 
Pinner, & Schopler, 1973). The effects of 
density on evaluative responses in these 
studies did not show the consistent negative 
pattern that emerged in the experiments 
where the confounding of size and density 
was present. In the Stokols et al. study, where 
the subjects played a cooperative or competi- 
tive quiz game, no differences were found 
between the high-density and the low-density 
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conditions in subjects? feelings toward the 
group and feelings toward other individuals. 
Furthermore, in an experiment where subjects 
discussed various topics and played a laby- 
rinth game (Freedman et al., 1972), those 
in the high-density condition liked the mem- 
bers of their group more than did those in 
the low-density conditions. 

In two of these studies, sex was found to 
interact with crowdedness. When the group 
task was to listen to jury tapes and arrive 
at various judgments (Freedman et al., 
1972), females in the crowded condition had 
more positive feelings in general and toward 

he group than did females in the uncrowded 
ondition (same-sex groups). Males re- 
Sponded in just the opposite way, being more 
negative when crowded than when not 
cyowded. Density did not affect the dependent 
variables in mixed-sex groups. Similarly, in 
the study by Ross et al. (1973) in which 
subjects had the task of reaching consensus 
on some choice-dilemmas questions, crowded 
females judged both themselves and others 
More positively than did uncrowded females, 
while crowded males judged themselves and 
others more negatively than did uncrowded 
males. The findings in these two studies lead 
to the expectation that females would be 
more attracted to fellow members when 
crowded and males more attracted when 
uncrowded. 

This study also took a fresh look at the 
classic group size-cohesiveness relation. It 
has been consistently found that large groups 
are less cohesive than small groups (Cart- 
wright, 1968; Hackman & Vidmar, 1970; 
Heslin & Dunphy, 1964; Willems, 1964), 
Members of large groups are less attracted 
to fellow members, feel less satisfaction and 
personal security, leave the group more often, 
and evaluate the group more negatively than 
do members of small groups. The strategy of 
the study presented here was to examine the 
effects of density and size on group cohesive- 
ness—the “desire to remain in the group” 
(Cartwright, 1968). An important facet of 
this experiment was a behavioral measure- 
ment of this desire to remain in the group. 
It was expected that more members of small 
groups would want to remain than members 


of large groups. 
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The primary factor considered was the 
sexual composition of the group. As noted 
even a decade ago research on group size 
(Thomas & Fink, 1963) and small groups in 
general (Borgatta, 1963) had tended to 
center on same-sex groups (usually male), A 
look through an updated bibliography of 
small group research (Hare, 1972) indicated 
no change in this situation. Therefore, there 
are scant research findings concerning the 
effect of same-sex versus mixed-sex groups. 
Intuitively, members of mixed-sex groups 
ought to have more positive evaluative re- 
sponses than members of same-sex groups. It 
is proposed that what is unpleasant in same- 
sex groups might be pleasant in mixed-sex 
groups, that the earlier negative responses to 
large group size may have been due to the use 
of same-sex groups. 

In summary, this investigation examined 
the effects of density (4 square feet vs. 18 
square feet, or 372 m? vs. 1,62 m°), group 
size (4 persons vs. 16 persons), composi- 
tion (same sex vs. mixed sex) and sex 
(male vs. female) ina 2x 2x22 design. 
Four dependent variables were included: feel- 
ings toward individuals, feelings toward the 
group, feelings in general, and decision to 
remain within the group. It was hoped that 
new insights into the effect of sexual composi- 
tion on group size and crowding would be 
obtained by using all of the following in one 
study: (a) both males and females, (b) both 
same-sex and mixed-sex groups, (c) indepen- 
dent manipulations of number and density, 
(d) a task that ensured interaction among 
the members, (e) group cohesiveness theory as 
a source of hypotheses, and (f) a behavioral 
measure of attraction toward the group. 


METHOD 
Subjects 


The subjects were 142 female and 142 male stu- 
dents who participated to fulfill an experimental 
requirement for an introductory psychology course. 


Experimental Situation 


The rooms used in this experiment were all in 
the same laboratory complex and were equivalent 
as to interior design, wall color, ceiling height, and 
wall-to-wall carpeting. They differed in size to pro- 
vide the need variations in amount of space per 
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person for large and small, crowded and uncrowded 
groups. These varying floor spaces were as follows: 


small, crowded: 4 persons in 4 X 4 feet = 16 square 
feet =4 square feet per person (1.22 X 122 m= 
1.44 m? = .37 m? per person). 

large, crowded: 16 persons in 7 feet 6 inches X 9 feet 
2 inches = 68.7 square feet =4.3 square feet per 
person (2.25 m X 2.75 m= 6.18 m? = .39 m? per 
person). 

small, not crowded: 4 persons in 7 feet 6 inches X 
9 feet 2 inches = 68.7 square feet= 17.2 square 
feet per person (2.25 m X 2.75 m = 6.18 m?= 1.55 
m? per person). 

large, not crowded: 16 persons in 18X 16 feet = 
288 square feet =18 square feet per person (5.4 
m X 4.8 m= 25.92 m*=1.62 m? per person). 


The crowded conditions were really crowded—it 
was impossible to fit any more chairs in the room. 

These rooms were furnished with a lightly padded 
armless contour chair for each subject. The chairs 
were arranged with an eye to maximizing ease of 
group participation. In the large groups, 12 chairs 
were in an oval with 4 swivel chairs in the center. 
In small uncrowded groups, the 4 chairs were in an 
oval. In the small crowded groups, two chairs faced 
the small writing shelf, and two chairs were immedi- 
ately behind these. It was noted that the subjects 
in this condition tended to initiate a circular ar- 
rangement by turning in their seats and in some cases 
sitting on the writing shelf. 


Procedure 


Upon arrival in the experimental room, the sub- 
jects were instructed to seat themselves in the pre- 
arranged chairs. On each chair were a pencil and 
legal sized pad of paper which served as a lapboard 
and a container for all the material needed for the 
experiment. On the top of the pad there was a 
subject identification letter. In the mixed-sex groups, 
males and females were told to sit in every other 
chair. After clipping on the identifying letters, the 
subjects read the instructions which outlined the 
main task and procedure: 


There are several parts to this study of group 
problem solving. Your main task will be to com- 


1 Dabbs (personal communication, October, 1973) 
has pointed out the interesting problem with ma- 
nipulating size and density independently. If we can 
consider each member in an uncrowded condition 
to be contributing, say, 12 square feet (1.08 m?) 
of his space to the “common space,” then the 
small uncrowded group has 48 square feet (4.32 m?) 
of common space, but the large uncrowded group 
has 192 square feet (17.28 m?) of common space. 
However, if this had made a difference in our study, 
we would have obtained a group size by density 
interaction. The Group Size X Density interaction 
was not obtained (see Table 1), so we cannot con- 
clude that “uncrowded” meant different things 
between small and large groups. 
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bine phrases into paragraphs. Full instructions for 
this follow in the booklet underneath this sheet. 

You are to work together as a group on the: 
paragraphs until you are interrupted by the experi- 
menter, who will signal an end to this izsk by 
knocking on the door. Do not open the dvor, but 
please indicate that the message has been r ved 
by knocking back. 

At this time, put your paragraphs 
the back of your pads take out a qu 
(Do not remove or look at the items b: 
The questionnaire is to be filled out inc 
with no discussion or interaction. Work 
own speed and be sure to answer ever 
When you have finished the questionnaire 
it in the back of your pads. 

When all of the members of the group havis 
completed the individual task, Member is to 
remove an additional sheet from the back oi his or 
her pad. The sheet has a group task whi you 
are all to work on together, until you are told to 
stop. 


From 


After all questions about the procedure were an- 
swered, the door to the room was shut, and the 
group proceeded to form paragraphs. (The rath 
complex instructions described above were 
faithfully.) After 14 hours the group wa 
to stop what they were doing and to sta: 
out the questionnaire that contained the dc «tf 
variables. í 
The decision tò measure the dependen! 
after only 14 hours of the scheduled 24-hoi 
was based on two factors noted in a pilot study. 
First, as the 24-hour mark was reached, there was 
a noticeable lightening of the subject’s mood, and 
it was feared that dependent variables i 
at this time would measure feelings associ ith 
the termination of the experiment rather ‘ e 
overall group experience. Second, subjects were fa- 
miliar with both the task and the situation afte: ` 14 
hours, and involvement appeared to be ma: i 
then. 
When all of the members of the group 
pleted the questionnaire, they proceed 
general instructions, to the second filler t ke 


ing of uses for common objects. After ap; rately 
15 minutes of this task, the experimenter asked 
those subjects who had indicated on thei question- 


naire that they would like to do a task alone rather 
than with the group to leave the room. These sub- 
jects did one more common usage item in individual 
cubicles, while the remaining members worked on 
the same item as a group. Then, the experiment was N 
terminated, and after debriefing, the subjects were 
dismissed. Thus, the sequence of events was as 
follows: the subjects spent 90 minutes on a para- 
graph construction task (see below), responded to 
the questions that measured the dependent variables, 
and spent 15 minutes doing an object usage task, 
those who had indicated a preference for working 
alone were separated, all subjects did one more 
object usage task, and then they were debriefed. 


Main Task 


The main task for the groups was to reconstruct 
paragraphs from their component phrases. A number 
of paragraphs had been previously divided into 16 
phrases. These phrases had been randomly ordered 
and put into booklets for groups of 4 and groups of 
16. Each subject received a booklet containing 
either one phrase (if he were in a large group) or 
four phrases (if he were in a small group) for each 
paragraph, answer sheets, and the following instruc- 
tions: 


This is an experiment on group problem solving. 
Your task will be to combine phrases into logical 
paragraphs. Each member of the group will have 
four (or one) phrases from each paragraph. There 

are sixteen phrases in each paragraph. You are to 

gather all the phrases from the other members 

and, working together, order these phrases into a 

logical paragraph. These phrases were taken from 
actual articles and when they are in the correct 
order they make sense. When the group as a 
whole has agreed on a correct ordering write the 
complete paragraph on the first answer sheet and 
go on to the next paragraph. Continue forming 
paragraphs in this way until you are signaled to 
stop. This is a group task so you should help each 
other and interact in any way that makes the 
task solving easier. 


Typical phrases were “moved it from East Hamp- 
ton’s main street” and “fireplaces of ancient brick, 
handhewn beams.” This group task combined two 
features: It could be repeated with new paragraphs 
for an indefinite period of time, and it required 
participation of all the members. The task accomp- 
lished its main purpose: It stimulated interaction 
and got the members involved with each other. 


Dependent Variables 


The dependent variables in the questionnaire in- 
cluded the following: 


1. Interpersonal attraction: Each member rated 
every other member on “degree of liking” on a 
7-point scale from 1 (like very much) to 7 (dislike 
very much) and on “degree of enjoying working 
with” on a 7-point scale from 1 (very much dislike 
working with) to 7 (very much enjoy working 
with). These ratings were summed (after the first 
one was reversed) and averaged, giving an index of 
interpersonal attraction from each subject. These 
two measures are similar to the attraction items on 
Byrne’s (1969) Interpersonal Judgment Scale and 
are well validated as attraction indicators, 

2. Evaluation of the group as a whole: Each sub- 
ject rated their liking for the group as a whole on a 
7-point scale. 

3. Feeling in general: Six scales used to assess 
positiveness of feeling were bad-good, comfort-dis- 
comfort, high-low, sad-happy, pleasant-unpleasant, 
and positive-negative. Replicating the procedure used 
by Griffitt and Veitch (1971), these scales were 
summed to yield an overall evaluative measure. 
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4. Desire to remain a member of the group: Each 
subject indicated whether he would like to do an 
additional task alone or with the group.? Variables 
1, 2, and 4 have been used to measure group cohe- 
siveness in past studies (Cartwright, 1968). 


RESULTS 


The design for this study was a 2 X 2 x 
2 2 factorial analysis of variance (Winer, 
1972, pp. 417-418). The unit of analysis was 
the individual response.* All four factors 
group size, density, group composition, and 
sex) were analyzed as between-groups ef- 
fects. The effect of these factors on each of 
the four dependent variables was analyzed 
separately (see Table 1). The results will be 
described by factors looking at each signifi- 
cant effect across all the dependent variables 
affected by it. 


Effects of Group Size 


As predicted, there was a main effect of 
group size on attraction toward the other 
members: Members were more attracted to 
other members in small groups than in large 
groups. This was modified by a significant 
Group Size X Sex interaction which showed 
that the tendency to like other members more 
in small than in large groups was more pro- 
nounced for males (small = 106.2, large = 
94.9) than for females (small = 103.5, large 
= 102.8).* 


* Researchers often fail to do a more informative 
analysis because of fear of violating some assump- 
tion of the analysis. In this case we have a dichoto- 
mous score on a variable with an underlying con- 
tinuous distribution (desire to remain with the 
group). Box (1953, 1954), Young and Veldman 
(1963), and others have shown that the analysis of 
variance is robust enough that wide deviations from 
these assumptions have little material effect on the 
significance tests, and the slight effects that may 
occur are in the direction of reducing F values (ie, 
they are conservative errors). 

3 We are aware that some inflation of significance 
levels is possible because of reduced variability of 
response within groups. However, separate from the 
prohibitive increase in sample size required and 
problems with comparing means based on different 
size groups, the fact that the crucial interactions 
between sex of respondent and sexual mixture of 
the group could not be tested dictated that the 
analysis be made using the individual subject as 
the unit of analysis rather than the group. 

* Attraction ratings toward other members were 
reversed and standardized (M = 50; SD=10), 
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TABLE 1 
ANALYSIS OF VARIANCE OF DEPENDENT VARIABLES 
Feel i 
Feel Remain 
Source toward | toward | temes? | with 
didual | 800P group 
Group size (A) 7.85 | <1 3.52" <1 
Density (B) 2.26 <1 5.61% <i 
Composition (C) | 4.28% | 4.78% | 18.8011 3.43* 
Sex (D) 1.41 <1 1.30 <1 
AXB <1 <i <1 <1 
AXC <i | 7.068%" | 1.81 4.55 
AXD 5.96** <1 <1 1.27 
BXC <1 <1 <1 <1 
BXD 5.49%* | 1.31 1 <1 
CXD <i | 7.11" | 3.62* <i 
AXBXC <1 <1 <1 <i 
A XB XD 1.61 1.45 2.63 1.15 
AXxCXD 1.80 3.75* 7.08" | 1.27 
BXCXD 4.99% <1 1.59 <1 
AXBXCXD <1 <i <1 1,37 


Note. Degrees of freedom are 1, 268. 
*p < 10. 
wD < 05. 


wD <0L, 
sk p < 001. 


Group size interacted with composition on 
two variables. First of all, in same-sex groups 
the expected greater desire to stay with the 
group appeared on the part of members of 
small groups as compared to large groups. 
However, there was a surprising reversal in 
mixed-sex groups, with more members of 
large groups wanting to stay than members 
of small groups. The same effect occurred in 
the subjects’ feelings toward the group as a 
whole. This intriguing reversal in mixed-sex 
groups can be examined in more detail by 
looking at the significant Group Size X 
Composition X Sex interaction. As Figure 1 
indicates, females liked the group when it 
was small if it was of the same sex. But if it 
was of mixed sex they liked the group if it 
was large. Males liked small and large groups 
about equally. 

The Group Size Composition X Sex in- 
teraction with feelings in general (see Figure 
2) indicates that they are very similar to 
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Ficure 2. Feelings in general as a function of group 
size, composition, and sex. 


feelings toward the group. Basically, general 
feelings were affected by size differences for 
females in mixed-sex groups. A 

In summary, group size appeared to have 
a moderate impact on the participants espe- 
cially as modifed by whether the group had 
only one sex or mixed the two sexes. 


Effects of Density 


Regarding feelings toward individuals, a 
Density X Sex interaction was qualified by 
the Density X Composition X Sex interaction; 


therefore we shall just consider this three, ~ 


way interaction (see Figure 3). When persons 
were in mixed-sex groups, both males and 
females tended to like the other members 
more when crowded than when not crowded. 
But when they were with only their own sex, 
males and females reacted very differently 
from each other and in opposition to findings 
from previous research on density. With their 
own sex, women did not like the other mem- 
bers as much when they were crowded as 
when they had more space. But men with 
their own sex liked each other more when 
they were crowded than when they had more 
space. 

The only other dependent variable that was 
affected was the general positiveness of the 
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Ficure 1. Feelings toward the group as a function 
of group size, composition, and sex. 


Ficure 3. Feelings toward individuals as a function 


of density, composition, and sex. 
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members’ feelings (main effect). It was found 
that, as predicted, general feelings were more 
negative in the crowded rooms than in those 
where people had space. In summary, density 
seemed to have a relatively anemic effect on 
participants’ feelings and reactions in this 
study. 


Effects of Composition of the Group and 
Sex of Subject 


Whether groups were of the same sex or 
mixed sex affected participants’ reactions in 


-the same direction on all four dependent 


variables. In mixed-sex groups members were 


. more attracted to the others there, felt more 


positively toward the group, had more posi- 
tive feelings in general, and tended to remain 
with the group (p< .06) more than did 
members of same-sex groups. As already re- 
ported, more subjects decided to remain with 
the large groups if the groups were of mixed 
sex rather than same sex (Group Size X 
Composition). In small groups there were no 


` differences between same-sex and mixed-sex 


groups. 

Composition of the group was involved in 
all three of the interactions we have already 
discussed (see Figures 1, 2, and 3). The main 
cause of the interactions was due to the fact 
that the sexes reacted differentially to sexual 
composition. Males show a clear main effect 
response to composition: They prefer mixed- 
sex groups. Females have a complex reaction 
to composition depending on other factors. 
Regarding the effect of group size (see Fig- 
ures 1 and 2), females have positive responses 
in mixed-sex groups if they are large and they 
have positive responses to same sex groups if 
they are small. Regarding density (see Figure 
3), females have positive responses in mixed- 
sex groups if they are crowded and to same- 
sex groups if they are uncrowded. 

Thus, sexual composition was clearly an 
important factor in the evaluative reactions 
of the members, and sex was involved in a 
number of interactions even though it had 
no main effect on the participants. 


Discussion 


The purpose of this study was to examine 
the independent effects of density and group 
size on men and women as modified by the 
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sexual composition of the group. Density, 
regardless of group size, depressed general 
feelings of members of all groups. However, 
density did not affect feelings toward the 
group nor motivate behavior (the percentages 
leaving the crowded and uncrowded groups 
were 22% and 23%). If high density was a 
stressor, one would think that more people in 
crowded situations than those in uncrowded 
situations would have decided to leave the 
group and do the additional tasks alone. 
Therefore, it is concluded that amount of 
space per se was not perceived as a stressor, 
This conclusion is in line with our predictions 
and the results of other experiments (Freed- 
man, Klevansky, & Ehrlich, 1971; Ross et 
al., 1973; Stokols et al., 1973). 

We did not replicate the findings of Freed- 
man et al., Ross et al., and their colleagues 
that in same-sex groups females in crowded 
groups were more attracted to fellow group 
members, while males showed the opposite 
reaction. In fact, the results in our study were 
just the reverse, with females liking their co- 
members less when crowded and males mak- 
ing a more positive evaluation when crowded, 
Our findings are reminiscent of the absence 
of a Sex X Density effect in the article by 
Stokols et al. (1973), They also failed to 
find males being more negative toward their 
fellow group members in high-density than in 
low-density conditions, and they failed to find 
females more positive in high-density condi- 
tions, Stokols et al. used an experimental 
task that was similar to ours in that both 
were highly involving, novel, guaranteed par- 
ticipation for all, provided ample feedback 
to the members on the progress of the group 
in performing its task, and lasted for 14 
hours. If we compare these two studies where 
males were positive and females negative in 
high-density conditions with the Freedman 
and Ross studies in which males were nega- 
tively affected by density and females were 
not, we may be able to understand something 
about men and women, about being close to 
other people, and perhaps the effects of dif- 
ferent kinds of situations and tasks on reac- 
tions to being close. Stokols® has suggested 
the following line of reasoning to explain our 


5 Personal communication, October, 1973. 
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findings compared to those found by others. 
It is possible that early in interactions, fe- 
males are more amenable to proximity with 
same-sex strangers, which is in keeping with 
the greater freedom given to women to be 
more intimate and dependent than males. 
Males, on the other hand, could be expected 
to be more concerned with maintaining their 
territory, asserting their independence, and 
keeping a safe distance between themselves 
and others. However it is proposed that dur- 
ing the later stages of interaction, after the 
males have worked on a common task, their 
initial feeling of competitiveness and hostility 
gives way to a “we” feeling and a team spirit 
that could well make the density positive (as 
in a bar sing or on an athletic team). The 
present study went much longer than the one 
by Ross et al. (1973) (14 hours vs. 5 or 20 
minutes), and it used a task that was more 
involved and required more cooperation than 
that used by Freedman et al. (1972). It is 
proposed that the combination of the longer 
time plus the team-type task that required 
members to organize themselves for effective 
action made achievement-related performance 
the salient dimension for evaluation of co- 
members. Horner (1972a, 1972b) has dem- 
onstrated that women are ambivalent about 
their own achievement, success, and leader- 
ship/dominance; they are also ambivalent (at 
best) about the ability of other women 
(Goldberg, 1968). So the kind of behavior 
required by our task—combining phrases as 
a group—over time both drew the dense con- 
dition males together by restructuring the 
close proximity to mean “team” and drove 
the females apart by depriving them of a 
warm, intimate setting and substituting a 
structure that required a focus on leadership 
and achievement, making the close proximity 
uncomfortable for them. 

Turning to group size, we found it to be a 
more potent factor than density. In same-sex 
groups this study replicated the classical size— 
cohesiveness finding and supported our gen- 
eral hypothesis that small size groups are more 
cohesive. Members of small groups had more 
positive feelings toward the group and the 
other members than members of large groups. 
Further, among same-sex groups, more people 
in small groups (80%) chose to remain with 
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the group than in large groups (69%). How- 
ever, when we put two sexes in the same 
group, this “usual,” “classic,” “traditional” 
relationship between size and cohesiveness 
disappeared with males and was reversed for 
the female members. When both sexes were 
together, more (92%) stayed in the large 
groups compared to only 72% staying in the 
small groups. One possible explanation of this 
effect is that if we assume that some people 
enjoy being with people of their own sex and 
other people enjoy being with the opposite 
sex, then we are likely to have fewer discon- 
tented people in mixed-sex than same-sex 
groups because mixed-sex groups (especially 
large ones) provide a broad selection of both 
same-sex and opposite-sex people. The repli- 
cation of the classic size-cohesiveness rela- 
tionship with same-sex groups coupled with 
the reversal of the relationship with mixed- 
sex groups is another reminder that research 
on one-sex groups should be generalized at the 
investigator’s peril. . 
A consistent pattern regarding sex of sub-“ 
ject emerged. As can be seen in Figures 1, 2, 
and 3, men appear to be sensitive primarily to 
group composition, whereas women seem to be 
sensitive to all combinations of the indepen- 
dent variables. This could be seen as an ex- 
ample of the greater sensitivity of women to 
environmental cues than men (Silverman, 
1970). Specifically, males responded more 


positively in mixed-sex groups than in same- ) 


sex groups, while females varied in response 
to composition depending on group size. Males 
felt the same in large and small mixed-sex 
groups and liked the groups equally. It ap- 
peared as if the males felt that “There is noth- 
ing like a dame, nothing in the world,” 
whereas females in mixed-sex groups made 
sharp distinctions between large and small 
groups. In the large groups, they too experi- 
enced the “high” produced by the presence of 
both sexes, but in small mixed-sex groups 
their feelings in general and feelings toward 
the group were not nearly as positive. ‘This 
drop in feelings in small mixed-sex groups 
may be interpreted in the light of Kuethe and 
Stricker’s (1963) social schemata. Using the 
felt board technique with two male and two 
female figures, they found that females tended 
to form separate male-female subgroups, 
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while males placed the figures all together in 
one group. If this common female social 
schemata was employed by the females in 
their perception of these small groups (which 
were made up of two males and two females), 
it is possible that the situation was too inti- 
mate for comfort because the range of choices 
for a pairing partner were low and vulnera- 
bility to unwanted advances high. 

It should be noted that if we had used just 
same-sex groups and had not analyzed for sex 
differences (not uncommon in psychological 
research) (Dan & Beekman, 1972; Holmes & 
Jorgensen, 1971), we would have received full 
support for our general hypothesis that large 
group size, regardless of space, would cause 
people to be unhappy and want to leave the 
group. The mixing of the sexes altered males’ 
evaluations and behavior; females were af- 
fected by combinations of the factors. 


A Posteriori Integration Based on an 
Extension of Deindividuation Theory 


The findings in this study and the situa- 
tion used to generate them indicate the use- 
fulness of deindividuation as an explanation 
of the data. 

The theory. When people are in large or 
crowded groups, they tend to lose individual 
visibility and responsibility for events. This 
experience has been termed deindividuation 
and has been demonstrated in a number of 
ways such as putting hoods over people’s 
faces, putting them in darkened rooms, lab 
coats, or shapeless gray coats, having them 
be anonymous, and various other techniques 
that reduce the extent to which a person 
stands out in a group. The basic finding has 
been that under these conditions there is a 
greater manifestation of usually censored 
words, thoughts, and behavior (Cannavale, 
Scarr, & Pepitone, 1970; Festinger, Pepitone, 
& Newcomb, 1952; Gergen, Gergen, & Barton, 
1973; Singer, Brush, & Lublin, 1965; Zim- 
bardo, 1969, Note 1). The basic idea, going 
back to LeBon (1896), is that people lose 
their restraints and do things that they would 
not do otherwise partly as a result of the 
relative anonymity of a crowd. Zimbardo 
(Note 1) depicts the determinants of deindi- 
yviduation as follows. There are a number of 
kinds of input variables that can lead to de- 
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individuation; the most important (for our 
purposes) are anonymity, shared responsibil- 
ity, group size, arousal, and sensory input 
overload. These events cause a weakening of 
controls based upon guilt, fear, etc., which in 
turn leads to output behaviors characterized 
by hyperresponsiveness—‘contagious plastic- 
ity” to the behavior of proximal, active oth- 
ers and low responsivity to distal reference 
groups. The person also experiences a greater 
liking for the group or situation associated 
with the “released” behavior. 

An extension of deindividuation theory. The 
vast majority of research and discussion of 
deindividuation has focused on the hostile, 
antisocial, destructive motives. Zimbardo 
(1969) describes everything from shocking 
victims, destroying cars abandoned on the 
streets of New York City and Palo Alto, 
lynching people, beating up people, looting, 
and so on. However, the theory also applies 
to the more positive, tender, loving, joyful, 
playful, foolish motives and behaviors. People 
cheer more enthusiastically when in large 
crowds; encounter groups are able to make a 
person be more loving, supportive, and tactile; 
when people are in the dark, they touch and 
hug more (Gergen et al., 1973); and a per- 
son can dance with more energy, enthusiasm, 
and abandon with a group than if he were 
alone (Radcliffe-Brown, 1948). 

One necessary condition is that a motive 
must be inhibited or restrained in the first 
place if it is to manifest the effect of deindi- 
viduation. It follows from this second point 
that manifestation of the effects of deindi- 
viduation are not found, or are much less 
marked, with aggressive people, with naturally 
exuberant people, with spontaneous people, in 
short, with people who have a low level of 
self-consciousness and self-restraint. 

Different deindividuation settings act as 
constraint releasers for different motives. A 
group of men outside mumbling about some 
violation of fair treatment or sacred norm is 
not likely to release affiliative behavior in its 
members; men and women milling around at 
a party in a darkened room are not likely to 
release loud and exuberant or aggressive be- 
havior. Thus “settings deindividuate selec- 
tively,” and they indicate what is an appro- 
priate use of the freedom they foster. 
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Different kinds of groups have different 
kinds of attractions: Large groups have the 
attractive characteristics of releasing con- 
straints, providing freedom, and reducing inhi- 
bitions; small groups facilitate the partici- 
pant meeting his needs for active participa- 
tion, visibility, uniqueness, and leadership. 
These different characteristics of groups some- 
times work at cross-purposes; that is, a per- 
son may find that lack of opportunity to par- 
ticipate reduces his satisfaction with the 
group (Heslin & Dunphy, 1964) to such a 
low point that any pleasant feelings from 
deindividuation (Festinger et al., 1952) are 
not able to compensate him. 

Application to this study. It follows from 
the extension of deindividuation theory out- 
lined above that if women have less social 
permission than men to express affection in 
public toward people they do not know well, 
then they should find the experience of being 
in mixed-sex groups to be both arousing and 
conflict producing. Therefore, when they are 
in mixed-sex groups, that are large and/or 
dense, they should feel aroused, positive, and 
they should feel attracted toward the other 
members. 

We found that for women in 
groups, large groups were more attractive 
than small groups, supporting the idea that 
deindividuation was occurring when the sexes 
were together. This notion is further sup- 
ported by the fact that when there were no 
men present, women preferred small groups 
to larger groups. It is as if the opportunity to 
be deindividuated was appreciated when the 
situation aroused denied desires or conflicted 
needs, but when such motives were not 
aroused (same-sex groups), deindividuation 
served no positive function that could counter- 
act the loss of individuality, visibility, and 
greater opportunity to participate that are 
afforded by smaller groups. 

The liking of both males and females for 
co-members when in crowded mixed-sex 
groups compared to more spacious grouping 
also supports the notion of people in mixed- 
sex groups finding them more attractive be- 
cause of deindividuation. When people are 
crowded together, intimacy is increased 
whether they like it or not (Hall, 1966), and 
our data indicate that when both sexes are 


mixed-sex 
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together they like each other better if they 
are really close together. ; 

The trend to integrate the sexes in work 
forces that were previously all male, as evi- 
denced in Navy recruitment and telephone 
company advertisements, points to the real 
need to examine the effects of sexual compo- 
sition of groups. In this study the mixed-sex 
groups were composed of equal numbers of 
each sex. It is very probable that different 
proportions of males and females would affect 
group members in altogether different ways: 
More research is needed to examine the effects 
of varying proportions of men and women & 
well as to test the extension of deindividua 
tion theory proposed here. 

In summary, the main findings were that 
crowdedness makes people less happy, con- 
tent, etc., but not necessarily more ready to 
leave the group. The most striking results 
involved the sexual composition of the 
groups. In general, everybody liked mixed- 
sex groups, especially males. Females liked 


those groups that had both males and females + 


in them if the groups were big or crowded. 


REFERENCE NOTE 


1. Zimbardo, P. G. Symposium on social and devel- 
opmental issues in moral research. Paper pre- 
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(low responsibility) conditions, and subjects expected either future interaction ae 


(high importance) or no future interaction (low importance) with the con- 
federate. Subsequent ratings of the confederate revealed that subjects became 
more positive toward her only in the no-choice-future-interaction condition; 
ratings in the other experimental conditions did not differ from control group 
ratings. Additionally, subjects in both no-choice conditions desired to meet the 
confederate more than subjects in the choice conditions. The results are inter- 
preted in terms of a process of rationalization designed to increase personal 
responsibility for important beneficial actions. 
theory and self-perception theory are discussed. 


Explaining the processes underlying atti- 
tude change following counterattitudinal be- 
havior has emerged as an intriguing puzzle 
with important practical applications. Recent 
work has clearly delineated two clusters of 
variables which appear crucial to an under- 
standing of the dynamics of forced compli- 
ance attitude change—variables relating to 
the attribution of responsiblity for a counter- 
attitudinal action (Bramel, 1968; Calder, 
Ross, & Insko, 1973; Collins & Hoyt, 1972; 
Cooper, 1971; Holt, Henley, & Collins, 1972; 
Schlenker, 1972; Tedeschi, Schlenker, & 
Bonoma, 1971) and variables relating to the 
personal and social consequences of the action 
(Aronson, 1968; Bramel, 1968; Calder et al., 
1973; Collins, 1969; Collins & Hoyt, 1972; 
Cooper & Worchel, 1970; Pepitone, 1966; 
Schlenker, 1972; Tedeschi et al., 1971). 
Reviews of the existing counterattitudinal be- 
havior literature have concluded that subse- 
quent attitude change occurs only when an 
individual feels personally responsible for the 
consequences of his actions and when the 

The data analyses were supported by the Univer- 
sity of Florida Computing Center. Requests for 
reprints should be sent to Barry R. Schlenker, 
Department of Psychology, University of Florida, 
Gainesville, Florida 32611. 
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Implications for dissonance 


actions produce negative consequences eithery 
by harming another person or by making the 
actor himself appear immoral, incompetent, 
or unattractive (Bramel, 1968; Calder et al., 
1973; Cooper & Worchel, 1970; Collins & 
Hoyt, 1972; Pepitone, 1966; Schlenker, 
1972). For example, subjects paid a small 
amount of money to convince a persuadable 
audience to hold an opinion which the sub- 
jects themselves believe is harmful, change 
their attitudes toward the advocated position 
(Nel, Helmreich, & Aronson, 1969). If sub- 
jects either receive a large amount of money 
for the counterattitudinal behavior or if their 
action will not harm the audience because the 
latter is described as nonpersuadable, no atti- 
tude change occurs. Similarly, subjects who 
have told an experimental confederate that 
an actually boring experimental task is quite 
enjoyable become more favorable toward the 
task when (a) they have been offered mini- 
mal compensation for lying and the choice 
about whether or not to do it and (b) the 
confederate believes the lie and is harmed 
(Calder et al., 1973; Cooper & Worchel, 1970). 
If subjects are given a large incentive or no 
choice about performing the behavior or if 
the confederate is not convinced and not 
harmed, no attitude change occurs. It has 
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been assumed that subjects feel responsible 
for the consequences of their actions when 
they agree to perform them for minimal 
compensation or are given decision freedom. 

The above data suggest that consistency 
pressures per se are not a powerful motive 
force for attitude changes. Inconsistent actions 
undertaken without personal responsibility 
appear to be particularly nontroublesome. 
Consequently, recent views of the counter- 
attitudinal advocacy literature have begun to 
stress factors other than mere consistency 
motives which could account for observed 
attitude changes. Pepitone (1966) suggests 
that such attitude changes are protective ac- 
tions designed to reduce the socially unpleas- 
ant consequences of subjects’ behaviors. He 
questions not only the existence of a “motive 
force for consistency,” but its theoretical 
value as well, since the concept is superfluous 
and also is a throwback to earlier tension- 
reduction principles which have outlived their 
usefulness in other areas of psychology. 
Zimbardo (1969) has viewed such attitude 
change as a type of face-saving in which 
individuals justify their behaviors to them- 
selves and others in order to maintain the 
perception of their own competence. Tedeschi, 
Schlenker, and Bonoma (1971) have sug- 
gested that such attitude change is the result 
of impression management tactics employed 
by subjects to convey the image of the sub- 
ject as a rational decision maker, Bramel 
(1968, p. 365) has stated that 


dissonance is a feeling of personal unworthiness (a 
type of anxiety) traceable to rejection of oneself by 
other people either in the present or in the past. 
Any information which implies that one is incompe- 
tent or immoral arouses dissonance. The reason dis- 
sonance is greatest when the person feels personally 
responsible for his behavior is that rejection by 
other people is usually greatest when they believe the 
person voluntarily acted in an inappropriate way. 


These converging lines of thought stress the 
functional nature of attitude changes which 
follow counterattitudinal behavior and mini- 
mize or disregard the position that such atti- 
tude change is primarily or even in part 
motivated by a need for cognitive consistency. 

From a functional perspective, attitude 
change following counterattitudinal behavior 
represents a rationalization technique de- 
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signed to protect or bolster the individual’s 
self-image and public image (Schlenker, 
1973). The greater the degree of personal re- 
sponsibility an individual perceives for an 
action which generates negative consequences, 
the greater his degree of personal dissatisfac- 
tion, anxiety, and attendant social punish- 
ment. Guilt and anxiety are concomitants of 
immoral behavior, while embarrassment and 
anxiety typically accompany behaviors which 
violate social norms or signify that the actor 
is incompetent. Also, observers tend to attrib- 
ute negative personality traits to an actor 
who behaves inappropriately, and these attri- 
butions can affect the future behavior of the 
observers toward the actor. These intrinsic 
and extrinsic punishments associated with pos- 
sessing responsibility for negative actions 
should lead to rationalization attempts de- 
signed to protect the individual’s self-image 
and public image and reduce the anxiety, 
embarrassment, and social punishment. 
Several rationalization tactics are possible. 
A person could alter his attitude toward the 
injured party to justify the inflicted harm, 
However, more socially desirable alternatives 
would include either (a) reducing personal 
responsibility or (b) minimizing the amount 
or existence of harm. The person could do 
the former by pointing to external pressures 
which constrained the behavior (e.g., no 
decision freedom) or by showing that the 
consequences were accidental or completely 
unforeseeable. Minimizing harm could be ac- 
complished by changing his opinion of the 
consequences, for example, “I didn’t lie about 
the task, since I really did believe that it 
was interesting and enjoyable as I said be- 
fore” (rather than boring and uninteresting as 
was the case). The greater the personal re- 
sponsibility which could be attributed to the 
individual through situational cues and the 
more negative the consequences of the action, 
the greater the amount of rationalization re- 
quired. Thus, if the subject was told by an 
experimenter that he must make a counter- 
attitudinal speech which would persuade some 
unsuspecting individual, he has a viable es- 
cape from his dilemma simply by pointing to 
the experimenter’s pressure. However, if free- 
dom of choice were emphasized by the experi- 
menter, an alternative rationalization tactic 
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(i.e., attitude change in the direction of the 
speech) would be necessitated. 

According to the self-image maintenance 
and enhancement explanation briefly outlined 
above, attitude changes which follow counter- 
attitudinal behaviors are not the consequence 
of cognitive inconsistencies (Festinger, 1957) 
or the passive process of attributing one’s 
attitude from one’s behavior on the basis of 
situational cues (Bem, 1972). Rather, the ex- 
perimental paradigms designed to assess re- 
actions to counterattitudinal behaviors have 
placed individuals in positions where their 
self-image and public image is jeopardized, 
and subsequent changes represent rationaliza- 
tion techniques designed to reduce respon- 
sibility for the negative actions, justify the 
actions, or change perceptions of the action. 
From this perspective, attitudinal discrepancy 
is not a necessary antecedent to attitude 
change. The same rationalization techniques 
will occur whenever a person behaves in a 
socially undesirable fashion and can be held 
accountable. If attitudinal discrepancy plays 
any role in the process, it is probable that it 
makes the socially undesirable consequences 
of a counterattitudinal action even more 
salient to the individual, since it means that 
he has been taught to avoid them, and hence 
makes rationalization even more necessary. 

The focus upon the negative versus neutral 
consequences which result from counteratti- 
tudinal behaviors has obscured a potentially 
parallel process which could affect rationaliza- 
tion and attitude change. While negative 
actions generate anxiety and dissatisfaction, 
positive actions produce personal satisfaction, 
pride, and social rewards. Positive conse- 
quences will be defined as those which are 
considered socially desirable, which help 
others, and/or which make the actor appear 
competent, sagacious, attractive, or altruistic. 
Following actions which generate positive 
consequences, the actor typically experiences 
satisfaction and pride as well as the potential 
for present and future rewards mediated by 
impressed or grateful observers. It is hypothe- 
sized that an individual will attempt to in- 
crease his perceived responsibility for such 
positive actions. He will attempt to make it 
clear that his positive actions were not acci- 
dental, were clearly intended and freely 
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chosen, were not under the control of extern 
forces, and contained few negative compo- 
nents. The more important the positive co 
sequences are, the more concerned the in 
vidual will be to insure that he or she i 
perceived as being responsible for the action 
and that the action is perceived as positive 
When situational pressures which could be 
perceived as external, nonpersonal causes for 
a positive action are absent or minimal, an 
individual can assign responsibility to himself 
and reap the associated rewards. However, 
when situational constraints are present which 
decrease personal responsibility, the individ- 
ual may act to minimize the situational fac- 
tors and rationalize his actions to increase 
personal responsibility. It is therefore hy- 
pothesized that rationalization and self- 
presentational tactics designed to secure re- 
sponsibility for the beneficial consequences of 
actions are directly related to the presence of 
situational constraints and external justifica- 
tions which minimize personal responsibility 
or detract from the positivity of the action./ 
Subjects in the present experiment per- 
formed a counterattitudinal yet largely posi- 
tive action. Each subject delivered an ex- 
tremely favorable interpersonal evaluation to 
a confederate who actually was rather ordi- 
nary and dull. Subjects were given either a 
choice or no choice about reading the con- 
trived evaluation and hence possessed some 
or no personal responsibility for their bene- 
ficial action. Additionally, half the subjects 
believed that they would never meet the other 
person and that the latter would not discover 
that the evaluation was bogus. The other half 
of the subjects expected a meeting in which 
the other person would discover that the 
evaluation was bogus and delivered under 
choice or no-choice conditions. Following 
delivery of the evaluation, subjects rated the 
attractiveness of the confederate and how 
much they wanted to meet the confederate. 
It was hypothesized that subjects in the 
no-choice conditions, whose beneficial action 
could not be attributed to them but rather to 
the demands of the experimenter, would de- 
sire to increase their personality responsibil- 
ity and hence obtain personal credit for the 
beneficial action. One method of increasing 
their perceived responsibility and making 
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their prior coerced evaluation appear credible 
would be to go overboard in their subsequent 
evaluations of the other person. If they pri- 
vately evaluated the other person quite fa- 
vorably, then their actions become a reason- 
able manifestation of their private beliefs, 
and their personal responsibility for the ac- 
ion would increase. Subjects in the choice 
conditions, on the other hand, already pos- 
sessed some responsibility for delivering the 
avorable evaluation; their freedom of choice 
made it relatively clear that they were per- 
sonally responsible. Knowledge of respon- 
sibility would have made it unnecessary for 
them to have to significantly distort their 
eelings toward the other person in order to 
increase their own responsibility, and they 
could remain content with themselves without 
having to employ rationalization. 

Whether or not subjects expected to meet 
their beneficiary should have affected both 
the importance of the evaluation and the pos- 
sibility of obtaining personal credit for the 
evaluation. When no meeting was anticipated, 
anonymity was assured, and the evaluation 
was not revealed to be bogus, no personal 
credit was possible from the beneficiary and 
the importance of feeling personally respon- 
sible would have been minimized. When a 
meeting was anticipated, the importance of 
the action and the potential for personal 
credit and responsibility should have been 
maximized. It was therefore hypothesized that 
subjects would increase their private evalua- 
tions of the other person more when they had 
no choice rather than a choice about deliver- 
ing the evaluation, and that the effect would 
be heightened when future interaction was 
anticipated, and minimized or eliminated 
when no future interaction was anticipated. 

Different hypotheses would be derived from 
either dissonance theory or self-perception 
theory. Consideration of these alternatives 
will be postponed until the discussion of the 
results. 


METHOD 
Subjects 


Fifty-two ‘female introductory psychology students 
partially fulfilled a course requirement through par- 
ticipation. Two subjects (one in the control condi- 
tion and one in the choice-future-interaction condi- 


tion) didn’t believe they were actually participating 
with another student and their data were discarded 
prior to the analyses. The remaining 50 subjects 
were randomly assigned to the five conditions of the 
experiment prior to their arrival at the laboratory. 


Procedure 


Each subject was seated at a table next to a 
microphone and loud speaker which comprised part 
of an intercom system, The female experimenter 
described the study as an investigation of “how 
people form impressions of others and how people 
react when they learn exactly what kind of impres- 
sion they have made on others.” Each subject was 
informed that she was participating with another girl 
of about the same age who was seated in an adjoin- 
ing similar room, The experimenter explained that 
to avoid confounding physical appearances with 
other types of information about others, precautions 
were taken to insure that each participant arrived 
at a separate room, would not see the other during 
the experiment, and could communicate only via an 
intercom system hooked up between the rooms. Sub- 
jects were told that each person would have a sepa- 
rate role in the experiment. One person would be in- 
terviewed by the experimenter while the second per- 
son listened. The second person would form an 
impression of the first person based upon what she 
heard during the interview and then deliver a per- 
sonal evaluation based upon that impression. ‘Then 
the reactions of the interviewee to the evaluation 
would be assessed, The experimenter explained that 
she had flipped a coin prior to the subject’s arrival 
and that the subject was assigned to the position of 
the person who formed an impression while the 
other girl was assigned to the position of the person 
who received the evaluation. The experimenter then 
described the following sequence of ensuing events, 
First, the experimenter would go over to the other 
girl’s room and explain the procedures to her. Next, 
she would ask the other girl some standardized ques- 
tions about the latter’s attitudes, interests, and activ- 
ities. The questions were described as very general 
ones which would ask for the type of information 
which is often brought out at parties or other social 
gatherings. Subjects were instructed to listen carefully 
to the other girl’s answers and to form an impression 
of her based upon what transpired during the inter- 
view. At the conclusion of the interview, the other 
girl would be asked to fill out some questionnaires 
and the experimenter would return to get the sub- 
ject’s impressions and to have the subject deliver 
an evaluation. Finally, the experimenter would assess 
the other girl’s reactions to the evaluation. 

After explaining the rationale for the study, the 
experimenter left the room and turned on a tape- 
recorded dialogue ostensibly between the experi- 
menter and the other girl. The tape recording was 
designed to portray the simulated subject as a dull 
college student who made average grades in school 
and who had few interests, goals, or hobbies. The 
simulated subject paused incessantly and punctuated 
her speech with numerous “uhs” and “ahs,” 
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Following the simulated interview, the experi- 
enter returned to the subject’s room, turned off 
ie intercom system, and explained that while some 
eople would receive the actual evaluations written 
y the evaluator, others would receive one of two 
tandard evaluations which were prepared in advance 
yy the experimenter. The procedure was justified by 
tating that we all know that some individuals re- 
celve very favorable or very unfavorable evalua- 
Hons from others in their everyday interactions. 
Consequently, the investigators wanted to insure 
that some extreme evaluations were included in the 
study so that reactions to these could be assessed. 
This would make the results of the study more 
generalizable to real interactions. The subject and 
the other girl were said to have been assigned to 
one of the conditions which included such extreme 
evaluations. Each subject was informed that her 
actual impression and evaluation naturally was 
important and would be obtained later. 

Subjects in the no-choice conditions were told: 


Because we are requiring all the people who 
happen to be assigned to the extreme evaluations 
conditions to read one or the other evaluation, 
we felt that the only fair way to decide whether 
they read the favorable or unfavorable one would 
be on the basis of chance rather than giving them 
their choice. So we decided in advance that all 
even-numbered subjects would be required to read 
the favorable evaluation and all odd-numbered 
subjects would be required to read the unfavor- 
able evaluation. Since you two are the sixth pair 
in these conditions, you'll be reading the favorable 
evaluation. 


Subjects in the choice conditions were told: 


Because we are requiring all the people who 
happen to be assigned to the extreme evaluations 
conditions to read one or the other evaluation, 
we felt that the only fair way to decide whether 
they read the favorable or unfavorable evaluation 
would be to give them their choice. You may 
choose whether you would like to read the favor- 
able evaluation or the unfavorable evaluation to 
her. There is one thing, though; we already have 
several unfavorable evaluations and we need a few 
more favorable ones. If you would, Pd appreciate 
it if you read the favorable one. However, you 
do have your choice. Which one do you choose? 


All subjects in the choice conditions stated that they 
would read the favorable evaluation. 


The future-interaction manipulation immediately 
followed the choice—no-choice manipulation. In the 
future-interaction conditions, subjects were told: 


After we have completed the study, TIl introduce 
you both and I'll explain the whole procedure to 
her. She'll be informed that you heard her answers 
and formed an impression of her; however, the 
evaluation that you read was actually one which 
we prepared in advance. She’ll be told that you 
(could choose) (were assigned) the evaluation you 


read, since we had both extremely favorable and 
unfavorable ones available. Then we can talk 
about both of your reactions to the experiment, 
about the actual impression you formed, and 
about the way she reacted to the one you read 


In the no-future-interaction conditions, the experi- 
menter explained that: 


Although we’d like to introduce you both at the 
end of this study and explain all the details to 
her, we have found that we can’t do that and still 
continue to do this kind of research. We have to 
protect the jdentities of the people who are par- 
ticipating, and we've therefore taken every pre- 
caution to insure that neither of you will learn 
the other’s identity and she won’t know that this 
is a previously prepared evaluation which was not 
based upon her answers. 


While the subject looked over the favorable 
evaluation so that it could be delivered smoothly, 
the experimenter left the room to see if the other 
girl was “finished with the questionnaire.” The 
experimenter returned within a minute or two, stated 
that the other girl was finished, turned on the inter- 
com system, and had the subject read the favorable 
evaluation. The favorable evaluation consisted of 
the following: 


As I understand it, my job is to give you my 
honest impression of you on the basis of what 
I heard you talking about before. Overall, I must 
say that I was very favorably impressed. You 
gave some excellent answers to the questions, and 
you appeared quite calm in a situation which 
would have made most people very nervous. You 
handled yourself extremely well. You came across 
as intelligent, well-rounded, sincere, and likeable. 
You really struck me as one of the most interest- 
ing people whom I’ve met since I came to the 
University here. 

I would like to be able to throw in some nega- 
tive reactions to keep my evaluation balanced. 
However, I frankly wasn’t able to come up with 
anything. All of the adjectives that I can think 
of to describe you are pretty positive. I guess 
I could go on with this sort of thing, but I'll 
stop here. 


The subject then was handed a questionnaire 
which asked for her actual impressions of the other 
girl. The experimenter left the room, ostensibly for 
the purpose of giving a questionnaire to the other 
girl. The questionnaire asked subjects to rate the 
other girl on 12 15-point bipolar adjective scales. 
The adjective pairs included: good-bad, competent- 
incompetent, respected-not respected, warm-cold, 
sincere-insincere, not conceited-conceited, intelligent- 
unintelligent, well adjusted-not well adjusted, inter- 
esting—uninteresting, sociable-unsociable, popular~ 
unpopular, and likeable-not likeable. After the 
subject had completed the rating scales, the experi- 
menter picked up the first questionnaire and handed 
the subject two additional questionnaires. One asked 
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the subject to rate herself on the same bipolar 
adjective scales she had just employed to rate the 
other girl, and the other contained questions which 
asked the subject to rate on 15-point scales: (a) 
“Overall, how attractive as a person do you feel the 
other girls is?” (b) “How much would you enjoy 
meeting the other girl?” and (c) “How much choice 
did you feel you had in deciding which evaluation 
to deliver?” The self-ratings were included to deter- 
mine if subjects would differentially perceive them- 
selves negatively for having told an untruth or 
positively for having performed a “good deed.” 

After completing the questionnaires, subjects were 
thoroughly debriefed and informed of the actual 
purpose of the experiment and the reasons for the 
deceptions. 

Subjects in the control condition also heard the 
tape as part of an impression formation task 
ostensibly investigating how people form judgments 


- and how accurately these judgments correspond to 
' the self-ratings of the person being evaluated. Sub- 


jects were assigned to the role of the person who 
formed the impression while the other girl was said 
to be the evaluatee. Each subject then listened to 
the tape recording of the bogus interview and was 
asked to complete the same questionnaires filled out 
by the experimental groups (the choice-in-reading- 
evaluations question naturally was deleted). No men- 
tion was made of reading an evaluation to the other 
girl or meeting or not meeting the other girl at 
the conclusion of the study. Hence, the experiment 
was a 2X2 factorial design with a control group. 


RESULTS 
Manipulation Checks 


The independent variable manipulations 
produced the desired effects. Subjects in the 
choice conditions (Ẹ = 11.1) perceived a 
great deal of choice in determining which 
evaluation they wished to read, while subjects 
in the no-choice condition ( = 1.1) per- 
ceived no choice, F(1,36) = 137.20, p< 
.001. During debriefing, all subjects correctly 
recalled whether or not they would meet the 
other girl. 

As shown in Table 1, subjects in the con- 
trol condition perceived the other girl to be 
slightly above neutral on the adjective evalu- 
ations, neutral in overall attractiveness as a 
person, and would prefer not to meet the 
other girl. Hence, the depiction of an uninter- 
esting, average to dull student was achieved 
during the interview. 


Evaluations of the Other Girl 


The major dependent variable was the 
average of ratings of the other girl on the 
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TABLE 1 
RATINGS OF THE OTHER GIRL 
Dependent variable 
Condition 
Average | attrac | Would 
Mating | tiveness | mestaa 
Future interaction 
Choice 9.6 94 8.1 
(1.8) (2.1) (2.2) 
No choice 12.2 10,3 12.2 
(1.2) (1.1) (2.0) 
No future interaction 
Choice 10.6 8.2 6.8 
(1.7) (2.5) (2.7) 
No choice 10.0 9.0 10.9 
(2.6) (3.0) (3.0) 
Control 9.3 78 6.9 
(2.0) (1.9) (4.6) 


Note, Standard deviations are given in parentheses. 


12 different 15-point bipolar adjective scales. 
Means and standard deviations for each con- 
dition are presented in Table 1. Planned 
multiple comparison ¢ tests indicated that the 
only condition which differed reliably from 
the control group was the no-choice-future- 
interaction condition, ¢(45) = 2.89, p < .01. 
Subjects in the no-choice-future-interaction 
condition also rated the other girl more posi- 
tively than subjects in the no-choice—no- 
future-interaction condition, #(45) = 2.57, p 
< .02; the choice-future-interaction condi- 
tion, (45) = 3.02, p < .01; and the choice- 
no-future condition, £(45) = 1.82, p< .10. 
None of the other conditions differed reliably 
from one another (ps > .10).* 

Subjects also were asked to rate on a 15- 
point scale how attractive overall the simu- 
lated subject was as a person. Means and 
standard deviations for each condition are 
presented in Table 1. Planned ¢ tests again 
revealed that the only condition which differed 
significantly from the control group was the 
no-choice-future-interaction condition, £(45) 
= 3.73, p < 001. The no-choice-future-inter- 
action condition also differed significantly 
from the choice-no-future-interaction condi- 


1 Planned ¢ tests (two-tailed) were deemed more 
appropriate for testing the major hypotheses, For 
those interested in an overall test of the adjective 
ratings, a 2 X 2 analysis of variance revealed a sig- 
nificant interaction between choice and anticipated 
interaction, F(1, 36) =7.09, p<.02; neither of the 
main effects approached significance (ps > .10). 
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tion, ¢(45) = 3.33, p<.01. None of the 
other comparisons revealed significant differ- 
ences (ps > .10).? 


Enjoy Meeting 


Table 1 presents the means and standard 
deviations for each condition in response to 
the question, “How much would you enjoy 
meeting the other girl?” A 2 x 2 analysis of 
variance revealed that subjects in the no- 
choice conditions stated that they would en- 
joy meeting the other girl more than subjects 
in the choice conditions, F(1, 36) = 26.66, 
p < .001. A marginally significant effect of 
anticipated interaction also was obtained, 
F (1,36) = 2.68, p<.10. Subjects in the 
future-interaction conditions stated that they 
would enjoy meeting the other girl more than 
subjects in the no-future-interaction condi- 
tions. The interaction term did not approach 
significance (F < 1). Subjects in the control 
group felt that they would not enjoy meeting 
the other girl, and comparisons between 
means indicated that subjects in both no- 
choice conditions would have enjoyed meeting 
the other girl more than subjects in the con- 
trol group or in either of the choice condi- 
tions (ps < 01). No other comparisons were 
significant (ps > .30). 


Self-Ratings 


The mean self-ratings of subjects averaged 
across the 12 bipolar adjective scales were 
unaffected by either of the independent varia- 
ble manipulations or their interaction (ps > 
.20). However, 2 X 2 analyses of variance 
performed on each of the individual adjective 
pairs revealed between-conditions differences 
on the items respected — not respected and 
likeable- not likeable. Naturally, these dif- 
ferences should be interpreted with caution 
since they could represent chance occurrences 
on these post hoc analyses, 

The analysis of variance performed on the 
respected — not respected item revealed both 
a main effect of the choice variable, F(1, 36) 
= 4.69, p < .05, with subjects in the no- 


2 An overall test of the attractiveness ratings re- 
vealed a marginal main effect of choice, F(1, 36) 
=2.99, p< .10; neither the anticipated interaction 
main effect nor the interaction term approached 
significance (ps > 10). 
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TABLE 2 
SELF-RaATINGS 
Adjective 
Condition 
Respected Likable 
Future interaction 
Choice 10.5 10.8 
(1.8) (2.3) 
No choice 13.3 12.8 
(1.5) (1.8) 
No future interaction 
Choice 12.3 12.1 
(1.7) (2.0) 
No choice 11.7 11.4 
(1.4) (1.3) 
Control 11.6 12.1 
(2.9) (1.7) 


Note, Standard deviations are given in parentheses, 


choice conditions perceiving themselves as 
more respected than subjects in the choice 
conditions, and an interaction between the 
choice variable and the future-interaction 
variable, F(1,36) = 11.21, p< .005. The 
means and standard deviations for each con- 
dition are presented in Table 2. A Newman- 
Keuls test revealed that subjects in the no- 
choice-future-interaction condition perceived 
themselves as more respected than subjects 
in the choice—future-interaction condition (p 
< .05); no other contrasts were significant 
(ps > .10), and no conditions differed reli- 
ably from the control group (ps > .10). 

The 2 X 2 analysis of variance performed 
on the likeable—not likeable item revealed 
an interaction between the independent varia- 
bles, F(1,36) = 5.27, $ < .05, and no main 
effects (ps >.20). A Newman-Keuls test 
performed on the four experimental means 
and the control mean revealed only a marginal 
difference, as subjects in the no-choice-future- 
interaction condition tended to feel more like- 
able than subjects in the choice-future- 
interaction condition ($ < .10); no other 
comparisons between means were significant 
and none of the experimental groups differed 
from the control group (ps > .10). 


Discussion 


The results indicated that attitude changes 
will occur following counterattitudinal be- 
havior which produces important positive con- 
Sequences and that these changes are greatest 
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when situational pressures minimize the per- 
son’s responsibility for the beneficial behav- 
ior. As predicted, subjects in the no-choice— 
future-interaction condition subsequently eval- 
uated the other girl much more positively 
than did subjects in any of the other condi- 
tions, including the control group. The other 
experimental conditions did not show substan- 
tial evaluative increases over those exhibited 
by the control group. Additionally, subjects 
in the no-choice-future-interaction condition 
thought that they would enjoy meeting the 
other girl and that they themselves were 
yrather respected and likeable. The pattern of 
results for these subjects presents the picture 
of a group which went overboard to convince 
themselves that they did indeed mean what 
they had said in their coerced evaluation and 
were quite content with themselves for being 
given the opportunity to obtain responsibility 
and credit. 

Subjects in the no-choice conditions stated 
that they would enjoy meeting the other girl 
more than subjects in the choice or control 
conditions. It is reasonable to suppose that 
subjects in the no-choice conditions desired 
to meet the other girl so that they could in- 
crease personal responsibility and credit for 
their positive actions. In the absence of such 
an interaction, subjects in the no-choice con- 
ditions would not have been in a position to 
obtain credit for their beneficial action, and 

«achieving personal responsibility for the be- 
havior would have been relatively unimpor- 
tant. It was considered doubtful that sub- 
jects could have looked only to the experi- 
menter as a source of recognition, approval, 
and credit if they changed their attitudes 
toward the other girl: An extremely favorable 
private evaluation might have been interpreted 
as demonstrating the subjects’ unperceptive- 
ness or indiscriminateness since the other girl 
was portrayed as rather average. Hence, a 

t. meeting with the other girl provided subjects 
in the no-choice conditions their only oppor- 
tunity to receive external approval. Subjects 
in the choice conditions possessed responsi- 
bility for their behavior and could have re- 
mained satisfied with their actions in the 
absence of a face-to-face meeting. 

As an alternative explanation of the re- 
sults, it could be argued that the future inter- 
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action posed a threat to subjects in the no- 
choice conditions, These subjects might have 
anticipated an unpleasant and embarrassing 
confrontation with the other girl if they rated 
her unfavorably. Since subjects in the choice- 
future-interaction condition believed that the 
other girl would be told they chose their eval- 
uation, a comparable threat would be non- 
existent. Thus, subjects in the no-choice-fu- 
ture-interaction condition might have elevated 
their ratings of the other girl to avoid a po- 
tential threat rather than to secure responsi- 
bility for a good deed. While the interpersonal 
evaluation data is explained through this al- 
ternative, the data on how much subjects 
would have enjoyed meeting the other girl 
appear to contradict it. If perceptions of a 
threatening interaction existed, it would sug- 
gest that subjects in the no-choice conditions 
should have desired to meet the confederate 
less than subjects in the choice or control 
conditions. Since the reverse was true, it 
would seem to indicate a desire to get some- 
thing (i.e., credit) rather than an attempt to 
avoid something (i.e., an unpleasant confron- 
tation). . 

The results failed to support self-perception 
theory (Bem, 1967, 1972). According to Bem, 
individuals infer their own attitudes from 
their behaviors when external cues minimize 
the possibility of external causes of the be- 
havior. In the absence of external constraints, 
the individual attributes to himself an atti- 
tude which is consistent with the behavior. 
However, when external constraints are pres- 
ent, the action cannot be considered a mani- 
festation of an underlying attitude and the 
action becomes largely irrelevant to attitude 
perception. Thus, self-perception theory 
would have predicted that subjects should 
have inferred more favorable attitudes toward 
the other girl in the choice conditions than in 
the no-choice conditions. ‘The prediction 
clearly was disconfirmed. 

At first glance the data also might appear 
to disconfirm predictions derived from disso- 
nance theory, since subjects in the choice con- 
ditions had less justification for performing 
the counterattitudinal behavior, yet did not 
change their attitudes toward the other girl, 
However, Aronson (1968) has hypothesized 
that “white lies” which unduly praise an- 
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other are a common occurrence and are per- 
ceived as justified by the positive conse- 
quences they entail. According to this inter- 
pretation, little or no dissonance should have 
been present in any of the experimental con- 
ditions. An explanation of the results has to 
be sought outside the realm of dissonance 
theory. The possible inapplicability of the 
dissonance concept would seem to weaken the 
theory’s generalizability. The most striking 
successes of dissonance theory have occurred 
within the area of attitude change following 
counterattitudinal behavior. It would be ex- 
tremely unparsimonious to suggest that disso- 
nance theory is applicable when such behav- 
ior produces largely negative consequences 
while an alternative theoretical approach is 
necessary when it produces largely positive 
consequences. 

Since Aronson’s (1968, 1972) revision of 
dissonance theory is in some respects similar 
to the present approach, a brief contrast is in 
order. Aronson has proposed that cognitive 
inconsistencies will be most salient, and hence 
most susceptible to empirical analysis, when 
an action is inconsistent with an individual’s 
self-concept. Dissonance is not produced by 
the cognitions “I said X” and “I believe ‘not 
X’.” Rather, dissonance is generated by the 
cognitions “I have said something I don’t 
believe and it could have bad consequences 
for people” and “I am a decent, reasonable, 
truthful person” (Aronson, 1972, p. 115). 
While Aronson emphasizes rationalization 
designed to protect the self-concept, the ra- 
tionalization is motivated by a need for cog- 
nitive consistency. The theory is still an in- 
traindividual theory which focuses upon cog- 
nitive tensions and cognitive fit, as does the 
original version of dissonance theory. It would 
be difficult to find inconsistent cognitions 
which would threaten the self-concept in the 
present study, and thus it appears that Aron- 
son’s revision of dissonance is inapplicable to 
the present results. A revision of dissonance 
theory might be applied if it is assumed that 
the cognitions “I am a good person” and “I 
don’t appear to be responsible for this good 
deed” are considered dissonant. In this case, 
it might be predicted that the less responsi- 
bility an individual has, the greater would be 
the resultant dissonance. However, such rea- 
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soning appears to be straining the credulity 
of the concept of dissonance. In view of the 
present results and those reviewed earlier, the 
most fruitful tactic for future research and 
theory would be to focus attention upon the 
consequences of an individual’s action and 
the ways in which the consequences reflect 
upon the person’s self-image and public image. 
Continued focus upon inconsistencies per se 
appears to obfuscate the relevant variables 
which produce self-persuasion. 

The present experiment was suggested in 
part by Davis and Jones’s (1960) finding that 
subjects derogated a person to whom they“ 
had earlier delivered a negative evaluation 
only when they had been given a choice about 
delivering it and when no future interaction 
with the victim was anticipated. Commenting 
on these findings, Walster, Berscheid, and 
Walster (1973) concluded that “distortion 
becomes more difficult and less likely to oc- 
cur when future interaction is anticipated” 
(p. 160). The present results indicate that 


while this conclusion may hold true for situs" 


ations in which aversive consequences have 
occurred, it is inapplicable to situations where 
positive consequences are involved. Following 
an aversive behavior, interaction with the vic- 
tim provides the opportunity to rationalize the 
behavior by taking back the evaluation, point- 
ing to external factors which might have gen- 
erated the evaluation, and apologizing for 
any temporary harm. Distorted perceptions,—¢ 
of the other person would emerge as a last 
resort should some of these easier and more 
acceptable alternatives fail. Folowing a bene- 
ficial behavior, interaction provides a valuable 
opportunity for insuring personal responsi- 
bility and obtaining the appropriate credit. 
The consequences of the favorable evalua- 
tion delivered in the present study were 
largely positive—an average to dull person 
was told that she was an extremely attractive 
person. While the behavior would be consid- ~~ 
ered a lie, few people would consider it rep- 
tehensible conduct. It could be expected that 
as the concomitant negative consequences of 
such a counterattitudinal act are increased, 
the desire to obtain credit and responsibility 
for the act would decrease to the point where 
the effects obtained in the present study 
would be minimized and ultimately reversed. 
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For example, had the confederate been por- 
trayed as an extremely obnoxious character, 
the anxiety associated with the flagrantly 
“immoral” behavior of telling such a lie would 
mitigate any tendency to obtain responsibility 
for the action and might lead to attempts to 
deny such responsibility if the behavior is 
perceived as extremely immoral. 

In conclusion, the present study indicates 
that attitude changes will occur following a 
counterattitudinal action which produces posi- 
tive consequences, a conclusion which is not 
Suggested by dissonance theory or any of its 
derivatives. It is clear from the present data 
that the choice variable, presumably mediat- 
ing personal responsibility, is an important 
variable for predicting attitude change. How- 
ever, the effects of responsibility seem to de- 
pend upon whether the important conse- 
quences of an action are positive or negative. 
When consequences are negative, rationaliza- 
tion attempts are directly related to situa- 
tional manipulations of personal responsibil- 
“ity. When consequences are positive, ration- 
alization attempts seem to be inversely re- 
lated to situational manipulations of personal 
responsibility. 
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Belief in a Just World and Altruistic Behavior 


Miron Zuckerman 
University of Rochester 


Three experiments tested the hypothesis that in a time of need those who 
believe in a just world tend to behave in an altruistic way to make them- 
selves more deserving. Using a paper-and-pencil test college undergraduates 
were divided at the median of their Belief in a Just World scores. The results 
of the three experiments showed that before exams students who believed more 
in a just world (a) volunteered more to serve as subjects in experiments, (b) 
were more willing to serve as readers for a blind student, and (c) agreed more 
to participate in a 1-hour study after having already completed an experiment 
requirement in an introductory psychology course. These results indicate that 
in a time of a need, those who believed in a just world behaved more de- 
servingly even though the deserving behavior (i.e., helping others) did not 
lead in any obvious way to the satisfaction of the need (i.e, doing well on 


the exam). 


In the past few years, the construct of a 
“belief in a just world” has been often used 
to explain experimental results. In particular, 
several studies (Lerner, 1971a; Lerner & Sim- 
mons, 1966) interpreted the rejection of an 
innocent victim on the basis of an observer’s 
desire to believe that he lives in a just world 
—a world in which we can get what we de- 
serve and deserve what we get” (Lerner, 
1971b, p. 51). The derogation of the victim 
helps the observer to see the former as de- 
serving his fate. Underlying such an explana- 
tion is the assumption that all of us share, to 
a lesser or greater degree, the belief in a just 
world. Only recently a paper-and-pencil scale 
has been developed to assess individual differ- 
ences on such a belief (Rubin & Peplau, 
1973). Subjects with high scores on a Just 
World Scale were relatively unsympathetic 
toward their fellows who were unlucky in the 
draft lottery. This finding fits with the notion 
that people may derogate a victim in order to 
maintain their belief in a just world. 

At this stage it seemed promising to explore 
other behavioral characteristics which may be 
related to the belief in a just world. The 
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present study focuses on altruistic behavior 
under conditions of need versus lack of ne 
on the part of those who are asked 
help. 

Lerner’s studies focused on the way | 
infer inputs from outcomes. Given the 
ing of a victim (outputs), subjects inferred 
that he or she was responsible for it or was 
an undesirable person (inputs). It may be in- 
teresting to explore if the opposite is also 
true. Since the belief in a just world i 
a two-way relationship between what 
and what we deserve, it should also enable a 
person to infer outcomes from input. A world” 
in which people get what ‘they deserve is 
world where “deserving” inputs are rey 
and “undeserving” inputs are punished. “9c 
Servingness” has two components, by 
and personal worth, both of which m 
to reward or punishment. This study's hy- 
Pothesis is that in time of need those who 
believe in a just world behave more desery- 
ingly than those who do not believe in a just 
world. The state of need serves as a motivat- 
ing force urging the individual to look for 
ways in which he can satisfy his needs. The 
belief in a just world serves as a guiding pri 
ciple suggesting that people who behave de- 
servingly are rewarded. It follows that when 
rewards are needed, a person who believes in 
a just world will try to be deserving in order 
to get these rewards. The person who does 
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not believe in a just world does not believe 
that deserving behavior is rewarded and will 
look for other ways to satisfy his needs. 

It is important to realize that the belief in 
a just world does not imply a clear instru- 
mental relationship between outputs (reward 
or punishment) and deservingness. For ex- 
ample, Lerner (1971a) suggested that the de- 
valuation of innocent victims—subjects who 
were shocked in a learning experiment— 
helped the observers to maintain their belief 
in a just world. However, such devaluation 


„does not explain how the victims came to be 


shocked in the first place. It is possible that 
those who believe in a just world see a rela- 
tionship between outputs and deservingness 
because of divine intervention or other super- 
natural forces. Indeed, Rubin and Peplau 
found that Just World scores were “correlated 
with the belief in an active God—a Being 
beyond ourselves who takes an active part in 
the affairs of man’ (r = .31, p < .001), with 
reported frequency of church attendance (r 


` = 45, p < .001), and with self-ratings of re- 


ligiosity (r= .42, p<.001)”; (Rubin & 
Peplau, 1973, p. 89). In this same line, the 
present study’s hypothesis is that in times of 
need people who believe in a just world will 
behave more altruistically than people who 
do not believe in a just world, even if such 
behavior has no “practical” implications re- 
gartling their needs. 


EXPERIMENT 1 


Using undergraduate students, this study 
compared the willingness of high and low be- 
lievers in a just world (HBJWs and LBJWs, 
respectively) to help graduate students by 
serving as subjects in their experiments. It 
was hypothesized that shortly before the mid- 
term exam, HBJWs would be more willing to 
help graduate students than LBJWs because 
helping would make HBJWs more deserving 
in their time of need. 


Method 


A 16-item Just World Scale (Rubin & Peplau, 
1973) was administered to Introductory Social Psy- 
chology students, 31 females and 21 males, at the 
beginning of the semester. The students were told 
that the scale was a recently developed personality 
measure and that the instructor was interested in 
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obtaining normative data from different subject pop- 
ulations. The exact nature of the test was not dis- 
closed. Potential scores on the scale ran from 16 to 
96 with higher scores indicating stronger belief in 
a just world. Analysis of the subjects’ scores did not 
show any sex differences. All subjects were divided 
into those above (HBJWs, »=26) and those below 
(LBJWs, n= 26) the median of the scale (Mdn = 
58). Blocking for sex, each group was randomly 
divided into two subgroups of 13 subjects each. One 
subgroup of HBJWs and one subgroup of LBJWs 
were asked to volunteer for experiments 5 weeks be- 
fore the midterm exam, The remaining two sub- 
groups were asked to volunteer 2 days before the 
exam, In all cases the experimenter, a female, con- 
tacted a subject by phone and delivered the follow- 
ing message: 


Hi, my name is —— and I am a graduate student 
in social psychology. I got the names of all the 
students in the social psychology course from your 
instructor with the intention of asking them to 
participate in experiments that the graduate stu- 
dents conduct here. I am compiling a list in order 
to know how many subjects will be available for 
such experiments, At the moment five 1-hour ex- 
periments are being planned in the department and 
I would like to know if you are willing to partic- 
ipate in any of them, Let me add that by volun- 
teering you'll help us a great deal because at the 
moment the number of subjects available is far 
below what is needed, However, this is not related 
in any way to the social psychology course. Also, 
if you tell me now how many experiments you are 
willing to participate in, experimenters will call 
you later about the time and place. 


Questions about the content and nature of the 
experiments were discouraged. Subjects were told 
that more information would be given with the in- 
vitation for an actual experiment, After a subject 
gave a specific number of hours, the experimenter 
thanked him and terminated the conversation. When 
calling, the experimenter did not know whether a 
particular subject was a HBJW or a LBJW (the 
list of potential subjects was actually used by the 
graduate students of the department for projects’ 
they were carrying out at that time). 


Results 


No sex-linked differences were found and 
the following results are based on the com- 
bined population of males and females. Of the 
26 subjects that the experimenter attempted 
to call 5 weeks before the exam, 23 were con- 
tacted, 12 HBJWs and 11 LBJWs. There was 
no significant difference between the mean 
number of experiments volunteered for by 
HBJWs and the mean number of experiments 
volunteered for by LBJWs (M = 2.83, M 
= 3.33 respectively; ¢ = .47, df = 21), 
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Of the 26 subjects that the experimenter 
attempted to call 2 days before the exam, 23 
were contacted, 10 HBJWs and 13 LBJWs. 
It was found that HBJWs volunteered for 
significantly more experiments than LBJWs 
(M = 3.17, M = 2.17 respectively; ¢(21) = 
1.96, p < .05, one-tailed). The results of a 
simple ¢ test comparing the two mean differ- 
ences (Walker & Lev, 1953) indicated a sig- 
nificant interaction between time of volun- 
teering (5 weeks vs. 2 days before the exam) 
and level of belief in a just world, ¢(42) = 
1.83, p < .05, one-tailed. The results support 
the hypothesis that in time of need HBJWs 
manifest more altruistic behavior than LBJ Ws. 


EXPERIMENT 2 
Method 


Experiment 2 repeated the procedure outlined in 
the initial study with some minor changes. The week 
of final exams served as a time of need and subjects 
were asked to read for a blind student rather than 
to volunteer for experiments. To exclude the possi- 
bility that the results of Experiment 1 were due to 
the particular nature of the groups under investiga- 
tion, the order of calls was changed. Subjects who in 
the first study had been called 5 weeks before the 
exam were called shortly before the final exam pe- 
riod; subjects who in the first study had been called 
2 days before the midterm were called 4 weeks be- 
fore the final exams.1 In addition, to avoid suspicion, 
a different experimenter, also a female, was used to 
carry out the experimental manipulation. As in the 
first study, the experimenter did not know whether 
a particular subject was a HBJW or a LBJW. The 
experimenter contacted all subjects by phone and 
delivered the following message: 


Hi, my name is and I am a reader for —— 
who is a blind student in the evening school. She 
is taking a social psychology course by the same 
instructor that you have. Unfortunately, I am not 
familiar with this stuff and I am having difficulty 
reading and explaining the material to her. I am 
calling a bunch of people from the morning course 
to see if anyone would care to read to her during 
the next week. I am trying to call a large number 
of people so that the load will not be too heavy 
for anyone. Right now I am just compiling a list 
of people who are willing to give some of their 
time, no more than 5 hours each. Later on PI 


1 Because of the change in the order of calls, the 
comparison between HBJWs and LBJWs in time of 
need, that is, a short time before their exams, used 
different subjects in Experiment 1 and Experiment 2, 
thus permitting a combination of the differences be- 
tween HBJWs and LBJWs in time of need as if the 
two studies were independent. 
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decide whom to call back to arrange for specific 
time and place. Are you willing to give any of 
your time to do this? 


The experimenter answered any questions th 
subjects had, recorded the number of volunteered 
hours, thanked the subject, and terminated the con 
versation. 


Results 


Again there were no sex-linked differences 
and the following results are based on males 
and females combined. Of the 26 subjects that 
the experimenter attempted to call 4 week 
before the exam, 21 were contacted, 11 


HBJWs and 10 LBJWs. There was no sig- 


nificant difference between the mean number 
of hours volunteered by HBJWs and the 
mean number of hours volunteered by the 
LBJWs (M = 2.70, M = 2.32 respectively; 
t= 39, df = 19). 

Of the 26 subjects that the experimenter 
attempted to call at the beginning of the final 
exam period, 16 were contacted, 8 HBJWs 
and 8 LBJWs. The high loss of subjects prob- 


ably reflects the fact that most students were” 


studying in the library and more difficult to 
contact. The results showed that HBJWs vol- 
unteered significantly more hours than LBJWs 
(M = 1.35, M=.25 respectively; ¢(14) = 
1.77, p= <.05, one-tailed). However, the 
interaction between time of volunteering (4 
weeks before the finals vs. the beginning of 
the finals) and level of belief in a just world 
was not significant (t = .59, df = 33), 


EXPERIMENT 3 
Method 


This study compared again HBJWs to LBJWs on 
their willingness to participate in an experiment for 
no apparent reward. The week before the final exams 
of the spring semester served as a time of need. Sub- 
jects were 165 students, 93 females and 72 males, 
who had taken an introduction to psychology course 
in the fall semester. In partial fulfillment of a re- 
quirement of this course, all 165 students partic- 
ipated in a correlational study of personality mea- 
sures conducted by the author. The fall study used 
several personality questionnaires including the Just 
World Scale. The present experiment examined the 
willingness of these students to return for an addi- 
tional hour of psychological testing for which they 
could not receive credit or monetary compensation. 
Unlike the two former studies, all subjects were 
asked to come for the experiment in a time of need, 
that is, a week before final exams started. The ex- 
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perimenter, a female who was not involved in the 
fall study, contacted all subjects by phone and de- 
livered the following message: 


Hi, my name is —— and I am a graduate student 
in social psychology. I am working in the area of 
motivation trying to relate achievement motivation 
to other personality measure. I found out from Dr. 
Zuckerman, who also works in this area, that he 
used an achievement motivation questionnaire in a 
study he conducted during the fall—Experiment 
No. —. He gave me the names and telephone 
numbers of students who took part in that experi- 
ment and I found yours among them. Do you 
remember taking part in that experiment? [If sub- 
jects said no, the experimenter reminded them of 
the location of the experiment, the content of the 
achievement motivation questionnaire, etc. How- 
ever, the Just World Scale was not mentioned. All 
subjects remembered eventually having partic- 
ipated in the fall study.] The reason I am calling 
is that I would like to ask you a favor. Would it 
be possible for you to come to an experiment that 
I am conducting. It will take 1 hour of your time 
and all you have to do is fill out some more per- 
sonality questionnaires. It is important for me that 
you come because I would be able to relate your 
scores from my study to the scores from Dr. Zuck- 
erman’s study. Would you agree to come? 


The experimenter answered any questions that 
subjects had and recorded whether they agreed to 
come. She then explained the real purpose of the 
telephone call and inquired whether subjects had sus- 
pected anything, None of the subjects thought that 
the intention of the experimenter was to relate altru- 
istic behavior to the belief in a just world. In fact, 
most subjects had forgotten about the Just World 
Scales which made the debriefing more difficult. After 
the debriefing, the experimenter thanked the subject 
and terminated the conversation. 


Results 


As in the first two studies there were no 
sex-linked differences. Of the 165 subjects 
that the experimenter attempted to call, 128 
were contacted. Subjects were divided into 
those above (HBJWs; = 64) and those be- 
low (LBJWs, n = 64) the median of their 
Just World scores from the fall semester. The 
results showed that 28 HBJWs (43.7%) but 
only 16 LBJWs (25%) agreed to participate 
in the 1-hour experiment. The difference be- 
tween the two groups reaches the .02 level of 
significance (Z = 2.23, one-tailed). 


GENERAL DISCUSSION 


The second half of Experiment 1 and 2 and 
Experiment 3 compared HBJWs to LBJWs 
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on their willingness to behave in an altruistic 
way in a time of need. Since the three studies 
used different populations of subjects (see 
Footnote 1), it is possible to combine the 
results to obtain the overall likelihood of the 
findings considered as a set. This can be done 
by translating the p values obtained to the 
equivalent standard normal deviates (Z = 
1.82 in Experiment 1; Z = 1.65 in Experi- 
ment 2; Z = 2.23 in Experiment 3) and di- 
viding the sum of Zs by the square root of 
their number to obtain an overall Z that gives 
the overall level of significance of the ob- 
tained results (Mosteller & Bush, 1954). The 
procedure shows that in time of need the dif- 
ference between HBJWs and LBJWs in help- 
ing behavior reaches the .001 level of sig- 
nificance: 


z _ 1.82 + 1.65 + 2.23 


=3.26 
A 5 3.29. 


According to these results there are certain 
conditions under which those who believe in 
a just world tend to behave more altruisti- 
cally than “nonbelievers.” At the same time, 
the belief in a just world should not be 
equated with altruism. First, HBJWs seem to 
be trading one favor (i.e., altruistic behavior) 
for another (i.e., success in exams), rather 
than being involved in “true” altruism. It is 
only in times of need that HBJWs went out 
of their way to be good. When there was no 
need, that is, when no specific rewards were 
desired, HBJWs did not need to be deserving. 
Second, Lerner’s studies indicated that the be- 
lief in a just world enables a person to per- 
ceive victims as deserving their lot and thus 
perpetuates social injustice. Lerner (1971) 
also suggested that, depending on the situa- 
tion, either the rejection of victims or helping 
them may be related to the same psychologi- 
cal construct. In view of such evidence it is 
possible to conclude that in and of itself the 
belief in a just world is devoid of any moral 
implications. It may be conceived rather as 
an image, that a person has of the world, 
which may require a derogation of a victim 
that cannot be helped otherwise. This image 
may also increase altruistic behavior so as to 
make the person more deserving when he is 
in a state of need. 
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It should be noted that the Lerner studies 
and the present experiment differ from each 
other. In the Lerner studies, the suffering of 
the victim created a need to maintain the 
belief in a just world. In the present study 
the belief in a just world was not threatened. 
Instead, it suggested to some individuals (i.e., 
HBJWs) a way to satisfy their need to suc- 
ceed in their exams. The fact that different 
behaviors are linked to the same theoretical 
construct may help to explain some of the 
inconsistencies that have been observed in 
human behavior and makes the construct a 
potentially useful tool in the investigation of 
such behavior. No doubt the door is open for 
more research on other behavioral character- 
istics and the conditions under which they are 
related to a belief in a just world. 


Miron ZUCKERMAN 
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A Subjective Probability Interpretation of the Risky Shift 


James P. Kahan 
University of Southern California 


Vinokur’s expected utility approach to the 
SU) model. In this SEU model, the ri: 


jective expected utility (S 


as the result of a change in the association of subjective 
objective statements of probability of success. Thirty 


y shift is generalized to a sub- 


risk levels with various 
ix female subjects parti 


pated in a slightly modified risky shift experiment designed to differentiate be- 


tween a “true” 


risky shift model and a revised SEU function model. 


The data 


supported the latter model. Implications for the various theoretical interpreta- 


tions of risky shift are discussed. 


The risky shift phenomenon has been the 
focus of a major part of recent experimental 
social psychology. Reviews by Dion, Baron, 
and Miller (1970), Clark (1971), Vinokur 
(1971b), and a special issue of the Journal 
of Personality and Social Psychology (Pruitt, 
1971a, 1971b) attest to the interest in risky 
shift phenomena, Explanations of the risky 
shift have been many and varied; the reader 
is referred to any of the reviews above for 
critical appraisals. Among those various ap- 
proaches are the decision-making model of 
Vinokur (1971a, 1971b) and the social com- 
parison of abilities interpretation of Jellison 
and Riskind (1970, 1971; Jellison, Riskind, 
& Broll, 1972). The present article extends 
the decision-making model along lines dictated 
by consideration of social comparison of 
abilities. 

Vinokur (1971a, 1971b) has suggested that 
the shifts in the lowest acceptable risk level 
could be- accounted for as the result of a 
rational decision-making process. The model 
he employs may be characterized as an ex- 
pected utility (EU) model (Edwards, 1954) 
and is one of a class of such models that 
have been proved fruitful in the field of 
behavioral decision theory (Edwards & Tver- 
sky, 1967), In an expected utility model, 
a decision maker considers, for each of the 
alternatives open to him, all of the possible 
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outcomes of those alternatives and assigns 
a personal value (his utility) to each outcome. 
Then, for each decision alternative, the proba- 
bility of each possible outcome is determined, 
The expected utility of a given decision 
alternative is found by multiplying the utility 
of each outcome by its probability of occur- 
rence, The alternative actually chosen is the 
one which has the highest expected utility. 

In the context of the risky shift, the deci- 
sion maker is faced with the choice between 
two decisions. One, the conservative alterna- 
tive, leaves him with the sure (proba- 
bility = 1) outcome which has utility U. to 
him. The other alternative is the risky choice, 
which may result in an outcome having U, 
to him if the gamble is successful, but, if 
the gamble fails, results in an outcome 
having Uy, Clearly, the condition 


> U.> Uy, 


must hold, or else there is no conflict over 
which decision to make. If the probability 
of the outcome of the risky decision being 
the highest valued one is taken to be p then 
the expected utility of the risky decisi 
by definition 


U, = pU, + (1 — p)U,. 


If U, > Us then the risky decision is chosen; 
if not, then the conservative choice is pre- 
scribed. The common form of the risky shift 
paradigm (Kogan & Wallach, 1967) asks for 
the minimum acceptable odds of success before 
the risky alternative would be advocated. 
The various odds of success are presented as 
relative frequencies (i.e, 3 chances in 10) 
in the typical presentation. This is, in effect, 
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asking for what values of p the expected 
value of U, is greater than U.. The point 
of indifference, f, is easily shown (Vinokur, 
1971b) to be 


Vinokur’s subsequent development shows that 
the risky shift, which produces different levels 
of f before and after group discussion, could 
result from changes in the values of Us, Ue, 
and Uy; as a result of that discussion. If the 
positive aspects of the successful result of 
the risky decision are emphasized, then the 
individual’s utility for that outcome, U., 
would be increased. Similarly, if the conserva- 
tive choice or the negative outcome of the 
risky choice is derogated, the values U. or Uy 
would decline. If either or both of these 
processes occurs, as has often been suggested, 
then the net results are a lowering of the 
value of f and the observed risky shift. 
Taken in this context, the risky shift is not 
a shift in advocated risk, but rather the same 
degree of risk but with shifted utilities. 

In Vinokur’s and other standard risky shift 
experiments, the task of the subjects is to 
choose the minimum acceptable risk from a 
set of relative frequencies presented to them. 
The assumption is made that this set of 
relative frequencies are accurate representa- 
tions of the individual’s own subjective proba- 
bilities of success. In general, however, ob- 
jectively presented relative frequencies are 
not equivalent to personal or subjective 
probabilities (Davidson, Suppes, & Siegel, 
1957; Erlick, 1964; Kahneman & Tversky, 
1973; Mosteller & Nogee, 1951; Pitz, 1965), 
Instead of the EU model employed by Vi- 
nokur, a more appropriate model would be a 
subjective expected utility (SEU) model 
(Edwards, 1954). The SEU modification to 
Vinokur’s model defines a subjective proba- 
bility of success ¢ which determines the 
individual’s minimum acceptable risk accord- 
ing to the equation 


cn 
‘) 
{ 


where s, the subjective probability of a suc- 
cessful outcome if the risky choice is taken, 
has replaced p, the relative frequency of that 
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outcome, The subjective probability function 
that relates the private $ to the publicly 
reported Ê is monotonic in the sense that 
the greater the value of p, the greater the 
value of s; however, the function is not neces- 
sarily linear, much less an identity, and may 
be affected by different environmental con- 
texts. In an SEU interpretation of the risky 
shift, the observed shift in objective relative 
frequencies could arise from either a change 
in the utility of the various outcomes, as 
before, or from a change in the subjective 
probability function relating the values of f 
to the values of $. In this latter situation, 
the identical subjective probability of a suc- 
cessful outcome might be reflected in differing 
objective relative frequency preferences at 
different stages in the risky shift experiment. 
It is important to distinguish here between 
the shift in the subjective probability function, 
in which the same risk is reflected in different 
objective relative frequencies, and a shift in 
preferred subjective probability, which would 
be a true shift in risk preference. The SEU 
model is predicated on the former of these 
two alternatives, 

One possible factor affecting the relation- 
ship of objective odds and subjective proba- 
bility in the context of the risky shift is the 
ability of the actor in the risky shift story. 
In general, objective odds are defined in terms 
that are general, either across actors or across 
situations, The meaning of an objective odds 
statement applied to an individual actor in 
an individual situation, then, might be dif- 
ferentially interpreted, depending on the iden- 
tity of the actor and the situation in which 
he is placed. Consider, for example, one 
situation reliably showing a risky shift, that 
of a football game. The choice is between 
a sure field goal or a risky play for a touch- 
down. Suppose that the objective odds are 
given as 5 in 10 that the risky play would 
succeed. In such a situation, the subjective 
probabilities of success could well depend on 
the ability of the players involved. A Bart 
Starr or a John Unitas would be considered 
to have a subjective probability of success 
considerably above .50 whereas a third-string 
quarterback in the game only because the 
others have been injured might be assigned 
subjective probabilities well below .50. The 
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decision to try for the touchdown at the 
identical objective odds of 5 out of 10 would 
then represent quite different levels of risk 
for the two types of quarterback. In order 
to present equal levels of risk, the respondent 
would be required to give different levels of 
objective odds for the two types of quarter- 
back, say 3 in 10 for Bart Starr and 7 in 10 
for Ferdinand Fumblefingers. 

In the usual risky shift stories (Kogan & 
Wallach, 1967), the abilities of the actors are 
left vague and undefined. If one of the effects 
of the group discussion were to make the 
ability of the actor more salient, and if the 
general perception of ability were increased 
as a result of the group discussion, then the 
group consensus decision and the individual 
postdiscussion decision would show a change 
in the direction of more risk, This would 
occur even in the absence of a shift in the 
individual risk-taking propensity of the sub- 
jects, but instead a change in the perceptions 
of ability and therefore in the relation of the 
subjective probability to the objective odds. 

Although the question of abilities of the 
actors in the risky shift situation has not 
been approached directly, there is enticing 
tangential evidence to show that perceived 
ability is a relevant variable. It is interesting 
to note that the 2 stories in Kogan and 
Wallach’s original set of 12 that show a con- 
servative rather than a risky shift are stories 
in which the actor has no real control over 
his own outcomes, and, hence, his abilities 
are irrelevant. Rabow, Fowler, Bradford, 
Hofeller, and Shibuya (1966) showed that 
when specific persons such as “your father” 
or “your brother” were substituted for the 
more traditional “Mr. A” and “Mr, B,” there 
was no shift. If these familiar individuals are 
already well-defined with respect to their 
general abilities, then the discussion would 
not have the effect of modifying the sub- 
jective probability function. Any shift would 
be a real risky shift or a shift in utilities, 
but could not be a subjective probability 
function shift. That no shift occurred may 
be interpreted as evidence favoring the present 
model. 

Jellison & Riskind (1970, 1971; Jellison 
et al., 1972) have approached the problem 
of ability directly, although from the point 
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of view of the subject in the risky shift ex- 
periment rather than the actor in the story. 
Their position is that one major component 
of how much risk an individual takes is his 
perception of his abilities in the situation; 
that is, the greater his ability, the greater 
the level of risk the individual can reasonably 
attempt. Persons of high ability are justified 
in taking more risky and difficult actions 
than a person who is less able. This position 
is entirely consonant with the SEU inter- 
pretation presented above, although it rises 
out of an orthogonal theoretical position. 
Jellison and Riskind (1970) showed that the 
greater the level of risk a person chooses, the 
more able that person is perceived. Further- 
more, the greater an individual’s ability, the 
more risk he is seen as likely to advocate. 
Jellison and Riskind (1971) extended this line 
of reasoning to show that people attribute 
greater riskiness to another as a function of 
the other’s perceived ability. Jellison et al. 
(1972) found that, in tasks involving skill, 
persons taking high risks were seen as more 
able than persons taking low risks, but that 
this effect was not present when the task 
involved chance. 

The risky shift paradigm as normally used 
confounds the two possibilities of a shift in 
advocated risk and a shift in the relation of 
subjective probability to objective relative 
frequency probability. It is impossible to say 
whether a shift in the advocated risk is an 
actual shift in the risk that should be taken 
or a new perception of the meaning of the 
objective relative frequencies presented, The 
present experiment attempts to separate these 
factors by adding a questionnaire to the 
normal risky shift procedure. This question- 
naire ascertains the extent of risk that the 
individual is advocating when he states a 
preference for a certain objective relative 
frequency of success. In this way, the ob- 
jective minimum acceptable odds of a suc- 
cessful outcome and the riskiness of those 
acceptable odds can be separated, 

Consider the situation where an individual 
has shown a “risky shift.” Let fı be his 
original minimum acceptable odds at time 4, 
and fy be his revised minimum acceptable 
odds at time l» where fi > ps. If a true 
shift in acceptable risk has occurred, the 
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subjective riskiness of the objective odds fı 
should not change over measurements at tı 
and fy. If, however, the advocacy of the new 
level fə is based on a shift in the relation 
of subjective probability to objective odds, 
then the subjective riskiness of the p; decision 
at t should be less than its subjective riski- 
ness at fg, Furthermore, if an individual is 
asked to assess the subjective riskiness of fı 
at time fı and the subjective riskiness of f: 
at time ts, then the two different odds should 
be seen as equivalent in risk. 


METHOD 
Subjects 


Subjects were 36 female undergraduate students at 
the University of Southern California who participated 
in the experiment as part of a requirement of the 
introductory psychology course. The subjects partici- 
pated in 12 groups of 3 students each. The subjects 
within any group were not close friends, but in most 
instances they were acquaintances Or had seen each 
other in classes. 


Procedure 


Subjects were seated around a table and were given 
the normal introduction to the risky shift experiment 
(Kogan & Wallach, 1967), and then were given a set 
of six of the original choice dilemmas to work on 
individually. The six dilemmas correspond to items 
4, 6, 7, 8, 10, and 11 of the original set of 12, and 
were chosen as ones where the outcomes of the risky 
choice are to a major extent determined by the per- 
formance (and hence the ability) of the central char- 
acter. The stories involved (a) the football captain 
who had to choose between playing for a win or a tie; 
(b) the college senior who must choose between two 
qualities of graduate school; (c) the chess player who 
must choose to play for a win or a draw; (d) the 
college senior who must choose between careers of 
music or medicine; (e) the businessman who must 
choose whether to run for Congress; and (f) the 
scientist who must choose between long-term and 
short-term research projects. 

Following the first questionnaire booklet, in which 
the minimum acceptable odds for each situation were 
measured, subjects were given a second booklet, 
hereafter referred to as the subjective risk questionnaire, 
which was designed to measure subjects’ estimates 
of how risky their own minimum acceptable odds 
were. These questionnaires again presented the cover 
stories for each dilemma, but instead of the usual 
request for minimum acceptable odds the question- 
naire asked how risky a decision to attempt to gain 
the most desitable outcome would be. The text used 
for the football player’s dilemma provides an 
illustration : 


Imagine that the chances are — in 10 that the 
more complicated play will work. Now, please in- 
dicate by placing an X on the dotted line below 
the degree of risk you feel that Mr. B would take 
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if he decided to pursue the more desirable alter- 
native; that is, play for the victory. 


no risk at all 


The experimenter inserted the subject’s own minimum 
acceptable risk in the blank space. Subjects were 
fully aware that the experimenter’s entry was their 
own previously stated minimum acceptable odds. 
Thus, a subjective measure of the risk, scored as a 
number from 0 to 50 based on the subject’s located 
point on the dotted line, that the subject was willing 
to take was determined for each dilemma. 

After completion of the subjective risk question- 
naire, subjects returned to the normal risky shift 
paradigm, discussing each item in turn to a group 
consensus and then indicating their own revised 
personal minimum acceptable odds. 

The final step in the experiment was a second 
administration of the subjective risk questionnaire. 
This time, each of the three subjects in a group had 
the same questionnaire except that the number in 
the blank space was determined differently for each 
of them. The nature of this determination constitutes 
the independent variable of the study. In Condition A, 
the subject’s own original minimum acceptable odds 
were entered. For this condition, the second adminis- 
tration of the subjective risk questionnaire was an 
exact replication of the first administration. In Condi- 
tion B, the group consensus minimum acceptable 
odds was entered in the blank space. These subjects 
evaluated the riskiness of the group’s decision, In 
Condition C, the subject’s own revised personal mini- 
mum acceptable odds provided the entries for the 
subjective risk questionnaire. Although the experi- 
mental treatments were identical for all subjects up 
to the second subjective risk questionnaire, their roles 
were randomly assigned (by alphabetical order of 
first name) at the very beginning of the experiment. 
Following completion of the last subjective risk 
questionnaire, the nature of the hypotheses under- 
lying the experiment and the reason for the various 
experimental questionnaires were explained to the 
subjects. 


RESULTS 


Five measures were taken from each sub- 
ject. For each of these measures, data were 
averaged over all six choice dilemma situ- 
ations. The first three measures were the 
subject’s minimum acceptable risk, the amount 
of shift toward risk between this first measure 
and the group consensus, and the amount of 
shift toward risk between the first measure 
and the revised minimum acceptable risk. 
These three measures are those typically 
employed in risky shift studies; the first has 
a range of from 0 to 10, while the latter two, 
being difference scores, may take values be- 
tween plus and minus 10. The fourth measure 
was the subjective riskiness of the original 
minimum acceptable odds, and the fifth mea- 
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Sure was the difference between the subjective 
tisk items on the two administrations of the 
questionnaire. For both of these measures, 
the raw score of number of dots from the 
left was adjusted by a multiplicative con- 
stant to yield scores with ranges comparable 
to the previous three variables. For the fifth, 
and critical, measure, a positive number in- 
dicates a shift in the direction of less sub- 
jective risk being advocated. All five measures 
are presented in Table 1. For statistical pur- 
poses, the experimental group of three sub- 
jects was taken as the experimental unit, 
with the three independent variables being 
repeated measures on this unit. This tech- 
nique controls for between-group differences 
in degree of shift. 


Risky Shift 

Prior to examination of subjective risk, it 
must be ascertained that the intrusion of the 
subjective risk questionnaire did not impair 
the risky shift phenomenon. To test this, the 
grand means over conditions of both the 
group shift and the personal shift were tested 
against a null hypothesis of zero shift. Both 
shifts were significantly different from zero 
by analysis of variance, F(1, 35) = 14.77 for 
group shift; F(1, 35) = 16.16 for personal 
shift; p < .001 for both. A second analysis 
showed that the differences among experi- 
mental conditions were not significant 
(a = .10), Thus, the risky shift did occur 
for all three conditions and did not occur 
differentially among them, It should be noted 
that, as usual, the magnitude of the personal 
shifts were less than the group shifts, in- 
dicating that the players were in some cases 
agreeing to a consensus to which they them- 
selves did not subscribe. Even so, there was 
a reliable and significant shift in personal risk. 

It has been argued (e.g., Collins & Guetz- 
kow, 1964; Teger & Pruitt, 1967) that the 
risky shift is a result of persons who originally 
have conservative feelings shifting more than 
persons who have original gambling feelings. 
To test this possibility, the individual’s origi- 
nal minimum acceptable risk was covaried in 
a reanalysis of the group and personal shifts, 
For both the group and the personal shift, 
the regression was significant, indicating that 
the more conservative the original position, 
the greater the shift. However, even with 
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TABLE 1 


MEANS OVER THE Six CHOICE DILEMMAS OF THE 


DEPENDENT VARIABLES 
Condition Over 
— ~ — condi- 
Measure A B c 


tion 


minimum ac- 


ceptable odds* 4.85 5.57 5.35 
Group shift AT 1.360 115 
Personal shift» 56 1.04 86 
Original subject 474 422 4°36 
Shift in subject 93 —1.20 —.04 


he higher the number, the more conservative the m 
b The higher the number, the more risky the measure 


this effect extracted by covariance, the tests 
for group shift and personal shift were still 
statistically different from zero, F(1, 34) 
= 18.11 for group shift, p < .001; F(1, 34) 
= 6.87 for personal shift, p < .013. 


Shift in Subjective Riskiness 

The shifts in subjective riskiness for the 
three conditions are presented in row 5 of 
Table 1, Recall that the SEU model dictated 
that if the risky shift occurred, subjects in 
Condition A, evaluating the subjective risk 
of a decision to opt for the most desirable, 
but chancy, outcome, would see their original 
minimum acceptable risk as more conservative, 
or as less of a risk than it formerly was. 
A subject in Condition C, in which he evalu- 
ates the riskiness of his revised opinion, should 
show no difference in riskiness between the 
later judgment and the earlier one, even 
though that earlier one was based on more 
conservative objective odds, Finally, a sub- 
ject in Condition B might rate the riskiness 
of the group decision to be greater than the 
original condition, because he has not fully 
subscribed to the risk level advocated. Because 
the group minimum acceptable odds are 
greater than his own personally advocated 
odds, the group choice is riskier than his 
stated risk preference. 

The results bear out the dictates of the 
SEU model, with the mean shift for Condi- 
tion A being positive, the shift for Condi- 
tion C near zero, and the shift for Condition B 
being negative. The differences among the 
three conditions are significant by a repeated 
measures analysis of variance, F (2, 22) = 8.57, 
p < .002. Post hoc Duncan tests show all 
three conditions to be distinct at a confidence 
level of a = .05, 
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Discussion 


The data from the present study show that 
there is not necessarily a change in the amount 
of subjective risk advocated when an indi- 
vidual shows what has come to be called a 
“risky shift.” Instead, evidence is presented 
that there is a change in the amount of 
subjective risk that is represented by a certain 
level of objective probability. In this context, 
as in Vinokur’s context of shifting utilities, 
the “risky shift” may be viewed as the result 
of a rational process of reevaluation by the 
subject of parameters involved in a maximi- 
zation of subjective expected utility, and not 
as the advocacy of greater risk. The present 
study does not firmly establish the presence 
of a shift in subjective probability to the 
exclusion of a shift in the utilities of the 
outcomes as posited by Vinokur (1971a). 
Both or either of the processes could account 
for the results in either experiment. However, 
the weight of evidence of the studies in 
decision making (Edwards & Tversky, 1967) 
points in the direction of at least some role 
for subjective probability. 

The present study does suggest, however, 
that the risky shift is not a shift in advocated 
risk, but rather the result of a rational process 
of attitude change. Once this concept of an 
actual shift in risk is discarded, the various 
competing explanations of a risky shift be- 
come technically moot. However, these ex- 
planations can serve as bases from which to 
attempt to answer the revised question of 
why a group causes a person to revise his 
utility of certain outcomes or his subjective 
probability of their occurrence. In particular, 
the risk-as-a-value hypothesis (Brown, 1965), 
as extended through Jellison and Riskind’s 
(1970) social comparison of abilities approach, 
suggests that ability may be a relevant factor 
in the answer to this question. The social 
comparison of abilities model does lend itself 
more readily to a subjective probability shift 
than a utility shift, because it appears more 
reasonable to presume that ability would 
affect the probability of success more than 
it would the evaluation of the outcomes asso- 
ciated with success and failure. 
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Effects of Schedule and Amount of Observed Reinforcement 
on Response Persistence 
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Replication of the usual findings concerning resistance to extinction after vari- 


able ratio (VR) and continuously rein 
low amounts of positive reinforcement 
control group design was used with 9 
peer served as the model. 


forced (CRF) responding with high and 
was attempted. A 2 X 2 factorial, single 
O male children, A videotaped, same-sex 


A free operant response, marble dropping, measured 


unreinforced observer imitative behavior following observation of the model’s 


performance. A significant main effect 
was found for total mean number cf 


for schedule (VR 4) and amount (high) 
responses. Also, a significant main effect 


for schedule (VR 4) was obtained for mean persistence time. A discrimination 
explanation was applied to the present results. Berger’s analysis of model rele- 
vance for social comparison was considered in interpreting the results. 


Although children frequently observe 
others receive positive reinforcement, the 
parameters that influence persistence of these 
observed responses have not received ade- 
quate empirical investigation. In most of the 
previously related research the term resis- 
tance to extinction has been used to describe 
unreinforced behavioral patterns of observers 
subsequent to viewing a reinforced model 
(Braun, 1972; Hamilton, 1970). In these 
studies, observing subjects are never directly 
reinforced, and therefore, removal of directly 
applied positive reinforcement does not occur. 
Employing the term resistance to extinction 
to describe the previously unreinforced imita- 
tive behavior is inaccurate and contradicts 
the accepted operant conditioning definition 
of this term (Reynolds, 1968). A more ac- 
curate term to describe unreinforced imitative 
observer behavior following the observation 
of a positively reinforced model is persistence. 
Persistence is used in the present study to 
describe this behavior. 

Variables which have been found to in- 
fluence resistance to extinction of directly 
reinforced responses such as schedule and 
magnitude (Koteskey, 1972; Lewis, 1960) 
have rarely been manipulated in studies in- 
volving observed or vicarious reinforcement. 
The schedule of observed reinforcement has 

Requests for reprints should be sent to Charles D. 
Kerns, 14160 Redhill Avenue, #123, Tustin, Cali- 
fornia 92680. 


been the most frequently manipulated vari- 
able in this research, whereas the empirical 
examination of amount of observed reinforce- 
ment has been sparse. Results concerning an 
“observed” partial reinforcement effect 
(PRE) have been contradictory to date. 
Studies associated with these inconsistent 
findings have varied greatly with respect to 
dependent measures employed to measure 
persistence. 

Rosenbaum and Bruning (1966) found 
that observed continuously reinforced (CRF) 
discrete bar movement responses produced an 
increase in the observer’s subsequent response 
speeds. This reversal of the typical response 
pattern following direct reinforcement was 
attributed to social demand characteristics; 
that is, subjects were not told they could stop 
responding when they wished. Their depen- 
dent measure of response speed was obtained 
by having second-grade male and female sub- 
jects move an upright handle from left to 
right on an 18-inch (45-cm) opening, Bruning 
(1964) obtained similar results using the 
same response and dependent measure follow- 
ing suspension of directly programmed be- 
havioral consequences, Bisese (1966) used 
imitative judgements of the length of lines 
that were displayed on a screen as his de- 
pendent variable in examining the effect of 
vicarious or observed reinforcement on these 
judgments. During observed reinforcement 
trials, subjects viewed the model receive three 
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treatment conditions. Treatment procedures in this 
study can be conceptually divided into two major 
procedural phases: Phase 1 involved the observation 
of the appropriate videotaped condition by each 
subject, Phase 2 involved the unreinforced perform- 
ance of subjects immediately following their obser- 
vation of the appropriate film. A detailed description 
of each procedure follows, 

Observation procedures. Subjects were brought in- 
dividually to the experimental room by the experi- 
menter and asked to sit in a chair next to the ex- 
perimenter in front of the videotape monitor. The 
experimenter stated, “You are going to see a movie. 
You will see a school boy named John and me in 
this movie. Watch closely.” The experimenter then 
requested the subject to place a set of headphones 
on his head while he and the experimenter watched 
the videotape. 

There were five different videotaped conditions. 
Each subject viewed the taped performance to which 
he was randomly assigned. The content in all five of 
the tapes was identical except for the instructions 
given to the model by the experimenter concerning 
the amounts and schedules of reinforcement that the 
model received for playing the marble game. Also, 
the actual reinforcement delivered to the model dif- 
fered between the five groups and conformed to the 
experimenter’s previously given instructions. The 
manner in which the two independent variables 
(amount and schedule of reinforcement delivered to 
the model) were displayed in films and viewed indi- 
vidually by subjects according to their assignment is 
described as follows. 

In Group 1, 18 subjects individually observed the 
model responding on a CRF schedule, the model re- 
ceiving four nickels for each response. Thus, subjects 
observed the model receive a total of 102 nickels for 
emitting 28 marble-dropping responses. 

In Group 2, 18 subjects individually observed the 
model responding on a CRF schedule, the model re- 
ceiving one nickel for each response, Thus, subjects 
observed the model receive a total of 28 nickels for 
emitting 28 responses. 

In Group 3, 18 subjects individually observed the 
model responding on a VR 4 schedule, the model 
receiving four nickels seven times, after Responses 1, 
6, 9, 17, 18, 24, and 28. Subjects thus observed the 
model receive a total of 28 nickels for emitting 28 
responses. 

In Group 4, 18 subjects individually observed the 
model responding on a VR 4 schedule, the model 
receiving one nickel seven times, after Responses 1, 
6, 9, 17, 18, 24, and 28. Subjects thus observed the 
model receive a total of seven nickels for emitting 
28 responses. 

In Group 5, the control condition, 18 subjects 
individually observed the model emit 28 marble- 
dropping responses without receiving reinforcement. 

Performance procedures. Immediately following 
observation of their specific videotaped condition, 
each subject was told by the experimenter, 


You have just seen a school boy like you play 
the marble game. Now it is your turn. Come over 
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here with me and I'll show you the game. R: 
member, you pick up one marble at a time w 
one hand from here and drop it in the white h 
here. You can play the game as long as you wan 
When you want to stop, you can. I will be oui 
side the room by the door, so just come get n 
if you want to stop, Go ahead, you can start 


Prior to giving these instructions, the experimenter 
removed a sheet that was draped over the front of 
the apparatus to conceal it during the observation 
phase. This sheet was replaced by an assistant be- 
tween the experimental sessions. 

After the word start, in the previous instructions, 
the experimenter left the experimental room leaving 
the door to the room ajar. There was no other 
person present in the room while the subject per- 
formed. During each subject's performance the de- 
pendent measures which operationally defined per- 
sistence in this study were recorded by an adult 
‘observer, This observer was seated outside the ex- 
perimental room by the door. 

The persistence or performance period was termi- 
nated when the subject informed the experimenter 
that he was finished or after an 8 minute time period 
had elapsed. Upon completion of their marble-drop- 
ping performance, each subject was praised for per- 
forming. In addition, the experimenter stated, “I 
want you to keep a secret with me and not tell any 
of your friends or classmates what you did, Let's 
shake on it.” Following this statement, the subject 
was returned to his room by the experimenter. 


Dependent Measures 


Persistence in emitting unreinforced responses fol- 
lowing the observation of a videotaped condition was 
measured by two dependent variables. First, the total 
number of marble-dropping responses emitted by 
each subject during the performance or persistence 
period was measured in 1-minute time blocks. Sec- 
ond, the total amount of time that each subject 
persisted in the operant task was recorded. This 
measure was recorded in minutes and seconds but 
converted for analysis purposes into minutes and 
fractions of minutes. 

The two dependent measures were recorded by the 
adult observer seated outside the experimental room 
by the door. The observer was not visible to the 
subject. The observer was instructed to count each 
marble dropped by the subject in 1-minute time 
blocks. Each marble, as it dropped, could be clearly 
heard from outside the room. The observer used a 
stopwatch to record the number of marbles dropped 
per 1-minute interval as well as to register the total 
persistence time for each subject. 

The performances of 15 randomly selected subjects 
on these two dependent measures were also recorded 
by another independent observer. The interobserver 
reliability for each dependent variables was 100%. 
This reliability data was identical to that found in 
an earlier pilot study that employed the same ex- 
perimental procedures. 
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ResuLts 
Total Mean Number of Responses 


The results of analysis of variance of the 
total mean number of marble-dropping re- 
sponses using the method of orthogonal con- 
trasts are summarized in Table 1, These data, 
along with appropriate pairwise contrasts 
using Duncan’s test, resulted in the following 
decisions concerning the three relevant null 
hypotheses. 


1. Null Hypothesis 1, that there is no 
significant mean difference in total number of 
marbles dropped between the observed VR 4 
schedule and observed CRF schedule, is re- 
jected. The critical value for F to exceed at 
the .05 level of significance is 3.95. 

Duncan’s test showed the mean for the 
VR 4 high-amount group (60.39) to be sig- 
nificantly different from the mean of the CRF 
high-amount group (32.83) at p < .01. Also, 
the mean of the VR 4 low condition (45.44) 
was significantly higher than the mean of the 
CRE low condition (26.33) at p < .01. These 
findings indicate that both VR 4 conditions 
contributed to the significant F value for the 
main effect of schedule of observed reinforce- 
ment. It can be concluded that observation 
of the VR 4 schedule subsequently produced 
significantly more persistence than observa- 
tion of the CRF schedule in terms of the total 
mean number of marbles dropped. 

2. Null Hypothesis 2, that there is no sig- 
nificant mean difference in total number of 
marbles dropped between groups observing 
the model receive the high amount of rein- 
forcement and those observing the model re- 
ceive the low amount of reinforcement, is 
rejected. 

Duncan’s test showed that the mean differ- 
ence between the VR 4 high-amount group 
and VR 4 low-amount group was significant 
at p<.05. Also, the CRF high-amount 
group’s mean was greater than that for the 
CRF low-amount, but statistical significance 
was not reached using Duncan’s comparison 
technique at p < .05. Results from these two 
mean comparisons indicate that the VR 4 
high-amount group was mainly responsible 
for the significant F value for the main effect 
of amount of observed reinforcement. 


TABLE 1 


ANALYSIS OF VARIANCE OF TOTAL RESPONSES 


Source | af MS F 
Schedule (A) 1 | 9800.00 | 24.59** 
Amount (B) 1 | 2069.39 | 5.19* 
AXB | 1| 320.89 “80 
Control X All Others | 1 | 5244.10 | 13.16** 

Error | 85 | 398.61 
a Additional contrast of control against treatments, 
*p <05. 


** p < 001. 


3. Null Hypothesis 3, that there is no sig- 
niñcant interaction between schedule and 
amount of observed reinforcement on the 
total mean number of responses, is accepted. 
The effects of schedule of reinforcement are 
constant across the low- and high-amount 
conditions. 

Duncan’s test was also used to compare the 
control group’s mean performance with that 
of each experimental group. The results 
showed that means for both VR 4 conditions 
were significantly different from the control 
group’s mean (22.17) on the total number of 
responses emitted during the performance pe- 
riod at p < .01. However, separate compar- 
isons of the CRF high-amount group and the 
CRF low-amount group with the control 
group both produced nonsignificant results at 
p < .05. This finding indicates that the CRF 
groups did not persist beyond chance levels. 

Another mean comparison of particular in- 
terest was between the VR 4 high-amount 
condition and the CRF low-amount condi- 
tion, since each group observed a total of 
28 reinforcement deliveries to the model. This 
comparison resulted in the VR 4 high-amount 
condition’s mean being significantly greater 
than that of the CRF low-amount condition 
at p<.01, This seems to show that the 
scheduling of observed reinforcement was 
more important on subsequent observers’ per- 
sistence than was the absolute magnitude of 
observed positive reinforcement, 

Computation of Hartley’s Fmax test for 
homogeneity of variance resulted in a value 
of 4,02. Since this value does not exceed the 
critical value of 5.16 at p < .01, the homoge- 
neity of variance assumption was accepted for 
this dependent measure. 
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- Total Mean Persistence Time 


The results of the analysis of variance on 
the mean persistence times in minutes using 
the method of orthogonal contrasts are sum- 
marized in Table 2. These data, along with 
appropriate mean contrasts using Duncan’s 
test, resulted in the following decisions con- 
cerning the three relevant null hypotheses. 


1. Null Hypothesis 1, that there is no sig- 
nificant mean difference in persistence time 
between the observed VR 4 schedule and the 
observed CRF schedule, is rejected. The crit- 
ical value for F to exceed at the .05 level of 
significance is 3.95. 

Duncan’s test showed the mean of the 
VR 4 high-amount group (5.30) to be signifi- 
cantly different from the mean of the CRF 
high-amount group (2.95) at p < .01. Also, 
the mean of the VR 4 low-amount group 
(4.34) was significantly higher than the mean 
of the CRF low-amount group (2.29) at 
p< .05. These findings indicate that both 
VR 4 conditions contributed to the significant 
F value for the main effect of schedule of 
observed reinforcement. It can be concluded 
that observation of the VR 4 schedule sub- 
sequently produced significantly more per- 
sistence than did observation of the CRF 
schedule in terms of time spent in the per- 
formance period. This same finding was ob- 
tained on the total mean number of marbles 
dropped. 

2. Null Hypothesis 2, that there is no sig- 
nificant mean difference in persistence time 
between groups observing the model receive 
a high amount of reinforcement and those 
observing the model receive a low amount of 
reinforcement, is accepted. The obtained F 
value was, however, close to reaching signifi- 


TABLE 2 
ANALYSIS OF VARIANCE OF PERSISTENCE TIMES 


Source | df MS F 
Schedule (A) 1 87.16 25.08** 
Amount (B) 1 11.92 3.43 
AXB 1 .39 ll 
Control X All Others* 1 29.56 8.51* 

Error 5 3.47 


8 
| 


* Additional contrast of control against treatments. 
*p < 005. 
KD < 001. 
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cance at p < .05, where the critical value for 
F to exceed is 3.95. The high-amount group’s 
effects were greater than the low-amount 
group’s for both schedule conditions. 

3. Null Hypothesis 3, that there is no sig- 
nificant interaction between schedule and 
amount of observed reinforcement on per- 
sistence time, is accepted. The effects of 
schedules are constant across the amount var- 
iable, with the VR 4 condition being signifi- 
cantly superior to the CRF condition. 

The results from mean comparisons of each 
experimental group with the control group for 
this dependent measure were identical to 
those found for mean total number of re- 
sponses. In addition, the mean persistence 
time between the VR 4 high-amount group 
and the CRF low-amount proved to be sig= 
nificant. The VR 4 high-amount group’s mean 
was higher, which seems to indicate that the 
scheduling of observed reinforcement is more 
critical in determining the observer persist- 
ence times than the absolute value of rein- 
forcement dispensed to the model. 

It was also observed that six subjects in the 
VR 4 high-amount group and four subjects in 
the VR 4 low-amount group persisted for the 
entire 8-minute performance period. 

Hartley’s Fmax test for homogeneity of var- 
iance resulted in a value of 4.53. This ob- 
tained value does not exceed the critical value 
of 5.16 at p < .01. The homogeneity of var- 
iance assumption was accepted, 


Supplementary Results 


In addition to the analyses of the two de- 
pendent measures, the mean number of re- 
sponses within each minute was calculated for 
each group. Figures 1 and 2 graphically com- 
pare these means per minute. 

In these figures, the means per minute of 
the four treatment conditions are shown to be 
greater than or equal to those of the control 
condition. In Figure 1, after the first minute, 
the VR 4 high-amount condition is shown to 
be superior to the CRF high-amount group in 
mean number of responses for each minute. 
In Figure 2, a comparison of the VR 4 low- 
amount group and the CRF low-amount 
group exhibits the same pattern except for 
the first minute. 
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When comparing both Figures 1 and 2, the 
mean responses of the VR 4 high-amount 
roup are greater than those of the VR 4 low- 
mount group for every minute. Likewise, 
ean responses of the CRF high-amount 
oup are higher than the CRF low-amount 
roup in every minute, With the exception of 
(wo intervals (minute 1: VR 4 high-amount 
group = CRF high-amount group; minute 1: 
CRF low-amount group > VR 4 low-amount 
oup), the order of greatest mean number of 
esponses per minute for all eight 1-minute 
‘tervals was the following: VR 4 high- 
mount, VR’ 4 low-amount, CRF high- 
mount, and CRF low-amount. This same 
yrdering of groups was found for the mean 
total number of responses in the first analysis. 
Also, the mean number of marbles dropped 
per minute for each group was analyzed by 
use of orthogonal contrasts. This analysis 
showed no significant main effect at the .05 
level of significance for either of the main or 
interaction effects. Results on the rate mea- 
sure were inconclusive and are discussed later. 
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Ficure 1. Comparison of VR 4 and CRF high-amount 
groups with the control condition. 
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Ficure 2, Comparison of VR 4 and CRF low-amount 
groups with the control condition, 


Discussion 


This study showed a VR 4 schedule to sig- 
nificantly increase unreinforced responding, 
while the CRF schedule did not facilitate per- 
sistence beyond chance levels. The present 
data are supportive of other recent studies in 
this area which have found that partial ob- 
served reinforcement produces more persist- 
ence than CRF (Hamilton, 1970). Most of 
the previous research studies have not, how- 
ever, used control groups to compare effects 
of observed reinforcement conditions, 

As mentioned earlier, the use of control 
groups in this area of research seems val- 
uable. Aside from measuring subjects’ tend- 
encies to imitate without viewing the ap- 
plication of external consequences, control 
conditions could indicate the reinforcing 
value of the specific response measure used. 
In the present study, control subjects showed 
considerable persistence during the earlier 
part of the performance period. In fact, sub- 
jects in the CRF conditions did not show 
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significantly more persistence than the control 
subjects. Had a control condition not been 
employed, false conclusions about the effec- 
tiveness of observed reinforcement may have 
been drawn. For example, one may have 
falsely concluded that a CRF schedule is 
better than no reinforcement in sustaining 
observer persistence. 

A limitation in the present study due to the 
lack of additional subjects was the exclusion 
of a no-model-exposure control condition. 
Subjects in this control condition would have 
only performed the experimental task. Data 
concerning the reinforcing value of the mar- 
ble-dropping game would have been provided. 
Instead, the lack of this control condition led, 
in the control group, to a confounding of the 
reinforcing value of the response measure and 
the control subjects’ initial tendencies to 
imitate. 

The present results concerning the effects 
of schedule of observed reinforcement on per- 
sistence replicate the usual findings in the 
area of resistance to extinction following di- 
rectly programmed behavioral consequences 
(Lewis, 1960; Koteskey, 1972). Lewis’ ob- 
servation that a 100% and 0% reinforcement 
schedule produce the least resistance to ex- 
tinction seems to apply to the present find- 
ings concerning schedule of observed rein- 
forcement. The VR 4 groups produced more 
persistence in terms of time and total re- 
sponses, while the CRF conditions (100% 
observed reinforcement) and control (0% ob- 
served reinforcement) did not persist beyond 
chance levels. 

It seems plausible that the above findings 
concerning persistence subsequent to observed 
reinforcement can be explained by processes 
which also clarify the PRE following suspen- 
sion of directly programmed behavioral con- 
sequences. Specifically, an extension of the 
Mowrer and Jones (1945) discrimination 
theory seems appropriate in explaining the 
present data, 

During observation of the VR 4 reinforced 
models, subjects witnessed several unrein- 
forced responses and were subsequently not 
reinforced for emitting the same response. 
The observed nonreinforced responses may 
have increased generalization from the ob- 
servation situation to the performance condi- 
tion and consequently reduced discrimination. 
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This may have produced greater subsequent 
persistence in observers, In contrast, subjects 
observing a CRF reinforced model did not 
witness nonreinforced responses, and discrim- 
ination of the stimulus conditions between the 
observation and performance periods were 
therefore easier to make. 

Also, the independent variable of amount 
of observed reinforcement was found to be 
significant for the mean total number of re- 
sponses and near significance at the .05 level 
for the mean persistence time measure., This 
increase in persistence due to larger amounts 
of reinforcement is also typically found to in- 
crease resistance to extinction following direct 
reinforcement (Pumroy & Pumroy, 1961; 
Reynolds, 1968). 

The high-amount variab!e had its greatest 
effect when used in combination with the 
VR 4 schedule. It appears that the VR 4 
schedules reduced discrimination during the 
performance period, while the high amount of 
reinforcement increased subjects’ motivation. 
However, different magnitudes of observed 
reinforcement may differentially affect the 
schedule to produce varying degrees of per- 
sistence. Further research is needed to exam- 
ine these effects using more variations in 
amounts of observed reinforcement than were 
used in the present study. 

While the discrimination hypothesis seems 
appropriate in interpreting the present find- 
ings, Berger’s (1971) analysis should be con- 
sidered. Berger found that when the condi- 
tions of reinforcement were the same for both 
relevant and nonrelevant models, PRE oc- 
curred only in observers who viewed the rel- 
evant model. In the present study, the model 
could be considered relevant for social com- 
parison by the observers. In fact, a relevant 
peer model was intentionally used in the pres- 
ent design. Subjects were told at the begin- 
ning of the performance phase, “You have 
just seen a school boy like you play the 
marble game.” Model relevance may there- 
fore have been an influential variable in pro- 
ducing PRE. 

It is interesting to note that the present 
findings generally support the notion that be- 
havioral patterns of persistence following ob- 
served positive reinforcement are similar to 
response patterns following the withdrawal of 
direct consequences. A pure discriminant op- 
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erant explanation appears plausible. However, 
when viewed in conjunction with Berger's 
findings, a straightforward discriminant oper- 
ant analysis of the present results may need 
to be qualified by including the relevance of 
the model for social comparison. Social com- 
parison variables may complicate a simple 
discriminant operant analysis of behavioral 
persistence following observation of the model. 
Future research is needed to test the main 
and interaction effects of model relevance and 
other variables associated with observed re- 
inforcement such as amount, schedule, and 
kind of reinforcement. 

The results concerning the mean rate of 
responding were inconclusive. One possible 
explanation is that observers imitated the rate 
of the model. This seems plausible, since the 
models response rates were not deliberately 
manipulated but instead were practically 
identical in all videotape conditions. Further 
research is needed to offer a more definitive 
analysis. 

On the basis of the present findings, it 
would seem important for persons working in 
classrooms, clinical groups, and other social 
settings to be aware that directly dispensed 
reinforcement to a specific subject may have 
similar and possibly unintended affects on un- 
reinforced members. Bolstad and Johnson 
(1972) found that children observing others 
receive reinforcement persisted in emitting 
low rates of disruptive behavior even though 
these observing children never received re- 
inforcement. 

The present research also suggests that not 
only is observed reinforcement important, but 
its scheduling, as well as magnitude, influence 
observer behavior. Most of the previous re- 
search has not systematically manipulated 
these variables. More attention should, how- 
ever, be given to these variables when they 
are applied in order to facilitate desired be- 
havioral outcomes. 


REFERENCE NOTE 


1, Gewirtz, J. L. Personal communication, Spring 
1972. 
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Conceptions of Harmful Groups: Some Correlates 
of Group Descriptions in Three Cultures 


William A. Scott 
James Cook University, 
Townsville, Australia 


John Rohrbaugh 
University of Colorado 


Adult residents in three communities—Boulder, Colorado, United States: 
Wellington, New Zealand; and Kyoto, Japan—were interviewed to ascertain 
their ideas about groups harmful to the respective societies. The three samples 
of respondents tended to identify similar kinds of groups—(a) establish- 
ment, (b) ideological extremists, (c) legal offenders, and (d) antiestablishment 
groups—and to attribute similar characteristics to them—(a) rigid, (b) sub- 
versive, (c) destructive, and (d) slovenly and immature, respectively; religious 
and ethnic minorities were rarely mentioned. Respondents who considered a 
particular type of group harmful tended to exaggerate the same distinctive 
negative characteristics assigned by respondents who did not regard it as 
harmful. To some extent, a respondent's designation of a group as harmful 
depended on its apparent possession of characteristics antithetical to his own 


liberal or conservative values. 


The purpose of this research was to study 
conceptions of harmful groups held by citizens 
in three societies. The three communities 
chosen—Boulder, Colorado, United States; 
Wellington, New Zealand; and Kyoto, Japan 
—differed substantially in specific groups 
available as targets of prejudice; nevertheless, 
it was expected that comparable types of 
groups would be identified as harmful in the 
three communities, The focus of inquiry was 
not limited to ethnic stereotypes, for it 
seemed quite conceivable that people with no 
pronounced ethnic prejudices might neverthe- 
less maintain rather definite notions about 
other outgroups—hippies, communists, or 
businessmen, for example—and that these be- 

These investigations were supported by the Uni- 
versity of Colorado Council on Research and Cre- 
ative Work. We are especially indebted to Shigeo 
Imamura for translating the interview schedule and 
facilitating the study in Japan. The following per- 
sons provided helpful consultation and assistance on 
various aspects of the project: Ruth Scott, Kay 
Blight, Laurence Brown, Satoru Inomata, Okichi 
Endo, J. R. Cole, Harry Orsman, Alan Robinson, 
G. A. Vignaux, Bruce Payne, Valerie Smeaton, and 
Hazel Smith. In addition, dozens of students con- 
ducted excellent interviews of an unusual nature, 
often under trying circumstances. Harry Triandis 
provided a helpful critique of an earlier draft. 

Requests for reprints should be sent to William A. 
Scott, Department of Behavioral Sciences, James 
Cook University, Townsville, 4811, Australia. 


liefs about groups named as harmful by the 
respondent might be of greater personal and 
social significance than beliefs about particu- 
lar groups identified a priori by the investi- 
gators. 


THEORETICAL BACKGROUND 


Previous theory and empirical findings en- 
courage two predictions concerning people’s 
conceptions of groups they consider harmful. 
First, the considerable body of empirical 
work on national and ethnic stereotypes (see, 
eg., Buchanan & Cantril, 1953; Harding, 
Proshansky, Kutner, & Chein, 1969; Kelman, 
1965) rather consistently demonstrates that 
negative characteristics tend to be attributed 
to dislike groups. Second, the particular con- 
tent of a negative conception is predicted, to 
some extent, from the theoretical formulation 
of Campbell and LeVine (1972) that stereo- 
types of outgroups include exaggerations of 
distinctive group characteristics that are nega- 
tively valued by the ingroup. These exaggera- 
tions are believed to follow a general percep- 
tual principle, enhancement of contrast 
(Campbell, 1956, 1967). It is therefore ex- 
pected that conceptions of outgroups will 
include, in part, plausible negative character- 
istics, that is, characteristics that are also 
attributed to the same group by unprejudiced 
external observers. 
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The designation of a characteristic as “neg- 
ative” depends on the values of the respon- 
dent. When ingroup members share common 
values, they are likely to agree on the choice 
of a scapegoat and to hold similar concep- 
tions of it. When ingroup members value dif- 
ferent traits, differing conceptions may arise 
from two sources: (a) Each person tends to 
identify as harmful the groups which appear 
to threaten his own values, and (b) having 
identified a group as harmful, the person is 
inclined to attribute to it all characteristics 
he dislikes. 

The foregoing reasoning stresses the joint 
dependence of outgroup conceptions on char- 
acteristics of the target groups and on values 
of the host. The formulation would apply 
equally well to any conception, favorable or 
unfavorable, individual or shared. In order to 
account for the development of shared preju- 
dices, focused on particular target groups, one 
may postulate the existence of widely shared 
values that appear to be conspicuously vio- 
lated by certain groups. This need not imply 
universal values or veridical judgment about 
the target group but only sufficient consensus 
on both value and judgment to yield some 
common target of hostility. 

Within modern, complex societies, the tar- 
gets are likely to be various because citizens’ 
values are diverse. Yet the bases for intra- 
societal cleavage are sufficiently fundamental 
that certain types of scapegoat are likely to 
be found in all communities, even though they 
provide targets of prejudice for quite different 
persons. For instance, differences in status 
might tend to generate differences in the 
degree to which manners, deference, and 
property are valued; differences in age may 
underlie differences in the value placed on 
wisdom and physical prowess. By reason of 
their status and conventional upbringing, some 
members of society will have a vested interest 
in stability; other members, socialized to 
alternative norms, will be committed to 
change in the status quo. Each interest group 
will tend to identify as dangerous those out- 
siders who appear to threaten cherished val- 
ues, To the extent that parallel conflicts of 
interest appear in different societies, compa- 
rable targets of hostility and prejudice will be 
found. To the extent that corresponding in- 


terest groups in different societies hold simi- 
lar values, the groups they identify as harm- 
ful will be attributed the same sorts of value- 
alien characteristics. 

Four main hypotheses emerge from these 
considerations: (a) Hypothesis 1: Similar 
kinds of groups will be identified as harmful 
in different societies. (b) Hypothesis 2: 
Similar characteristics will be attributed in 
different societies to a particular kind of 
group identified as harmful. (c) Hypothesis 
3: The characteristics attributed to a particu- 
lar group identified as harmful will be, in 
part, an exaggeration of undesirable char- 
acteristics consensually attributed to the 
group by people who do not consider it harm- 
ful. (d) Hypothesis 4: The characteristics 
attributed to any group identified as harmful 
will be, in part, those which contrast with 
the respondent’s values. 

The first two hypotheses could be tested 
only in a limited way, since they imply the 
culture as the unit of analysis, and only 
three cultures were available for systematic 
study. The third and fourth hypotheses were 
tested independently within each of the three 
samples, the individual respondent being the 
unit of analysis. From all of the values that 
might be considered under Hypothesis 4, only 
the bipolar value of liberalism versus con- 
servatism was assessed in this study, « 


METHOD 
Samples 


Respondents were adult residents of their respec- 
tive communities, Those in Boulder, Colorado, were 
obtained by a two-stage probability sample of 
dwelling units (see Kish, 1949, 1965), Those in 
Wellington, New Zealand, were selected systemati- 
cally, after a random start, from recent electoral 
rolls from two of the city’s four districts. Respond- 
ents in Japan were selected systematically after a 
random start from files listing all residences and 
inhabitants kept in the city office of Otsu and in 
two ward offices of Kyoto. 

Interviews were conducted during 1970 and 1971 
by local residents who were university students or 
staff trained by the first author.’ Attrition within 
the Boulder and Wellington samples fell within 
acceptable limits (22% and 24%, respectively), 
yielding samples of 205 in Boulder and 200 in Wel- 

1We are grateful to Okichi Endo of Doshisha 
University and to Satoru Inomata of Shiga Univer- 
sity for supervising the sample selection and inter- 
viewing in Japan. 
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lington, These were augmented by 39 training inter- 
Views from Boulder and 72 from Wellington, after 
preliminary analyses had indicated that the proba- 
bility samples and the training interviews produced 
essentially the same results on the variables mea- 
sured here. The attrition rate in Kyoto/Otsu was 
much higher (approximately 50%) mainly due to 
refusals from women and older respondents and 
erroneous listings in the ward offices, Though the 
resulting sample of 137 can hardly be considered an 
accurate representation of the entire community, it 
was sufficiently heterogeneous to permit an ade- 
quate test of the present hypotheses. 


Interview Schedule 


Assessment of the major variables, namely, char- 
acteristics attributed to harmful groups, depended 
mainly on descriptive checklists containing several 
adjectives representing each characteristic that might 
be attributed to a group. It was recognized, how- 
ever, that any standard checklist could influence the 
respondent's description, so that results from this 
procedure should be compared with those obtained 
from more natural, free descriptions of the same 
groups. Therefore, the interview schedule was con- 
structed to elicit both kinds of descriptions—free 
and checklist—of groups named as harmful by the 
respondent, 

In addition, Hypothesis 3 required that each of 
several specified groups be described both by re- 
spondents who judged it harmful and by respondents 
who did not. This implied a checklist of groups 
developed for each community, from which respon- 
dents could select those they regarded as harmful; it 
also implied the prior anticipation of groups likely 
to be controversial within the community so that 
sufficient descriptions of them could be obtained 
both from respondents who had, and from those who 
had not, checked each group as harmful. 

The construction of appropriate checklists in 
Boulder was facilitated by a pilot study, conducted 
in 1969, in which a random sample of adult residents 
(selected by multistage procedures as specified in 
Kish, 1965) nominated and described in their own 
words groups they considered harmful. The most 
frequently mentioned groups were included in the 
Boulder checklist for the present study, and the 
free descriptions of them were coded, yielding several 
major descriptive categories, which were subse- 
quently expressed as related words on a standard 
checklist containing 60 adjectives and descriptive 
phrases in alphabetical order. 

This descriptive checklist was used for all groups 
nominated as harmful in all three samples, Although 
a preferable procedure would have been to construct 
a common checklist from pilot studies in all three 
communities, the resources and time available for 
the study did not permit this. A standardized list 
was necessary to compare descriptions among the 
three cultures, so it was constructed on the basis of 
the best information available at the beginning of 
the study. 
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The checklists of groups for Wellington and Kyoto 
samples were constructed from two sources: (a) by 
showing social psychologist collaborators from those 
cultures the list developed for Boulder and asking 
whether groups of those types were to be found in 
New Zealand or Japan and (b) by asking addi- 
tional informants (students and university instruc- 
tors) in both communities what groups were likely 
to be regarded as harmful by some (but not all) 
citizens. The Boulder checklist contained 60 groups, 
of which 33 essentially identical groups appeared on 
the 50-group Wellington checklist and of which 30 
essentially identical groups appeared on the 52-group 
Kyoto checklist. The Wellington and Kyoto/Otsu 
checklists had 38 groups in common, (The Ameri- 
can Students for a Democratic Society, the New 
Zealand Progressive Youth Movement, and the 
Japanese Zengakuren were regarded as essentially 
identical, as were the Republican Party, the National 
Party, and the Democratic-Liberal Party of the 
three respective countries.) 

The interview schedule was constructed so as to 
elicit free responses first. Following two warm-up 
questions, respondents were asked, “Are there any 
groups in [country] that are harmful to the society? 
Any others?” Up to five groups were recorded, al- 
though fewer than one sixth of the respondents 
named that many. For each group mentioned, the 
respondent was asked, “What are these people like? 
Anything else?” A maximum of five characteristics 
were coded per group. s 

These free nominations and descriptions were 
followed by the standard checklist descriptions of 
the groups nominated as harmful and then by the 
checklist of groups from which respondents were 
invited to select those they considered harmful. Next 
appeared a set of questions designed to tap various 
aspects of conservative versus liberal ideology and 
representing one kind of (bipolar) value considered 
in Hypothesis 4, The items were based on Wilson 
and Patterson’s (1968) scale developed for use in 
Great Britain, the Netherlands, and New Zealand 
and asked the respondent, “Do you generally be- 
lieve in or not?” An effort was made to 
adapt specific items to each culture as appropriate 
(eg, “shrine visiting” in Japan instead of “sabbath 
observance”), 

All respondents were then shown the standard 
adjective checklists, this time for describing six or 
seven specific groups selected in advance, to test 
Hypothesis 3, under the expectation that the groups 
would offend a range of values likely to appear in 
each community (see Note to Table 6). The inter- 
view ended with several questions assessing age, edu- 
cation, religious affiliation, and political preference, 
variables intended for inclusion in the liberalism— 
conservatism scale, 


Analyses 


Classification of groups. Since Hypotheses 1 and 
2 refer not to specific groups but to “kinds of 
groups,” it was important to test them by pro- 
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TABLE 1 
VoLUNTEERED HARMFUL GROUPS AND CLUSTERS 


Boulder | Wellington | Kyoto/Otsu 
sample sample sample 
(n = 244) | (n = 272) | (m = 137) 
Group code ae Se >| PA Sees | Us. 
T = | 
Clus- | Clus- | Clus- 
Big business 20} 
Government officials 09 
Politicians |. 21 
Military | -07 
Police 05 
Merchants 00 
Rate paces 04 
ight-wing groups, =) 
miscellaneous 4 | EX 
Labor wi 04| EX 
Nazis, F 01 | s 
Left-wing groups, mise. 10) EX 
Militant minorities | 

{black militants, 

Maori agitators) 00 
Communists 06 fe 
Criminals Ja 132 | OFF 
Drug users and EA, PE 

peddlers | .09 | OFF 01 | OFF 
Militant agitators j 14 | .12 | OFF 
Deviant religions | 02 | OFF 07 
Juvenile delinquents | OL ‘ 
Protesters and 

demonstraters 403 | ANT 
People who don't | £ “7 

contribute 04 | ANT |06| ANT 
Hippies jaa ANT | 04 

j 


Note, p is the proportion of Keapandents who 
one group in the code category. “ategories for wl 
in all three itted from the table, Abbreviations 
are as follow: blishment, EX = extremists, OFF 
= legal offenders, and / = antiestablishment groups. 


cedures that allowed classification of similar groups 
into a common category, Such classification might 
be done immediately, on an a priori basis, at the 
most abstract level, but this procedure runs a 
serious risk of relying too heavily on the investi- 
gator’s preconceptions concerning what groups are 
functionally equivalent within and between cultures. 
Accordingly, a procedure was chosen which com- 
bined a priori judgment at a low level of abstrac- 
tion with empirical clustering at a higher level. 

Groups nominated as either helpful (Q. 2) or 
harmful (Q. 3) were coded into 70 categories, de- 
fined at a low level of abstraction, such as police, 
mass communications media, welfare recipients, and 
criminals? Those most frequently used for coding 
harmful groups are reported in Table 1, Categories 
at this level of classification were used for tests of 
Hypothesis 1. However, they were too refined for 
testing Hypothesis 2, which required a comparison 
of descriptions between different types of groups. 
Most of the low-level categories contained no more 
than a dozen entries, which made comparisons 
between group descriptions quite unreliable. 

In order to produce larger categories at a higher 
level of abstraction, it was assumed that functionally 
equivalent groups could be identified as those called 
“harmful” by the same people, From this perspective, 
the functional significance of a group within a 
particular society is appraised from the kinds of 
other groups to which it is seen as similary and the 


basis of similarity judgments employed here is the 
nomination of the group as harmful, Two groups 
which are named harmful by the same set of re- 
spondents, and not named by another common 
group of respondents, are treated as functionally 
equivalent and therefore appropriately assignable to 
the same abstract category. 

Only 38 of the 70 low-level categories were re- 
quired for coding all of the harmful groups nomi- 
nated. These were converted into dichotomous 
variables to which each respondent could be assigned 
as having named or not named that group harmful. 
The dichotomous variables were intercorrelated over 
all respondents, and the intercorrelation matrix was 
subjected to a cluster analysis. (see Table 2)” The 
clusters formed in this way served as higher level 
categories for classifying the groups nominated as 
harmful and thus yielded increased frequencies for 
comparing descriptions of group types in a test of 
Hypothesis 2. The same correlation matrices were 
used to provide one test of Hypothesis 1 because the 
degree to which a particular group is functionally 
similar in two different societies may be inferred 
from the similarity in pattern of correlations with 
other code categories. By extension, the degree to 
which two intercorrelation matrices are similar be- 
tween cultures may be interpreted as reflecting the 
degree to which the groups are functionally equiva- 
lent in the present sense. 

Scoring oj group descriptions. The checklist de- 
scriptions were intended to yield more reliable 
measures than the free descriptions, being less sub- 
ject to response errors (i.e„ omission by respondent 
or interviewer of critical characteristics) and be- 
cause a number of related adjectives could be com- 


2 Coding was performed by the authors with the 
assistance of Stephen M. Sasaki and Takuji Tsuka- 
tani. The same set of categories was used for coding 
both helpful and harmful groups in order to permit 
direct comparisons between the frequencies with 
which any particular type of group was named in 
either context. The categories were developed ini- 
tially from responses to a pilot study in Boulder 
and then generalized to apply to the New Zealand 
and Japanese data, Percentages for intercoder agree~ 
ment on independently coded interviews in Boulder 
and Kyoto/Otsu were .77 and .58, respectively, 

“The criteria for cluster formation were precisely 
specified, but they were also set very low, because 
of exceedingly low communalities due to response 
errors inherent in free-response data and because 
the variables were negatively intercorrelated, on the 
average, due to the restriction of respondents’ nomi- 
nations to a maximum of five groups. A particular 
category was included in a cluster if it had a mean 
correlation of at least .0S with all other categories 
in the cluster and if this mean correlation was at 
least 05 higher than its mean correlation with the 
categories of any other cluster. Thus, mutually ex- 
clusive clusters were formed, with small-to-moderate 
reliabilities and generally negative intercorrelations 
between clusters. 
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TABLE 2 
HARMFUL Group CLUSTER FREQUENCIES 
AND RELIABILITIES 
Kyoto/ 
Boulder Wellington Otsu 
sample sample sample 
Cina (n =244) (n = 272) (n = 137) 
pa rb pe re pe re 
Establishment 360 «SI AZ, aes 3462 
Extremists 34 38 46 132 AST RIS 
Legal offenders 27 40 ADAR ALNAS 
Antiestablish- 
ment groups Capea? 30.24 


Note, Clusters for which pe <.10 in all three samples are 
omitted. 

a pe is the proportion of respondents who named at least one 
group in the cluster. 

ra is Cronbach's (1951) alpha, an estimate of cluster 
reliability. 


bined into each of several composite scales. Ac- 
cordingly, checklist descriptions were used as the 
main variables for testing the last three hypotheses. 
However, it was possible to include in these scales 
only characteristics explicitly anticipated in con- 
structing the checklist; hence there might be a sub- 
stantial limitation in group descriptions, especially 
in Wellington and Kyoto/Otsu. Accordingly, paral- 
lel analyses were performed on the free descrip- 
tions. These generally yielded results corresponding 
to the checklist analyses, although at substantially 
reduced magnitudes. That is, similar clusters of 
characteristics were found, for the most part, and 
they were generally applied to the same group 
types, as was true for the checklist descriptions. 
Accordingly, only results from the more reliable 
checklist descriptions will be treated here. 

The use or nonuse of each adjective was treated 
as a dichotomous variable, and the 60 variables were 
intercorrelated over all groups nominated as harm- 
ful for each sample separately (678 groups in 
Boulder, 552 in Wellington, and 301 in Kyoto/Otsu). 
Clusters of adjectives were formed according to 
the following criteria: (a) the mean correlation of 
an item with all other items in the cluster was at 
least .20, and (b) the mean intracluster correlation 
of every item was at least 05 higher than its mean 
correlation with items of any other cluster. Items 
that correlated equally well with two or more clus- 
ters were removed in order to reduce intercluster 
overlap. A score for each descriptive cluster applied 
to each group was constructed for every respondent 
by counting the number of adjectives from the 
cluster which he had selected to describe the par- 
ticular group. Reliabilities of the resulting scores 
ranged from .61 to .80 (see Table 3). 

It is desirable to use scores that can be compared 
from ‘one descriptive cluster to another in order to 
ascertain the most distinctive group characteristics. 
Accordingly, each raw cluster score was converted to 
a standard (z) score, based on the mean and stan- 
dard deviation of raw scores for that descriptive 
cluster, computed within each sample separately 
over all harmful groups nominated. 
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Measure of Conservatism-Liberalism. A cross- 
culturally comparable index of conservatism—liberal- 
ism was developed by item analysis within each 
sample separately. Items were retained if they had 
positive correlations with the total scale in all three 
samples and a significant item-total correlation in at 
least one sample. The six demographic indicators of 
conservatism retained by these criteria were age, sex 
(female), traditional religious affiliation, frequent 
religious attendance, education level (low), and 
political party (conservative). The nine attitudinal 
indicators were belief in church authority, sabbath 
observance, censorship, and the death penalty; 
rejection of premarital sex, abortion, mixed mar- 
riage, and women judges; and profession of con- 
servative political views. The reliabilities of the 
total scales (estimated from Cronbach’s coefficient 
alpha) were .81, .67, and .75 in Boulder, Wellington, 
and Kyoto/Otsu, respectively. 


RESULTS 


Intercultural Similarity in Groups 
Seen as Harmful 


Similar rates of nomination. Hypothesis 1 
states that similar kinds of groups will be 
identified as harmful in the three communi- 
ties. Two methods of operationalizing the 
term similar can be employed to test the hy- 
pothesis. The first is to compare the relative 
frequencies with which the 70 coding cate- 
gories for groups were used by correlating 
the p values of Table 1 over all 70 codes (in- 
cluding many not reported in Table 1 due to 
small frequencies) between each pair of sam- 
ples. The resulting correlations are .30 be- 
tween Boulder and Wellington percentages, 
.65 between Boulder and Kyoto/Otsu, and 
.22 between Wellington and Kyoto/Otsu. 
(Assuming independence of the 70 categories, 
a correlation of .20 is significant at p < .05, 
one-tailed test.) A comparable analysis was 


TABLE 3 


RELIABILITIES OF CHECKLIST SCALES FOR 
DESCRIBING HARMFUL GROUPS 


Sample 
Scale Boulder Wellington Kyoto 
Selfish -76 .66 17 
Rigid -80 -62 79, 
Slovenly 78 70 61 
Immature 79 -12 -18 
Destructive -10 61 -19 
Subversive -70 -68 -68 
Favorable -74 -10 -62 


Note. Table entires are Cronbach's (1951) coefficient alpha. 
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made for corresponding groups on the check- 
lists from which respondents selected the 
groups they judged to be harmful. The inter- 
sample correlations between percentages are 
substantially higher for the checklist groups 
than for the volunteered groups: .74 between 
Boulder and Wellington percentages, .75 be- 
tween Boulder and Kyoto/Otsu, and .55 be- 
tween Wellington and Kyoto/Otsu (all ps < 
05, assuming independence of the listed 
groups). These results indicate that the most 
frequently named harmful groups in one com- 
munity tend to be the most frequently named 
in the other two communities as well. 

Similar intercorrelation matrices. A second 
way of operationalizing “similarity” between 
nominated groups in two communities is by 
means of the correlation matrices constructed 
when the mention of each group is correlated 
(by phi coefficient) with the mention of every 
other group. To the extent that the two cor- 
relation matrices are similar, the groups on 
which they are based are deemed to have simi- 
lar psychological significance for the two sam- 
ples of respondents and therefore common 
meanings within the two cultures. This is the 
condition required by Hypothesis 1, and com- 
parison of correlation matrices provides an 
essential additional test in order to ascertain 
that group types are functionally equivalent 
acrosscultures. Indices of intermatrix simi- 
larity weré provided by the correlations co- 
efficient computed over corresponding ele- 
ments. In order for these elements (correla- 
tion coefficients) to be stable, only groups 
nominated by at least 10 respondents in both 
samples were considered, which thereby 
yielded 91 pairs of coefficients for the Boul- 
der—Wellington comparison and 66 pairs for 
comparisons involving Kyoto/Otsu. The re- 
sulting similarity indices are .31, .66, and .55 
for the three pairs of communities, Boulder- 
Wellington, Boulder—Kyoto/Otsu, and Wel- 
lington—Kyoto/Otsu, respectively (all ps < 
.05, assuming independence of the correlations 
within each matrix). 

Hypothesis 1 was therefore supported. Simi- 
lar kinds of groups tend to be regarded as 
harmful in the three communities. The four 
common clusters identified in Table 2 may be 
regarded as defining the core of similarity. 
Together the clustered groups (including 
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numerous low-frequency categories not re- 
ported in Table 1) account for 70% of all 
harmful groups nominated in Boulder and 
68% of those nominated both in Wellington 
and in Kyoto/Otsu. 

The establishment cluster, consisting mainly 
of big business, politicians, and bureaucrats, 
is rather prominent in Boulder and Kyoto/ 
Otsu, with about one third of the respondents 
nominating one or more groups within it; it 
is considerably less prominent in Wellington. 
Ideological extremists are the most commonly 
mentioned harmful groups in Wellington; 
they were also nominated by at least one 
quarter of the respondents in the other two 
samples. It is noteworthy that, in all three 
communities, both left- and right-wing groups 
appear in this single cluster, which indicates 
that extreme views, regardless of direction, 
are seen as generally threatening by many 
respondents. In Boulder and Wellington, anti- 
establishment groups fall into two clusters, 
legal offenders and the less serious disruptors. 
The Japanese respondents did not distinguish 
between these two levels of severity, instead 
tending to mention a variety of antiestablish- 
ment groups ranging from criminals to mili- 
tant agitators. 


Intercultural Similarity in Descriptions of 
Harmful Groups 


Hypothesis 2 was tested with multiple- 
item scales constructed from the descriptive 
checklists. The equivalence of these check- 
lists in the three communities may be ascer- 
tained from the similarity of clusters formed 
independently within each sample.* Table 4 
identifies each cluster by the items contained 
in it. 

Given that the variables have similar 
meanings in the three samples and that group 


4Two clusters may be regarded as similar be- 
tween samples, if the adjectives contained in them 
account for at least 50% of their total frequencies; 
by this criterion, all identically named clusters are 
similar, except for “rigid” in Wellington and “sub- 
versive” in Kyoto/Otsu, The two exceptions are 
attributable to differing cluster sizes, resulting mainly 
from elimination of items that correlated equally well 
in two different clusters. In Table 4, items are ar- 
ranged to clarify the clusters; their actual order on 
the checklist was (English) alphabetical, except for 
“all alike” and “all different,” which appeared first. 
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TABLE 4 
ITEMS IN CHECKLIST SCALES FOR DESCRIBING HARMFUL Groups 
Sample | Sample 
Welling- Kyoto/ || Welling- Kyoto, 
Item Boulder ton Otsu | Item Boulder ton Otsu 
Dishonest SEL SEL SEL Maladjusted IMA IMA IMA 
Selfish SEL SEL SEL Unhappy IMA IMA IMA 
Out for themselves SEL SEL Easily led IMA IMA 
Inconsiderate SEL SEL Afraid IMA IMA 
Deceitful SEL SEL Uninformed IMA 
Insincere SEL SEL Neurotic IMA IMA 
Lying SEL SEL || Cruel DES DES 
They don’t give a damn SLO SEL Destructive DES DES 
Authoritarian RIG RIG RIG || Violent DES DES 
Restrictive RIG RIG RIG Antisocial DES 
Rigid RIG RIG RIG || Mean DES 
Power seeking RIG RIG || Trouble-makers DES DES 
Conservative RIG RIG || Disloyal SUB DES 
Oppressive RIG RIG Unpatriotic SUB 
Oppose progress RIG RIG || Radical SUB 
Prejudiced RIG SUB RIG || Subversive SUB 
Closed minded RIG DES || Leftist SUB 
Ambitious FAV RIG || Undemocratic SUB 
Dirty SLO SLO SLO || Idealistic 
Impolite SLO SLO SLO Intelligent FAV 
Sloppy SLO SLO SLO || Well dressed z, 
Parasitic SLO SLO | Hardworking a 
Poor appearance SLO SLO || Well mannered t 
Immoral SLO Informed 
Lazy SLO SLO IMA || Helpful 
Drop-outs SLO SLO IMA || Good intentions 
Ignorant IMA IMA IMA | Hypocritical 
Childish IMA IMA IMA || Liberal 
Immature IMA IMA IMA || All alike 
Insecure IMA IMA IMA | All different 
| 


Note. Abbreviations are as 
= subversive, and FAV = favorable. 


descriptions are measured with reasonably 
high reliability, one may test the second hy- 
pothesis, that similar types of groups are simi- 
larly described in the three communities by 
people who regard them as harmful. Mean 


selfish, RIG = rigid, 


SLO = slovenly, IMA = immature, DES = destructive, SUB 


scores for the common group types on each 
descriptive scale are reported in Table 5. The 
overall similarity between typical descriptions 
from two communities may be represented by 
a product-moment correlation coefficient com- 


TABLE 5 
MEAN CHECKLIST SCALE Scores FOR DESCRIBING Four COMMON TYPES or HARMFUL Groups 


Boulder sample 


Kyoto/Otsu sample 


Wellington sample = 
Descriptive pi 
scale pais 
OFF ANT | EST EX OFF ANT 2 ANT 
=a 
Selfish 63, —.41y 1245 24 —.09 34 | 
Rigid 2la Aly 455 84, 15p = .23h0 —.51e 
Slovenly Kii i 380 -105 —.5le —.14b. 04a» 45a 
Immature —.4h alda  —.10ab —.3ó —.21n Sta 38a 
Destructive | —.47» 42a 39a —.64, 15b 45a 09a, 
Subversive | —.41p 40a 30, —.1l, | —.27 46. —40, —.34, 
Favorable Sls 14 —1085 m5 195 00; —.29, —.23, 


Note. Scores are standard (2) scores based on the mean and standard dev: 
a given sample. Within any sample and descriptive cluster, a common lett 


that the group means are not sign ntly different 
group types are as follows: EST = establishment, E> 


t p <.05) 


= extreme, O 


ion of a part icular descriptive cluster computed from 
ripts) indicates 


b 


= legal oender and’ AN’ y atlestablishinent groups. 
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de 
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TABLE 6 


CORRELATIONS OVER SPECIFIED GROUPS BETWEEN MEAN DESCRIPTIVE SCALE SCORES FROM 
RESPONDENTS WHo Dip anp Dip Nor Cueck Eac Group HARMFUL 


Ta" 


Tay* 
Descriptive scale 
Boulder Wellington Kyoto/Otsu Boulder Wellington Kyoto/Otsu 
Selfish 90 99 97 -56 .96 .95 
Rigid .99 95 63 87 65 04 
Slovenly 98 96 95 94 80 04 
Immature 86 95 87 21 68 43 
Destructive OT 82 96 -68 38 66 
Subversive 98 89 99 .83 — 36 -10 
Favorable 75 Eyi 62 —.50 —.76 —.26 


Note. The specified groups in Boulder were: 


(b) hippies, (c) Zengakuren, (d) large indi 
groups regarded as equivalent between soci 
a y is the mean score assigned by “hostil 


ies.) 


a) John Birch Society, (b) hippi: 
communists, (f) black militants, and (g) criminals. Those in Wellington were (a) N 


sive Youth Movement, (d) large industry, (e) communists, and (h) trade unions. 
y, (i) radical democrats, and (j) collective action groups. (Identical letters signify 


(c) student protesters, (d) establishment (e) 
w Zealand Nazi Party, (b) hippies, (c) Progres- 
hose in Kyoto/Otsu were (a) ultranationalists, 


espondents (those who checked the group harmful), x is the mean score assigned by 


“unbiased” respondents (those who did not check the group), and z = y — x. (Hostile and unbiased respondents differ from one 


group to another.) 


puted over the corresponding mean scores. 
Between Boulder and Wellington (28 pairs of 
means) this index of similarity is .70, be- 
tween Boulder and Kyoto (21 pairs) it is .77, 
and between Wellington and Kyoto (21 
pairs) it is .65, (all ps < 05) Thus, Hy- 
pothesis 2 is supported at a general level. 

In order to ascertain more exactly which 
descriptions were distinctively applied to 
which types of groups, F ratios were com- 
puted across the three or four common group 
types in a given community for each descrip- 
tive scale.” The mean descriptions differed 
significantly in 19 tests out of 21, and Tuk- 
ey’s (a) test (Winer, 1971) was applied to 
identify the significantly different pairs of 
groups. The results, shown in Table 5, indi- 
cate that in all three communities establish- 
ment groups were likely to be seen as rigid, 
extremists as subversive, criminal offenders as 
destructive, and antiestablishment groups as 
slovenly and immature. 


51f two or more groups of a given type were 
described by the same respondent, his mean de- 
scriptive cluster scores were entered into the analy- 
ses of variance. The columns are treated as inde- 
pendent, thus ignoring any correlation introduced 
through some respondents’ contributing group de- 
scriptions to two or more clusters; this assumption 
produces a conservative bias in the F test and 
Tukey’s (a) tests, even though the alpha levels were 
formally set at .05. 


Similar Descriptions of Specified Groups 


Some of the groups designated in the inter- 
view schedule for checklist description by all 
respondents were intended to fill comparable 
functions in the three societies (see note to 
Table 6). Thus it was possible to test Hy- 
pothesis 2 again, this time using specific 
groups, and considering the descriptions pro- 
vided by all respondents, not just those who 
regarded the group as harmful. 

The mean descriptive scale scores (z scores) 
assigned to corresponding groups were corre- 
lated for each pair of samples. The indices of 
similarity are .69 between Boulder and Wel- 
lington, .62 between Boulder and Kyoto/Otsu, 
and .82 between Wellington and Kyoto/Otsu 
(all ps <.05). These results provide addi- 
tional support for and clarification of Hy- 
pothesis 2. Whether corresponding groups are 
identified a priori or on the basis of similar 
clusters constructed from the respondents’ 
nominations of harmful groups, whether de- 
scriptions are provided by all respondents or 
only by those who name the group as harm- 
ful, similar groups tend to be similarly de- 
scribed in the three cultures. 

Analyses of variance and Tukey’s (a) tests 
indicated that right-wing extremists (desig- 
nated a in the Note to Table 6) were gen- 
erally seen as rigid, hippies as slovenly and 
immature, the establishment (or large indus- 
try) as selfish and rigid, and communists as 
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subversive. Student protesters (designated c 
in the Note to Table 6) is the only common 
specified group that was not similarly char- 
acterized in all three samples.® 


Exaggeration of Salient Group Characteristics 
by Hostile Respondents 


Hypothesis 3 proposes that people who 
regard a group as harmful will tend to exag- 
gerate its distinctive liabilities. Although no 
method was available for assessing “true” 
characteristics of the groups nominated, a 
useful substitute was found in the descrip- 
tions offered by respondents who did not 
regard them as harmful. 

Table 6 is based on mean descriptive scale 
scores from hostile respondents (those who 
had checked a particular group harmful) and 
from relatively unbiased respondents (those 
who had not checked the group harmful)." 
“Unbiased” respondents, of course, differ 
from one group to another. Although of gen- 
erally lower elevation on all six unfavorable 
epithets, the pattern of means from the unbi- 
ased respondents closely followed the pattern 
of mean scores from respondents hostile 
toward the several groups. Correlations be- 
tween means over the 6 or 7 specified groups 
range from .62 to .99 with a mean of .88. 

The exaggeration of an unfavorable char- 
acteristic by hostile respondents may be rep- 
resented by the degree to which their mean 
descriptive scale score exceeds the mean from 
unbiased respondents. For every descriptive 
scale applied to every group, a discrepancy 
score was constructed as the mean scores from 
hostile respondents minus the mean score 
from unbiased respondents. This discrepancy 
score was correlated over the six or seven 
groups within each descriptive scale. The 
correlations (for unfavorable descriptions 
only) range from —.36 to .96; 17 of the 18 
correlations are positive, and their mean is 
59, 

These results confirm Hypothesis 3, indi- 
cating that the negative characteristics most 
likely to be exaggerated in descriptions by 
people hostile to a particular group are those 
most salient in the descriptions provided by 
unbiased respondents, For example, the most 
exaggerated characteristic of hippies, as seen 
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by people who regard them as harmful, is 
their slovenliness. This is precisely the trait 
that most clearly distinguishes hippies from 
other groups in the view of people who do not 
regard them as harmful.® 


Conservative-Liberal Ideology and Group 
Descriptions 


Hypothesis 4 proposes a general tendency 
for people to describe harmful groups in 
terms antagonistic to their own values. Only 
one bipolar value was systematically assessed 
in the present studies—the ideological dispo- 
sition toward conservatism versus liberalism 
—so that only limited testing of the hypothe- 
sis is possible. Conservatism is assumed to 
represent the value attached to social stabil- 
ity, while its opposite, liberalism, is assumed 
to reflect the value placed on social change. 
Some of the descriptive clusters identified in 


6In the F tests and Tukey’s (a) tests, all groups 
were treated as independent, even though the scores 
come from the same respondents. This produces a 
conservative bias in all tests, and the actual p levels 
may be considered lower than the .05 chosen for 
inferring a “significant” difference. 

7 These are standard (z) scores based on means 
and standard deviations computed over all specified 
groups within each sample. 

SIn Boulder, mean cluster scores provided by re- 
spondents who checked the group harmful were sig- 
nificantly lower (p<.05) by £ test (Winer, 1971, 
p. 42) than those from respondents who did not 
check the group harmful in 34 out of 42 compari- 
sons (excluding the “favorable” descriptive cluster). 
In Wellington, 24 out of 36 such ¢ tests were sig- 
nificant, and in Kyoto/Otsu, 19 out of 36 were sig- 
nificant. The exceptions, in nearly all cases, were 
nonsignificant differences in the expected direction for 
groups designated harmful by very few respondents 
(student protesters in Boulder, large industry in 
Wellington, and radical democrats and collective 
action groups in Kyoto/Otsu). Thus, the frequently 
reported finding that undesirable characteristics are 
more often attributed to disliked groups than to 
liked groups was replicated. 

"This result is not likely due to a tendency of 
“generally hostile” respondents to check more items 
on the descriptive checklists than “generally tolerant” 
respondents, since the same direction of correlation 
between unbiased descriptions and difference scores 
does not appear for favorable descriptions. Addi- 
tionally, the correlations with discrepancy scores for 
negative descriptions are generally in the opposite 
direction to that expected from regression effects, 
that is, y— x is positively rather than negatively 
correlated with x. 
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TABLE 7 
. RVATISM AND CHECKLIST DESCRIPTIONS OF GROUPS 
| Volunteered harmful groups | Specified groups checked “Harmful” 
Descriptive scale | ay N Pg F i Lt ORY 
7 | Boulder | | Kyoto | Boulder Wellington Kyoto 
A (n = 223) (n = 112) (n = 216) (n = 171) (n = 87) 
2 ye St = a a sees é $ ariy 
Subversive —.0%2 AS 06 14 
Stovenly .14* —.11 10 07 
Destructive .13* —.14 07 —.06 
Selfish -11 = 26" 00 —,28* 
Immature —.09 =15 04 —.05 
Rigid —.16" —.42* 03 
Favorable —.06 —.18* 01 


*p <05 (two-tailed test). 


the three communities could be roughly classi- 
fied according to the degree to which they 
appear, a priori, to threaten either of these 
polar values: (a) subversive and trouble mak- 
ing threaten conservatives most; (b) immoral, 
parasitic, slovenly, destructive, and mean 
threaten conservatives more than liberals; (c) 
selfish and incompetent show no difference 
between ideological groups; (d) ignorant and 
immature threaten liberals more than conser- 
vatives; (e) rigid threatens liberals most. 

The reason for considering subversion most 
antagonistic to conservative values is that the 
very concept implies inviolability of present 
social forms. By contrast, rigidity should be 
most offensive to those who want continuing 
change in the social system. Ignorance and 
immaturity may threaten liberals more than 
conservatives because they imply the absence 
of understanding that would facilitate the 
acceptance of change. Immorality, slovenli- 
ness, parasitism, destructiveness, and related 
traits should be more offensive to conserva- 
tives than to liberals in the three communi- 
ties studied because they violate traditional 
social norms in those societies. 

These particular instances of the general 
hypothesis received varying degrees of sup- 
port from the two sets of results presented in 
Table 7. Conservatives were more likely than 
liberals to describe harmful groups as sub- 
yersive (p < .05 in two out of six tests 
slovenly (p <..05 in three out of six tes 
and destructive (p< .05 in two out of six 
tests). Liberals were more likely than con- 
servatives to describe harmful groups as 
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Note. ns are less than total sample sizes because some respondents did not check or describe harmful groups, 


rigid (p < .05 in four out of six tests) and 
immature or ignorant (p < .05 in one out of 
six tests). These results are in general ac- 
cord with a priori expectations. In addition, 
liberals were more likely than conservatives 
to describe harmful groups as selfish (p < .05 
in two out of six tests) and more likely to 
mention something favorable about them (p 
< .05 in three out of six tests). 

The reasoning behind this hypothesis im- 
plies two bases for the relationship: (a) The 
difference in descriptions by liberals and con- 
servatives results from their application to 
rather different groups; characteristics of a 
particular group offend the values of liberals 
or conservatives more, and therefore the 
group is identified as harmful. (b) The de- 
scription is a product of the respondent’s 
ideology rather than an apt characterization 
of the group; any group identified as harm- 
ful is attributed characteristics antagonistic 
to the person’s values, regardless of their 
appropriateness to the group. 

The first interpretation found some sup- 
port in the data. Liberals are more likely to 
identify establishment groups as harmful (r 
—.32, —.13, and —.40 in Boulder, Wel- 
lington, and Kyoto/Otsu, respectively), while 
conservatives are more likely to name legal 
offenders (r = .26, .13, and .07, respectively ) 
and antiestablishment groups (r= .36 and 
.07 in Boulder and Wellington samples, re- 
spectively). Since the descriptive cluster 
“rigid” was commonly applied to the’ estab- 
lishment, “slovenly” to antiestablishment 
groups, and “destructive” to legal offenders 
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(see Table 5), it seems that in these cases 
identification of a group as harmful resulted 
from its apparent threat to the respondent’s 
values. 

The second possible reason for the use of 
different descriptions by conservatives and 
liberals—indiscriminate attribution of unde- 
sirable traits to any threatening group—was 
investigated by correlating conservatism with 
scores on the descriptive clusters applied to 
each group nominated or checked as harmful 
by at least 20 respondents. There are 18 
such groups in Boulder: 6 of them were 
called rigid significantly more often by lib- 
erals than by conservatives; 2 of them were 
called subversive significantly more often by 
conservatives than by liberals. Neither of 
these trends was replicated in Wellington or 
Kyoto/Otsu. Thus, the main thrust of the 
data supports the inference that different 
groups are judged harmful by liberals and 
conservatives largely because they appear to 
threaten different sets of values. Evidence for 
differing characterizations of the same group 
due to respondents’ ideological biases was 
found only in Boulder. 


Discussion 


The relatively low salience of ethnic and 
religious groups is noteworthy. In Boulder, 
black militants and Chicanos were named as 
harmful by 11% and 2% of the sample, re- 
spectively; in Wellington, Islanders were men- 
tioned by 5% of the respondents and Maoris 
were nominated by no one; Japanese re- 
spondents named no ethnic groups. Foreigners 
as a general category were mentioned by 1%, 
4%, and 1% of the respondents in the three 
samples, respectively; religious sects were 
named by 2%, 5%, and 7%. These propor- 
tions are quite small in comparison to the 
major group clusters identified in each sample. 
Perhaps the predominant focus of respondents 
on other targets is due to the limitation of 
samples to three complex industrial societies. 
Also, naming a group as harmful is only one 
of many ways in which prejudicial attitudes 
might be manifest. Nevertheless, it seems 
clear that to understand the bases of stereo- 
types and prejudicial conceptions, some ex- 
pansion is required beyond ethnic and re- 
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ligious targets considered in most investiga- 
tions. 

Two main bases for the contents of nega- 
tive group descriptions were sought in this 
investigation. The respondents ideology 
(conservative or liberal) was shown to be re- 
lated mainly to his selection of certain kinds 
of groups for which particular descriptions are 
distinctly appropriate. The fact that Boulder 
residents displayed this relationship more 
clearly than those in the other two communi- 
ties may be due to the unusual composition 
of the university town in which educational 
level and student status both constitute im- 
portant bases for intergroup segregation. The 
relative weakness of this ideological polari- 
zation in Wellington is evidenced by the low 
reliability of the conservatism scale there, as 
well as by Boshier’s (1972) demonstration of 
at least 10 factors in the Wilson and Patter- 
son (1968) scale when administered in New 
Zealand. Perhaps if more relevant values had 
been identified in Wellington and Kyoto/ 
Otsu, they might have played a greater role 
in image formation than was found in the 
present studies. 

The second psychological basis for concep- 
tions of harmful groups, exaggeration of plau- 
sible group characteristics, was found in all 
three communities. These results accord both 
with Vinacke’s (1949) finding that ethnic 
group stereotypes held by outsiders tend to 
correspond with group members’ own views 
of themselves (see Scott, 1965) and with 
Campbell’s (1967) proposition that outgroup 
stereotypes tend to exaggerate actual differ- 
ences of small magnitude. 

Substantial similarity was found among the 
three cultures in the kinds of groups identi- 
fied as harmful to the society and in the 
ways of describing such groups. Ideological 
extremists, members of the establishment, 
and legal offenders constitute coherent clus- 
ters of groups considered harmful in all three 
communities, compromising substantial pro- 
portions of the total numbers of groups nomi- 
nated. Moreover, each of these group types 
was likely te be described similarly in all 
three communities—ideological extremists as 
subversive, establishment*groups as selfish and 
rigid, and criminal offenders as desttuctive. 
The general proposition to which these find- 
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ings are relevant is that all societies contain 
certain kinds of groups regarded as harmful 
by substantial proportions of the citizenry. 
The identification of purportedly universal 
types has been approached only inductively 
in the present investigation; it remains to be 
seen whether other communities will yield 
similar clusters, Clearly, the Boulder and 
Wellington samples differ to some extent from 
the Kyoto/Otsu sample in discriminating an 
antiestablishment cluster distinct from legal 
offenders. The reason for this finding can 
only be surmised at present. Perhaps in the 
Segment of Japanese society supporting nor- 
mative behavior, deviance such as delinquency 
and protest is so repugnant that it tends to 
be assimilated into the “criminal” category, 
whereas in both the United States and New 
Zealand supporters of the normative order 
see these forms of deviance as age related 
and transient, more as an object of disdain 
than as a source of physical threat. 
Differences among the cultures could be 
explored endlessly in the present data, but 
there are several reasons for not doing so. The 
samples were selected, not from the entire 
societies, but from very restricted communi- 
ties that do not reflect national characteristics 
in comparable ways. More importantly, inter- 
cultural differences are significant primarily 
as illustrations of nomothetic principles which 
apply across all cultures, With a sample of 
just three communities, it would be impos- 
sible to establish such nomothetic bases, Any 
intersample differences would have meaning 
only to people interested in these particular 
Societies for their own sake, independent of 
whatever cross-cultural variables might be 
represented. The present analyses have there- 
fore focused on similarities rather than dif- 
ferences among the three communities. Such 
similarities appear to be rather substantial; 


whether they exist in an even wider range 
of cultures can only be ascertained by 
replicative studies elsewhere. x be) 
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Effects of Constituent Trust and Representative Bargaining 
Orientation on Intergroup Bargaining 


James A. Wall, Jr. 
Department of Personnel and Organizational Behavior, 
Indiana University 


This study investigated the effects of a constituent’s trust and his representa- 

tive’s bargaining orientation upon (a) the representative's bargaining behavior, X. 
(b) the opposing representative’s (outsider’s) bargaining behavior, and (c) the 

number of agreements reached by the representative and the outsider. The 

effect of the representative’s bargaining upon the constituent’s trust was also 

investigated. Subjects formed two-man labor and management teams and 

bargained over four separate wages. In the negotiations, two of the subjects 

serving as the representative and the outsider bargained face-to-face with each 

other. The other two, the constituents, directed and monitored their repre- 

sentatives’ bargaining. Initial constituent distrust (vs. trust) was found’ to 

foster more competitive representative and outsider bargaining and fewer agree~ 

ments, A competitive representative bargaining orientation (vs. a cooperative 

orientation) fosters more competitive representative bargaining but, ironically, 

more cooperative outsider bargaining and more agreements. A tentative ex- ` 
planation of this paradox is offered. Because it permitted unconstrained feed- 

back, the study is posited to be more relevant to intergroup bargaining than 

previous bargaining studies. It also provides support for and suggests revisions 

of previously developed models of intragroup-intergroup bargaining. 


This study investigates the effect of a whom he must bargain (e.g., Baranowski & 
group member’s (constituent’s) trust and his Sumers, 1972; Deutsch & Krauss, 1960; Har- 
representative’s bargaining orientation upon ford & Solomon, 1967; Horai & Tedeschi, 
the process and outcomes of intergroup bar- 1969; McNeel, McClintock, & Nuttin, 1972; 
gaining. A representative engaged in inter- Pilisuk & Skolnick, 1968; Scodel, 1962; ` 
group negotiation finds himself in a unique Siegel & Fouraker, 1960; Swingle, 1967, 
position. Not only is he subject to the expec- 1970). 
tations and demands of his constituents, but Several authors have taken cognizance of 
he also is subject to the influence of the the representative’s position (Iklé, 1964; 
opposing representative (outsider) with whom Kahn, Wolfe, Quinn, Snoek, & Rosenthal, 
he is negotiating. There is a large body of 1964; Stogdill, 1962; Vidmar & McGrath, 
literature indicating that representatives are 1970; Walton & McKersie, 1965) and have 
under the aforementioned dual set of forces. discussed how these forces mutually affect 
There is ample evidence that the representa- the representative’s behavior. For example, 
tive’s behavior is influenced by his constitu- Walton and McKersie (1965) have main- 
ents (e.g., Druckman, 1968; Klimoski, 1972; tained that representatives must bargain with 
Kogan, Lamm, & Trommsdorff, 1972; Lamm the members of their own group just as they 
& Kogan, 1970; Vidmar, 1971; Julian & Mc- must bargain with outsiders. In this dual 
Grath, Note 1) and by the outsiders with negotiation, the outcome of the negotiation 

with one group many times determines the 

The author wishes to thank J. Stacy Adams, J. representative’s options in dealing with the 
Ogden Hamilton, Dennis Organ, and Benson Rosen Other group. 
for their critical readings of an earlier draft of this Studies by Holmes (1971) and Frey and 
Lena Pee erie his eed Adams (1972) have investigated the effects 
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Wall, Jr, Department of Personnel and Organizi:  —tative’s behavior. Both EET daha 


tional Behavior, Graduate School of Business, Indi- oa £ 
ana University, Bloomington, Indiana 47401. oft-conflicting internal and external demands 
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“upon the representative significantly affected 
the representative’s behavior; however, the 
main effects of the internal (external) varia- 
bles were at times mitigated or superseded by 
interactions with external (internal) varia- 
bles. Specifically, Holmes (1971) found that 
representatives tended to follow their constit- 
uents’ dictates as to how to bargain—coop- 
eratively or competitively—whenever they 

d faced a cooperative outsider. However, they 
did not follow a cooperative dictate when the 
outsider was perceived as competitive. Frey 
and Adams (1972) found that constituent 
trust (distrust) in the representative had no 
effect on the representative’s competitiveness. 

* However, it interacted with outsider behavior: 
Distrusted representatives bargained more 

~ competitively with cooperative outsiders than 
did trusted representatives. There was no 
difference in their competitiveness when the 
outsider was competitive. These two studies 
demonstrated that the representative is not 
simply a man in the middle, influenced by 
both his constituency and his counterpart. 
That is, rather than having simple additive 
effects, the factors influencing the representa- 
tive’s bargaining behavior interact in a com- 
plex manner, 

These and other studies, however, do not 
—Mequately simulate the complexity of the ac- 
tual constituent-representative-outsider bar- 
gaining process. They are static and thereby 
do not permit the dependent variable under 
study to affect the manipulated independent 
variable. However, McGuire (1973) pointed 
out that as a rule, the effects of independent 
variables do influence the variables them- 
selves. Consequently, he suggested that re- 
searchers think more complexly, using models 
and studies which include realistic feedback 
loops and bidirectional causality. In so doing, 
he contended, they would be better able to 
build theories which are more applicable to 
individual and social reality. To facilitate the 
establishment of a more relevant intergroup 
bargaining theory, the present study examines 
intergroup bargaining as a process, allowing 
sequential interactions not present in previous 
studies: (a) The constituent-representative 
relationship is allowed to affect both the rep- 
resentative’s and the outsider’s bargaining, 
(b) the representative-outsider negotiation is 


permitted to influence the constituent—repre- 
sentative relationship, and (c) the resulting 
constituent-representative relationship is al- 
lowed to alter the future representative—out- 
sider bargaining. This study determines the 
effect that constituent trust and his repre- 
sentative’s bargaining orientation have upon 
this process, 


Dynamic Effects of Constituent Trust 


Representatives are subject to the dictates 
of outsiders as well as to those of their con- 
stituents. Therefore, to reach an agreement 
with the outsider, the representative may 
have to present the outsider’s case to the 
constituent, yield more than his constituent 
wishes, or violate some constituent norms. 
Consequently, the constituent often feels he 
has objective reasons for distrusting the rep- 
resentative and as a result either intentionally 
or inadvertently exhibits his distrust. 

The representative’s reaction to the distrust 
is complex. One might expect a distrusted 
representative as opposed to a trusted repre- 
sentative to bargain more toughly with any 
outsider—demanding larger concessions from 
him and using tougher language in the nego- 
tiation—in an attempt to prove his loyalty 
to the constituent. (However, Frey and Ad- 
ams, 1972, found that distrust produces such 
an effect only when the representative faces a 
cooperative outsider.) The distrusted repre- 
sentative’s competitiveness, while intended to 
impress the constituent, inadvertently por- 
trays him as a competitive opponent to the 
outsider. The initially cooperative outsider 
upon perceiving that he faces a competitive 
representative begins to bargain competitively 
(Holmes, 1971; Kelley & Stahelski, 1970). 
His retaliation reinforces the representative’s 
competitiveness, encouraging him to continue 
his competitiveness in subsequent negotia- 
tions. Therefore, the representative and out- 
sider become engaged in a mutually reinforc- 
ing competitive encounter, and consequently 
the two reach few agreements. 

Conversely, a trusted representative, when 
encountering a cooperative outsider, cooper- 
ates and thereby reinforces the outsider’s 
cooperativeness. As a result, a substantial 
number of agreements are reached. 
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The question remaining is how the dis- 
trusted as opposed to the trusted representa- 
tive’s bargaining will affect his relationship 
with his constituent. His competitive bargain- 
ing could prove his trustworthiness. In this 
case, the representative would no longer need 
to prove himself and subsequently would bar- 
gain more cooperatively. However, the repre- 
sentative’s failure in the negotiation, perhaps 
resulting from his tough bargaining, could 
enhance the constituent’s distrust (Wall & 
Adams, 1974). In this case, the repre- 
sentative would continue to bargain competi- 
tively in order to overcome the constituent’s 
distrust. 


Representative’s Bargaining Orientation 


While the representative's behavior is a 
function of both constituent and outsider 
pressures, it is also affected by the representa- 
tive’s personal attributes (Kahn et al., 1964; 
Kelley & Stahelski, 1970). In the latter study 
it was shown that a bargainer’s personality 
does affect his bargaining behavior. Specifi- 
cally, a competitive bargainer was found to 
have a low rate of cooperative behavior, re- 
gardless of the type of opponent he encoun- 
tered. A cooperative bargainer’s cooperation 
rate was generally higher, but more coopera- 
tion was given to a cooperative opponent than 
to a competitive one. 

Since internal factors do influence a bar- 
gainer’s behavior, it seemed fruitful to inves- 
tigate the effect of one such factor, the rep- 
resentative’s bargaining orientation, upon the 
bargaining process. It was predicted that a 
representative’s competitive orientation has 
effects similar to those of constituent dis- 
trust: (a) A competitively oriented as op- 
posed to a cooperatively oriented representa- 
tive bargains more toughly with an initially 
cooperative outsider, sending stronger mes- 
sages and demanding larger concessions. (b) 
The initially cooperative outsider perceives 
that he faces a competitive representative and 
bargains competitively (Holmes, 1971; Kel- 
ley & Stahelski, 1970). (c) The outsider’s 
competitive bargaining reinforces the repre- 
sentative’s competitiveness, encouraging him 
to continue his competitiveness in subsequent 
negotiations. (d) Therefore, the representa- 
tive and outsider become engaged in a mu- 
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tually reinforcing competitive encounter and 
consequently reach few agreements. 

Conversely, a cooperatively oriented repre- 
sentative, when encountering a cooperative 
outsider, bargains more cooperatively and re- 
inforces the outsider’s cooperative bargaining. 
As a result, a substantial number of agree- 
ments are reached. 


HYPOTHESES 


In summary, it is hypothesized that the 
two factors—constituent trust and representa- 
tive bargaining orientation—have the follow- 
ing effects: 


Constituent Trust 


1. When facing an initially cooperative out- 
sider, a distrusted representative bargains 
more competitively than does a trusted rep- 
resentative. He (a) demands more initially 
from the outsider, (b) makes higher final 
demands, and (c) accompanies his demands 
with stronger bargaining messages. 

2. The initially cooperative outsider facing 
a distrusted representative, as opposed to the 
initially cooperative outsider facing a trusted 
representative, (a) perceives the representa- 
tive to be more competitive, (b) concedes 
less initially to the representative, (c) gives 
lower final concessions to the representative, 
and (d) accompanies his concessions with 
stronger bargaining messages. 

3. The distrusted representative as com- 
pared to the trusted representative reaches 
fewer agreements with the initially coopera- 
tive outsider. 


Representative Bargaining Orientation 


4. When facing an initially cooperative out- 
sider, the competitively oriented representa- 
tive bargains more competitively than does 
the cooperatively oriented representative. He 
(a) demands more initially from the out- 
sider, (b) makes higher final demands, and 
(c) accompanies his demands with stronger 
bargaining messages. 

5. The initially cooperative outsider facing 
a competitively oriented representative, as 
opposed to the initially cooperative outsider 
facing a cooperatively oriented representa- 
tive, (a) perceives the representative to be 
more competitive, (b) concedes less initially 
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to the representative, (c) gives lower final 
concessions to the representative, and (d) 
accompanies his concessions with stronger 
bargaining messages. 

6. The competitively oriented representa- 
tive as compared to the cooperatively oriented 
representative reaches fewer agreements with 
the initially cooperative outsider. 


METHOD 
Subjects 


The 120 subjects who participated in the experi- 
ment were undergraduate, male volunteers enrolled 
in business courses at Indiana University. For serv- 
ing as a subject, each student received 2 hours credit 
for experimental participation. 


Procedure 


At the beginning of the experiment, four subjects 
met together in a large room. They were informed 
that they would form two-man labor and manage- 
ment teams that would bargain over four wages. In 
the negotiations, two of the subjects would serve as 
their team’s representative and would bargain face- 
to-face with the opposing representative. The other 
two, the constituents, would direct and oversee their 
representative’s bargaining; the constituents would be 
located in cubicles at the opposite corners of the 
same room. (The union representative was the rep- 
resentative under study, the union constituent was 
the constituent, and the management representative, 

e outsider.) The following negotiation sequence 

described by the experimenter: The representa- 
tive, outsider, and constituents would take their 
seats at the bargaining table and in the cubicles and 
would read the description of the bargaining issues. 
Having read the description, the representative and 
the outsider would write a note to their respective 
constituents. In his note each could say anything he 
wished, but he should keep the note short (15-20 
words). Upon receiving the note, each constituent 
would fill out an evaluation of his representative as 
a negotiator, prepare a bargaining directive for him, 
and send both of these to his representative. After 
both the representative and the outsider each had 
received his evaluation and directive, the representa- 
tive would hand the outsider a bargaining sheet on 
which he had made his opening demand and checked 
one of five bargaining messages. To allow his con- 
stituent to oversee his bargaining, the representative 
would send: a carbon copy of this bargaining sheet 
to his constituent. The outsider would respond to 
the demand with a bargaining sheet containing a 
counteroffer (or acceptance) and his message, send- 
ing a carbon copy of the sheet to his constituent. If 
the representative and outsider could not reach an 
agreement after both had made four offers, the ex- 
perimenter would terminate the bargaining and both 
teams would be fined $250. As soon as an agree- 
ment was reached or as soon as the eight offers were 
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tendered, both constituents would be informed of 
the results of Wage Negotiation 1. The constituents 
would then send their respective representatives a 
second evaluation and a directive for Wage Negoti- 
ation 2. When both had received these, the repre- 
sentative would make his first demand and send his 
first message in Wage Negotiation 2. This sequence 
would be repeated until Wage Negotiations 2, 3, and 
4 were complete, 

After the introduction, the subjects were ran- 
domly assigned to the four roles. The representative 
and outsider were asked not to communicate verb- 
ally with each other and were then shown (a) the 
description of the issues, (b) the sheet on which 
each was to send his note to his constituent, and 
(c) his, and only his, team’s payoff matrix. (The 
union and management matrices were constructed so 
that any loss for one side resulted in an equal gain 
for the other. The only way the representative and 
outsider could avoid a loss for one or the other 
was to settle at a wage rate that yielded a zero 
payoff for both sides.) The representative and out- 
sider were instructed to read the issues description 
and send their messages to their constituents. After 
each had done so, he was given a list of four bar- 
gaining strategies, described below, which he was 
asked to rank in terms of appropriateness, After 
the representative and outsider each had received 
his constituent’s evaluation and directive, the bar- 
gaining began. After the negotiation over the four 
wages was complete, the subjects were asked to 
complete short questionnaires. As soon as each had 
completed his questionnaire, the subjects were 
brought together in the center of the room and 
debriefed. Each session lasted approximately 2 
hours.! 

Since the purpose of the study was to examine the 
process in which a trusted (vs. distrusted) and a 
cooperatively oriented (ys. a competitively ori- 
ented) representative bargained with an initially 
cooperatively bargaining outsider, the outsider’s ini- 
tial cooperative bargaining was requisite, This was 
obtained, in all conditions, by replacing each of the 
directives issued by the outsider’s constituent with 
one that instructed the outsider to be cooperative. 
The directive prior to the Wage Negotiat’on 1 read, 
“Try to win the opponent over to our side by being 
cooperative. This is very important at the first of 
the negotiation. Be cooperative even if it means 
taking some losses at first.” Prior to Wage Negotia- 
tions 2, 3, and 4, the directives, respectively, were 
as follows: “Be more cooperative in the next nego- 
tiation;” “Regardless of what has happened so far, 
cooperation, I think, is the best policy;” and “We're 
near the end now so try to get the other team to 
cooperate as much as possible.” In addit'on, the 
evaluations that were sent by the outsider’s constit- 
uent were replaced with ones which reflected mod- 
erate trust in the outsider, 


1 Readers wishing copies of the payoff matrices or 
the bargaining messages may request these from the 
author. 


ay 
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Also, since the study was concerned with the 
representative’s reactions to the constituent trust 
(vs. distrust) and not with his reaction or compli- 
ance to the constituent’s directives (this behavior 
had already been studied by Holmes, 1971, and 
Organ, 1971), the constituent’s directives were re- 
placed in all conditions by directives that designated 
no specific bargaining strategy (e.g., “This experi- 
ment puts you in the driver’s seat so do the best 
you can for us;” and “Remember, we are in the 
negotiation to make money”). 


Independent Variables 


Constituent trust. The constituent’s trust in his 
representative was manipulated by substituting a 
previously written note for the one written by his 
representative. In the trust condition (T) this note 
read, “We have a good case. Inflation is hurting us 
as much as it is them. I intend to be fair but as the 
union’s representative, I intend to put our interests 
first.” In the distrust condition (T) it read, “Even 
though I am the union representative, management 
has a good case. Inflation seems to be hurting them 
more than it is us and our wages seem high any- 
way.” 

Representative’s bargaining orientation. As men- 
tioned previously, after sending his message to his 
constituent, the representative was directed to rank 
according to appropriateness four bargaining strate- 
gies: (a) Be very cooperative in order to win the 
cooperativeness of the opposing representative; (b) 
Be very tough to win as much as possible for your 
team in each round; (c) Be slightly cooperative 
with the opposing representative; and (d) Be 
slightly competitive in dealing with the opposing 
representative. Those representatives indicating that 
a “very cooperative” or “slightly cooperative” strat- 
egy was most appropriate were considered to have a 
cooperative bargaining orientation (C) and those 
choosing a “very tough” or “slightly competitive” 
strategy were considered to hold a competitive (C) 
orientation. 


Dependent Variables 


Representative’s bargaining behavior. The repre- 
sentative’s initial and final demands in each wage 
negotiation were expressed as the difference between 
the current wage (Wage 1 = $2.00, Wage 2 = $2.25, 
Wage 3 =$2.50, and Wage 4= $3.00) and his de- 
mand. The strength of his bargaining message was 
assessed according to which of the previously scaled 
(Thurstone equal interval method) messages he 
chose to send to the outsider. 

Outsider’s perception of the representative. The 
outsider’s perception of the representative was as- 
sessed by having him rate the representative on a 
7-point competitive-cooperative scale. 

Outsider’s bargaining behavior. The outsider’s ini- 
tial and final concessions in each wage negotiation 
were expressed as the difference between the current 
wage and his concession; the strength of his mes- 


sage was assessed according to which scaled m 
he sent. 

Number of agreements reached. As mentii 
previously, the representative and outsider nego 
ated over four wages. Consequently, the məxir 
number of agreements was four; the minimum, : 


Design 


A 2X2 factorial design with two levels of c: 
stituent trust and two levels of representative bar 
gaining orientation was used to study the outsider’s 
perception of the representative. A 2X 2X4 fac- 
torial design with two levels of constituent trust 
two levels of representative bargaining orientation 
and four wage negotiations was used to study ( 
the representative’s initial demands, (b) the repre- 
sentative’s final demands, (c) the strength of the 
representative’s messages, (d) the outsider’s initial 
concessions, (e) the outsider’s final concessions, and 
(f) the strength of the outsider’s messages. 


RESULTS 
Manipulation 


The manipulation of constituent trust was 
checked by noting the constituent’s response 
on the question “How strongly do you feel 
your representative will represent your team’s 
interests in the forthcoming negotiation?” 
prior to the first negotiation. Those in the 
trust condition (T) indicated that they trusted 
their representative, while those in the dis- 
trust condition (T) indicated that they did 
not, F(1,26) = 51.71, p < .001. 


Constituent Trust 


Constituent trust had the predicted effects 
upon the representative’s initial demands 
(Hypothesis 1a). As can be seen in Table 1, 
the representatives under the T condition 
made higher initial demands than did the 
representatives in the T condition, F(1, 104) 
= 6.37, p < .01. However, as opposed to pre- 
diction (Hypothesis 1b), the representatives’ 
final demands in the T condition were larger 
than those in the T condition. An explanation 
for this finding is presented in the examina- 
tion of the outsider’s bargaining. Finally, Hy- 
pothesis 1c did not receive support; the differ- 
ence in message strengths between the trust 
conditions was not significant. 

As noted above, prior to each of the four 
negotiations, the outsider was instructed to 
bargain cooperatively with the representative. 
However, as predicted (Hypothesis 2a), the 
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TABLE 1 


MEANS OF REPRESENTATIVE DEMANDS AND 
OUTSIDER CONCESSIONS 


Wage negotiation 
Condition — 
1 2 3 4 
| 
Trust | 
Representative’s initial | 
demand 87 | .84 | .85 | .91 
Outsider’s initial 
concession 37 39 | .51 26 
Outsider’s final 
concession. 53 | .66 | .68 | .60 
Distrust | 
Representative’s initial | 
demand 89 | .89 | .96 | .95 
Outsider’s initial 
concession wy) drt Peed 18 
Outsider’s final | 
concession A3 | 55 | .52 43 


outsiders opposing the distrusted representa- 
tives perceived them to be more competitive 
than did the outsiders opposing the trusted 
representatives, F(1,26) = 4.21, p < .05, and 
consequently they bargained more competi- 
tively. As can be seen in Table 1, Hypothesis 


2b was supported. The outsiders in the T 
condition made lower initial concessions than 


ig those in the T condition, F(1,104) = 


9.09, p < .003. The large difference between 
the initial concessions in the Wage 1 negoti- 
ation suggests that the outsiders were very 
quick to retaliate against the distrusted rep- 
resentatives. This recalcitration probably ac- 
counts for the finding above that the repre- 
sentatives’ final demands were lower in the T 
condition. That is, in the T condition the 
outsiders’ low initial concessions may have 
forced the representatives continuously to 
lower their subsequent demands in that nego- 
tiation, while in the T condition the outsiders’ 
high initial concessions motivated the repre- 
sentatives to raise their subsequent demands. 

It also can be noted in Table 1 that the 
outsiders’ retaliation was not temporary; as 
predicted in Hypothesis 2c, outsiders facing 
distrusted representatives made substantially 
smaller final concessions in each negotiation 
than did those facing trusted representatives, 
F(1, 104) = 7.40, p< .008. However, con- 
trary to the prediction in Hypothesis 2d, trust 


had no effect upon the strength of the out- 
siders’ bargaining messages. 

Constituent trust had the predicted effect 
upon the number of agreements (Hypothesis 
3). The representative-outsider dyads under 
the T condition agreed upon only 23 wages 
(out of a possible 60 agreements); under the 
T condition, there were 40 agreements; the 
difference is significant: yë, corrected for con- 
tinuity, = 8.55, p < .005. This finding is par- 
ticularly meaningful in view of the fact that 
deadlock resulted in a $250 penalty for both 
negotiators. 

As can be seen in Table 2, although the dis- 
trusted representatives’ bargaining led to 
fewer agreements, it quickly proved their 
trustworthiness to their constituents. This 
finding, therefore, raises an important ques- 
tion: If the distrusted representatives rapidly 
won their constituent’s trust, why did their 
bargaining remain competitive? A plausible 
explanation is that the representatives’ bar- 
gaining triggered two deviation-amplifying 
mutual causal processes (Adams, in press; 
Maruyama, 1963): (a) As discussed previ- 
ously, the distrusted representatives’ competi- 
tive bargaining engendered the outsiders’ 
competitiveness which in turn fostered con- 
tinued competitiveness on the representatives’ 
part. (b) In addition, the constituents’ reac- 
tion to their representatives’ bargaining re- 
warded their competitiveness. The distrusted 
representatives’ initial demands for Wage 1 
were only slightly larger than those of the 
trusted representatives. However, the dis- 
trusted representatives’ competitiveness dur- 
ing the first wage negotiation was reinforced 
strongly by the rise in their constituent’s 
trust. Therefore, the distrusted representatives 
issued high initial demands in the second 


TABLE 


MEANS OF CONSTITUENT TRUST 


Prior to 


aE SA | ae Tn 
1 3 4 

Trust 1.66 2.75 2.58 

Distrust 5.13 2.36 1.78 


Note. 1 = high trust; 7 = low trust, 
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negotiation; saw their constituent’s trust rise 
further; issued another large demand, etc. 


Representative’s Bargaining Orientation 


The representatives’ bargaining orientation 
affected all facets of their bargaining as pre- 
dicted (Hypothesis 4a-c). The competitively 
oriented representatives bargained more com- 
petitively with the outsiders than did coop- 
eratively oriented ones, issuing larger opening 
demands, F(1,104) = 21.77, p< 001, and 
larger final demands, F(1, 104) = 18.20, p< 
.001; they accompanied their demands with 
stronger messages, F(1, 104) = 6.40, p < .01. 
However, contrary to Hypothesis 5a, the out- 
siders did not perceive the competitively ori- 
ented representatives to be more competitive 
than the cooperatively oriented representa- 
tives. As a result, they did not retaliate and 
the competitively oriented representatives, as 
opposed to the predictions of the Hypothesis 
5b and 5c, secured larger initial concessions, 
F(1, 104) = 4.50, p < .036, and final conces- 
sions, F(1, 104) = 14.83, $ < .001. Hypothe- 
sis 5d received no support; the difference in 
the outsiders’ message strengths between the 
two conditions was not significant. Finally, as 
opposed to the prediction of Hypothesis 6, 
the competitively oriented representatives se- 
cured 39 agreements, whereas the coopera- 
tively oriented representatives secured only 
24 (xë, corrected for continuity, = 6.55, $ < 
1028 


Discussion 


One purpose of this study was to investi- 
gate the effects of constituent trust and rep- 
resentative bargaining orientations upon the 
process and outcomes of intergroup bargain- 
ing. It was posited that constituent distrust 
would trigger interacting representative and 
outsider bargaining behaviors which would be 
detrimental to agreements. In addition, it was 
thought that the constituent’s reactions to his 
representative's bargaining tactics and success 
would reinforce or dampen this chain of 
events. The analyses of the subjects’ behavior 
provided support for many of the hypotheses 
but more importantly they supported the con- 
tention that bargaining should be studied as 
a sequential process. 
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The constituents’ initial distrust triggered 
mutually reinforcing competition between the 
representatives and the outsiders. The di 
trusted as opposed to the trusted representa- 
tives issued higher opening demands and the 
outsiders, although continually instructed to 
be cooperative, retaliated, giving lower con- 
cessions than the ones granted by outsiders 
facing trusted representatives, The constitu- 
ents’ reactions to the distrusted representa- 
tives’ bargaining unexpectedly reinforced the 
competitive circle. The rise in the constitu- 
ents’ trust was expected to give the represent- 
atives less felt need to prove their trustworthi- 
ness; consequently, the representatives’ com- 
petitiveness was expected to decrease. How- 
ever, the rise in trust reinforced rather than 
abated the competitiveness. 

These interpretations provide support for 
the Adams (in press) and Wall and Adams 
(1974) models, but they also suggest some 
revisions. First, constituent distrust has a 
dual effect: it initiates competitive representa- 
tive bargaining, but its reduction seems to 
encourage rather than reduce competitive rep- 
resentative bargaining. Second, feedback con- 
cerning bargaining success (agreement vs. 
deadlock) appears to have a relatively less 

2As noted previously, a 2X2 X4 analysis was 
used to study the bargaining data. This analysis 
assumes independence across the four wage negotia- 
tions. Since the same subjects were used in the four 
negotiations, absolute independence was not at- 
tained. Yet it was felt that the subject’s behavior 
in the second, third, and fourth negotiations was to 
some extent independent of his behavior in the 
preceding negotiations because he not only moved 
from one negotiation to another but also was ex- 
posed to both real constituent feedback (in the 
representative's case) concerning his bargaining and 
to the outsider’s behavior. The use of the 2X2 X4 
analysis was predicated on the validity of this sort of 
independence. 

A more conservative approach is a 2X2 analysis 
of the data means. For readers favoring this ap- 
proach, the results of this analysis follow: Con- 
stituent Distrust—(a) representative’s initial de- 
mands, F(1, 26) = 2.88, p < -10, (b) outsider’s initial 
concessions, F(1, 26) = 4.40, p < .046, and (c) out- 
sider’s final concessions, F(1, 26) = 2.66, p<.ll; 
Representative’s Bargaining Orientation—(a) repre- 
sentative’s initial demands, F(1, 26) =9.30, p< 
.005, (b) representative's final demands, F(1, 26) = 
6.10, p < .02, (c) outsider’s initial concessions, F(1, 
26) = 2.20, p<.15, and (d) outsider’s final con- 
cessions, F(1, 26) = 5.13, $ <03. 
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INTERGROUP BARGAINING 


powerful effect than originally hypothesized. 
The Adams model predicts that competitive 
representative bargaining will reduce constit- 
uent distrust but that the reduced bargaining 
success that this competitiveness engenders 
will raise the distrust. The present study sug- 
gests that the former influence is the stronger; 
the representative’s competitiveness arising 
under the distrust condition reduced the dis- 
trust even though it fostered significantly 
more deadlocks. 

The most important revision suggested by 
this study is the inclusion of the outsider’s 
perception of the representative. The present 
findings indicate that the outsider’s perception 
determines the effect of the representative's 
competitive bargaining. As seen in this study, 
the representative’s competitiveness fostered 
by distrust was perceived by the outsider, and 
therefore, the outsider bargained competi- 
tively himself, reaching few agreements with 
the representative. However, the representa- 
tive’s competitiveness fostered by his own 
competitive bargaining orientation was not 
perceived by the outsider. Therefore, the out- 
sider bargained cooperatively, reaching a con- 
siderable number of agreements. 

The puzzling question remaining is why the 

mpetitively oriented representatives were 
“a perceived as more competitive than the 
cooperatively oriented representatives when 
the distrusted representatives were perceived 
to be more competitive than the trusted rep- 
resentatives. Compared to the cooperatively 
oriented representatives, the competitively 
oriented bargained significantly more com- 
petitively. Not only did they issue larger 
opening and final demands, but they also 
accompanied their demands with stronger 
messages. Yet compared to the trusted repre- 
sentatives, the distrusted seemed to bargain 
only slightly more competitively, issuing only 
higher initial demands. Perhaps the constitu- 
ent distrust made the representatives feel 
tense or aggressive, and the face-to-face bar- 
gaining allowed them to reveal their tension 
or to exhibit their aggression—perhaps inad- 
vertently—to the outsiders by taking a long 
time to make their demands, frowning, or 
staring at the outsider, etc. 

Intergroup bargaining is a complex process 
in which the behavior of each participant is 
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both the cause and effect of the other partici- 
pants’ behavior. The main purpose in under- 
taking this study was to demonstrate that 
this bargaining should be studied in more 
complexity than usually has been the case. 
Some control deliberately was sacrificed for 
the sake of relevancy; however, this ap- 
proach seems worthwhile because it points 
out that the effects of factors reported in 
bargaining studies probably are contingent 
upon their interactions with other, controlled 
factors. For example, a representative’s reac- 
tion to an outsider’s competitive bargaining 
reported in the literature may also be affected 
by the representative’s constituent’s expressed 
approval or disapproval and by the outsider’s 
reaction to his bargaining. Also, reported find- 
ings concerning a constituent’s demands of 
and trust in a representative would be de- 
pendent upon the representative's bargaining, 
which in turn would be both influenced by 
and dependent upon the outsider’s bargaining. 
It is hoped that the model and method of 
study described here, limited as they are, pro- 
vide a framework for more detailed and rele- 
vant study of intergroup bargaining. 
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Self-Attributed Choice as a Function of the Consequence 
of a Decision 
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An experiment was conducted to test possible motivational processes in the 
self-attribution of choice. The procedure involved asking male and female 

¢ college students to decide which of two tasks another person would perform 
at a later time. Information about the tasks was very ambiguous, but subjects 
were given an opportunity to obtain information about the actual 
nature of the tasks. Some were given a difficult opportunity to obtain this 
information, while others were given an easy opportunity. After refusing the 
opportunity and making a decision, subjects were presented with information 
about the actual nature of the chosen task. Some were led to believe it would 
be very pleasant, while others were led to believe it would be very un- 
pleasant. Major findings were that (a) individuals who discovered that the 
task would be pleasant attributed a relatively high amount of choice to them- 
selves in making the decision and (b) individuals who discovered that the 
task would be unpleasant attributed a relatively low amount of choice to 
themselves in making the decision, but only if they previously had been 
offered an easy opportunity to find out about the task. The results are dis- 
cussed in terms of the processes of ego enhancement, ego defense, and dis- 
sonance reduction. 


If an individual selects one of two alterna- Harvey & Harris, 1975; Harvey & Johnston, 


tives and later discovers that the outcome of 
the selection will be unpleasant for another 
person, will he be motivated to minimize the 
amount of choice he had in making the selec- 

tion? That is, will he attempt to absolve 

himself of the responsibility for his ill-fated 
decision by lowering his retrospective per- 
ception of choice in making the selection? 
Similarly, what if an individual discovers 
that a choice he made will result in a quite 
pleasant outcome for another person? Will 
this knowledge motivate the individual to 
assume more responsibility for his decision by 
increasing his feeling of choice? 

An emerging area for research in social 
psychology concerns variables which affect 
perceived and attributed freedom of choice 
(eg., Bringle, Lehtinen, & Steiner, 1973; 
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1973; Jellison & Harvey, 1973; Kruglanski 
& Cohen, 1973). In previous work in this 
area, little attention has been given to the 
possible operation of motivational processes. 
The present study was designed to investigate 
some possible motivational processes in the 
self-attribution of choice. 

A basic assumption underlying this research 
was that an individual generally should view 
himself as responsible for his behavior to the 
extent that he feels he has freedom of choice 
in performing the behavior. Evidence sug- 
gestive of this linkage between perceived 
choice and perceived responsibility is wide- 
spread in the literature of social psychology 
(e.g, Collins & Hoyt, 1972; Cooper, 1971; 
Linder, Cooper, & Jones, 1967). Given this 
relationship, it seems possible that under cer- 
tain conditions an actor may be motivated to 
attribute choice to himself in taking an action 
so as either to accept responsibility or to re- 
ject responsibility for the action. Retrospec- 
tive feelings of choice should be susceptible 
to motivational influences only if the original 
choice situation is amenable to reinterpreta- 
tion in light of subsequent events (i.e. if 
information about the choice alternatives is 
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relatively ambiguous at the time of the deci- 
sion). 

When an individual makes a decision and 
later discovers that the decision will result in 
a pleasant outcome for another person, it may 
be predicted that the individual will attribute 
a relatively high amount of choice to himself 
in making the decision, This prediction is 
based upon the assumption that it is ego en- 
hancing to see oneself as responsible for good 
things that happen to others. Although this 
prediction has not been previously tested, it 
has at least been implied in previous theo- 
retical statements (e.g., Schwartz, 1970). 

When an individual makes a decision and 
later discovers that the decision will have an 
adverse effect upon another person, it may be 
predicted that the individual will attribute a 
relatively low amount of choice to himself in 
making the decision (this prediction presup- 
poses that the individual feels he could have 
foreseen and prevented the adverse effect). 
This prediction is based upon the assumption 
that it is ego threatening to see oneself as 
responsible for unpleasant things that happen 
to others, The idea that a lowered feeling of 
choice may be used as a way of absolving one- 
self of responsibility for socially reprehensi- 
ble actions is certainly not a novel proposal. 
Other theorists including Schwartz (1970) 
and Wolosin and Denner (Note 1) have made 
very similar proposals but have not provided 
any relevant evidence. 

While the “threat to ego” assumption, per 
se, would seem to be an adequate basis for the 
foregoing prediction, that prediction also 
might be derived from cognitive dissonance 
theory. Since the situation in question does 
involve personal responsibility for an action 
which has negative consequences, it would 
seem to fulfill the major criteria necessary for 
consideration as a powerful dissonance-arous- 
ing situation (see Collins & Hoyt, 1972). But 
can it be assumed that self-attributed choice 
may serve as a mode of dissonance reduc- 
tion? 

In a number of studies related to disso- 
nance theory, perceived choice has been ma- 
nipulated. And in quite a few of these studies 
(Brock & Buss, 1962; Calder, Ross, & Insko, 
1973; Linder et al., 1967; Sherman, 1970), 
measures of perceived choice have been taken 
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in an attempt to check on the effectiveness 
of the manipulation, The results of thes 
manipulation checks provide no evidence that 
perceived choice was used as a mode of dis- 
sonance reduction (i.e., perceived choice was 
found to be high in the conditions in which 
it was expected to be high, irrespective of the 
presumed level of dissonance in those condi 
tions). It should be noted, however, that 
there may have been more salient modes of 
dissonance reduction available to subjects. In 
most of these studies, attitude change was the 
principal dependent variable, and it seems 
likely that attitude change was also the most 
salient mode of dissonance reduction avail- 
able. Further, Brock and Buss (1962) found 
some evidence of dissonance reduction on a 
variable roughly equivalent to perceived 
choice: After having been induced to act in a 
counterattitudinal way by shocking others, 
subjects in high-shock (presumably high-dis- 
sonance) conditions indicated that they felt 
more obligated to aggress than did subjects 
in low-shock (presumably low-dissonance) 
conditions. Thus, there do seem to be some 
grounds for thinking that self-attributed 
choice may serve as a mode of dissonance 
reduction. 

The main experimental situation which was 
created in this study involved a subject’s de- 
cision to take one action as opposed to an- 
other and his later discovery that the conse- 
quence of that action would be either pleasant 
or unpleasant for another person. At the time 
of the decision the probable consequence of 
each action was not clear to the subjects 
(although they did know that one was likely 
to be pleasant and the other unpleasant), and 
an additional independent variable in this 
study was the ease with which subjects could 
obtain clarifying information about the conse- 
quences of the possible actions. Some sub- 
jects were given a relatively easy opportunity 
to obtain such information, whereas other 
subjects were given a relatively difficult op- 
portunity (only individuals who refused the 
opportunity were considered as subjects in the 


1 However, this effect has to be qualified in terms 
of a marginally significant Sex of Aggressor X Level 
of Shock interaction. Men reported more obligation 
than women under high shock and less under low 


shock (p < .08). 
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experiment). Our main interest in this varia- 
ble was in how it might interact with the 
consequences variable to affect self-attributed 
choice. 

When an individual discovers that his deci- 
sion will result in an unpleasant consequence 
for another person, he may feel considerable 
personal responsibility for the consequence, 
especially if he could have obtained clarifying 
information with relative ease. However, if 
exercising the opportunity to acquire clarify- 
ing information had required a large expendi- 
ture of effort, the individual may not feel 
much personal responsibility for the conse- 
quence. The more personal responsibility the 
individual feels for an unpleasant conse- 
quence, the greater should be the threat to his 
ego or, in dissonance terms, the greater the 
arousal of dissonance. Based on this reasoning, 
it was predicted that when a decision re- 
sulted in an unpleasant consequence, self- 
attributed choice would be lower when the 
opportunity had been easy than when it had 
been difficult. 

When an individual discovers that his deci- 
sion will result in a pleasant consequence for 
another person, this discovery may indicate to 
him that he was capable of making the best 
decision without expending additional effort; 
thus, whether the opportunity was easy or 
difficult should not appreciably affect his 
sense of personal responsibility for the conse- 
quence. It was predicted that when a decision 
resulted in a pleasant consequence, the ease 
of opportunity variable would not influence 
self-attributed choice. 


METHOD 

Subjects 

The subjects were 55 male and female college 
students who participated in order to fulfill a re- 
quirement in general psychology. Subjects were run 
individually and randomly assigned to conditions; 
the number of subjects per condition varied from 
8 to 10. The sex ratio was approximately constant 
across conditions. 


Procedure 

At the beginning of cach session, the first experi- 
menter introduced himself as a student who was 
substituting for the regular experimenter, a profes- 
sor temporarily occupied with other work. The sub- 
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ject was told that the experiment employed two 
types of matched subjects, “learners” and “decision 
makers.” Each subject was informed that he had 
been randomly assigned to the role of decision 
maker. The first experimenter indicated that one 
purpose of the study was to investigate a learner's 
behavior and perceptions in a situation which a 
decision maker had selected for him, The subject 
was told that his only task was to make a decision 
regarding the experiment in which the learner would 
participate and then answer a couple of questions 
about the decision, The decision was portrayed as 
very important, and the subject was informed that 
the decision-making process was also a problem 
being investigated in the study. In- addition, the 
subject was told that his participation in the study 
was voluntary and that he could leave and get full 
credit at any time. 

The subject was then given written descriptions of 
two alternative learning experiments from which to 
select. He was asked to examine the descriptions 
carefully, since the regular experimenter had men- 
tioned in passing that one of the experiments was 
very pleasant and the other very unpleasant. The 
written descriptions consisted of two mimeogra hagi 
pages with a lengthy paragraph on cach page. 
paragraphs, labeled Experiment 1 and Experiment 2, 
were composed of highly technical and abstruse 
descriptions of the equipment, conditions, and effects 
involved in each supposed experiment. These de- 
scriptions were designed to be equally ambiguous, 
and an analysis of subjects’ selections showed no 
significant preference for one description over the 
other. 

After the subject had been given an opportunity 
to examine the descriptions, the first experimenter 
interrupted him and said, “Say, I can see that those 
descriptions are pretty technical, I wish I could 
help you understand them, but as I said before, I'm 
just a substitute and don’t know anything about 
them myself.” The subject was told that it would, 
however, be possible for him to personally consult 
the regular experimenter to get clarification as to 
the nature of the tasks. Then the experimenter in- 
troduced the instructions designed to manipulate 
the c: of opportunity variable. In the easy op- 
portunity conditions, the regular experimenter was 
described as being in Room 104 of the building that 
the subject was currently in, In the difficult oppor- 
tunity conditions, he was described as being in Room 
104 of a psychology building on the other side of 
the campus (the location was described as being 
inaccessible by car). In all conditions, subjects were 
told that the second experimenter was pretty busy 
but that they could nevertheless interrupt him if 
they wanted the clarifying information. Subjects 
were given directions to the appropriate location and 
told that the first experimenter would wait for them 
if they elected to seek the information. They we: 
also told that if they went ahead and made the deci- 
sion without obtaining the clarifying information, it 
would have to be final and irrevocable. 
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In all conditions only three persons decided to 
exercise their option to seek more information. 
Two of these persons had been assigned to the diffi- 
cult opportunity conditions, and one had been 
assigned to the easy opportunity conditions. For 
these three individuals, the session was terminated 
at this point, and they were given a thorough de- 
briefing. 

After the subject had elected not to exercise the 
opportunity and had made the selection, he was 
shown a form and asked to sign it. For all subjects, 
this form contained a number of signatures in a 
column labeled decision makers and no names in an 
adjacent column labeled learners. The subject was 
told that at a later time in the semester there would 
be a meeting of the decision makers and the learn- 
ers to discuss the results of the study. The subject 
was told that learners would sign the form when 
they were run and that they would be able to sce 
who had selected for them. These instructions were 
designed to enhance the subject’s perception of the 
public nature of his selection and its possible conse- 
quences for him. 

At this point, the second (“regular”) experimenter 
entered the room, explained that he had finished 
his business a little early, and asked if the session 
had been completed. The first experimenter indicated 
that the subject had not yet filled out the final 
questionnaire, and the regular experimenter then 
offered to take responsibility for this final detail. 
He then asked the subject about which experiment 
he had selected. After the subject’s reply, the second 
experimenter indicated that the laboratory where 
that particular experiment would be run was just 
down the corridor. He then offered to show this lab 
to the subject. (It should be noted that each experi- 
menter was “blind” with respect to condition as- 
signment for the manipulation that he did not im- 
plement). 

In the unpleasant consequence conditions, the sub- 
ject was taken to a laboratory consisting of a large 
room containing an elaborate array of electronic 
equipment and wires. A sign on the door of the 
laboratory read “AUTHORIZED PERSONS ONLY-DAN- 
GEROUS ELECTRICAL EQUIPMENT.” Inside the lab, the 
subject was shown a chair with wires extending to 
it from a machine. The subject was told, 


This will be a learning experiment. . . . The 
subject will be hooked up to this machine and will 
receive electric shocks over a period of time. The 
subject's task will be to make certain discrimina- 
tions, and the shocks will be contingent upon his 
responses. Subjects report that they really dislike 
the experiment. 

*The ease of opportunity manipulation was de- 
signed so that subjects in all conditions would per- 
ceive that they had a viable opportunity to seek 
additional information. But it was also designed to be 
difficult enough so that in all conditions only a few 
subjects would exercise it. Of course, if a sizable 
number in either set of opportunity conditions had 
exercised the opportunity, the results might have 
been open to a self-selection interpretation. 
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In the pleasant consequence conditions, the sub- 
ject was taken to the same laboratory where a sign 
on the door read “AUTHORIZED PERSONS ONLY— 
DELICATE SENSORY EQUIPMENT.” In these conditions, 
the subject was shown the same arrangement of 
equipment and was told, 


This will be a learning experiment. « The 
subject will be hooked up to this machine and 
will receive pleasant sensory cues over a period of 
time. The subject’s task will be to make certain 
discriminations, and the cues will be contingent 
upon his responses. Subjects report that they 
really like the experiment, 


After the presentation of the instructions designed 
to manipulate perceived consequence, the subject 
was asked to fill out the final questionnaire. The 
questionnaire contained two items, each of which was 
answered on a 21-point scale. These questions and 
the respective end labels on each scale were as fol- 
lows: (a) How much choice did you have in se- 
lecting which experiment the learner would partici- 
pate in? (none at all-very large amount) and (b) 
How much do you think the learner will enjoy the 
experiment you selected for him? (none at all-very 
large amount). The subject was asked to consider 
each of the questions for a minute before answering 
it. 

Two additional conditions, difficult opportunity— 
no consequence and easy opportunity-no conse- 
quence, were run in order to provide baselines 
against which to compare conditions in which con- 
sequence was manipulated. In addition to the ma- 
nipulation of the opportunity variable, the procedure 
for these no consequence conditions included all 
aspects of the procedure employed in the experi- 
mental conditions up to the point after which the 
subject replied to the second experimenter’s question 
about which experiment he had selected for the 
learner. After this reply, the second experimenter 
merely took the subject to another room down the 
corridor and administered the final questionnaire. 

The session was terminated after the subject had 
completed the final questionnaire and after he had 
been questioned for suspicion concerning the alleged 
purposes of the experiment. Three subjects expressed 
suspicion as to the existence of the learner, and the 
data for these subjects were discarded from the final 
analysis; they were in three different conditions. 
Before leaving, each subject was given a thorough 
debriefing, 

In summary, there were four experimental condi- 
tions comprising a 2 X 2 (Easy or Difficult Opportu- 
nity X Pleasant or Unpleasant Consequence) factorial 
design; also, there were two control conditions in 
which Opportunity was manipulated but consequence 
of decision was not. 


RESULTS 


Effectiveness of Manipulation of 
Consequence of Decision 

It was expected that subject decision 
makers would perceive that learners would 
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enjoy the experiment that had been selected 
for them more in the pleasant consequence 
conditions than in the unpleasant consequence 
conditions. The data provided strong sup- 
port for this expectation. The mean for the 
pleasant consequence conditions combined 
(X = 15.3) was much higher than the mean 
for the unpleasant consequence conditions 
combined (X = 4.5). An analysis of variance 
for the experimental conditions, excluding the 
no consequence conditions, showed that the 
main effect of consequence of decision was 
highly significant, F(1, 34) = 97.26, p < .001. 
This analysis revealed no other significant ef- 
fects. These results provide good evidence 
that the manipulation of consequence of 
decision was successful. 


Measure of Self-Attributed Choice 


Means for the measure of self-attributed 
choice for all conditions are presented in 
Table 1. It was expected that for the condi- 
tions within the factorial design, ease of op- 
portunity would interact with consequence of 
decision. When a decision resulted in an un- 
pleasant consequence, self-attributed choice 
was expected to be less for individuals who 
previously had been offered an easy opportu- 


A than for those who previously had been 


offered a difficult opportunity; whereas, when 
a decision resulted in a pleasant consequence, 
self-attributed choice was not expected to be 
affected by the ease of opportunity variable. 
Inspection of Table 1 shows that the data 
are in line with this prediction. An analysis 
of variance for the conditions within the fac- 
torial design showed a significant Ease of 
Opportunity X Consequence of Decision inter- 
action, F(1,34) = 9.51, p < .01. Individual 
comparisons among means relevant to this 
interaction revealed (a) that self-attributed 
choice was less in the easy opportunity—un- 
pleasant consequence condition than in the 
difficult opportunity—unpleasant consequence 
condition (p < .01)* and (b) that there was 
no significant difference between the easy 
opportunity-pleasant consequence condition 
and the difficult opportunity—pleasant conse- 
quence condition. These results provide strong 
support for the predicted Ease of Opportu- 
nity X Consequence of Decision interaction. 


3 Two-tailed ¢ tests. 
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TABLE 1 


NS FOR THE MEASURE OF 
ATTRIBUTED CHOICE 


ME 


Information concerning 


OE annou yA consequence of decision 
larifying information| __ D 
about consequence 

of decision Td pe z 
Pleasant | picasant | None 
Difficult (9) (10) (9) 
16.8 10.7 10.0 
Easy (10) (9) (8) 
3.1 10.4 


It was predicted that (a) individuals who 
discover that a decision they made will lead 
to a pleasant consequence will attribute a 
relatively high amount of choice to them- 
selves in making the decision and (b) indi- 
viduals who discover that a decision they 
made will lead to an unpleasant consequence 
will attribute a relatively low amount of 
choice to themselves in making the decision. 
Tests of these predictions are provided by 
comparisons between the pleasant conse- 
quence conditions and appropriate no conse- 
quence conditions, and the unpleasant con- 
sequence conditions and appropriate no 
consequence conditions, Considering compari- 
sons for the pleasant consequence conditions, 
it can be seen from Table 1 that self-attrib- 
uted choice was greater in the easy opportu- 
nity~pleasant consequence condition than in 
the easy opportunity-no consequence condi- 
tion. A comparison revealed that the differ- 
ence between these two conditions was highly 
significant (p< .001). Further, Table 1 
shows that self-attributed choice was greater 
in the difficult opportunity-pleasant conse- 
quence condition than in the difficult oppor- 
tunity-no consequence condition, This differ- 
ence was also highly significant (p < .001), 
These results indicate that subjects in both 
of the pleasant consequence conditions attrib- 
uted a relatively high amount of choice to 
themselves in making their decision. 

With respect to the comparisons for the 
unpleasant consequence conditions, Table 1 
shows that self-attributed choice was much 
lower in the easy opportunity—unpleasant 
consequence condition than in the easy oppor- 
tunity-no consequence condition. A com- 
parison showed that the difference between 
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these two conditions was highly significant 
($ < .001). However, it can also be seen from 
Table 1 that there was little difference in 
self-attributed choice between the difficult 
Opportunity—unpleasant consequence condition 
and the difficult opportunity-no consequence 
condition. A comparison showed that the dif- 
ference between these two conditions did not 
approach significance. These results indicate 
that only in the €asy opportunity—unpleasant 
Consequence condition did subjects attribute 
a relatively low amount of choice to them- 
selves in making their decision. 


Discussion 


The principal purpose of this study was to 
investigate the operation of some motivational 
processes in the self-attribution of choice. The 
results of the study provide considerable evi- 
dence that motivational processes do affect 
self-attributed choice. First, the results 
showed that, as predicted, individuals who 
discover that a decision they made will lead 
to a pleasant consequence for another person 
will attribute a relatively high amount of 
choice to themselves in making the decision. 
This prediction was based upon the assump- 
tion that it is ego-enhancing to view oneself as 
responsible for a decision that leads to a 
pleasant consequence for another person and 
that self-attributed choice will be used as 
a means of assuming such responsibility. 
Second, the results for the easy opportunity— 
unpleasant consequence condition showed evi- 
dence for the prediction that individuals who 
discover that their decision will lead to an 
unpleasant consequence for another person 
will attribute a relatively low amount of 
choice to themselves in making the decision. 
This prediction was based upon the assump- 
tion that it is ego threatening or dissonance 
arousing to see oneself as responsible for a 
decision that leads to an unpleasant conse- 
quence for another person and that self- 
attributed choice will be used as a means of 
absolving oneself of such responsibility. 

Although it was expected that there would 
be some reduction in self-attributed choice in 
the difficult opportunity—unpleasant conse- 
quence condition (relative to the difficult 
opportunity—no consequence condition), the 
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results provide no support for this expecta- 
tion. This finding, however, would seem to be 
readily understandable in terms of how the 
ease of opportunity variable was expected to 
qualify the effect of the consequence of the 
decision upon self-attributed choice, Having 
had a difficult opportunity to get clarifying 
information about a decision may reduce an 
individual’s sense of responsibility when an 
unpleasant consequence is produced by the 
decision. The individual in this situation may 
feel that the difficulty of the opportunity 
justifies his having not exercised it. 

The results of the study provide strong 
support for the predicted Ease of Opportu- 
nity X Consequence of Decision interaction. 
When an unpleasant consequence resulted 
from a decision, self-attributed choice was 
less for individuals who previously had been 
offered an easy opportunity to obtain clarify- 
ing information about the consequence than 
for those who previously had been offered a 
difficult opportunity. On the other hand, 
when a pleasant consequence resulted from a 
decision, self-attributed choice was not af- 
fected by the ease of opportunity variables. 

Given the operation of a negative motiva- 
tional process in the easy opportunity—un- 
pleasant consequence condition, one may 
wonder why perceived pleasantness was as 
low as it was in that condition. In other 
words, why was there no reduction of ego 
threat or dissonance via an increased percep- 
tion of pleasantness in that condition? One 
explanation for this result may be found in 
the order and manner in which the measures 
of perceived pleasantness and self-attributed 
choice were taken, The self-attributed choice 
measure was taken first, and subjects were 
asked to contemplate this question for a brief 
period of time before answering it, possibly 
making this measure more salient as a mode 
of reducing ego threat or dissonance. A second 
explanation for this result may be found in 
the method of presentation of the conse- 
quence information. This information was 
intentionally presented by means of unequivyo- 
cal physical evidence such as a warning sign 
and a large array of electrical equipment de- 
scribed as designed to deliver unpleasant 
shocks. Because of this method of presenta- 
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SELF-ATTRIBUTED CHOICE 


tion, it seems likely that the perception of 
pleasantness would have been relatively un- 
available as a mode of reducing ego threat 
or dissonance. 

We have implied that either an ego threat 
process or a dissonance arousal process may 
account for the results of self-attributed 
choice in the easy opportunity-unpleasant 
consequence condition. However, it may be 
argued that these processes are not neces- 
sarily different. According to Aronson’s 
(1968) modification of Festinger’s (1957) 
original theory of dissonance, dissonance is 
most powerful in situations where the self- 
concept is threatened. This formulation is 
based on the assumption that most individu- 
als think of themselves as decent people who 
would not engage in behavior discrepant with 
this image, unless there was good reason to 
do so. When an individual does engage in 
such behavior without adequate justification, 
his conception of himself as a decent person 


* should be threatened and he should experi- 


ence considerable dissonance. This individual 
should attempt to reduce his dissonance via 
whatever mode of reduction is available. This 
“dissonance-self-concept” argument would 
seem to accommodate rather well the self- 
attributed choice data in the easy opportunity— 
pleasant consequence condition and, in so 
doing, makes more cogent the conclusion from 
these data that self-attributed choice may 
serve as a mode of dissonance reduction. 

The data of the present study suggest that 
the psychological experience of freedom of 
choice sometimes involves more than a simple 
processing of information. These data suggest 
that when appropriate stimulus conditions are 
created, people may interpret their freedom 
of choice in a manner concordant with their 
self-interests and psychological needs. Viewed 
in this light, the results of this study would 
seem to provide a valuable addition to our 
emerging understanding of the psychological 
experience of freedom of choice. 


REFERENCE NOTE 
1. Wolosin, R. J., & Denner, B. Three studies on 
the attribution of freedom to the self and to 
others, Unpublished manuscript, Indiana Univer- 
sity, 1970. 
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Effect of Group Success and Failure upon Interpersonal 
Attraction in Cooperating Interracial Groups 


Fletcher A. Blanchard, Leonard Adelman, and Stuart W. Cook 
University of Colorado 


The effect of group success and failure upon interpersonal attraction in 
cooperating interracial groups was investigated in a 2X 2% 2 factorially de- 
signed experiment. The level of participation in the group’s decision making 
(high or low) and race (black or white) served as the additional independent Ea 
variables. The 56 military subjects exhibited significantly greater attraction 
for groupmates under the success as opposed to the failure condition (p < 
001). No main effects for the race of the group member being evaluated 
or the level of participation in decision making were obtained, nor were there 
interactions among any of the three independent variables. Separate analyses 
of the attraction ratings given the white and the black groupmates further 
revealed that whatever elevated or depressed the ratings for one, similarly 
affected the ratings for the other. A general satisfaction-dissatisfaction inter- 
pretation is offered to explain the effect of group success—failure on inter- 
personal attraction. Implications for the scapegoat hypothesis are discussed. 


Reviews of the literature on interpersonal 
and attitudinal outcomes of personal contact 
with representatives of a disliked group 
make it clear that these outcomes vary 
widely, sometimes being positive, sometimes 
negative, and sometimes absent (Amir, 1969; 
Cook, 1957, 1962, 1969, Note 1; Cook & 
Selltiz, 1955; Eisenman, 1965; Lieberson, 
1961; Pettigrew, 1969). As a result, attention 
has shifted to a search for types of personal 
contact which produce positive outcomes. One 
type of contact which has received consider- 
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1 Although Wilson and Miller (1961) analyze their 
data from a different perspective, it is evident from 
the results which they present in Table 1 (p. 430) 
that liking from one’s partner in a cooperative dyad 
increased substantially more under success than under 
failure, 

2Tt should be noted, however, that because 
Shelley’s experiment focused on level of aspiration, 
and attraction measures were administered prior to 
the critical success—failure feedback, it is difficult to 
ascertain which groups felt more successful at the 
point in time when the intermember attraction data 
were collected. 


able emphasis is that involving cooperation, 
that is, means interdependence in the achieve- 
ment of joint goals (Sherif, 1958; Sherif, 
Harvey, White, Hood & Sherif, 1961; Sherif 
& Sherif, 1953). 

Cooperation in the achievement of joint 
goals often, but not always, results in greater 
liking and respect among the persons involved 
(Crombag, 1966; Deutsch, 1949; Raven & 
Eachus, 1963; Thomas, 1957; Wilson & 
Miller, 1961). Some evidence is available 
that the same is true when the cooperating 
individuals are from different racial groups 
(Cook, 1969, Note 1). 

Among the variables that may influence the 
extent to which interpersonal liking and re- 
spect develop in cooperating groups is success 
or failure in achieving the group goal. A 
number of reviewers have argued that task 
success is crucial to the development of at- 
traction among individuals engaged in a co- 
operative enterprise (Ashmore, 1970; Collins 
& Raven, 1969; Lott & Lott, 1965). Despite 
this apparent consensus, we feel that such a 
conclusion may be premature for two reasons. 

First, some studies do not support the suc- 
cess-attraction relationship. Although Deutsch 
(1959), Wilson and Miller (1961)*, and 
Shelley (1954)? have provided evidence that 
success on the task is associated with positive 
feelings about the members of one’s group, 
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the results obtained from four other investiga- 
tions clearly qualify such an interpretation. 

1. In his study of the effects of stress 
on cooperating groups, Lanzetta (1955) found 
greater friendliness among group members 
under conditions of failure (accompanied by 
stress) at the cooperative task than under 
conditions of success (no stress or mild 
stress). However, findings by Pepitone and 
Kleiner (1957) appear to qualify this some- 
what; they show that high-status teams that 
compete under conditions of low threat (i.e., 
an experimenter-predicted low probability of 
loss) show more increase in positive own- 
group choices than do teams who compete 
under high threat (i.e. a high probability 
of loss). 

2. Hoffman (1958) reported a significant 
relationship between intermember attraction 
and group success only in cooperative groups 
which had been carefully composed of persons 
with similar personality traits; no such rela- 
tionship obtained in groups heterogeneous 
with respect to personality. 

3. Thibaut (1950) found that success and 
failure at team tasks influenced the level of 
attraction shown by unpopular subjects for 
fellow group members but had less influence 
in the case of more popular subjects. 


LL 4. Streufert and Streufert (1969) made a 


similar point with respect to individual dif- 
ferences in cognitive style: Success and fail- 
ure influence attraction among cognitively 
“simple” subjects more strongly than among 
cognitively “complex” ones. 

A second reason for re-evaluating the 
success~attraction relationship is that some of 
the research purporting to demonstrate the 
importance of success has utilized experimen- 
tal settings so structured as to attach a high 
reward value to task outcome, while simul- 
taneously limiting the possibility that mutual 
respect and liking might develop from other 
aspects of the cooperative interaction. In 
three experiments (Heber & Heber, 1957; 
Stotland, 1959; Zander & Havelin, 1960) 
subjects worked independently under’ condi- 
tions of common fate prior to rating the at- 
tractiveness of their fellow group members. 
That is, there was no means interdependence 
in the sense of common discussion or coopera- 
tive planning among group members about 
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how they might best achieve successful 
outcomes. In two of the three studies (Stot- 
land, 1959; Zander & Havelin, 1960) 
subjects worked in separate cubicles and 
could not see each other. Not only was 
means interdependence eliminated in these 
three investigations, but instructions given 
the subjects in two of the three empha- 
sized that personally important conse- 
quences would attend success or failure at 
the task. In one of the two studies subjects 
were told that the results of their performance 
would be included in their personnel file 
(Zander & Havelin, 1960), and in the other, 
elementary school students were informed 
that their group’s grade would be recorded 
on their personal report cards (Heber & 
Heber, 1957). The meaning of such studies 
for cooperative groups in natural settings that 
are, on the one hand, characterized by less 
weighty consequences for success in the 
achievement of joint goals and, on the other, 
by means interdependence and by face-to-face 
interaction, seems unclear. Understanding of 
the role of success—failure in determining 
liking and respect among group members will 
remain limited until it is studied under a 
variety of such circumstances. 


Participation in Decision Making 


It is possible, of course, that group success 
and failure have different implications for 
intermember attraction depending upon the 
extent to which a group member feels he has 
played a significant role in determining the 
fate of his group, For example, a group mem- 
ber who feels himself to be important to his 
group’s effort might experience greater satis- 
faction following success than a group mem- 
ber who considers himself more peripheral to 
his group’s success. Although the impact of 
participation on interpersonal attraction has 
not been systematically investigated in the 
context of cooperative groups, tangential evi- 
dence supports the potential importance of 
this variable. Some studies suggest that 
simply taking an active part in the group 
operations is conducive to increased liking for 
groupmates. For example, Hare (1952) re- 
ported that limited opportunity to participate 
in a group discussion will result in dissatis- 
faction with that group. In a series of studies 
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concerning communication patterns in task- 
oriented groups Bavelas (1950), Bavelas and 
Barrett (1951), Christie, Luce, and Macy 
(Note 2), and Leavitt (1951) demonstrated 
that persons occupying central positions in 
communication networks (hence, participating 
more) had higher group morale than persons 
occupying more peripheral positions. Insofar, 
then, as liking the group as a whole is similar 
to liking the members of the group, these 
studies would suggest a relationship between 
level of participation and attraction for 
groupmates. 

Industrial studies focusing on a special type 
of participation, namely, participation in 
decision-making activities, have reached the 
same conclusion. Coch and French (1948) 
found that workers who fully participated 
with management in cooperatively planning 
job changes displayed more positive attitudes 
toward management than did workers whose 
participation was less pronounced. Likert 
(1961, p. 98) has stressed the importance of 
participation in decision making in gratifying 
the ego motives of employees and thereby 
producing both a sense of cohesiveness in the 
work group and mutual attraction between 
managers and subordinates. Thus, consider- 
able data seem consonant with the proposi- 
tion that participation, especially in the sense 
of being actively involved in group decisions, 
contributes both to satisfaction with the group 
and to liking for fellow group members. 


Race of the Group Member Being 
Evaluated 


In assessing the significance of group suc- 
cess and failure for determining intermember 
attraction in cooperating interracial groups 
(as distinct from racially homogeneous 
groups), the race of the group members must 
be taken into consideration. In particular, 
when a white group member is prejudiced, the 
race of his groupmates may influence the 
amount of liking and respect he develops for 
them. However, it has been pointed out on 
numerous occasions that such correspondence 
between attitudes to a disliked group and 
reactions to individual representatives of that 
group is often absent (e.g., Wicker, 1969). 
Moreover, Cook (1969, Note 1) has shown 
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that prejudiced whites may develop strong 
respect and liking for individual blacks with 
whom they work cooperatively." In addition, 
Burnstein and McRae (1962) report that 
under certain conditions prejudiced whites in 
cooperating interracial groups develop greater 
attraction for their black groupmates than for 
some of their white groupmates. This hap- 
pened to a greater extent under group failure 
(accompanied by “threat of failure”) than 
under group success (no “threat of failure”). 

Given these varying research results it 
would appear that under conditions of co- 
operative interdependence no clear prediction 
as to the effect of race of group member on 
interpersonal attraction for him may be 
formulated. It may be, on the other hand, 
that in line with the frustration-aggression 
hypothesis (Dollard, Doob, Miller, Mowrer, 
& Sears, 1939) an influence of race on attrac- 
tion might be found in the interaction of race 
with group success-failure. That is, one 
method which a prejudiced individual might 
use to cope with the frustration generated by 
task failure is to aggress toward or project 
blame upon a team member from a minority 
group. However, the empirical evidence with 
respect to this hypothesis is not consistent. 
Miller and Bugelski (1948) have reported 
that following an extremely frustrating ex- 
perience, subjects displaced hostility by at- 
tributing fewer positive and more negative 
traits to two ethnic groups. Attempted repli- 
cation of this study has proven both success- 
ful (Cowen, Landes & Schaet, 1959) and 
unsuccessful (Stagner & Congdon, 1955). 
Weatherley (1961) has indicated that aggres- 
sive reactions to frustration are more likely 
to be directed at representatives of a disliked 
ethnic group, Similarly, Berkowitz and Green 
(1962) have suggested that the hostility is 


likely to be displaced onto any disliked per- , 


son. However, in an earlier study, Lindzey 
(1950) reported that experimentally inducing 
frustration produced no differences in the 


®Cook notes that the attitude change shown by 
prejudiced whites under these conditions is substan- 
tial if it occurs at all. He suggests the interpretation 
that changes in strong attitudes occur only through 
the crystalization of an accumulation of change in- 
fluences, which to an observer appears as an over- 
reaction or conversion. 
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amount of displaced hostility shown by 
prejudiced and nonprejudiced subjects. 

The above considerations led to 
formulation of two hypotheses: 

1. A member of a cooperating interracial 
group that is successful will develop greater 
attraction for his groupmates than will a 
member of such a group that fails. 

2, A member of a cooperating interracial 
group who actively participates in the deci- 
sions made by that group will develop greater 
attraction for groupmates than will a mem- 
ber whose participation in decision making is 
more limited. 

Neither the research nor the theoretical 
formulations concerning the effects of race on 
interpersonal attraction in cooperating groups 
is sufficiently clear to warrant predictions 
involving this variable. 


the 


METHOD 
Subjects 


Research subjects in the experiment were 56 white 
males with limited military service, aged 18-21, who 
had volunteered knowing only that they would be 
paid $5 to $10 for their participation. Subjects 
selected from this pool of volunteers were from 
small towns in the southern United States. The 
intent of selecting subjects by geographical back- 
ground was to obtain individuals with preexisting, 
antiblack attitudes. Public opinion surveys have 
shown that antiblack attitudes are more prevalent 
in the rural South than elsewhere in the country 
(Brink & Harris, 1966; Greeley & Sheatsley, 1971). 
The use of a racial attitude inventory as a subject 
selection technique was considered but abandoned 
for fear of sensitizing the subjects to the experi- 
menter’s interest in the race variable.’ All indications 
from several preexperimental checks were that none 
of the subjects knew anything about the experiment 
before they participated in it. 


Experimental Task 


The cooperative task forming the context in which 
the experiment was conducted was a “management 
training activity.” It involved the operation of a 
railroad business with lines serving 10 cities and 
carrying a variety of items in 500 cars of six dif- 
ferent types. The business received orders to ship 
specified materials from one city to another. Profits 
depended upon having the appropriate types of cars 
in the proper city to fill each order quickly. The 
three-person staff filled orders, redistributed cars (at 
a loss if empty), kept financial records, ete. The 
business had overhead expenses and lost money when 
mistakes were made. Major increases in efficiency 
occurred as the staff learned (a) that they must 
redistribute empty cars, (b) that certain cities only 
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use certain types of cars, and (c) that overall oper- 
ating expenses are decreased by shipping certain 
materials in low-payment cars that can be utilized 
later at a given destination. The business is con- 
ducted during a series of five 15-min sessions called 
workdays. Observation of subjects has consistently 
shown that they become very excited and involved 
in the task, urging co-workers to perform their jobs 
quickly and reacting strongly to the success-failure 
feedback at the end of each workday. The activity 
was difficult enough to continually challenge the 
young adult participants throughout its 1.5 hour 
duration. 

There were three positions in the business. The 
executive oficer received the requests for shipment, 
consulted with other staff members, made the final 
decisions, and “phoned in” the information about 
how each order was to be executed. The shipping 
oficer suggested the type of car and route to be used 
and kept the financial records. The equipment oficer 
kept records regarding the distribution of cars, noted 
the type of available cars at the city of origin, and 
recommended the redeployment of cars to points 
where they might be needed. The research subject 
always occupied the latter position. The two other 
positions were filled by experimental confederates, 


Independent Variables 


Success-failure of the group, degree of participa- 
tion in decision making, and race of the group’s 
executive officer were systematically varied in a 
222 factorial design with a random assignment 
of subjects to conditions. 

The success-failure variable 
follows: 

1, The subjects were to!d at the beginning of the 
experiment that they would receive a $5 reward for 
exceeding the performance of a stated comparison 
group. 

2. The comparison groups were said to be mili- 
tary officers (high status) for the success condition 
and basic enlisted personnel (low status) for the 
failure condition. 


4 Although the sensi 


was introduced as 


tion problem precluded 
the use of a racial attitude inventory, subjects were 
pretested with a number of well-established measures 
of selected personality variables. We anticipated that 
success-failure would interact with personal needs 
for achievement and that level of task participation 
would interact with needs for affiliation and social 
dominance in determining attraction outcomes. How- 
ever, no personality data are reported because no 
interpretable findings were obtained. 

5Zander and Ledvinka (1971) indicate that 
greater task difficulty produces more favorable feel- 
ings about the group experience than less task diffi- 
culty when group success and failure are held con- 
stant. The intent of the present study was to hold 
task difficulty at a level which was high enough to 
challenge the participants yet not so high as to dis- 
courage them. Observation of the subjects indicated 
that this attempt was successful. 
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3. Success or failure was reported to the work 
group at the end of each workday by plotting on a 
display board the team’s progress in relation to the 
appropriate comparison group. In the success condi- 
tions the experimenter reported the team’s progress 
at the end of each of the five workdays as 
iollows: Day 1, even; Day 2, $500 ahead; Day 3, 
$500 behind; Day 4, $1000 ahead; Day 5, $3000 
ahead. The corresponding ”daily” feedback for the 
failure condition was as follows: even; $500 behind; 
$500 ahead; $1000 behind; $3000 behind. 

The subject’s level of participation in decision 
making was controlled by the behavior of the two 
trained confederates. Throughout the course of the 
task, major changes in the way the group operated 
its business were initiated in order to increase the 
group’s efficiency, In the low-participation condi- 
tion the insights underlying these changes were “dis- 
covered” and decided upon totally by the confed- 
erate occupying the executive role; the executive ex- 
plained the reasons for his decisions to his co-work- 
ers. By contrast, in the high-participation condition 
the confederates guided the discussion in such a way 
that the subject was likely to discover the principles 
of efficient operation. Furthermore, the executive 
officer urged the subject to make the decisions re- 
garding the implementation of changes. 

Race was varied as follows: For half the subjects 
the executive officer was black; for the other half he 
was white. When the executive officer was black, 
the shipping officer was always white, and vice 
versa. It is possible that the degree of attraction 
for either groupmate may have been affected by 
the fact that he was experienced in the context of a 
three-person interracial group, that is, when the sub- 
ject rated either of his two groupmates, the other 
groupmate was always of the other race. Generali- 
zations from our findings, then, must keep this con- 
straint in mind. In addition, there was a confound- 
ing of race with the personality and appearance of 
the two confederates. However, the importance of 
this confounding is minimized by the role training 
the confederates received. Their role prescriptions 
Specified not only the content of their exchanges with 
the research subject, but also the affect displayed 
toward him. 


Subjective Experience Variables 


Two rating scales were employed to assess the 
subjects’ perception of the success-failure and the 
high-participation—-low-participation variables. A 
three-item measure of perceived success and a four- 
item measure of perceived participation in decision 
making were developed for this purpose. Both scales 
exhibited good internal consistency (Cronbach's 
alpha > .90). Results indicated that the subject's 

ê Immediately following the completion of the 
postexperimental questionnaires in which the criterion 
data were gathered, the failure subjects were informed 
that an error had been made by the experimenter and 
that recomputation of their profits showed that they 
had indeed won the $5 bonus, 
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TABLE 1 
MEAN ATTRACTION FOR THE EXECUTIVE OFFICER 


Condition. e a na eT nalts 
Success 
High participation 140.43 137.86 
Low participation 139.86 144.86 
Failure 
High participation 130.43 125.43 
Low participation 129.71 129.14 


Note. Error (within); MS = 68.21. 


perceptions of success—failure and level of participa- 
tion corresponded with the objective manipulations; 
differences between subjects in different treatment 
conditions on the relevant perception measures were 
significant at better than the .001 level of confi- 
dence. To provide a check on the adequacy of the 
experimental manipulations, every session was tape 
recorded and later evaluated by a trained observer 
who judged the quality of the confederates’ efforts. 
When these were judged to be inadequate the sub- 
ject was discarded and replaced; this occurred in 
two instances. 


Criterion Variables 


Each subject's postexperimental attraction ratings 
for both the executive officer and shipping officer 
(i.e., the two confederates) were obtained in pri- 
vate with assurances that his group would not as- 
semble again and that the ratings were confidential. 
The confederates posing as enlisted men had been 
introduced as members of units located in a dif- 
ferent part of a large military base in order to 
minimize any expectations of continued acquaintance 
and thereby encourage frank and honest ratings by 
the subject. Attraction for each confederate was 
measured by a 21-item scale. Six of these items asked 
for ratings of respect and liking. For example, sub- 
jects were asked, “How much did you like the exec- 
utive officer?” In response, subjects then rated the 
executive officer on a 7-point scale ranging from 
“liked him very much” to “disliked him very much.” 
Nine of the items were adjective ratings from the 
evaluative factor of the semantic differential (Os- 
good, Suci, & Tannenbaum, 1957). The adjective 
ratings also utilized 7-point continua and included 
such dimensions as “friendly-unfriendly,” “honest- 
dishonest,” and “valuable-worthless.” The last six 
items were intended to specify real-life choices for 
the subjects, for example, how much they would 
like to work with each confederate on a military 
training program, and how much they would like to 
spend off-duty time with each. The 21-item attrac- 
tion scale showed high internal consistency (Cron- 
bach’s alpha>.90) when used to rate either the 
executive officer or the shipping officer. 

In order to minimize the likelihood that the pre- 
dicted attraction outcomes might be influenced by 
confederate expectations, the two confederates were 
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misled about the nature of our hypotheses. They 
were told that we expected subjects to like their 
groupmates more when the group failed and when 
their personal participation was minimal. The ra- 
tionales provided emphasized that stress and adver- 
sity tend to forge intimate bonds between people, 
and that the subject under low participation would 
be less threatened and would like his groupmates 
more because they relieved him of the burdens and 
risks associated with decision-making responsibili- 
ties. 


RESULTS 


Each of the 56 subjects supplied an at- 
traction score (21-item scale)’ for the exec- 
utive officer. These scores were distributed, 
seven per cell, in the eight cells of the 2 x 
2 X 2 design. The results of the analysis of 
variance are presented in the text where ap- 
proprieťe: The mean scores are found in Ta- 

le 1. 

Attraction ratings for the third group 
member (shipping-officer role) were also ob- 
tained and analyzed in the manner described 
above. The results of the analysis of variance 
are presented in the text and the means in 
Table 2. The findings are reviewed below. 

Hypothesis 1. A member of a cooperating 
interracial group that is successful will de- 
velop greater attraction for his groupmates 
than will a member of such a group that 
fails, 

The data show significant main effects for 
success-failure on attraction to both team 
members. For the executive oficer F(1, 48) 
= 29.91, p < .001, while for the shipping 
officer F(1,48) = 23.23, p< 001. Inspec- 
tion of the cell means presented in Table 1 
indicates that the differences in attraction are 
in the direction predicted by Hypothesis 1. 

Hypothesis 2. A member of a cooperating 
interracial group who actively participates in 
the decisions made by that group will develop 
greater attraction for his groupmates than 
will a member whose participation in decision 
making is more limited. 

No significant main effect for participation 
on attraction ratings for either the executive 
officer, F(1,48) = 1.14, p > .10, or the ship- 
ping officer, F(1,48) =.15, p> 10, was 
found. These results failed to support Hy- 
pothesis 2. 

The failure to find a significant main effect 
for participation in decision making stimu- 


1025 


TABLE 2 
MEAN ATTRACTION FOR THE SHIPPING OFFICER 


Black stimulus | White stimulus 


Condition 


person person 
Success 
High participation 140.29 138.57 
Low participation 138.29 143.71 
Failure 
High participation 130.71 128.29 
Low participation 123.57 128.29 


Note. Error (within): MS = 94.17. 


lated further analysis. We noted that success- 
ful subjects in the low-participation condition 
reported higher perceived participation in de- 
cision making than did failing subjects in the 
low-participation condition. This observation 
led us to carry out an analysis of variance in 
which perceived participation was the de- 
pendent variable. In addition to a significant 
main effect for objectively manipulated par- 
ticipation, F(1,48) = 29.34, p< .001, we 
found, as anticipated, a main effect for suc- 
cess-failure, F (1,48) = 39.06, p < .001, and 
a Success-Failure X Objectively Manipulated 
Participation interaction, F(1,48) = 6.25, 
p < .05. Success produced higher perceived 
participation. 

These results gave rise to the possibility 
that perceived participation in decision mak- 
ing, when treated as an independent variable, 
might show an impact on the attraction rat- 
ings.* Partial correlations were computed be- 


7 Cluster analysis of the 21-item attraction measure 
produced three item clusters. All analyses were re- 
peated using the scores derived from these clusters. 
However, since the results of these analyses exactly 
paralleled those obtained when the total scale score 
was used, only the latter data are presented. 

8 This evidence of the interdependence of success- 
failure and level of subjectively experienced partici- 
pation raises the question of whether our inde- 
pendent variables were, in fact, orthogonal. In 
response to this question several points should be 
considered. First, it seems reasonable to conceptualize 
a given subjective experience as an analogue of an 
environmental variable—related to but not iso- 
morphic with the environmental variable in ques- 
tion. Such a perspective implies that perceived par- 
ticipation is a legitimate variable in its own right, 
quite apart from the objectively manipulated level 
of participation. That is, we would argue that there 
are two imperfectly correlated classes of variables 
to which a person can respond, one subjective and 
one objective. An individual can react, in other 
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tween perceived participation and attraction 
for the executive officer and the shipping offi- 
cer with perceived success held constant. The 
partial correlations did not reach significance, 
Toy .<(54) = .20, p > -10 in both cases. These 
results clearly indicate that neither objectively 
manipulated participation nor perceived par- 
ticipation affected attraction ratings for 
either confederate. 


Race 


As noted earlier, no hypothesis concerning 
the effect of the race of the stimulus person 
was formulated. There were no significant 
effects on interpersonal attraction due to the 
race of the stimulus person, either by itself 
or in interaction with any other variable. 


Group Member's Role 


The data were such as to make it possible 
to examine the effect of a group member’s 
position (executive or shipping officer) on the 
level of attraction expressed for him. The im- 
portance of this issue derives from the fact 
that both the position titles and the respec- 
tive role requirements implied a differential 
in authority and, perhaps, status that might 
have influenced attraction ratings. The 56 
scores reflecting attraction for the black con- 
federate were analyzed in terms of the effect 


words, not only to subjectively experienced events, 
but also to objective environmental events without 
having these events veridically represented in sub- 
jective experience such that they can be accurately 
described in a verbal report. For example, social norms 
influence behavior, but the nature of their behavioral 
prescriptions cannot always be described by the 
responding individual, This perspective suggests that 
a more relevant check on the manipulation of the 
participation variable would be the behavioral ob- 
servations which we employed: An independent ob- 
server listened to a recording of the confederates’ 
behavior as they introduced the high- and low- 
participation conditions. When the confederates’ be- 
havior failed to elicit the specified type of participa- 
tion, subjects were discarded. As noted earlier this 
happened in only two cases. Insofar, then, as our 
high- and low-participation scripts dictated differen- 
tial subject participation in decision making, it is 
clear that behavioral variation along this dimens‘on 
did occur. 

9 This result is similar to the findings of Hurwitz, 
Zander and Hymovitch (1953) who reported no 
significant difference in the attraction ratings made 
by low-status group members for low- and high- 
status groupmates. 
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of three variables: (a) success-failure; (b) 
level of participation in decision making; and 
(c) position in the group (executive or ship- 
ping officer). A three-way analysis of vari- 
ance of these scores was carried out. A parallel 
analysis of variance was carried out for the 
attraction scores for the white confederate. 
The results indicated that the position held 
by the group member had no effect on ratings 
of liking and respect for him,” 


DISCUSSION 


Our results demonstrate that members of 
successful, face-to-face, cooperating groups 
liked and respected their groupmates more 
than did members of similar but failing 
groups. While previous research had suggested 
that success-failure is an important variable 
in determining interpersonal attraction in 
small groups, certain features of this research 
had raised doubts that its findings could be 
generalized to groups characterized by a high 
level of means interdependence in the pursuit 
of joint goals with extensive, face-to-face 
interaction. The results of the present study 
dispel such doubts. 

To understand the effect of group success— 
failure on liking and respect for groupmates 
it is helpful to postulate a mediating variable 
of satisfaction—dissatisfaction with the group 
experience. The satisfaction generated by 
group success (and the rewards accompanying 
success) is associated with those who are 
involved in the group activities. This reward 
generalization, or spread of affect, results in 
increased intermember attraction. A similar 
interpretation has been suggested by a num- 
ber of writers (e.g., Byrne & Wong, 1962; 
Lott & Lott, 1965). 

It is important to note that the prejudiced 
white subjects in the cooperating interracial 
groups of this study show no differences in 
attraction for their black and white team- 
mates. In view of the failure of other studies 
to find a predictable relationship between 
prejudice toward an ethnic group and reaction 
to individual members of that group (e€g., 
Wicker, 1969) this result is not surprising. 
What is needed, clearly, is to specify the 
circumstances under which positive reactions 
to individuals develop despite negative reac- 
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tions to the ethnic groups to which they be- 
long (Cook, 1969, Note 1). Our results sug- 
gest that the combination of experiences that 
characterizes membership in cooperating 
groups may represent one such circumstance. 
Among these experiences are: prejudiced 
whites behave counterattitudinally for a pe- 
riod of time; they acquire information that 
enables them to individualize the black team 
members; and they share a task-oriented ef- 
fort on equal-status terms. 

It is of particular interest that the lowering 
of respect and liking that occurs under group 
failure is no greater for black than for white 
groupmates. Although the evidence regarding 
displacement of frustration-induced aggres- 
sion onto persons toward whom one is preju- 
diced is inconsistent (see the introduction), 
such a dynamic conceivably might have op- 
erated in this study to further reduce attrac- 
tion for the black groupmate. That it did not 
do so may be due to the fact that the frus- 
tration in this study was caused by group 
failure. By contrast, the frustration in earlier 
studies in which displacement of hostility did 
occur had usually been induced by arbitrary 
deprivation or experimenter mistreatment. 
For example, Miller and Bugelski (1948) 
report that, following the experience of being 
arbitrarily prevented from attending a movie 
they had planned to see, subjects displaced 
hostility by attributing fewer positive and 
more negative traits to other ethnic groups. 
Weatherley (1961) found that prejudiced 
subjects expressed significantly greater fan- 
tasy aggression toward a member of a dis- 
liked minority group than did nonprejudiced 
subjects following “highly insulting and dep- 
recating comments to the subjects” by the 
experimenter. In neither of these studies were 
the subjects interacting with the ethnic groups 
in question. Berkowitz (1959) demonstrated 
that highly prejudiced subjects liked a team- 
mate with whom they had worked on a coop- 
erative task significantly less than did non- 
prejudiced subjects after experimenter mis- 
treatment of the following sort: telling the 
subject that she was late and sarcastically 
asking her whether this was always true, 
deprecating the subject’s performance on a 
concept formation task, and questioning her 
ability to do well in college. Berkowitz and 
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Green (1962) found that subjects who had 
been “criticized and insulted by the experi- 
menter” during a problem-solving activity 
liked an already disliked person significantly 
less than did subjects who had not experi- 
enced hostility. Presumably, subjects in these 
studies were inhibited from responding di- 
rectly to the high-status source of their dep- 
rivation or mistreatment and thus displaced 
their aggressive reactions onto a safe and 
harmless target.’° By contrast, as indicated 
above, the frustration in the present study 
was due to group failure. The reason for the 
failure was understood to be inadequate per- 
formance on the part of the group to which 
the subject belonged. Success and failure 
were judged in comparison to “past groups 
who had participated in the research.” This 
left no room for the subject to see the experi- 
menter’s behavior as capricious or arbitrary. 
Indeed, the experimenter expressed regret 
when reporting failure to the group members. 

While, as the foregoing indicates, we found 
no evidence that under failure black team 
members are liked and respected less than 
white ones, neither did we find the occurrence 
of more favorable feelings toward blacks 
which Burnstein and McRae (1962) observed 
under “threat of failure.” As noted earlier, 
these investigators found that prejudiced 
white subjects in groups that failed (accom- 
panied by threat of failure) ranked their 
black teammates relatively higher on attrac- 
tion than did subjects in groups that suc- 
ceeded (no threat of failure), They inter- 
preted this as indicating that threat of failure 
and the experience of shared coping with that 
threat makes a group more cohesive. This 
group cohesion, in turn, leads to increased 
attraction for all team members. Why this 
increase is relatively greater for the black 
than for the white teammates is not clear. 

An alternative explanation may be offered 
for the Burnstein and McRae results. Instead 
of shared “threat of failure,” the punitive 
actions of the experimenter may have ac- 


10 Other investigators have indicated that the extent 
to which frustration leads to the expression of ag- 
gression is related to the degree of arbitrariness of 
the frustrating agent; the greater the arbitrariness, 
the greater the likelihood of aggression (Allison and 
Hunt, 1959; Cohen, 1955; Pastore, 1952). 
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counted for the findings. Lanzetta (1955) 
has demonstrated that “high stress,” induced 
in cooperating groups of white subjects by 
task failure accompanied by experimenter 
badgering and ridicule, increased intermem- 
ber ratings of friendliness and cooperative- 
ness. It seems likely that this outcome is a 
defensive, united-front reaction to the humili- 
ating attack from the experimenter. While 
Burnstein and McRae were milder, they 
employed similar tactics in their manipula- 
tion of threat of failure. After each trial the 
experimenter not only reported the team’s 
failing progress but included negative state- 
ments about the team’s performance as well. 
After the final trial the team members were 
asked to stand and face each other (subjects 
had previously been visually isolated in sepa- 
rate cubicles) while the experimenter deliv- 
ered negative remarks about their poor per- 
formance.** 

It seems reasonable to infer that by these 
humiliating actions Burnstein and McRae led 
their subjects to project onto the experimenter 
any negative feelings that failure may have 
aroused and simultaneously promoted de- 
fensive or protective intermember attraction. 
Yet, we must still ask why the latter effect 
was most strongly reflected in the ratings 
made by prejudiced white subjects of the 
black groupmate. One answer to this question 
is suggested by a theory advanced by Katz 
(1970). Marshalling evidence to show that 
many persons retain a guilt-evoking recogni- 
tion that the targets of their prejudice are 
indeed treated unfairly, he argues that this 
ambivalence “amplifies” reactions to the ob- 
jects of prejudice. Unfriendly reactions be- 
come even more negative and friendly reac- 
tions more positive. As applied to the Burn- 
stein and McRae results we might postulate 
that responses reflecting positive intermember 
attraction in reaction to the experimenter’s 
hostile actions are “amplified” when the black 
team member is being rated.** 


31 Unlike the frustration—aggression studies where 
experimenter hostility was directed toward one per- 
son, the ridicule was directed at the group as a whole 
in the Burnstein and McRae investigation. 

12 Katz also interprets the Burnstein-McRae (1962) 
results in terms of ambivalence-amplification theory. 
His interpretation, however, is different from that 
offered here. 
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To the extent that this type of outcome is 
a function of experimenter behavior in intro- 
ducing failure rather than of failure itself, 
results like those of Burnstein and McRae, 
while extremely valuable from other perspec- 
tives, need not be viewed as contradicting 
results from studies which show decreases in 
attraction to black (and white) groupmates 
under group failure. 

Finally, although we had considered it pos- 
sible that a high level of participation in 
group decision making might enhance inter- 
member attraction and hence minimize the 
detrimental effects of group failure, such did 
not occur. No effect of level of participation, 
nor any interaction with group success-fail- 
ure was found. It is possible that this outcome 
might be a consequence of the fact that the 
extrinsic reward in the experimental situa- 
tion ($5 bonus) was associated solely with 
success. Participation in decision making, 
while it may have satisfied intrinsic ego mo- 
tives in the subjects, may not have engendered 
a sufficiently high level of satisfaction to be- 
come a factor in the overall affect generated 
by the experimental experience. If this were 
the case, some means of attaching an ex- 
trinsic reward to high levels of participation 
or, conversely, disassociating extrinsic reward 
from group success, might conceivably in- 
crease the impact of the participation varia- 
ble. 


SUMMARY 


The degree of liking and respect among 
members of cooperating interracial groups 
found to be influenced by the success or f 
ure of the group in achieving its goal. This 
effect was equally strong for attraction to 
black and to white groupmates. No effect 
upon intermember attraction was observed 
for race alone, nor for race in inte: 
with success-failure (projected blame). A 
high level of participation in group decision 
making, assumed to satisfy intrinsic ego 
motives, did not minimize the negative effects 
of group failure. 

The interpretation offered for these results 
is that an extrinsic financial reward associ- 
ated with group success or failure controlled 
the level of satisfaction of the research sub- 
jects with the group experience and that the 
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resulting affect was generalized to the persons 
with whom the experience had been shared. 
The individualizing potential of face-to-face, 
cooperating groups served to inhibit a poten- 
tial tendency to project blame for failure 
upon black members of the group. 

These findings would appear to have im- 
portant practical implications. Enforced de- 
segregation in such natural settings as schools 
and industry often provides the opportunity 
to assemble task-oriented interracial groups. 
The success or failure of such groups, when 
accompanied by significant extrinsic rewards, 
may well determine the degree of cross-racial 
liking and respect which develops. 
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Inferential Beliefs and Order Effects in Personality 
Impression Formation 


James J. Jaccard and Martin Fishbein 
University of Illinois 


The present study investigated (a) the role of inferential beliefs in the im- 
pression formation process and (b) order effects. It was found that considera- 
tion of a subject’s inferential beliefs significantly increased prediction of the 
evaluative impression formed by subjects. Four possible explanations of order 
effects were considered. Results tended to restrict further the discounting notion 
and were not supportive of an attention decrement, directed impression, or 


serial learning explanation. 


Research on impression formation has gen- 
erally used the same experimental paradigm: 
Subjects are presented a number of discrete 
characteristics ascribed to a stimulus person. 
After seeing this information, the subject is 
asked to form an impression, that is, to indi- 
cate the degree to which he likes or dislikes 
the stimulus person. Effects of different types 
of information are monitored, and inferences 
about the way the subject processes informa- 
tion to formulate his impression are made. 

The notion of relating effects of informa- 
tion to an overall evaluation of an individual 
is closely related to Fishbein’s (1963, 1967) 
theory of attitude formation. According to 
this theory, an individual’s attitude toward 
(evaluation of) some person is a function of 
(a) his beliefs about the person and (b) the 
evaluative aspects of those beliefs. Algebrai- 
cally, this may be represented by the follow- 
ing equation: 


Ap = s Biai, (1) 


i=l 


where Ap = the attitude toward some person 
p, B, = the perceived probability that p pos- 
sesses some trait Xj, a; = the evaluation of 
X;, and n = the number of beliefs. The above 
model has received considerable support with 
respect to attitude research (e.g., Fishbein, 
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1963; Kaplan & Fishbein, 1969) and has 
been applied to the impression formation area 
by Anderson and Fishbein (1965) to good 
effect. As such, the model may be of diag- 
nostic value in analyzing factors affecting 
evaluative impressions. 


Inferential Beliefs 


A number of studies have indicated that 
given a set of traits describing a stimulus 
person, a subject may often infer or attribute 
additional traits to that person. Research 
on trait attribution and implicit personality 
theory has considered trait interrelations in 
depth (e.g., D’Andrade, 1965; Hakel, 1969; 
Norman, 1963), and the work of Dustin and 
Baldwin (1966) and Wyer (1968, 1970) sug- 
gests indirectly that impressions may be based 
in part on inferential beliefs. 

Although these studies are suggestive, most 
studies of impression formation have excluded 
inferential beliefs in the analysis of evaluative 
impressions. Only the studies by Asch (1946) 
have explicitly dealt with traits inferred by 
individuals and how these may influence the 
resulting impression. One purpose of the 
present investigation is to test the importance 
of inferential beliefs in the prediction of 
evaluative impressions. Specifically, it is hy- 
pothesized that the correaltion between Ap 
and SB,a; should be higher when the XBya; 
measure includes inferential beliefs than when 
it does not. 


Order Effects 


Research on order effects has clearly estab- 
lished that order of presentation of traits in- 
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fluences a subject’s impression, usually in 
the direction of Primacy (e.g., Anderson, 
1968; Anderson & Hubert, 1963; Asch, 
1946). However, the mechanism underlying 
these order effects has remained elusive. The 
Present investigation considers four explana- 
tions of order effects. 


Change in Meaning 


A number of theorists (e.g., Asch, 1946; 
Chalmers, 1971) have hypothesized that 
order effects may be due to a “change in 
meaning” or “directed impression” mecha- 
nism. Asch has argued that within a given set 
of traits, some take a more central role than 
others and that these “set the direction for 
the further view of the person.” Adjectives 
in any position of the list may be central 
and, hence, Asch’s change in meaning expla- 
nation can account for either primacy or 
recency. Most interpretations of change in 
meaning, however, argue that adjectives pre- 
sented early in a list create an initial impres- 
sion which influences the interpretation of 
later adjectives. For example, if a set of 
adjectives is preceded by a set of favorable 
adjectives, the evaluative meanings of the 
latter set should shift in a more favorable 
direction. It is this interpretation of change 
in meaning that the present article is con- 
cerned with, 

Indirect evidence for a directed impression 
interpretation has been obtained in studies 
of context effects (eg., Wyer & Dermer, 
1968; Wyer & Watson, 1969). These studies 
have indicated that (a) evaluation of an 
adjective within a stimulus list differs from 
its evaluation in isolation, and (b) evalua- 
tion of the adjective in context is displaced 
toward the evaluations of the other adjectives 
in the stimulus list. Both of these findings are 
in accord with the “change in meaning” 
hypothesis." 


Attention Decrement 


Anderson and Barrios (1961) have sug- 
gested that the often obtained primacy effect 
may result from a progressive decrease in 


1 Anderson has interpreted these findings as sup- 
porting “halo effects” rather than change in meaning. 


attention over the adjectives of a given set 
According to this view, a subject’s attention 
to stimulus traits is positively and linearly 
related to order of presentation such that 
the subject attends most to adjectives pre- 
sented early in the list and least to adjective 
presented late in the list. The notion of atten 
tion decrement has been studied by Ander: 
and Hubert (1963), Stewart (1965), Ander- 
son and Jacobson (1965), Hendrick and 
Constantini (1970), and Chalmers (1971), 
The results of these studies are equivc 
For example, Anderson and Hubert read 
set of adjectives to subjects and later asked 
them to recall these traits. If subjects pay 
less attention to later adjectives, one woul 
expect the probability of recall to be lower 
for these adjectives than for earlier ones (to 
which they are more attentive). However, on 
the recall trial, a uniform recency effect in 
the recall cruve was observed. In contrast to 
this, Stewart and Hendrick and Constantini 
reported data, not based on recall. which 
are congruent with the attention decrement 
notion. 


Discounting 


Anderson and Jacobson (1965) have pre- 
sented a third explanation of order effects. 
According to these theorists, subjects may 
not believe that all traits presented by an 
experimenter are equally valid and hence may 
discount some of them in forming an overall 
impression. Anderson and Jacobson argued 
that those traits perceived as being inconsist- 
ent with other traits in the stimulus set tend 
to be discounted. Since information in im- 
pression formation studies usually comes in 
blocks of desirable or undesirable traits, one 
strategy a subject might use is to discount 
an entire block of information. Hendrick and 
Costantini (1970) argued that subjects tend 
to discount later rather than earlier informa- 
tion, the result being a primacy effect (ie., 
the later block of information is discounted), 

Support for the discounting notion has also 
been equivocal. Two experiments by Hendrick 
and Costantini (1970) revealed data incon- 
sistent with the discounting hypothesis, while 
research by Anderson and his associates (e.g., 
Anderson, 1965, 1968; Anderson and Jacob- 
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105) has yielded data which are in 
with a discounting explanation. 


Learning 


urth explanation of primacy—recency 

which has not been formally applied 

impression formation area, is one 

on serial learning curves (Hovland, 

Jersild, 1929; Miller & Campbell, 

According to this view, an individual 

o learn (e.g., recall) stimuli at the be- 

and ends of the list better than stimuli 

ag in the middle. Depending upon the 

» values of traits in the list, serial 

could explain primacy or recency 

For example, if the first part of a 

consisted of highly polarized positive 

traits and the last part consisted of slightly 

polarized negative traits, then a primacy effect 

would be obtained (assuming both parts are 

learned about equally well and the middle 

part least well). On the other hand, if the two 

ends were reversed, the latter end would have 

the higher affective value, and recency effects 

ould be obtained. From a learning perspec- 

en, a subject’s impression of a stimu- 

} n should depend on those traits that 

ar ved (i.e., recalled) and the evaluation 
of those traits, 

ond purpose of the present investiga- 

i o test the applicability of the above 

explan.uons to the analysis of primacy- 

recency effects. From the perspective of the 

Fishbein model presented earlier, the dis- 

counting notion should be related to belief 

strength, or the subjective estimate by a sub- 

ject of the probability the stimulus person 

< a given trait. If a subject “discounts” 

he may be said to assign a low prob- 

hat the person does possess the trait. 

wm decrement should be related to 

í traits. If, as suggested by Anderson 

| Hubert (1963), later words in a list are 

atteuded to less, these words should have a 

lower probability of recall. Attention decre- 

ment may thus be assessed by the analysis 

of recall curves. In addition, attention decre- 

ment may also be related to belief strength. 

If subjects pay less attention to later adjec- 

tives, it might be expected that subjects 

would assign a lower probability that the 
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person possesses a trait occurring later in the 
list. Thus, according to attention decrement, 
one might expect an orderly decrease of belief 
strength over portions of the list. 

Whereas attention decrement and discount- 
ing argue for differences in belief strength as 
the major cause of order effects, a directed 
impression notion holds that changes in 
evaluations are the primary factors. The rela- 
tive influence of change in meaning may 
therefore be scrutinized in terms of the extent 
to which a given trait is evaluated differently 
in different contexts. Finally, a serial learning 
explanation can be tested to the extent that 
subjects’ impressions follow the nature of the 
recall curve.” 


METHOD 
Subjects 


Subjects were 121 undergraduate female students 
enrolled in an introductory psychology course. Their 
participation fulfilled a course requirement. 


Design and Construction of Stimulus Lists 


A list of 134 adectives describing people was com- 
piled and given to 30 females who served as judges. 
Each judge rated each adjective on a 7-point good- 
bad semantic differential scale. From this list of 
134 adjectives, 12 were selected for use in experi- 
mental sessions: 3 very favorable (+3), 3 slightly 
favorable (+1), 3 slightly unfavorable (—1), and 
3 very unfavorable (—3). The following criteria 
were used to select the adjectives; (a) They should 
have a mean value as close to the desired point on the 
scale as possible, (b) there should be as much agree- 
ment among judges as possible (i.e., a small standard 
deviation), and (c) none of them should be a 
direct antonym of a word already on the list. The 
following adjectives were selected: very favorable— 
ambitious, loving, sincere; slightly favorable—quiet, 
rich, musical; slightly unfavorable—stout, critical, 
nervous; very unfavorable—ugly, rude, boring. 

Four groups of 31, 31, 29, and 30 subjects were 
read the list of 12 adjectives said to describe a 
person, For each experimental group the list was 
read in a different order. The order of presentation, 


2The attention decrement and serial learning ex- 
planations are conceptually similar in that each 
makes predictions about the adjectives the subject 
should learn. However, the attention decrement 
notion implies a progressive decrease in recall, 
whereas serial learning implies a bow-shaped learning 
curve and hence may make different predictions 
about the impression a subject will form. 
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in terms of scale values, was as follows: 
Group 1: +1+-1 1 +3+ 
Group 2: FI +343 +i+ 
Group ee ea e genes 
Group 4: =a -3- 


Thus two groups received lists in the pro-con order, 
and the order was reversed for the remaining two 
groups. In addition, within each half of the lists, the 
adjectives were arranged in a climax (+1 to +3) or 
anticlimax order (+3 to £1). The four lists there- 
fore represent a 2X2 (Pro-Con vs. Con-Pro X 
Climax ys. Anticlimax) analysis of variance design. 
According to the attention decrement, discounting, 
and directed impression notions, a significant pro-con 
versus con-pro main effect on evaluation of the 
stimulus person should occur. In contrast, the serial 
learning notion predicts no main effects and an 
interaction.® 


Procedure 


Subjects were run in small groups of approxi- 
mately 15. After all subjects were seated, a covered 
booklet was passed out. The following instructions 
were then read by the experimenter: 


We are doing a study on impression formation 
and would like to get a feel for how people form 
impressions and the things they take into account 
in doing so. I am going to read to you a list of 
traits that describe a person. On the basis of this, 
I am going to ask you to form an impression of 
the person. I will tell you exactly how I want you 
to do this later. First, let me read the traits that 
describe the person to you. The person is... 
[the list of adjectives was then read to the sub- 
jects.] Now turn to the first page of your book- 
let. I would like you to try to recall all the traits 
that I just read to you. List them on the lines 
as follows [a series of lines was drawn on the 
blackboard, and subjects were shown how to enter 
their answers]. In addition, I would like you to 
write down on the other lines any other adjectives 
that you think might characterize the person de- 
scribed. When you are finished, go on to the next 
page. Further instructions will be given there.* 


On the second page of the booklet, subjects were 
given instructions on the use of semantic differential 
scales (Snider & Osgood, 1969). On the third page, 
subjects rated the “person described” on 7 bipolar 
semantic differential scales. Six of these were 7-point 
scales which have been found to load consistently 
on an evaluative factor (e.g., good—bad, awful-nice). 
The seventh was the traditional like—dislike measure, 
also a 7-point scale.5 

Page 4 of the booklet instructed subjects to return 
to the first page with the following instructions: 


We would like to know how certain you are 
that each of the characteristics you have listed 
is a characteristic of the person. That is, how 
certain are you that your description accurately 
describes the person? 


3 
1 
1 
3— 
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ee ai ae eG ig 9 
aye ie Nemes On bea 
-1 434343 414141 

3 +14+14+1 434343 


For each trait listed by the subjects on page 
subjects were told to indicate their certainty on 
4-point scale ranging from not certain at all (0) t: 
quite certain (+3). This measure constituted tl 
“belief” component (i.e. the perceived probabilit 
that p has trait X) of the Fishbein model. Page 5 
of the booklet instructed subjects to return te the 
first page again and rate each trait listed on 
7-point good-bad scale. This measure constituted th 
“evaluative aspect of the belief” (i-e., evaluation « 
trait X) of the Fishbein model. 

Pages 6, 7, and 8 contained a list of 48 adjective 
among which the 12 adjectives orginally present 
were embedded. The remaining 36 adjectives we 
those used by Asch (1946) in asking subjects to f 
impressions of others (18 bipolar scales, 36 adji 
tives in all), For all 48 adjectives, subjects v 
asked to indicate on a 4-point scale how certain t 
were that “a person such as the one described w<: 
have that trait.” In addition, subjects rated eac! 
the 48 adjectives on a 7-point good-bad seman 
differential scale. 


Ba; Measures 


From subjects’ responses, Bia; measures were 
computed by multiplying the score on each belief 
measure by the score on the corresponding evalua- 
tion measure and summing these products over all 
beliefs. A number of different 2Bia: measures were 
calculated: (a) =Byia: based on traits recalled, (b) 


Bia; based on traits recalled and traits inferred 
(i.e, those that subjects listed on the first page as 


also being descriptive of the person described), (c) 
Biya; based on all traits originally presented (mea- 
sures used to construct this independent variable 
were those ascertained in the last part of the book- 
let), (d) Bia; based on the Asch adjectives, and 


3 The interaction prediction is obvious if one sums 
the scale values of the first and fourth quarters of 
each group and compares the respective sums. 

4 It should be noted that subjects were not toid in 
advance that they would be required to recall the 
traits. 

5 Results of a pilot study indicated similar results 
to those to be reported when subjects were asked 
to give the impression first and then to recall beliefs. 

6 Measures of belief and evaluative aspects of 
beliefs obtained at the beginning of the questionnaire 
were highly correlated with those measures obtained 
in the last part of the questionnaire. For example, 
a Bia; based on traits recalled was computed from 
responses in the first part of the questionnaire as 
well as from the sęcond part. These two measures 
were correlated .823 (p < .01). Similarly, a correla- 
tion between belief score for trait X in the beginning 


oy ee 


So 


+ 
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(e) ZBya, based on both the Asch adjectives and the 
original 12 items. 


RESULTS 


Consistent with expectations, the sum of 
the six evaluative scales on which subjects 
rated the stimulus person was highly related 
(r = .882, p < .01) with scores on the like- 
dislike scale. Since past research has tradi- 
tionally examined the like-dislike variable, it 
has also been selected as the major dependent 
variable in the following analyses. Analyses 
using the sum of all evaluative scales were 
also performed, and the results were compar- 
able. 


Inferential Beliefs 


Table 1 presents correlations relevant to 
testing the role of inferential beliefs in im- 
pression formation, As can be seen, the high- 
est correlation between Ap and ¥B;a; was 
observed when the SB;a; measure was based 
on traits recalled and traits inferred (r= 
669, p < .01). This correlation was signifi- 
cantly higher than the correlation between Ap 
and XB;a; based on all 12 traits presented 
(r = 481, tart = 5.00, p < .01). Indeed, the 
correlation based on recalled and inferred 
traits was significantly higher than all other 
correlations presented in Table 1. Thus, pre- 
diction of evaluative impression was signifi- 
cantly improved by taking into consideration 


` inferential beliefs. 


One implication of the Fishbein model for 
impression formation research is that both 
beliefs (B,) and evaluations (aj) are con- 
ceptualized as crucial variables in the predic- 
tion of evaluative impressions. If belief 
strength is assumed to be equal and constant 
for all traits, then the evaluative impression 
should be a function of simply the evaluations 
of traits in the stimulus list. However, this is 
not the case when belief strength may vary. 
Support for the notion that belief strength is 
important can be seen in Table 1, For ex- 
ample, the correlation (r = .669) between Ap 
and Bia, based on traits recalled and in- 
ferred was significantly higher (¢ = 3.45, p < 
.01) than the correlation (r = .535) between 


of the questionnaire and belief score for trait X at 
the end was computed for each trait. The mean 
correlation for the 12 traits was .775 (p < .01). 
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TABLE 1 


SELECTED CORRELATIONS BETWEEN MEASURES OF 
Bia; AND LIKE-DISLIKE SCALE 


Measure Like-dislike 
XB;a; traits recalled and inferred 669 
DBiaj traits recalled only A72 
XBia; all 12 traits presented 81 
Bia; Asch adjectives only .552 
DBia; Asch adjectives and the 
12 traits presented .565 
Daj traits recalled and inferred 535 
Xa; traits recalled only 352 
270 


Xai all 12 traits presented 


Note. N 121. All correlations are significant, p < 01. 
ZBiai r to s (1963, 1967) theory of attitude forma- 
tion, where Bi refers to and aj refers to evaluative 
aspects of these beliefs. 


belie 


Ap and Xa; (i.e., the sum of the evaluations 
of the traits recalled and inferred). Thus, 
prediction of Ap was significantly improved 
by taking into consideration belief strength. 


Order Effects 


To test for order effects, a 2 X 2 analysis 
of variance was performed on liking-disliking 
of the stimulus person. The first factor was 
the pro—con versus con-pro order factor; the 
second factor was the climax-anticlimax fac- 
tor, Only a significant pro-con, con-pro effect 
was observed, F(1, 117) = 8.83, p< .01. 
Analysis of the means indicated a primacy 
effect, with groups in which adjectives were 
presented in a pro-con order (Groups 1 and 
2) having a mean likability score of 4,23 and 
con-pro groups, a mean likability of 3.44. 
This finding is consistent with the attention 
decrement, directed impression, and discount- 
ing notions and is counter to the serial learn- 
ing hypothesis. (Table 2 presents the means 
for each of the four groups.) A similar 2 X 2 
analysis of variance was performed using 
XB,a; based on traits recalled and inferred as 
the dependent measure. This analysis pro- 
duced comparable results. 


TABLE 2 


MEAN EVALUATION OF STIMULUS PERSON As A 
FUNCTION OF PrO-CON, CON-PRO AND 


CLIMAX~ANTICLIMAX FACTORS 
Condition | Climax | Anticlimax M 
Pro-con | 3.90 4.55 4.23 
Con-pro 3.45 3.43 3.44 

M 3.67 3.99 
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TABLE 3 


Mean BELIEF STRENGTH PER QUARTER OF LIST AS 
A Function or Pro-Con, Con-Pro, 
CLIMAX-ANTICLIMAX, AND LIST 
Position FACTORS 


Quarter | 
Condition |———__—__——__—_|_ Mu 
First | Second | Third | Fourth 
Pro-con | | 
Climax 2.56 | 2.24 | 2.37 2.17 | 2.34 
Anticlimax | 2.42 | 2.44 | 1.67 | 1.83 | 2.09 
Con-pro | 
Climax 2.42 | 2.26 | 2.32 | 2.40 | 2.35 
Anticlimax | 2.19 | 2.31 1.78 1.99 | 2.07 
M 2.40 | 2.31 2.02 | 2.10 


Note. Belief strength is scored on a scale from 0 to 3. 


Discounting 


According to the discounting hypothesis, 
the mean belief strength for traits in the first 
half of the list should be greater than the 
mean belief strength for traits in the second 
half for all groups. This follows from the fact 
that for each group, the second half of the 
list is affectively inconsistent with the first 
half and hence should be discounted. 

A2Xx2x 2X2 analysis of variance was 
performed on belief strength. The four factors 
were (a) pro-con, con-pro, (b) climax—anti- 
climax, (c) the two halves of the list, and (d) 
the two quarters within each half. The de- 
pendent measure was the subjects’ mean be- 
lief strength for the three traits within each 
of the four quarters and was derived from 
measures obtained at the end of the question- 
naire (see Footnote 6). 

Consistent with the discounting notion, a 
main effect for halves of the list was observed, 
F(1, 117) = 23.69, p < 01, with the mean 
belief strength for traits in the first half of 
the list being significantly greater than the 
mean belief strength for traits in the second 
half (2.35 vs. 2.07). However, two significant 
interaction effects indicate that discounting 
is limited to certain conditions. First, a sig- 
nificant interaction, F¢1, 117) = 16.00, p < 
.01, was observed between climax—anticlimax 
alignment and the halves of the list. As seen 
in Table 3, subjects in the anticlimax condi- 
tion tended to believe less strongly the second 
half of the list as opposed to the first half 
(X = 2.34 in the first half vs. X = 1.82 in 
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the second), whereas those in the climax « 
dition tended to believe both halves wi 
equal strength. This suggests that if discoun' 
ing is causing lowered belief strength, th: 
effect is only operating in conditions of anti- 
climax as opposed to climax alignment. 

A significant interaction was also observed 
between the pro-con factor and the halves of 
the list, F(1, 117) = 3.94, p < .05. Subjects 
in the pro-con condition tended to believe 
traits in the first half of the list more strongly 
than traits in the second half ( first half X = 
2.41 vs. second half ¥ = 2.01), whereas those 
in the con-pro condition believed both halves 
with equal strength (2.29 vs. 2.12). Thus, 
subjects who heard positive traits first and 
then negative traits were likely to discount’ 
the negative traits, whereas subjects who 
heard negative traits first and then positive 
traits did not tend to discount the positive 
traits.” 


Attention Decrement 


According to the attention decrement hy- 
pothesis, subjects’ attention to stimulus traits 
should decrease linearly as a function of posi- 
tion during presentation. If later words are 
attended to less, these words should have a 
lower probability of recall. Thus, the recall 
curve should also reflect a linear decrease. 

A2X2xX2X 2 analysis of variance was 
performed on the number of words recalled 
in each of the four quarters of the list. The 
factors were identical to those in the previous 
analysis of variance for belief strength. Table 
4 presents the mean number of words recalled 
for each group by quarter of the list. A main 
effect for halves of the list was observed, 
F(1, 117) = 7.12, p< .01, and the means 
were consistent with the previous analysis on 


7 Although the analyses are not directly relevant 
to the theoretical explanations of order effects con- 
sidered in this article, there was an overall tendency 
for subjects to believe positive traits more than 
negative traits (X = 2.28 vs. 2.15, respectively) and 
to believe less polarized traits more than polarized 
traits (X = 2.28 vs. 2.14 respectively). In addition, 
subjects in the climax condition were more likely to 
believe traits than subjects in the anticlimax condi- 
tion (see Table 3). 

8It should be noted that the attention decrement 
notion is in conflict with most learning phenomena 
such as the Von Restorff effect. 
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belief strength. The mean number of words 
recalled in the first half of the list was 2.24 
as contrasted with 2.07 in the second half. 
This difference was, however, entirely due to 
recall in the first quarter of the list, F(1, 
117) = 4.74, p < .01. The mean number of 
words recalled in the first quarter of the list 
was 2.41 as contrasted with 2.07, 2.07 and 
2,07 for the remaining three quarters, respec- 
tively. Thus, contrary to the attention decre- 
ment notion, a progressive linear decrease in 
recall did not occur, but rather subjects 
tended to recall the first quarter of the list 
best with a leveling off of recall in the remain- 
ing three quarters. 

The analysis of variance also revealed a 
significant interaction between halves of the 
list and the pro-con, con—pro factor for re- 
call. Subjects in the pro-con condition tended 
to recall traits in the first half of the list (£ 


~ per quarter = 2.27) better than those in the 


second half (X per quarter = 1.96). In con- 
trast, subjects in the con-pro condition re- 
called both halves equally well (X per 
quarter in the first half = 2.20, X per quarter 
in the second half = 2.16).° 

This effect also parallels the halves of the 
list by pro-con interaction for belief strength. 
In order to further analyze this relationship 
between recall and belief strength, a point- 
biserial correlation was computed between re- 
call of a trait (1 = recalled, 0= did not 
recall) and belief strength for that trait. The 
correlation between recall and belief strength, 
averaging across the 12 traits, was .392 (p < 
01), Although these variables are related, this 
relationship is not necessarily a strong one. 
This suggests that even though a subject may 
recall a trait, he may not necessarily believe 
that the stimulus person does indeed possess 
that trait.’ 


Serial Learning 


According to the serial learning hypothesis, 
the recall curve for all subjects should be bow 
shaped, with traits being recalled best at the 
beginning and end of the list and less well in 
the middle. In contrast to this, analysis of 
traits recalled showed that for all groups con- 
sidered together, the first quarter of the list 
was recalled best and the remaining three 
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TABLE 4 


Mean NUMBER or WORDS RECALLED PER QUARTER 
or List As A FUNCTION OF Pro-—Con, Con—PRO, 
CLIMAX—ANTICLIMAX, AND LIST 
Posrrion FACTORS 


| Quarter 
Condition —— M 
Third | Fourth 
Pro-con 
Climax 2.81 1:97 | 1.94 | 2.42 | 2.28 
Anticlimax | 2.14 | 2.17 | 2.07 | 145 | 1.96 
Con-pro 
Climax 2.29 | 2.32 | 2.39 | 2.39 | 2.35 
Anticlimax | 2.40 | 1.80 | 1.90 | 2.00 | 2.02 
M 241 | 2.07 | 2.07 | 2.07 


quarters of the list recalled equally well. 
Further, had serial learning actually occurred, 
the impression of the stimulus person for 
Groups 3 and 4 should have been more favor- 
able than that for groups 1 and 2; however, 
this was not the case (see Table 2). 

Although the recall curve did not follow 
the serial learning pattern, learning theory 
would suggest that the resulting impression 
should approximate the observed recall 
curve."' A test of this notion may be made by 
examining the correlations in Table 1. The 
Xa; based on traits recalled (equivalent to 
recall, which is scored 1 = recall, 0 = did not 
recall, being weighted by the scale value) was 
correlated .352 with the observed impression, 
indicating that a person’s evaluation of a 
stimulus person is, in part, a function of the 
traits he recalls. However, this correlation was 
not as high as the correlation between XB;a; 
for traits recalled and the same criterion (r = 
472). In terms of the Fishbein model, the 
results of these analyses are not unexpected. 


It should be noted that, overall, subjects were 
more likely to recall positive versus negative traits 
(X per quarter = 2.23 and 2.06, respectively). 

10 Support for this notion is also reflected in effects 
that were significant for the analysis of variance on 
belief strength but not for recall. For example, there 
was a significant difference between the second 
quarter as compared with the third and fourth 
quarters on belief strength, yet these three quarters 
were equivalent in terms of recall. 

11 In fact, both the attention decrement and serial 
learning notions assume this; each, however, specifies 
a particular recall curve that should be obtained. 
For attention decrement the recall “curve” is a 
linear decrease, while for serial learning it is bow 
shaped. 


1038 


TABLE 5 


MEAN EVALUATION or Traits IN EACH HALF OF THE 
List as A Function or Pro—Con, Con—Pro 
ORDER OF PRESENTATION 


Condition First half Second half 
Pro-con 1.73 =1.28 
Con-pro =1.27 1.66 


One should only expect accurate prediction 
of impressions from recall curves to the extent 
that recall is linearly related to belief 
strength. As pointed out earlier, the correla- 
tion between recall and belief strength was 
not of high magnitude. It may be argued that 
(a) lack of recall implies low belief strength 
and (b) recall is a necessary but not sufficient 
condition for high belief strength. As such, 
one would not expect an impression to bear 
a systematic relationship to pattern of recall. 


Directed Impression 

Whereas the previously considered explana- 
tions of order effects focus on belief strength, 
the directed impression hypothesis holds that 
changes in the evaluations of traits are the 
primary cause of order effects. According to 
this view, traits preceded by a set of positive 
adjectives should be perceived as more favor- 
able than when presented in isolation, whereas 
traits preceded by a set of negative adjec- 
tives should be perceived as more negative 
than when presented in isolation. In terms of 
the present investigation, the mean evalua- 
tion of traits in the second half of the list 
for the pro-con groups should be higher than 
the mean evaluation of the same traits for 
the con-pro groups (since the former is pre- 
ceded by a set of positive traits, while the 
latter is not). Similarly, the mean evaluation 
of traits in the first half of the list for the 
pro-con groups should be higher than the 
mean evaluation of traits in the second half 
of the list for the con—pro groups. 

As seen in Table 5, these hypotheses were 
not borne out by the data. Overall, the evalu- 
ation of traits was not affected by the traits 
preceding it as predicted by the directed im- 
pression position.** 

DISCUSSION 

The present data indicate a number of im- 

portant points for impression formation re- 
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search, One major implication concerns iho y 


role of belief strength in the impression ‘ 
mation process. In many studies, traits sc 
read to a group of subjects, and it is assumod 
that the resulting impression is a function 
the scale value of these traits and only th” 
scale values. The assumption is that bt 
strength is constant for these traits. 
results of the present investigation indicate, 
however, that this may not always be th: 
case. For example, the correlation bet ce: 
Ap and XB;a; based on traits recalled : 
traits inferred was significantly higher | 
3.45, p < .01) than the correlation betw en 
Ap and Xa; (i.e., the sum of the evaluations 
of the traits recalled and inferred). Predic- 
tion of Ap was significantly improved by cou- 
sidering belief strength. This suggests that a 
subject may believe that some but not all of 
a set of adjectives presented are descriptive ~ 
of a stimulus person, and he or she may hold 
these beliefs with varying strength. Clearly, 
the degree to which this belief strength varies 
will affect the resulting impression. 

A second major implication of the present 
investigation is the finding that prediction of 
Ap was significantly improved by taking into 
consideration inferential beliefs. Again, many 
studies in impression formation assume that 


` 


the impression a subject forms of a stimulus sses 


person is a function of the traits presented to 
the subject and only these traits. Research in 
implicit personality theory, however, suggests 
that this may not be the case, Considerable 
research has demonstrated that certain traits 
imply other traits and these, in turn, may be 
important in describing a person’s impression 
of another. The results of the present study 
corroborate this notion, 

If inferential beliefs and belief strength are ` 
important determinants of the formation of 
evaluative impressions, then many of the 
current considerations in the impression for- 
mation area should be reevaluated. For ex- 
ample, Anderson and Hubert (1963) have 


12 Jt should be noted that a significant Pro-Cou 
Con-Pro X Climax-Anticlimax interaction s 
served as well as a Pro-Con, Con-Pro > 
within Halves interaction for this analy: 
results indicate that some change in meaning is 
occurring but not in the manner predicted by th 
directed impression hypothesis. 


‘a 


á argued that “the formation of the impression 
involves a memory process which is distinct 

; from, and not dependent on, the immediate 
verbal memory for the adjectives just heard.” 
They suggest a model of memory in which “it 
is assumed that as each successive adjective 
is received its meaning is extracted and com- 
bined with the current impression, thus yield- 
ing a changed impression. Once this is done, 
memory for the adjective per se is no longer 
necessary for the impression process.” (p. 
358) Anderson and Hubert used as primary 

oa support for their argument of separate mem- 

| ory systems the fact that primacy in impres- 

sion was observed for their subjects in con- 

junction with recency in recall of traits. As 

was pointed out earlier, this discrepancy in 

recall pattern and impression could be at- 

tributable to differences in belief strength (or, 

„for that matter, the nature of inferential be- 

liefs involved) rather than separate memory 
processes. 

; A second area in which belief strength and 
inferential beliefs may become crucial is with 
respect to the summation versus averaging of 
stimuli controversy. Again, most studies of 
averaging-adding have compared sets of traits 
selected to differ in terms of their additive or 
mean affective values, with little concern for 

da the “weights” placed on these traits or the 

belief strength. More specifically, the as- 

sumption of equal weighting has usually been 

made, This assumption may not always be 

warranted. For example, if information is per- 

ceived to be inconsistent with a subject’s 

prior beliefs (e.g., a stimulus person is de- 

scribed as kind, helpful, and then mean), it 

may reduce his subjective probability (a) 

that his prior information is true and/or (b) 

4 that the new information is accurate. Accord- 

ing to the Fishbein formulation, if the pre- 

sentation of new traits affects the belief 

2 strength of previously presented traits, the 

resulting impression could actually be reduced 

and thus appear to be in the direction an 

averaging model would predict (Fishbein & 

Ajzen, 1972). Furthermore, depending upon 

the nature of the beliefs the subject may 

infer, the conclusions of averaging-adding 
studies may be quite different."* 

In terms of order effects, the data were 

somewhat supportive of the discounting no- 


| 
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tion. However, discounting was observed only 
in conditions of anticlimax alignment. 
Further, subjects who were presented adjec- 
tives in a pro-con order tended to discount 
the second half of the list, whereas subjects 
who were presented adjectives in a con—pro 
order did not. The results of the present 
study, as well as previous ones, indicate that 
discounting is not a general phenomenon and 
is dependent upon (a) the order of presenta- 
tion and the nature of the adjectives pre- 
sented to subjects (see also Hendrick & 
Costantini, 1970) and (b) the task instruc- 
tions (e.g, Anderson & Jacobson, 1965) 
given to subjects. 

With respect to attention decrement, the 
data were not as supportive. If one assumes 
that attention is related to probability of 
recall, then a steadily decreasing recall curve 
should be the result. The recall data of the 
present study do not support this notion. 
Further, previous research by Anderson and 
Hubert (1963) noted a recency effect in re- 
call, where adjectives presented at the end 
of a list (to which subjects should, according 
to attention decrement, pay least attention) 
were recalled most often. Taken together, 
these data question attention decrement as 
an explanatory tool for order effects. 

The serial learning hypothesis did not re- 
ceive strong support. The recall data did not 
follow a bow-shaped learning curve. Further, 
the relationship between recall and belief 
strength was not a strong one (7 = 392, p < 
.01). Finally, a directed impression interpre- 
tation of order effects was not supported by 
the data, although some change in meaning 
was observed. 

It seems reasonable to conclude from the 
present investigation that the various explana- 
tions investigated are not mutually exclusive 
alternatives. All four factors could conceiv- 
ably contribute to order effects, and the ques- 
tion arises as to when one would expect a 


13 Although the present study was not designed to 
test the averaging versus adding combinatorial ques- 
tion, a =Bia; measure and >Byai/n measure were 
compared with respect to prediction of liking-dis- 
liking of the stimulus person. The former correlated 
.669 with the like-dislike measure, while the latter 
correlated .644 with same measure. The difference 
between these correlations was not statistically 
significant. 
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given interpretation to be operative. The 
present data indicate that a subject’s impres- 
sion is based, in part, on the adjectives that 
the subject learns about the stimulus person. 
However, only identifying those adjectives 
that are learned is not sufficient is accounting 
for the subject’s impression. In addition, one 
must consider the belief strength of the ad- 
jectives that are learned, and the concern 
then turns to the factors that affect belief 
strength. The discounting notion is one ex- 
planation of the types of variables that may 
influence belief strength (namely, affective 
inconsistency). Finally, one must consider 
the nature of inferential beliefs to completely 
understand the impression process. Future 
research might best be directed to the investi- 
gation of other factors that influence belief 
strength, evaluations, and inferential beliefs. 
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Some Informational Determinants of One's Own Liking 


for a Person and Beliefs That Others Will 
Like This Person 


Robert S. Wyer, Jr. 
University of Illinois at Urbana-Champaign 


A model of complex social inference phenomena developed by Gollob was used 
to identify the evaluative and descriptive aspects of information about two 
persons, A and B, that affect a subject’s (P's) beliefs that A would like B and 
also to investigate the extent to which similar factors affect P’s beliefs that he 
personally would like B. A’s liking for B was inferred in part from the evalua- 
ative implications of the adjective describing B and also in part from the 
descriptive implications of this adjective independently of its favorableness, 
In addition, the descriptive similarity of the adjective describing A and B and 
the evaluative similarity of these adjectives both contributed substantially to 
these inferences. The implications of the adjectives describing A, considered 
independently of the adjective describing B, also had small but significant 
effects. Similar factors affected P’s own inferences that he would like B; that 
is, the evaluative and descriptive implications of the adjective describing B 
and the similarity of these implications to those of the adjectives P used to 
describe himself all contributed independently to P’s liking for B. Results are 
discussed in terms of their implications for information integration and attri- 
bution processes and for the continued use of Gollob’s formulation in investi- 


gating complex inference phenomena. 


In experiments on information integration 
processes (for reviews, see Anderson, 1968b, 
1971; Wyer, 1974), a subject P is typically 
asked to estimate his liking for a person B 
on the basis of a set of one or more adjectives 
describing him. These judgments are often as- 


» sumed to depend primarily upon the evalua- 


tive, or affective, quality of the adjectives 
contained in this description (e.g., the favor- 
ableness of the attributes implied by these 
adjectives). However, P’s liking for B may 
also depend upon the descriptive, or denota- 
tive, implications of the information describ- 
ing him, Moreover, it may possibly depend 
upon attributes of P and the similarity of 
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these attributes to those of B (Byrne, 1971). 
This article is concerned with these possibil- 
ities and with the manner in which the con- 
tributions of these various factors can be 
isolated, 

Peabody (1967, 1970) found that when the 
evaluative and descriptive implications of an 
adjective conflict, attributes of a person de- 
scribed by the adjective may often be pre- 
dicted on the basis of its descriptive implica- 
tions. Although Peabody did not investigate 
the possibility directly, it seems reasonable to 
expect that the descriptive implications of an 
adjective also contribute substantially to esti- 
mates of how well a person described by it is 
liked. In this regard, although the favorable- 
ness of an attribute and the likableness of a 
person who possesses it are highly correlated 
and are often thought to be identical con- 
structs, there is a conceptual difference be- 
tween them. One’s liking for a person may be 
primarily based upon the quality of his per- 
sonal relations with this person. If this is 
true, information about the person’s attri- 
butes may affect judgments of his likableness 
only to the extent that the attributes have 
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implications for these relations. Certainly not 
all favorable (or unfavorable) attributes have 
such implications. Thus attributes such as 
hardworking or wealthy, which have little 
to do with interpersonal relations, may be 
considered favorable, and yet persons de- 
scribed by them may be judged relatively 
neutral in likableness. Moreover, an attribute 
such as lonesome, may imply behavior that 
would lead P to like a person who possesses 
it even though the attribute is regarded as 
unfavorable. This argument is consistent with 
Byrne’s (1971) suggestion that information 
about a person’s attributes affects P’s attrac- 
tion to him to the extent that these attributes 
imply that P’s approach responses to the per- 
son described will be positively reinforced. 
Additional considerations are suggested by 
research on P’s inferences that a third person, 
A, would like B (e.g., Feather, 1967; Gollob, 
1974a, 1974b; Rubin & Zajonc, 1969; Wyer, 
1974; Wyer & Lyon, 1970). This research 
suggests that such inferences are determined 
not only by the information describing B but 
also by the characteristics of A. Moreover, 
each piece of information may contribute in 
different ways to these inferences, depending 
upon the assumptions underlying the use of 
this information. For example, suppose P is 
told that A is kind and B is cruel and is asked 
whether or not A likes B. P would probably 
infer that A likes B if either (a) he assumes 
that persons are generally more apt to like 
one another than to dislike one another, re- 
gardless of their attributes, or (b) he assumes 
that kind persons are more apt to like others 
than are cruel persons, regardless of who 
these others are. On the other hand, P may 
infer that A dislikes B if either (c) he as- 
sumes that cruel persons are typically dis- 
liked, regardless of the attributes of the actor 
(A), or (d) he assumes that persons like 
others who are similar to themselves and dis- 
like others who differ from themselves. More- 
over, these alternative assumptions are not 
mutually exclusive; P’s inference may be 
based in part upon all four assumptions and 
thus may be a weighted function of the im- 
plications of each. In any event, if P’s own 
liking for B is governed by factors similar to 
those that affect his inferences about A’s lik- 
ing for B, then P’s liking for B may depend 
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upon the adjectives P uses to describe hi 
self, the similarity between the implicati 
of these adjectives and those of adjectis 
used to describe B, and the quality of ! 
attributes alone, 

Until recently, no systematic procedure w 
available for isolating and assessing the rei 
tive contributions of the various inform: 
tional factors suggested by the precedin; 
analyses. However, Gollob (1974a, 1974b 
has proposed a “‘subject-verb—object” (S~ 
O) formulation of social inference process: 
that can readily be used for this purpose. 11 
this article, an extension of Gollob’s formula 
tion will first be used to identify factors that 
affect P’s inferences about A’s liking for B, 
based upon information describing both A 
and B. Then, we will compare these factors to 
those that affect P’s own evaluations of B. 


INFERENCES ABOUT A’s EVALUATION oF B 


Since Gollob’s formulation has been dis- 
cussed in detail elsewhere (Gollob, 1974a, 
1974b; Wyer, 1974), it will not be elaborated 
here. To apply it to the problems of concern 
in this study, first consider a sentence of the 
form “[adj,] A [verbs] [adjr] B,” where 
adj, and adj, each describes either the posi- 
tive (p) or the negative (n) pole of an at- 
tribute dimension, and the verb is either likes 
(p) or dislikes (n). (In this context, the 
terms positive and negative are used for con- 
venience in discriminating the two poles of 
each dimension, and do not necessarily carry 
any implications for the favorableness of the 
attribute being described.) Eight different 
sentences may be formed from the various 
combinations of these components: ppp, ppn, 
pnp, pnn, npp, npn, nnp, and nnn, where the 
first symbol in each case represents the posi- 
tivity of adja, the second the positivity of the 
verb, and the third, the positivity of adj». 
Suppose P is presented all four combinations 
of adj4 and adj, and in each case is asked to 
infer both the probability that A likes B and 
the probability that A dislikes B. These infer 
ences are, in effect, judgments of the validity 
of each of the eight sentences described 
above. Gollob postulated four factors that arc 
most likely to affect these estimates, each 
responding to a different assumption P : 
make about the implications of the info 
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DETERMINANTS OF ONE’s LIKING FOR A PERSON 


tion presented: 


1. Verb (V) positivity. A contribution of 
this factor indicates the extent to which P 
assumes that persons are more (or less) apt 
to like one another than to dislike one an- 
other, regardless of the attributes describing 
them. 

2. Subject-verb (SV) balance. This factor 
indicates the similarity between the polarity 
of adj, and the polarity of A’s sentiment re- 
lation to B. The contribution of this factor 
indicates the extent to which P assumes that 
A’s liking for B depends upon the attributes 
of A alone, without regard to the attributes 
of B. 

3. Verb-object (VO) balance. This shows 
the similarity between the polarity of adjs 
and the polarity of A’s sentiment toward B. 
The contribution of this factor indicates the 
extent to which P assumes that A’s liking for 
B depends upon the characteristics of B, in- 
dependent of the attributes of A. 

4. Subject-verb-ob ject (SVO) balance. This 
indicates the extent to which A’s liking for B 
is inferred to be positive when adj, and adj, 
are either both positive or both negative and 
the extent to which A’s liking for B is in- 
ferred to be negative when these adjectives 
differ in polarity. The contribution of this 
factor indicates the extent to which P as- 
sumes that A’s liking for B depends upon the 
similarity between the attribute possessed by 
A and the attribute possessed by B. 


The magnitude of the contributions of the 
four informational factors described above 
correspond to the magnitudes of four of the 
seven orthogonal contrasts comprising an 
analysis of variance of judgments as a func- 
tion of the positivity of the two adjectives 
describing A and B and the positivity of the 
verb (for an elaboration of this point, see 
Gollob, 1974b; Wyer, 1974). To this extent, 
these judgments may be described as a linear 
function of the contributions of the four in- 
formational characteristics, or alternatively, 
the contributions of the assumptions under- 
lying their use.* 

In the preceding discussion, the criterion 
for labeling attributes positive or negative 
was intentionally left vague. In initial appli- 


è cations of his formulation, Gollob (1974a) 
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restricted his consideration to the effects of at- 
tributes which differed in polarity along eval- 
uative dimensions (i.e., kind-cruel, friendly— 
unfriendly, etc.). However, in principle, the 
adjectives describing A and B may vary along 
descriptive as well as evaluative dimensions. 
More recent applications of Gollob’s formula- 
tion (Wyer, 1974) have shown that the rela- 
tive contributions of different informational 
characteristics to inferences depend substan- 
tially upon the descriptive implications of the 
adjectives used to represent opposite evalua- 
tive poles. However, since the descriptive and 
evaluative implications of these adjectives 
were typically confounded, the relative con- 
tributions of these implications could not be 
determined, Thus, for example, while A was 
often expected to like B more when his own 
attributes were similar to those of B than 
when his attributes were dissimilar, it was 
unclear whether the primary basis for this 
inference was evaluative similarity or descrip- 
tive similarity, Would A be expected to like 
B more if A and B were described as cautious 
and timid, respectively (which are denota- 
tively similar but differ in favorableness), or 
if A and B were described as cautious and 
bold, respectively (which are descriptively 
dissimilar but equally favorable)? The fol- 


1 The contribution of V positivity would be in- 
ferred from the difference between the mean judg- 
ment of sentences in which V is postive (ppp, ppn, 
npp, npn) and the mean judgment of sentences 
in which V is negative (pnp, pnn, nnp, nnn); the 
contribution of SV balance would be inferred from 
the difference between the mean judgment of the 
sentences in which S and V are both positive or 
both negative (ppp, ppn, nnp, nnn) and the mean 
judgment of sentences in which the positivity of 
these components differ (pnp, pnn, npp, npn); the 
contribution of VO balance would be reflected by 
the difference between judgments of the sentences 
ppp, pnn, npp and nnn and the mean judgment of 
sentences ppn, pnp, npn, nnp; the contribution of 
SVO balance would be reflected by the difference 
between the mean judgment of sentences ppp, npn, 
nnp, and pnn and the mean judgment of sentences 
pnp, ppn, npp, and nnn. These comparisons corre- 
spond to the main effect of V and the interactions 
of SXV, V XO, and SX V XO in an analysis of 
variance. Alternatively, the magnitude of the con- 
tributions correspond to the weights in a linear re- 
gression equation relating judgments to the four 
types of informational characteristics considered 
(Gollob, 1974b). 
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lowing experiment was designed in part to 
investigate this question. 


Method 


Forty-four introductory psychology students par- 
ticipated in the experiment to fulfill a course require- 
ment, Stimulus materials consisted of 64 pairs of 
statements describing A and B. These pairs com- 
prised four stimulus replications of a 2X 2X2 x2 
design in which the evaluative or affective quality of 
the adjective assigned to A (Sx), the descriptive im- 
plications of this adjective (Sp), the evaluative qual- 
ity of the adjective assigned to B (Ox), and the 
descriptive implications of this adjective (Op) were 
all varied independently, The four adjectives com- 
prising each replication, which were selected from 
sets constructed by Peabody (1967), are shown in 
Table 1 along with an indication of their normative 
favorableness. (The assignment of p and n to the 
two levels of descriptive quality is, of course, arbi- 
trary.) Although perfect matching of the favorable- 
ness of adjectives representing the two descriptive 
levels was not always possible on the basis of Pea- 
body’s materials and norms, differences are not large 
and do not create difficulties in interpreting the re- 
sults to be reported. 

Within each stimulus replication, 16 pairs of state- 
ments describing A and B were formed by combining 
each of four statements about A (e.g., A is cautious, 
A is timid, A is bold, A is rash) with each of four 
similar statements about B (B is cautious, B is timid, 
B is bold, B is rash). The resulting 64 pairs of state- 
ments were then randomized and presented to sub- 
jects in a questionnaire containing two conditions. 
In one condition, subjects read each pair of state- 
ments and then indicated how likely it was that A 
would like B. In the second, they read each pair and 
then estimated how likely it was that A would dis- 
like B. To avoid confusion, all judgments of one 
type were made before judgments of the other; the 
order of reporting the two types of judgments was 


TABLE 1 
Trait ADJECTIVES AND NORMATIVE FAVORABLENESS 


Evaluative implications 


Descriptive 
implications 
? n 
Replication 1 
b Cautious (.9) Timid (—1,1) 
n Bold (1-1) Rash (—1.2) 
Replication 2 
p Gay (1.5) Frivolous (—1.2) 
Serious (1.3) Grim (—1.6) 


Secretive (—1.2) 


n 
Replication 3 
b Discreet (1.3) 


n Frank (1.8) Indiscreet (—1.4) 
Replication 4 x 

b Cooperative (1.6) | Conforming (—1.6) 

n Individualistic (2.0) | Uncooperative (—2.0) 


Note. The terms p and n ret fer to the positive or negative pole 
of an attribute dimension, Normative favorableness ratin; 
reported by Peabody (196' re based upon mean judgment 
along a scale from —3 (unfavorable) to +3 (favorable); they 
are shown in parentheses, 
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TABLE 2 


A’s LIKING FOR B As A FUNCTION 
oF Sp, Se, Op AND Og 


Polarity of valu 


Polarity of values along 
Sp and Sg ——— 


Replication 1 | 
pp (cautious) 
pn (timid) 
np (bold) 
nn (rash) 


Replication 2 


Pp (gay) 611 416 | —.213 | — 
pn (frivolous) AAS 465 | —.230 | — 
np (serious) —.075 | —.373| 481| 0 
nn (grim) —.193 | —.240 AB1) 0 


Replication 3 
Pp (discreet) 
pn (secretive) 
np n 
nn (indiscreet) 
Replication 4 
PP (cooperative) 
pn (conforming) 
np (individualistic) 
nn (uncooperative) 


Note, A's liking for B is defined as the difference between 
P's belief that A likes B and his belief that A dislikes B. Sp and 
Se denote the two subject dimensions, the former pertaining 
to the descriptive quality of the adjectiv lescribing A and the 
latter Moares to the evaluative quality of this adjective; 
Op ine Ox refer to the two object di. the former denot- 
ing the descriptive quality of the ad, ibing B and the 
latter denoting the evaluative quality is adjective. The 
terms p and » refer to the positive or negative pole of an attri 
bute dimension, 


counterbalanced over subjects. Judgments were re 
corded along an 11-category scale from 0 (not at all 
likely) to 10 (extremely likely) and were subse- 
quently divided by 10 to convert them into unit 
of subjective probability. This scale was selected 
since judgments along such a scale have been found 
in the past to behave as if the scale were approxi- 
mately ratio with an origin at 0 (Wyer, 1970, 1975). 


Results 


The mean difference between inferences 
that A likes B and inferences that A dislikes 
B, an index of the magnitude of A’s liking for 
B, is shown in Table 2 as a function of each 
combination of adjectives describing A and B 
and stimulus replications. Since the descrip- 
tions of A and B both vary along two dimen- 
sions rather than one, the identification of 
informational characteristics and their con- 
tributions using Gollob’s S-V-O model is 
somewhat more complex than that describ: 
in the introduction. However, similar proce- 
dures apply. Let the two “subject” dime: 
sions (one pertaining to the descriptive qu 
ity of the adjective describing A and the 
other pertaining to the evaluative quality of 


Di 
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TABLE 3 
CONTRIBUTIONS OF INFORMATIONAL CHARACTERISTICS TO INFERENCES OF A’s LIKING FOR B 


Replication | 
Informational characteristic - - -| M MS | MSemo^ | F(df = 1,43) 
1 2 3 | 4 
V positivity .059* .056* 023 | .038* 044 696 7,88" 
SpV balance | .066* 124 —.003 | .125* 078 195 21.95* 
SEV balance -016 .055* | —.004 | .118* 046 p2 12.44* 
VOp balance | .064* .150* :051. | .270* 134 247 51.17* 
VOrx balance .076* :139* 166* | .212* 148 334 46.71* 
SpVOp balance .528* .581* 3914 | .482* .495 2.276 76.03* 
SxVOx balance -054 .094* 139* | -131* 105 261 28.82" 


e, Significance of contribu 
Contributions of informational c] 
analysis of judgments as a function of the informational character: 
V denotes the verb dimension (like vs. dislike). SpV 
while SzV balance refers to the juative char: 

‘Op balance ref th 
fons of t 


ibing B; 


‘the adjectives describing 
he error term is based upon ana 
*p < 05, 


«this adjective) be denoted Sp and Sg, respec- 


tively; let the two corresponding “object” 
dimensions (pertaining to the adjective de- 
scribing B) be denoted Op and Ox, respec- 
tively, and let the verb dimension (like vs. 
dislike) be denoted V. Of the 31 possible 
orthogonal contrasts (or comparisons) com- 
prising an analysis of variance of judgments 
as a function of Sp, Se, V, Or, and Op, 7 are 
of primary interest. These contrasts reflect 
the tendency to infer that A is more apt to 
like B than dislike B (V positivity), the ex- 
tent to which this tendency depends upon the 
descriptive and evaluative characteristics of 
the adjective describing A (SpV balance and 
SV balance), the descriptive and evaluative 
implications of the adjective describing B 
(VOp balance and VO, balance), and the 
descriptive and evaluative similarity between 
the adjectives describing A and B (SpVOp 
balance and S~VO, balance).? The magni- 
tude and significance of these contrasts are 
shown in Table 3 both for each replication 
separately and averaged over replications.” 
V positivity. The contribution of V positiv- 
ity was consistently positive and did not sig- 
nificantly differ across stimulus replications, 
F (3,129) = 1.82, p > .10. That is, A was be- 
lieved more apt to like B than to dislike him, 
independent of the adjective describing either 
A or B. This effect is analogous to the posi- 
tivity bias reported in several previous stud- 


jective desc 
implications of the adj 
oth terms refer to the verb and obje 
‘ijectives describing A and B, while SzVOx. balance refers to the evaluative similarity 


ons within each replication are based upon analyses of data pertaining to each replication sep- 
acteristics are twice th magnitude of the regression weights obtait 
TA; 


e descriptive characte: i 
th of the: bj 
ibing B, and VOe balance FRN to the 
ct dimensions. SpVOp balance refers to 


lyses pooled over replications, 


ies (Rubin & Zajonc, 1969; Zajonc & Sher- 
man, 1967). 

SV balance. The average contributions of 
both SpV balance and SV balance were sig- 
nificant. However, these contributions varied 
in magnitude over replications, in each case, 
F(3, 129) > 5.00, p < .05, being less in Rep- 
lications 1 and 3 than in Replications 2 and 


2 These contrasts, which are calculated in a manner 
analogous to that described in Footnote 1, may also 
be computed from the difference scores shown in 
Table 2 for each replication. That is, the contribu- 
tion of V positivity is simply the mean of the 16 
difference scores; the contribution of S»V_ balance 
corresponds to the difference between the mean of 
the eight scores in which Sp is positive and the 
eight scores in which Sp is negative, and so on. 

%The significance of these contrasts was deter- 
mined in each case on the basis of a standard analy- 
sis of variance of judgments as a function of five 
within-subjects variables: Sp, Sx, V, On, and Or. 

Since there is no a priori basis for denoting poles 
along the descriptive dimensions used in each replica- 
tion as p or n and since the particular dimension 
involved in the four replications differed substan- 
tially, a pooling of judgments over replications may 
be somewhat misleading. However, only two infor- 
mational factors, SuV balance and VO» balance, are 
affected by this ambiguity. On the basis of the nota- 
tion described in Table 1, the contributions of these 
factors were positive in seven of eight cases and 
negligible in the remaining case. Thus, pooling over 
replications on the basis of this notation does not 
prevent the contributions of these factors from being 
detected and makes more salient the generality of 
these contributions as well as the contributions of 
the other factors being investigated. 
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4. The contribution of SpV balance indicates 
that the belief that A was more apt to like 
B than to dislike him (referred to hereafter 
as A’s liking for B) depended in part upon 
the descriptive implications of A’s attributes, 
independent of the favorableness of these at- 
tributes. That is, A’s liking for B was pre- 
dicted to be greater when A was cautious (or 
timid), gay (or frivolous), or cooperative (or 
conforming) than when he was bold (or 
rash), serious (or grim), or individualistic (or 
uncooperative), respectively. Although the 
adjectives representing the two levels of de- 
scriptive quality were not perfectly matched 
in favorableness (see Table 1), the contribu- 
tion of SpV balance does not appear to result 
from this partial confounding. (In Replica- 
tions 2 and 4, the contribution of this factor 
was actually opposite in direction to that ex- 
pected on the basis of such a confounding.) 
VO balance. The overall contributions of 
both VO, balance and VO, balance were sig- 
nificant. Although these contributions each 
varied in magnitude over replications (p < 
05), they were consistently positive, justify- 
ing the conclusion that A’s liking for B de- 
pended not only upon the favorableness of 
the adjective describing B but also upon the 
descriptive implications of this adjective, in- 
dependent of its favorableness per se. For 
example, in Replication 4, A’s liking for B 
was inferred to be greater when B was co- 
operative (M = .231) than when he was in- 
dividualistic (M = .057) and was greater 
when B was conforming (M = .115) than 
when he was uncooperative (M = —.251). 
SVO balance. The overall contributions of 
SzVO, balance and SpVOp balance were both 
significantly positive. The latter contribution, 
although large in all cases, varied in magni- 
tude over replications (p < .05). Thus, A’s 
liking for B was believed to be greater to the 
extent that A’s and B’s attributes were sim- 
ilar in favorableness, but also to the extent 
that these attributes had similar descriptive 
implications, independently of the favorable- 
ness of these attributes. For example, a cau- 
tious A’s liking for B was believed to be 
greater when B was timid (a denotatively 
similar but evaluatively dissimilar attribute) 
than when B was bold (an evaluatively sim- 
ilar but denotatively dissimilar attribute) ; 
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alternatively, a cooperative A’s liking 
was believed to be greater when B wa 
forming than when he was individualisi 

The possibility should be noted that i 
design of this study, the manipulation « 
scriptive similarity and of evaluative sin 
ity are somewhat confounded. This possi! 
results from the fact that when adjective 
both descriptively similar and evaluativ jy 
similar, they are identical and thus are + 
descriptively similar than when they are 
scriptively similar but evaluatively dissimil: 
However, the extent to which the contri! 
tion of descriptive similarity is indeed greater 
when adjectives are identical (e.g., A is cai- 
tious, B is cautious or A is timid, B is timiz 
than when they are not (A is cautious, B is 
timid) may be inferred from the contrast be 
tween the eight judgments along the two di- 
agonals of each 4 X 4 matrix in Table 2 and 
the eight judgments in the off-diagonal cells,‘ 
This contrast was nonsignificant, averaged 
over replications, M = .021; F(1, 43) = 2.43, 
p > .10. Thus, the confounding of evaluative 
and descriptive similarity does not appear to 
have had appreciable effects upon the results 
obtained. 

Other contrasts. In addition to the seven 
contrasts described above, nine others pertain 
to the contingency of V (i.e., of A’s liking for 
B) upon various combinations of Sp, Sy, Op, 
and Og. Averaged over replications, only one 
of these contrasts, corresponding to the inter- 
action of Sp, Sp, V, and Op, reached signifi- 
cance, F(1, 43) = 10.79, p < .05; this inter- 
action indicates that the contribution of 
SpVOp balance was greater when the adjec- 
tive describing A was favorable (M = .530) 


ne 
le- 
r= 


than when it was unfavorable (M = 461). 


The total variance accounted for by these 
nine contrasts (SS = 3.93) was substantially 
less than that accounted for by the seven 
contrasts assumed to be most relevant to the 
inferences being drawn (SS = 220.33), 


Summary and Evaluation 


According to Gollob’s (1974b) formula 
tion, judgments of A’s liking for B should be 


4 Appreciation is extended to Anthony Gr v 
for pointing out the nature of this confounding « 


appropriate contrast for investigating its effect. < 


{ 


| 
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a linear function of the contributions of the 
informational characteristics described above. 
However, the relative magnitude of these con- 
tributions may be worth noting. The implica- 
tions of the attributes describing A contrib- 
uted relatively little to inferences of A’s 
liking for B, regardless of whether these im- 
plications were evaluative or descriptive. This 
conclusion differs from that implied by Gol- 
lob’s (1974a) study in which SV balance 
contributed more substantially to such infer- 
ences. The difference between these sets of 
results suggests that the relative contribu- 
tions of informational variables depends upon 
idiosyncratic characteristics of the informa- 
tion presented. This is also suggested by data 
reported by Wyer (1974). 

The descriptive similarity between the at- 
tributes of A and B contributed much more 
to inferences than did the descriptive implica- 
tions of the adjective describing B alone, 
while the evaluative similarity of these at- 
tributes and the evaluative implications of 
the adjective describing B contributed about 
equally to inferences. Direct comparisons of 
the magnitude of evaluative and descriptive 
factors may be misleading, since the variation 
of stimuli along evaluative dimensions may 
not be comparable to their variation along 
descriptive dimensions. Nevertheless, the fact 
that the contribution of SpVOy balance was 
over three times that of any other informa- 
tional factor is noteworthy. 


INFORMATIONAL DETERMINANTS OF P’s 
Own LIKING For B 


The results of Experiment 1 indicate that 
while P’s inferences about another person’s 
(A’s) liking for B are partially contingent 
upon the characteristics of the adjective de- 
scribing B, they also depend upon attributes 
of A and the similarity of these attributes to 
those of B. If P bases his own liking for a 
person upon factors similar to those he uses 
to infer another’s liking for this person, then 
his liking for B should be partially a function 
of the adjectives he uses to describe himself 
and the similarity between these adjectives 
and those used to describe B. A second ex- 
periment was conducted to investigate this 
possibility. 


1047 


Method 


Subjects were 37 introductory psychology students 
who had not participated in the first experiment. 
Each subject (P) completed a six-page questionnaire, 
each page of which contained a randomly ordered 
list of the 16 single adjectives described in Table 1. 
On the first page, P estimated the likelihood that a 
person would possess each of the adjectives de- 
scribed; on the second, he estimated the likelihood 
that he would like a person described by each ad- 
jective; on the third, the likelihood that he was 
similar to a person described by each adjective; on 
the fourth, the likelihood that he would ke a 
person described by each; on the fifth, the likelihood 
that he was dissimilar to a person described by each; 
and on the sixth, the likelihood that he personally 
possessed each of the 16 attributes described. All 
judgments were made along an 11-point scale from 
O (not at all likely) to 10 (extremely likely) and 
were subsequently divided by 10 to convert them 
into units of subjective probability. The data per- 
taining to similarity and dissimilarity will not be 
discussed. 


Results 


Self-attributions, P’s beliefs that he person- 
ally possessed each of the 16 stimulus attri- 
butes and his estimates of the likelihood that 
a person in general would possess the same 
attributes were each investigated as a function 
of stimulus replication and the descriptive 
and evaluative implications of the attributes. 
The effect of descriptive and evaluative as- 
pects of these attributes, which were similar 
in each replication, are shown averaged over 
the four replications in Table 4. The magni- 
tudes of these effects were inferred from the 
mean difference between judgments at the 
two levels of descriptive positivity and of 
evaluative positivity. Averaged over replica- 


TABLE 4 


P’s INFERENCES ABOUT HIMSELF AND OTHERS AS A 
FUNCTION OF THE EVALUATIVE AND DESCRIPTIVE 


Descriptive 


implications | Beliefs about self 


b 691 | .472 | .582 | .611 | .569. 
n 676 | 316 | 496 | 1539 | 1472 
‘521 


M -684 | .394 575 
| 


Note. The terms p and n refer to the positive or negative pole 
of an attribute dimension, 
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tions, the descriptive implications of adjec- 
tives contributed positively (p < .05) to be- 
liefs about both oneself (M = .085) and 
others (M = .085); these contributions obvi- 
ously did not differ significantly from one 
another (F < .01, p> .10; moreover, F < 
3.00, p > .10 in analyses of each replication 
separately). In contrast, the evaluative qual- 
ity of these adjectives had much more impact 
upon subjects’ self-attributions (M = .290) 
than upon their beliefs about others, M = 
.055; F(1, 36) = 89.95, p < .001. 

Analyses of P’s liking for B. The mean dif- 
ference between P’s estimate of the probabil- 
ity he would like B and his estimate of the 
probability he would dislike B is shown in 
Table 5 as a function of the evaluative and 
descriptive quality of the adjective describing 
B. The magnitude of the contrasts corre- 
sponding to the contributions of V positivity, 
VO, balance, and VO» balance, as defined in 
Experiment 1, are summarized in Table 6. 
Although the magnitude of these contribu- 
tions varied significantly over stimulus repli- 
cations, in each case, F(3,108) > 5.99, p 
< .05, the relative magnitudes of these con- 
tributions were similar in three of the four 
cases. The contribution of V positivity indi- 
cates a general tendency for P to like persons 
more than to dislike them, regardless of their 
personal characteristics, The contribution of 
VOp balance indicates that the descriptive 
implications of the adjective describing B 
affected P’s belief that he would like B, inde- 
pendently of the favorableness of the adjec- 
tive; averaged over replications, the contri- 
bution of this factor was not appreciably 


TABLE 5 
MEAN LIKING FoR B As A FUNCTION OF Og AND Op 


Evaluative and | Replication | 
descriptive {ies an FE M 
implications | j i) 2 
(Op and Ox) | 1 2 |~3 | 4 

pp 278 | 462| 230| 603| 393 
pn | 021 .079 | —.089 | .603 | 064 
np :229| 1311] 1481) :538| 390 
nn | =+:321 | —432 | —.232 | —:560 | 386 


Liking for B is defined as the difference between P's 
hat he would like B and his belief that he would dislike 
abbreviations Op and Or refer to the two object dimen- 
ormer denoting the descriptive quality of the adjec- 
ing B and the latter denoting the evaluative quality 
e. The combinations of p and n refer to the 
positive or negative pole of an attribute dimension. 
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different from its contribution to P’s f 
that A would like B (see Table 3). On ‘se 
other hand, VO; balance appeared to - 
tribute substantially more to P’s own | 

for B than it contributed to P’s beliefs 
another (A) would like B. Since “subj 
characteristics (the quality of the adjecti 
describing P) are not controlled in this a: 
ysis, this latter difference could be due in | 

to a confounding of the affects of VO, | 
ance with the effects of similarity in att: 
butes between P and B (i.e., the effects 
SrVOy balance and SpVOp balance). For ex 
ample, since P typically uses evaluatively 
positive adjectives to describe himself (sec 
Table 4), his tendency to like persons de 
scribed by such adjectives more than those 
described by evaluatively negative adjectives 
could be due in part to his beliefs that he was 
descriptively or evaluatively similar to B as 
well as to the favorableness of B’s attributes 
per se. 

Individual differences in liking: effects of 
self-perceptions. The extent to which P’s lik- 
ing for B depends upon the descriptive and 
evaluative implications of his self-perceptions 
(that is, by SpV balance and SV balance), 
and upon the similarity of these implications 
to those of the adjective describing B (by 
SpVOp balance and SVO» balance), is more 
difficult to assess directly than is the extent 
to which analogous factors affect P’s infer- 
ences about A’s liking for B. This is because 
P typically does not describe himself by only 
one of the adjectives comprising the four 
combinations of evaluative and descriptive 
positivity being considered, but rather be- 
lieves himself to have each attribute with 
some probability less than 1. However, cor- 
relational analyses can be used to obtain some 
information about these matters. If the con- 
tributions of SpV balance, SV balance, 
SpVOp balance, and S,VO, balance are ad- 
ditive, as implied by Gollob’s formulation, 
then the magnitude of the relation between 
P’s belief that he personally has an attribute 
and his inference that he likes a person B 
with a given (similar or different) attribute 
should increase with the sum of the theoret 
ical contributions to this inference of the fou: 
factors described above. For example, suppose 
self-perceptions pertaining to the term can- 
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TABLE 6 


MAGNITUDE oF CONTRASTS CORRESPONDING TO THE CONTRIBUTIONS OF V Positivity, VOr BALANCE, 
AND VOp BALANCE, EXPERIMENT 1 


Replication 
Contribution of: M MS M Sorror F 
1 2 | | 4 
| 
V positivity .053* .065* 098" | .118* 083 4.11 15,.51* 
VOp balance -196* penSt SS .258* 163 3.94 17.70* 
VOr, balance .403* -638* .516* | .904* -616 56.25 174.90* 


Note. Significance levels within repli 
V denotes the verb dimension (like 


ions are based upon analyses of each replicati 
ke). VOp balance refers to the descripti 


separately. Abbreviations are as follows: 
plications of the adjective describing B, 


and VOx balance refers to the evaluative implications of the adjective describing B; both terms refer to the verb and object 


dimensions, 
*p <.05 (df = 1, 36). 


tious (a descriptively positive and evaluative 
positive attribute) are correlated over sub- 
jects with liking for a B described as cautious 
(ie. with the difference between P’s belief 
that he would like B and his belief that he 
would dislike B). Since Sp, Se, Op, and Ox 
are all positive, the magnitude of this correla- 
tion will increase with the magnitude of the 
contributions to liking of each of the four 
informational factors. On the other hand, the 
correlation over subjects between self-percep- 
tions of cautious and liking for a person B 
described as rash (a descriptively negative 
and evaluatively negative attribute) will in- 
crease with the contributions to liking of SpV 
balance and SV balance but will decrease 
(become more negative) as the contributions 
of SpVOp balance and Sy,VOy balance in- 
crease. More generally, suppose that within 
each stimulus replication the 16 possible cor- 
relations are computed between subjects’ be- 
liefs that they personally have each of the 
four attributes and their liking for a B with 
each of these attributes. The relative contri- 
butions of each of the four factors can be 
inferred from the mean difference between 
the eight correlations that should theoret- 
ically be positively affected by the factor and 
the eight correlations that should be nega- 
tively affected by the factor.’ These differ- 
ences are directly analogous to the orthogonal 
contrasts used in Experiment 1 to estimate 
the contributions of the same factors to infer- 
ences about A’s liking for B. 

The correlation over subjects between P’s 
self-perception of each attribute and his lik- 
ing for a person B described by each attri- 


bute was calculated separately for each stim- 
ulus replication. These correlations, averaged 
over replications after appropriate transfor- 
mations to Fisher’s Z, are shown in Table 7. 
The contributions of the four inference char- 
acteristics, estimated in the manner described 
above, are shown in Table 8 for each replica- 
tion separately and averaged over replica- 
tions. Analyses of variance of data pertaining 
to each contrast separately indicated that the 
contributions of SpVOp balance and Sy;VOy 
balance were large and significant and did not 
significantly differ over the four replications, 
in each case, F(3,21) < 1.00, p> .10; in 
contrast, the contributions of SpV balance 
and SpV balance were relatively small and 
nonsignificant. 

As noted in the analyses of inferences 
about A’s liking for B, a possible confounding 
exists in the manipulations of descriptive sim- 
ilarity and evaluative similarity, the effect of 
which can be inferred from the magnitude of 
the contrast between the eight entries in the 
diagnonals of the 4 X 4 matrix in Table 7 


5For example, let rı represent the correlation 
over subjects between P’s belief that he personally 
has attribute 7 and his liking for a person B de- 
scribed as having attribute j. Then, the contribu- 
tion of SpV balance in Replication 1 would be in- 
ferred from the difference between (a) the mean of 
Tory Fet, Yeb, Fers Fre, Tit, and ri and (b) the mean of 
the remaining eight correlations, where c, t, b, and r 
represent the terms cautious, timid, bold, and rash, 
respectively. Similarly the contribution of SeVOn 
balance would be indicated by the difference be 
tween the mean of fee, Feb, Fits Fir, Ton, Foe, rr, and 
re, and the mean of the remaining eight correlations, 
and so on, 
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TABLE 7 
MEAN TRANSFORMED CORRELATIONS BETWEEN P’s 
BELIEF THAT He Is Descripep BY EACH 
ADJECTIVE AND His LIKING ror a B 
DESCRIBED BY THis ADJECTIVE 


Op and Og 


Sp and Se — 
| pp pn np nn 

bp .730 338 145 | 248 

pn 061 740 | —161 053 

np 422 | —1128 679 | —.050 

nn —.388 | —.058 | — 14 679 


Nole, In this Table, Sp and Se refer to the descriptive and 
evaluative implications of the adjective used by P to de 
himself, and Op and Ox refer to the descriptive and evaluative 
i ions of the adjective describing B. All data are in units 

s Z. Combinations of p and n denote the positive and 
negative poles of an attribute dimension. 


and the eight entries in the off-diagonal cells 
in Table 7. While this contrast was moder- 
ately large (averaged over replications, M 
= .178), it did not reach significance, F(1, 7) 
= 1.56, p > .10, nor did it vary significantly 
over replications, F = .18, p > .10, 


Summary 


The data from this experiment, in combi- 
nation with those from Experiment 1, suggest 
that similar factors affect both P’s beliefs 
that he personally will like B and his beliefs 
that another person will like B; that is, both 
(a) the descriptive and evaluative similarity 
between the attributes of B and those of the 
person evaluating him and (b) the descrip- 
tive and evaluative implications of the adjec- 
tives describing B considered in isolation con- 
tribute substantially to these inferences, while 
(c) the descriptive and evaluative implica- 
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tions of the evaluator (A or P) cons d in 3 
isolation contribute appreciably less 
DISCUSSION 

The results of this study point « the 
potential complexity of social inference ohe- 
nomena but at the same time demonstra’ the 
value of using Gollob’s S-V-O model t in- 
vestigate systematically the nature of se 
complexities. Each adjective describing 4, id 


B (or for that matter, P) provides sc) al 
different informational cues, and its ef ct 
upon the judgment to be made appears to be 
a composite function of the implications of 
these cues. These implications not only te- 
pend upon the nature of the adjective alo ie 
but also upon its similarity along evaluative 
and descriptive dimensions to adjectives de 
scribing the other person involved in the rela- 


tion being judged. While the S-V-O model” 


cannot predict a priori the relative contribu- 
tions of different informational factors, it en- 
ables these contributions to be identified and 
compared systematically, 

Several specific aspects of the results re- 
ported are of interest. First, the results of 
Experiment 1 indicate that while the evalua- 
tive implications of information about a per- 
son B contribute to P’s beliefs that A will 
like him, an equally important contributor 
to these inferences is the descriptive implica- 
tions of the information about B and its 
similarity to the descriptive implications of 
the information about A. While the relative 
contributions of descriptive and evaluative 
factors should not be generalized beyond the 


TABLE 8 


CONTRIBUTIONS OF THE FOUR INFERENCE CHARACTERISTICS, SpV BALANCE, StV BALANCE, 
SpVOp BALANCE, AND S~VOr BALAN 


Replication 
Contributions of E paac os Seay, US Fee j 
informational factors I 
a S R Eg 4 
SDV balance [o7 | 163 | 076 | 104 | 17 | bar 
SxV balance 104 -086 -236 | 116 | 21 587 | 
SpVOp balance .390* .388* .460* | 463 3.44 105 
SrVOx balance .502* 443* .558* | 462 | 3.43 | 104 


Note. Significance levels within replications are based upon analy: 

are in units of Fisher's Z. Abbreviations are the following: SDV 

describing A, while SEV balance refers to the evaluative chara 

the subject and verb dimensions. SpVOp balance refers to the di 

while SeVOr balance refers to the evaluative similarity between the adjec 
*p <.05 (df = 1, 7). 


s of each replication j 
ce refers to the d characteristics of the 
s of the adjective desc: both of these term 
ptive similarity between the adjectives describing A a: 
escribing A and B. 


| 


DETERMINANTS OF ONE’s LIKING -FOR A PERSON 


s particular stimulus adjectives used in this 
study, the substantial contribution of descrip- 
tive factors may place limits upon the gen- 
erality of contemporary models of social in- 
ference that take into account only the 
affective implications of the information pre- 
sented. For example, Gollob (1974b) notes 
that the propositions comprising a triad of 
sentiment relations of the sort to which cogni- 


be interpreted as a single sentence of the 
form to which his S-V-O model is applicable 
(e.g., “Likable A likes likable B”). The con- 
tribution of S~VO, balance to inferences that 
A likes B in a sentence similar to that con- 
sidered in this study would be analogous to 
the contribution of the cognitive balance 


: tive balance theory (Heider, 1958) applies 
4 (e.g., “P likes A; P likes B; A likes B”) can 
} 

4 
P 

| 

| 


principle postulated by Heider, To this ex- 
tent, evidence from this study (see also Gol- 
lob, 1974b; Wyer, 1974) suggests that while 
cognitive balance may significantly contribute 
to such inference processes, the magnitude of 
its contribution is often much less than that 
of other aspects of the information presented 
and may depend upon the particular attri- 
butes of A and B that provide the basis for 
their likableness. A similar conclusion is sug- 
gested by Rubin and Zajonc (1969). More- 
over, the descriptive implications of the rela- 
tions comprising the triad may often con- 
tribute as much to these inferences as the 
effective quality of these relations. In this 
regard, Gollob (1974a) found that the rela- 
tive contributions of informational factors 
differed when verbs such as admires and 
despises were used to describe the relations 
being inferred instead of likes and dislikes. 
(Specifically, while SVO balance was the pri- 
» mary contributor to inferences about A’s 
admiration of B, inferences that A liked B 
were affected primarily by the description of 
A independently of its similarity to the des- 
cription of B.) In addition to reemphasizing 
the care with which the specific results of 
the present study should be generalized to 
inferences of affective relations other than 
liking, and to inferences based upon different 
attributes of A and B, Gollob’s finding points 
out the need to develop a clearer conceptual 
understanding of the descriptive implications 
of both the information presented and the 
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characteristic being inferred. In this regard, 
Peabody (1970) has differentiated adjectives 
of the type considered in this study along a 
“tight-loose” dimension, However, variation 
along this dimension would not account for 
the differences in descriptive implications over 
the four stimulus replications considered in 
this study. 

The evidence that subjects’ own liking for 
a person described by a set of adjectives is 
determined in part by descriptive as well as 
evaluative aspects of the information pre- 
sented, and that it also depends in part upon 
the adjectives subjects use to describe them- 
selves, is noteworthy in the context of im- 
pression formation research in which judg- 
ments are assumed to be primarily a function 
of the evaluative implications of the informa- 
tion presented. Since the various informa- 
tional factors identified in this study theoreti- 
cally contribute additively to judgments, the 
idiosyncratic effects of certain of these factors 
may be minimized when judgments are aver- 
aged over subjects. However, if the average 
contribution of these factors is not zero, their 
effects may create problems in applying 
models that theoretically provide exact quan- 
titative descriptions of information integra- 
tion processes. 

Experiment 2 may be interpreted as an in- 
vestigation of the determinants of variance 
in normative scale values of personality trait 
adjectives of the type compiled by Anderson 
(1968a). These values are typically assumed 
to be invariant over subjects and stimulus 
contexts. However, the results of Experiment 
2 clearly indicate that such values are de- 
termined not only by characteristics of the 
trait itself but also by the descriptive and 
evaluative similarity between the trait and 
those the subject uses to describe himself. 

In this regard, Kaplan (1971) found evi- 
dence that the combined influence of several 
pieces of information about persons was af- 
fected by subjects’ predispositions to evaluate 
persons positively or negatively in the absence 
of specific information about them. Such pre- 
dispositions may be determined in part by 
differences in subjects’ self-perceptions along 
evaluative and descriptive dimensions and 
their similarity along these dimensions to the 
person being judged. Certain of these differ- 
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ences can be taken into account in averaging 
models, such as Anderson’s (1968b, 1971) 
through the introduction of an “initial im- 
pression” term. This term, which pertains 
only to predispositions along an evaluative 
dimension (at least, when the judgment being 
made is evaluative), would correspond to the 
contribution of SV balance in the present 
study. However, the manner in which the 
effects of other individual difference factors 
(i.e., SpV balance, SpVOp balance, or $,VO; 
balance) can be incorporated a priori into 
such a model is unclear. 

The contributions of descriptive and evalu- 
ative similarity to judgments of B’s likable- 
ness are also of interest to consider in the 
context of Byrne’s (1971) similarity—attrac- 
tion formulation. Byrne has shown that one 
person’s attraction to another is posi- 
tively affected by information that these two 
persons hold similar attitudes toward various 
other persons and objects and also by infor- 
mation that they are similar in general per- 
sonality characteristics, as measured by stan- 
dard personality tests. However, the extent 
to which these judgments are a result of de- 
scriptive aspects of the information, evalua- 
tive aspects of the information, or both is 
difficult to determine from Byrne’s study. Pre- 
cedures similar to those employed in the ex- 
periments reported here may possibly be 
useful for assessing the relative contributions 
of these factors and also for identifying the 
contribution of factors in addition to simi- 
larity that affect attraction in the situations 
studied by Byrne. 

Evaluative factors appear to have con- 
tributed more to P’s own liking for B than to 
his predictions of A’s liking for B. However, 
conclusions based upon this evidence should 
be treated with some caution, since the con- 
tributions of evaluative and descriptive fac- 
tors depend upon the variation along the 
stimulus dimensions used in defining these 
factors. In the case of SV balance and SVO 
balance, this variation decreased in Experi- 
ment 2 relative to Experiment 1, since P 
typically believed himself to be described by 
the positive and negative poles of each sub- 

ject dimension with a probability of less than 
unity. If this decrease was greater along de- 
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scriptive dimensions than along e tive 
dimensions, the relatively lower cont tion 
of descriptive factors in Experiment! lay 
have been a result of this difference a us 
may not reflect any fundamental diii ::nce 


between the cues used by subjects whe. in- 
ferring their own liking for B and the ues 
they use to predict others’ liking for B. To 
this extent, results would be consistent + ith 
Bem’s (1972) general hypothesis that f- 
inferences and inferences about others 
based upon similar types of information. 

In interpreting data pertaining to sei- 
inferences, we have implicitly assumed th 
subjects infer their liking for others on t! 
basis of their self-perceptions and the sim 
larity of these perceptions to their perception 
of others. However, it is equally plausible to 
assume that subjects tend to describe them- 


selves as similar to persons they like and as» 


dissimilar to persons they dislike; to this ex- 
tent, their self-attributions may be a con- 
sequence of their liking for certain types of 
persons and not a determinant of their liking 
for these persons. The relative merits of these 
two alternative interpretations could be ex- 
ployed by experimentally manipulating sub- 
jects’ self-attributions and observing changes 
in their liking for others and by manipulating 
for certain types of persons and observing 
changes in self attributions. Gollob’s S-V-O 
model would be helpful in diagnosing i 
determinants of such changes. 
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Factors Affecting Entrapment in Waiting Situations: 
The Rosencrantz and Guildenstern Effect 


Jeffrey Z. Rubin and Joel Brockner 
Tufts University 


Seventy-two undergraduates took part in an experimental study of entrapment, 

in which the passage of time could be viewed either as an investment (in- 

creasing the likelihood of goal attainment) or as an expense. Subjects were 

given an opportunity to win a large sum of money (the jackpot), which de- > 


creased in value over time. In order to win the jackpot, a series of crossword 
puzzle words had to be solved, some of which were so difficult that the use 
of a necessary, scarce resource (a crossword dictionary) was required. And in 
order to obtain the use of this resource, subjects had to wait “in line” until 
it became available—which it never did. Three variables were varied in a 
2X2X2 design: rate of decrement of the jackpot, availability of a chart 
providing continuous information about this decrement, and information con- 
cerning subjects’ (presumed) position in line for the dictionary, Entrapment 


in this waiting conflict was found to be hi 
when (a) the size of the jackpot decre: 


gh overall, and was further heightened 
ased slowly rather than rapidly, (b) 


when no payoff chart was made available, and (c) when subjects were told 
they were first rather than third in line for the dictionary. 


We've traveled too far, and our momentum has 
taken over; we move idly towards eternity, without 
possibility of reprieve or hope of explanation. 

—Tom Stoppard, 
Rosencrantz & Guildenstern Are Dead 

Social psychologists such as Deutsch 
(1973), Schelling (1960), and others have 
long been interested in the circumstances 
underlying entrapment in a variety of conflict 
situations. One type of conflict in which 
entrapment commonly occurs, for example, 
is an auction, in which bidding typically 
escalates at increasing cost to the participants. 
In the presence of others with whom they are 
competing for some desirable object or re- 
source, bidders tend to increase the magnitude 
of their offers, often above and beyond the ob- 
jective value of the object in question. Indeed, 
Shubik (1971) has described an auction game 
in which entrapment is so intense that a 
dollar bill may be auctioned off for consider- 
ably more than the one dollar at stake! 
Gambling, similarly, can be interpreted as a 
conflict situation in which entrapment is 
likely to occur—particularly when the gam- 
bler wishes to recoup losses he has already 

incurred and therefore decides to roll the 
dice “just one more time.” 


This research was supported in part by National 
Institute of Mental Health Research Grant MH 
22605 to the first author. 


Still another type of entrapment conflict, 
and one which has received far less attention 
in the experimental literature, may be char- 
acterized as a situation in which entrapment 
occurs as a simple function of the passage of 
time. Consider the following examples: (a) 
A woman places a phone call in order to ob- 
tain some important piece of information from 
the party at the other end, and is put on 
“hold.” She must decide whether to continue 
waiting for the other party to return (as she 
expects), or hang up and try again later, (b) 
A man is confronted with the choice of walk- 
ing to his destination (for an appointment), 
or continuing to wait for the bus which he 
expects will get him to his destination faster 
and in greater comfort. (c) The United States 
is confronted with the choice of continuing a 
“holding pattern” of remaining in Vietnam 
(in the hope and expectation of victory) or 
withdrawing its troops. It is to an analysis 
of situations such as these, in which entrap- 
ment is a function more of the sheer passage 
of time than the decision maker’s active de- 
cision to invest additional resources (as in 
auctions and gambling), that the present re 
search is directed. 

Waiting conflicts of the type illu 
above share a number of important chara 


istics: 


c 
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ENTRAPMENT IN WAITING SITUATIONS 


1. In each, the same parameter (time) 
tends to be viewed both as an investment and 
an expense. The passage of time is an in- 
vestment to the extent that it is seen as in- 
creasing the likelihood of goal attainment 
(the return of the other party, arrival of the 
bus, victory in Vietnam), On the other hand, 
the passage of time is also an expense to the 
extent that it is viewed in relation to the costs 
incurred by waiting (wasting time on the 


‘phone without getting the information one 


seeks, being late for one’s appointment, the 
continued loss of American lives in Vietnam). 

2, As time passes, the cost associated with 
continued waiting increases, but so does the 
presumed proximity to the goal. Hence, the 
greater the passage of time, the greater the 
conflict. And the greater the conflict, the 
greater the pressures to act decisively—either 
by withdrawing or by committing oneself to 
remain in the situation. 

3. Of these two possible decisions (total 
commitment or total withdrawal), the former 
is probably more likely to occur, To use 
Lewin’s (1938) distinction between “driving” 
and “restraining” forces, the force restraining 
against continued involvement (the cost as- 
sociated with the passage of time) is offset 
by the presence of three driving forces—the 
reward associated with the goal, the presumed 
increased proximity to the goal, and the cost 
associated with “giving up” on one’s invest- 
ment (having spent 15 minutes on “hold” or 
at the street corner for “nothing,” having lost 
American lives in vain). Hence, waiting 
tends to beget waiting, and entrapment is the 
result. 

4. A waiting conflict of the type described 
can thus be considered as a special case of 
simple approach-avoidance conflict (Lewin, 
1935), in which the tendencies both to ap- 


1The importance of this third driving force can 
be seen in Lewin’s (1951) description of a housewife 
who is in conflict over whether or not to buy an 
attractive (but expensive) piece of meat. Assuming 
the housewife decides to buy the meat, Lewin writes, 
“the food passes the gate . . . The forces formerly 
opposing one another will now point in the same 
direction; the high price that tended to keep the 
expensive food cut is now the reason why the house- 
wife makes sure that through all the difficulties the 
meat gets safely to the table and is eaten.” (Lewin, 
1951, p. 76, cited in Deutsch, 1968). 
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proach and to avoid increase as a function 
of the passage of time. Note, however, that 
the gradient of approach is assumed to be 
steeper than the gradient of avoidance—an 
assumption directly contrary to that made by 
theorists such as Lewin (1935) and Miller 
(1944). A person is thus most likely to with- 
draw from a waiting conflict at the outset, 
when the tendency to avoid is greater than 
the tendency to approach. Once the decision 
to wait has been made, however, and given 
the passage of time, the tendency to approach 
(the driving force) increases more rapidly 
than the tendency to avoid (the restraining 
force)—and entrapment becomes increasingly 
likely to occur. 


In order to investigate some of the factors 
affecting entrapment in waiting conflicts with 
the above characteristics, it is of course neces- 
sary to translate such conflicts into the lan- 
guage of an experimental paradigm—the 
major requirement of which is that subjects 
be able to view the passage of time both as 
an investment and an expense. In one such 
paradigm, employed in the present investiga- 
tion, subjects are given an initial sum of 
money (the “initial stake”) as well as an 
opportunity to earn a far greater sum of 
money (the “jackpot”), and are told that 
they can quit the situation at any time. In 
order to win the jackpot, subjects need to 
solve a series of problems which are suffi- 
ciently difficult that the use of a valuable 
(scarce) resource becomes necessary. And in 
order to obtain the use of this valuable re- 
source, subjects need to wait “in line” until 
the resource becomes available, As time 
passes, the amount of money in both the 
initial stake as well as the jackpot decreases. 
Hence, the longer the time spent waiting for 
the resource (which is necessary to obtain 
the jackpot), the greater the expense to the 
subject—and also the greater the investment 
(the greater the likelihood that the resource 
will become available). The initial stake in 
this paradigm represents the “cost of playing 
the game”; as time elapses, this cost increases 
(the reward decreases). The jackpot, of 
course, represents the goal which subjects 
hope to attain, and as time elapses, the ob- 
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jective value of the reward associated with 
this goal decreases.? 

While a number of factors may well affect 
the likelihood of entrapment in waiting con- 
flicts, three appear to be of particular im- 
portance. 


Queue Position 


If our analysis is correct, it stands to rea- 
son that the greater the presumed proximity 
to the goal, the greater the goal gradient, and 
the greater the likelihood of entrapment. 
Given that the driving forces to approach the 
goal increase more rapidly than the restrain- 
ing force to avoid it, one might expect in- 
creased goal proximity (or the perception 
thereof) to result in increased attempts to 
locomote toward the goal. In the language of 
the experimental paradigm, the closer a per- 
son believes he is to having access to the 
resource he needs for goal attainment (the 
higher his position in line, or queue position), 
the greater the degree of entrapment should 
be. 


Salience 


Deutsch (1968) points out in his review of 
Lewin’s work that a goal-oriented individual 
placed in a cognitively unstructured situation 
is likely to experience intense conflict—he 
has little knowledge of “what leads to what.” 
And the greater the conflict in a waiting situ- 
ation, we have argued, the greater the pres- 
sure to act decisively by committing oneself 
to total involvement. Thus, the greater the 
ambiguity of the costs and rewards associated 
with waiting, the smaller the degree of cogni- 
tive structure, and the greater the driving 
forces to continue waiting. In terms of the 
experimental paradigm, then, the lower the 
salience of information concerning the decre- 
ment over time of the initial stake and the 


*Note that in a situation such as this, in which 
the passage of time is associated with a decrement 
in the objective reward value of the goal, one might 
expect very little entrapment to occur. After all, the 
longer one waits, the greater the objective cost, and 
the less objectively rewarding the goal becomes. 
However, to the extent that the goal is attractive 
not because of its objective, but rather its subjective, 
value, entrapment might nevertheless be expected to 
occur, 
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jackpot, the greater the degree of er ment = 
should be. 
Rate of Decrement 

It stands to reason, finally, that the < cater 
the objective value associated with ai ‘ing 
the goal, the greater should be the d ing 
forces locomoting the individual toward ‘his 
goal, and the greater the likelihood of e p- 
ment. In the language of our experi il 
paradigm, the lower the rate of decrem f 
the jackpot, the greater its objective vali. at 
any point in time and the longer it will ù -e 


the actual costs incurred (represented by tse 
decrement of the initial stake) to offset {ve 
potential jackpot winnings. The lower the re e 
of jackpot decrement, in other words, the 
greater the temptation to continue waiting, 


and the greater the degree of entrapment 


should be. 
METHOD 
Subjects 


The subjects were 72 male and female under 
graduates at Tufts University, recruited for voluntary 
participation in a study of problem solving in four- 
person groups. Subjects were told they would be 
paid for their participation, and that they could earn 
“up to $8.00.” In reality, the experiment was struc- 
tured in such a way that it was virtually impossible 


to earn this much money, and at the experiment’s W 


conclusion, subjects were actually paid somewher 
between $2.00 and $2.40. All subjects were run on 
at a time, 


Procedure 


Upon their arrival at the laboratory, subjects were 
ushered into individual cubicles and were seated at 
a table, on top of which lay $2.40 in cash, a set of 
written instructions, a stopwatch, and a switch (con- 
nected to a light in the adjacent, experimental con- 
trol room). After a few minutes of “stage setting,” 
in which subjects were led to believe that the three 
others in their group (none of whom actually 
existed) had arrived earlier and were already seated 
in three adjacent cubicles, they were asked to read 
through the set of instructions on their table, 

The instructions began by informing subjects that 
they would be taking part in a study concerned wit! 
“how groups go about solving problems, especially 
when the resources necessary for doing so 
scarce . . . and how familiarity among group mer 
bers affects this process,” Subjects were told tha 
some groups people would therefore be permitted | 
meet and speak with one another before engag 
in a problem-solving task, but in the subjec: 
group (the “control” group), no such interaction 
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would be possible. The instructions then continued: 


You are one in a group of four people, who will 
each be given an identical task. In a few minutes 
we will ask each of you to attempt to solve the 
same short crossword pussle. You will probably 
find that some of the words in this puzzle are easy 
to figure out, while others are perhaps not so easy. 


As soon as you have finished reading the instruc- 
tions, you may take the $2.40 on your desk. This 
is your money, and we will refer to this $2.40 as 
your INITIAL STAKE, However, whether you 
get to keep all of this Initial Stake, go home with 
less, or as much as a grand total of $8.00, will 
depend entirely on your performance in the ex- 
periment. 


In a few minutes you will be given a crossword 
puzzle, containing ten words, which you will be 
asked to solve. The amount of money that you 
make will be determined by two things: (1) the 
number of words you solve and (2) the amount 
of time you take to solve them. 


If you are able to solve at least eight of the ten 
words in the puzzle, in three minutes or less, you 
will win a total of $8.00. We will refer to this 
$8.00 prize as the JACKPOT, However—and this 
is quite important—in order for you to be eligible 
for the Jackpot, or any portion of it, you must 
correctly solve at least eight of the ten words in 
your puzzle. Only if you solve at least eight of the 
words can you take home all, or some portion, of 
the $8.00 prize. 


Subjects were told that assuming they were able 
to solve at least eight words and were therefore 
eligible for the jackpot, the amount of money they 
would win would be determined by the length of 
time they had taken. Depending on the experimental 
condition, subjects were told that for each additional 
minute taken beyond the first three minutes they 
would lose an additional 10%, either from the $8.00 
jackpot (low-decrement conditions) or from the 
remainder in the jackpot (high-decrement condi- 
tions). Thus, in the low-decrement conditions, if they 
solved eight or more words before the end of the 
fourth minute, the subjects would win 90% of 
$8.00, or $7.20; if they took five minutes to solve 
the words, they would win 80% of the 8.00 jackpot, 
or $6.40; etc. In the high-decrement conditions, sub- 
jects would win 90% of $8.00 (7.20) if they solved 
eight or more words before the end of the fourth 
minute, and 80% of the $7.20 remainder ($5.75) if 
they took five minutes, etc. 

Subjects were also told that if they were unable to 
solve at least eight words they would be ineligible 
for the jackpot and their pay would be based on 
the initial stake, as well as the amount of time they 
had spent in the experiment. For each additional 
minute beyond the first three, they would lose an 
additional 10% from their $2.40 initial stake (win- 
ning $2.16 before the end of Minute 4, $1.92 before 
the end of Minute 5, etc.). 


At this point, a second experimental manipulation 
was introduced in the instructions. Subjects in high- 
salience conditions were told: 


In order to make it easier for you to understand 
the amount of money you can win as time passes 
in the experiment, we have constructed a chart 
which shows your potential earnings at the end of 
each minute. This chart is located on the wall 
next to your desk. Take a moment now to 
familiarize yourself with it. You may refer to it 
at any time during the experiment. 


Subjects in the low-salience conditions had no such 
chart available, and the above paragraph was there- 
fore deleted in the instructions they received. 

The instructions then continued: 


You will have a total of 13 minutes in which to 
work on your puzzle, You are free, of course, to 
stop your participation in the experiment at any 
time you wish. Whether you choose to continue 
working on the puzzle or decide to stop is com- 
pletely up to you. As soon as you have solved eight 
or more words, you will obviously want to stop— 
and take home the portion of the Jackpot money 
you have won at that point. If you have not yet 
solved eight or more words, however, you may 
decide to continue working or to simply stop. By 
continuing to work, you may solve the eight or 
more words necessary for the Jackpot. By stop- 
ping, you forfeit your eligibility for the Jackpot, 
and earn a portion of your Initial Stake deter- 
mined by the time you have already spent. 


Because some of the words are more difficult to 
solve than others, you may want to make use of 
a crossword puzzle dictionary which the experi- 
menter has available. This dictionary contains the 
answers to all of the words used in constructing 
your puzzle. However, because we are interested in 
studying group problem solving when resources 
are scarce, there will be one, and only one, dic- 
tionary available for the four of you to use and 
share, as you see fit. 


In order to obtain the dictionary from the ex- 
perimenter, subjects were told, they should simply 
move the switch on their table into the “on” posi- 
tion. If the dictionary was not currently being used 
by one of the other group members, the experimenter 
would hand it to them at once, and they could then 
use the dictionary for as little or as much time as 
they wished. The instructions then stated: 


If, however, the dictionary is being used by some- 
one else, at the point you request it, it will be 
given out on a first come-first served basis—and 
the experimenter will hand you a card with a 
number on it (either a 1, 2, or 3), indicating your 
place in line. For example, if you receive a card 
with the number 1 on it, this means that the 
dictionary is in use by a member of your group, 
and that you are next in line to get it as soon 
as he is through. . . 
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If the dictionary is not available when you request 
it, and the experimenter hands you a card indicat- 
ing your number in line, he will ask you to hand 
him your puzzle, while you wait. Thus you cannot 
wait for the dictionary and continue to work on 
your puszle at the same time. If at any time you 
decide that you would prefer not to wait any 
longer, and/or that you would like to continue 
working on your puzzle, you can get your puzzle 
back by once again moving your switch into the 
“on” position. The experimenter will then return 
your puzzle and take back the card he has given 
you. This means, of course, that you will lose 
your place in line for the dictionary. 


Finally, the subjects were told they could request 
the dictionary at any time and as many times as 
they wished. They were reminded of the fact that 
they could stop the experiment at any time (by 
signaling the experimenter), and that their pay 
would be based on the number of words solved (less 
than eight vs. eight or more) and the time spent in 
the experiment, 

The experimenter then answered any and all ques- 
tions about the rules of the experiment, handed the 
subjects a crossword puzzle (face down), and on 
signal asked them to start the stopwatch on their 
table and begin work on the puzzle. All subjects were 
presented with the identical puzzle, deliberately con- 
structed to be so difficult that eight or more words 
could not be correctly solved without the use of a 
dictionary. Indeed, none of the 72 subjects was able 
to solve eight words. Moreover, there was, in actu- 
ality, no crossword puzzle dictionary. Thus, the sub- 
jects were placed in a situation in which they could 
choose to wait for a valuable, necessary resource 
which never became available. 

At the end of 13 minutes (or earlier, if the subject 
terminated the experiment before that time), sub- 
jects were given a short postexperimental question- 
naire to complete, were fully debriefed, and were 
paid for their participation. Regardless of the amount 
of time subjects had spent in the experiment, they 
were paid a minimum of $2.00, and as much as $2.40 
(if they stopped the experiment before the end of 
the third minute). 


Experimental Design and Manipulations 


Three treatment variables were incorporated into 
a completely between-subjects, 2X 2 X 2 factorial 
design: (a) decrement, manipulated by varying the 
rate of decrease of the amount of money in the 
jackpot—high versus low; (b) salience, manipulated 
by the presence or absence of a chart showing the 
rate of decrement, over time, of the jackpot and the 
initial stake—high versus low; and (c) queue posi- 
tion, manipulated by varying the information sub- 
jects were given about their position in line for the 
“dictionary”—first (next in line) versus third (last 
in line, behind the current “user” and the two 
“others” waiting ahead of the subject). Subjects 
were thus forever waiting in line for a dictionary 
which they never received? 
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RESULTS 
Game Behavior 
Given our interest in the condition ler 
which people become entrapped in wing 


conflicts, the data of primary impor 
this study were tapped through a variet) of 
measures of game behavior. The cond. >n 
means for these measures are presented `n 
Table 1. 

The experimental situation was struct 
in such a way that subjects’ waiting beh: 
could be examined in relation to sev 
“break-even” points. One of these points 
represented by the end of Minute 3, afte 
which the size of the initial stake began t 
decrement at a rate of 10% per additiona! 
minute. Thus, to the extent that subjects re 
mained in the experiment longer than three 
minutes, they presumably did so because they 
expected to solve eight or more words and 
obtain some portion of the jackpot. As can 
be seen in Table 1, 87% of the subjects 
(collapsing across experimental conditions) 
remained in the experiment longer than three 
minutes. Chi square analysis of this index, 
following a procedure for frequency data in 
multiple classification designs (Sutcliffe, 
1957), revealed no significant differences as 
a function of the three treatment variables. 
The only effect to approach significance was 
a salience main effect, x?(1) = 3.07, p < .10. 
Thus, a greater percentage of subjects tended 
to remain in the experiment beyond this first 
break-even point of three minutes when they 
were in low- rather than in high-salience con- 
ditions, 

A second break-even point is represented by 
the time beyond which subjects could make 
no more than $2,40—even if they won what 
was left in the jackpot! For subjects in high- 
decrement conditions, the jackpot dropped to 
$2.40 during the seventh minute; while in 
low-decrement conditions, it dropped to $2.40 
during the tenth minute. Thus by waiting 
beyond the seventh and tenth minutes, re- 


3 Not all of the subjects asked for the dictionary 
Indeed, this was the case for 15 of the participants 
Because these subjects never asked for the dictio 
none were exposed to the queue-position v 
and their data were therefore eliminated, lea ; 
total of 72 subjects unequally distributed among the 
eight experimental conditions. 
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TABLE 1 


MEAN INDICES OF GAME BEHAVIOR AS A FUNCTION 


or DECREMENT, SALIENCE, AND QUEUE POSITION 


High decrement Low decrement 
index High salience Low salience High salience Low salience x 
First | Third | First | Third | First | Third | First | Third 
| Gr = 9) | = 8) | = 9) | Ge = 9) | = 9) |e = 9) | Cw = 9) | (n = 10) 

Percentage of subjects | | 

remaining in experiment 

beyond minute three 77.8 | 62.5 | 88.9 | 100.0 | 100.0 77.8 88.9 | 100.0 87.0 
Percentage of subjects | | 

remaining in experiment | 

beyond jackpot-initial stake! | 

break-even point 55.6 55.6 | 22.2 77.8 70.0 53.9 
Overall winnings (cents) 106.67 | 74.67 | 157.33 | 40.00 | 57.60 | 104.32 
Total wait for dictionary (sec) 157.78 | 302.89 | 71.89 | 425.78 | 243.70 | 243.52 
Maximum wait for dictionary | | 

(sec) 275.22 | 138.12 | 275.00 | 108.89 | 302.22 


spectively, subjects could win no more by 
further waiting (for the dictionary) than they 
would have by quitting the experiment before 
the end of the third minute and taking home 
their $2.40 initial stake. As can be seen in 
Table 1, more than half (53.9%) of the sub- 
jects in the experiment continued to wait 
beyond this “point of no return,” having been 
entrapped in a conflict from which there was 
no longer any satisfactory escape. Chi square 
analysis of this measure, by conditions, re- 
vealed only one marginally significant effect— 
a main effect for salience, y*(1) = 2.98, p < 
.10. Thus, as in the case of the three-minute 
break-even point, a greater percentage of 
subjects in low-salience conditions tended to 
wait beyond the point when the jackpot had 
dropped to $2.40. 

Perhaps the simplest indicator of degree of 
entrapment is represented by subjects’ overall 
winnings, since this measure is a direct reflec- 
tion of the amount of time spent in the ex- 
periment. As can be seen in Table 1, subjects, 
across conditions, made only $1.04—less than 
half of the $2.40 they could have obtained by 
quitting before the end of the third minute. 
Analysis of variance of this index revealed 
significant main effects for decrement, F(1, 
64) = 4.24, p<.05, and salience, F(1, 
64) = 10.16, p < .005, as well as a mar- 
ginally significant effect due to queue position, 
F(1, 64) = 3.65, p < .10. Thus, as expected, 
overall winnings were lower (subjects were 


entrapped to a greater extent) when the size 
of the jackpot decreased slowly rather than 
rapidly, when no payoff chart was made avail- 
able (low salience), and when subjects were 
told they were first rather than third in line 
for the dictionary. No significant interaction 
effects emerged. 

While the above results clearly suggest 
that subjects became entrapped in the experi- 
mental situation, and that their stay in the 
experiment varied as a function of the three 
treatment variables, these results are of little 
help in explaining what subjects actually did 
throughout their stay. In particular, to what 
extent did they spend their time waiting for 
the dictionary? Table 1 presents the subjects’ 
total waiting time for the dictionary. The 
grand mean of 243.52 sec (4.06 min) reflects 
the fact that approximately one third of the 
experiment was spent waiting for a valuable 
resource that never arrived. Analysis of vari- 
ance of this index revealed significant effects 
for salience, F(1, 64) = 4.02, p < .05, and 
queue position, F(1, 64) = 17.04, p < .001. 
Thus, the total wait for the dictionary was 
longer when salience was low and when the 
subjects were given information which indi- 
cated that they were next in line for the 
dictionary. Again, no significant interactions 
emerged, 

Finally, in order to better understand the 
nature of the above condition differences in 
time spent waiting for the dictionary, it may 
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be useful to consider one additional index of 
game behavior. One might expect entrapped 
subjects to have not only waited longer for 
the dictionary overall, but once having re- 
quested it, to have waited longer (to have had 
a greater maximum waiting time) before tem- 
porarily giving up and going back to work on 
the puzzle. Note that maximum waiting time 
should be negatively correlated with the num- 
ber of times the dictionary was requested— 
and indeed this was found to be the case, 
(70) = —.22, p < .05. 

Table 1 presents the subjects’ maximum 
waiting time for the dictionary. Analysis of 
variance revealed a significant effect due to 
queue position, F(1, 64) = 20.48, p < .001, 
as well as a marginally significant Decrement 
X Salience interaction, F(1, 64) = 3.43, p < 
-10. Maximum wait for the dictionary was 
thus greater when subjects were told they 
were first, rather than third, in line. More- 
over, this wait was longer in high-decrement, 
high-salience than in low-decrement, high- 
salience conditions, being longest, as one 
might expect, in low-decrement, low-salience 
conditions. 


Postexperimental Questionnaire Responses 


The postexperimental questionnaire con- 
sisted of several items designed to assess sub- 
jects’ post hoc explanations for their behavior. 
The most important of these items, with re- 
spect to understanding subjects’ waiting be- 
havior, was a four-part question in which 
subjects were to indicate the importance of 
each of four reasons for waiting for the dic- 
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tionary as long as they did. The reasons were 
as follows: (a) “As- time went by, I felt 
increasingly confident that I would get the 
dictionary soon”; (b) “By getting the dic- 
tionary, I thought I could earn more money 
than I already had”; (c) “I had already 
waited so long, it would have been foolish not 
to wait a little longer”; and (d) “I did not 
realize how much money I had lost. If I had, 
I surely would have quit sooner.” Subjects 
rated the importance of each reason on a 
61-point, dotted scale extending from “not 
at all important” (1) to “very important” 
(61). Although each reason was rated inde- 
pendently, and all four could therefore have 
been judged to be equally important or un- 
important, one might more reasonably expect 
these ratings to have been made in concert 
and to have varied as a function of the 
perimental manipulations. This, indeed 
pears to have been the case. 

The condition and grand means for sub- 
jects’ importance ratings of the four reasons 
are presented in Table 2. As can be seen, 
none of the four reasons for having waited 
for the dictionary was seen as particularly 
important. In fact, judging from the means 
obtained by collapsing across conditions, three 
of the four reasons were seen as rather unim- 
portant. A rather different picture emerged, 
however, when these ratings were considered 
as a function of the experimental treatments. 
Analysis of variance of ratings of the first 
reason (feeling increasingly confident) re- 
vealed one significant effect: for queue posi- 
tion, F(1, 64) = 26.95, p < .001. Subjects 


ex- 
ap- 


TABLE 2 


MEAN Importance RATINGS OF Four REASONS FOR WAITING FOR DICTIONARY AS A FUNCTION OF DECREMENT, 
SALIENCE, AND QUEUE POSITION 


High decrement 


Low decrement | 


Low salience 


RAA High salience Low salience High salience | 3 
] 
First Third First Third First Third First | Third | 
(n = 9) | (n = 8) | (n =9) | (n =9) | (n =9) | (n = 9) | (n =9) | (» = 10)| 
Increased confidence 28.00 | 12.75 | 36.56 | 5.78 | 27.89 7.11 33.78 | 16.40 | 21.03 
Making more money 33.44 | 20.62 | 53.00 | 24.44 | 37.11 | 22.11 | 43.33 | 47.00 3513 
Foolish not to continue waiting) 24.56 8.88 | 41.22 | 21.56 | 22.89 | 7.89 | 22.78 | 32.50 | 22:79 
Not realizing amount of r | | 2 
money already lost 13.44 | 14.62 | 38.33 | 27.33 | 14.11 | 5.89 | 22.00 | 25.40 


20.14 


Nolte. The larger the magnitude of a condition mean, the greater the rated importance of a reason (out of 61). 


i 


a) 


who were led to believe they were next in line 
for the dictionary rated their confidence that 
they would obtain it soon as far more im- 
portant than those who were told they were 
third in line. A roughly comparable pattern 
appeared in the variance analysis of the sec- 
ond reason (expecting to make more money). 
Here, two significant effects emerged: for 
salience, F(1, 64) = 7.57, p < .01, and queue 
position, F(1, 64) = 7.09, p < .01. Subjects 
„Were thus more likely to rate the possibility of 
earning more money than they already had 
as an important reason for waiting for the 
dictionary when they had no payoff chart 
available, and when they were told they were 
first in line, 

Again, when one turns to the third reason 
for waiting (foolish not to continue waiting), 
a similar pattern emerges. Analysis of vari- 
ance of this measure revealed significant ef- 
‘fects for salience, F(1, 64) = 7.12, p < .01, 
and queue position, F(1, 64) = 4.05, p < .05. 
Thus, subjects in low-salience conditions, and 
those given the first queue position, cited the 
foolishness of not waiting a little longer as 
a more important reason for “holding out” 
for the dictionary. Finally, analysis of ratings 
of the fourth reason (not realizing the amount 
of money already lost), revealed a significant 
salience effect, F(1, 64) = 11.51, p < .005, 
“reflecting the fact that subjects in low-salience 
conditions were less cognizant of their losses 
(and rated this as a more important reason 
for their waiting) than those in high-salience 
conditions. 


Discussion 


Following from a conceptual analysis of 
the characteristics of waiting conflicts, an 
experimental paradigm was developed in 
which it was predicted that entrapment would 
be especially likely to result. That entrap- 
ment did, in fact, occur is clear. Overall, 87% 
of the subjects remained in the experiment 
beyond the point (Minute 3) at which con- 
tinuation began to be associated with increas- 
ing monetary cost. More than half of the 
subjects continued in the experiment beyond 
the jackpot —initial stake break-even point, 
the point at which they could make no more 
by reaching their goal than they would have 
by quitting the experiment earlier and settling 
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for the initial stake. Moreover, subjects spent 
almost a third of the experiment waiting for 
a necessary resource (the dictionary) that 
never became available and ended up earning 
far less ($1.04) than the initial stake of 
$2.40 with which they began the task. Taken 
together, these findings thus point to the 
success of the experimental paradigm in cap- 
turing the ingredients necessary for entrap- 
ment. 

It was predicted and found that degree of 
entrapment in this situation varied as a func- 
tion of three factors: salience, queue position, 
and decrement. In contrast with subjects who 
were provided with charts depicting the de- 
crement over time of the initial stake and 
jackpot (high salience), those in low-salience 
conditions: (a) spent more time waiting for 
the dictionary, (b) earned less money, and 
(c) tended to remain in the experiment be- 
yond both Minute 3 and the jackpot — initial 
stake break-even point. Similarly, subjects 
who were told they were next in line for the 
dictionary spent more time waiting for the 
dictionary (both overall and on their longest 
single wait), and tended to earn less money 
than those subjects who were told they were 
third in line. Finally, as expected, subjects 
made less money when the size of the jack- 
pot decreased slowly (low-decrement condi- 
tions) than when it decremented more rap- 
idly. 

In interpreting the particular effects of the 
experimental variables on subjects’ behavior, 
the postexperimental questionnaire data are 
of some additional help. For unlike Rosen- 
crantz and Guildenstern, who “move idly 
toward eternity, without possibility of re- 
prieve or hope of explanation,” (Stoppard, 
1968, p. 121) our subjects appear to have 
had a pretty fair idea of why they became as 
entrapped as they did. Recall the four rea- 
sons which subjects were asked to rate and 
the significant effects obtained for each: (a) 
the feeling of increasing confidence that they 
would get the dictionary soon (significant 
queue position effect only); (b) the belief 
that by getting the dictionary, they could 
earn more money than they already had 
(queue position and salience main effects); 
(c) the belief that they had already waited 
so long, it would have been foolish not to 
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wait a little longer (queue position and sa- 
lience main effects); and (d) the nonrealiza- 
tion of the amount of money already lost 
(salience effect only). 

The first three reasons represent verbal 
translations of the three driving forces we 
have postulated keep a person in a waiting 
conflict, thereby engendering entrapment: the 
presumed increased proximity to the goal, 
the reward associated with attaining the goal, 
and the cost associated with giving up on 
one’s investment. The restraining force 
against continued waiting (the cost associated 
with the passage of time) is represented by 
the last reason. Thus, subjects told they were 
first in line may have become as entrapped as 
they did because of the enhanced importance 
in their eyes of the three driving forces: The 
longer they waited, the closer they believed 
they were to a goal which was associated with 
reward, and the more prone they thus became 
to justify their effort by continued waiting. 
The explanation for the entrapment of sub- 
jects in low-salience conditions is slightly dif- 
ferent, and may be attributable to the en- 
hanced importance of two of the three driving 
forces, as well as the relative unimportance 
of the sole restraining force. Because these 
subjects thought their continued waiting in- 
creased the likelihood of their making more 
money than they already had, and because 
they were unaware (or so they said) of the 
costs associated with this waiting, they too 
tended to justify their effort by continued 
waiting—and entrapment. 

While the questionnaire data permit us to 
better understand the effects of the salience 
and queue-position variables on subjects’ be- 
havior (and reasoning), the exact functioning 
of the third (and perhaps least important) 
variable, rate of decrement, is less clear. That 
this is so can be seen by inspection of those 
conditions in which entrapment was most and 
least likely to occur. On the one hand, it 
was found (as expected) that subjects in the 
low-decrement, low-salience, first-queue-posi- 
tion condition were most likely to wait beyond 
the jackpot — initial stake break-even point, 
won the least money in the experiment, had 
the longest (overall and maximum) waiting 
times for the dictionary, and rated all four 
of the reasons for waiting for the dictionary 
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as most important. This, then, was t ndi- 
tion in which entrapment was con ntly 
most likely to occur. 

It was expected, conversely, that 1p- 
ment would be least likely to emerge he 
high-decrement, high-salience, third e- 
position condition, and indeed the overall 


winnings data suggest that this was the cise, 
Other measures, however, present a somewi:> 
different picture. As the treatment means fo 
the total wait and maximum wait for dic- 
tionary indices indicate, the smallest values 
were obtained in the /ow-decrement, high-sa- 
lience, third-queue-position condition, But 
why should this have been the case? And 
what do these apparently contradictory find- 
ings imply for our analysis of the effects of 
the decrement variable on entrapment? 
Perhaps a subject in the high-decrement, 


high-salience, third-queue-position condition 


reasoned more or less as follows: 


Hmm. I'm in a rather precarious situation, I ca 
see from my charts that the value of the jackpot 
decreasing rather quickly. So, if I’m to have any 
chance of winning a large sum of money, I'd bett: 
make my move and try to get that dictionary. [} 
requests dictionary.] Hmm. I’m third in line. Wel’, 
I'd better stick it out for a while and keep waitiny 
It’s now or never. And if I don’t get the dictionary 
prettly soon, TIl just have to quit the experiment, 


A subject in the low-decrement, high-sa- 
lience, third-queue-position condition, on the 
other hand, may have reasoned somewhat 
differently: 


Hm. Things look pretty good. I can see from my 
charts that the value of the jackpot is decreasing 
pretty slowly, so I can afford to stay in the experi- 
ment for quite a while. In order to win the jackpot, 
though, I need that dictionary, so I might as well 
try to get it now. [He requests the dictionary.] 
Hmm. I’m third in line. Well, I’ve got time. I might 
as well stop waiting for the dictionary, go back to 
work on my puzzle, and try asking for the dictionary 
a little later. 


This analysis appears to account for the 
data and, if correct, it implies that degree of 
entrapment was indeed less intense in the 
high-decrement, high-salience, third-queue- 
position than in the low-decrement, high-sa- 
lience, third-queue-position condition—as ex- 
pected. Subjects in the former condition tried 
to win the jackpot by requesting the diction- 
ary and then “sticking it out” for a longer 


mee 


S 
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period of time—but ended up quitting the 
experiment sooner (hence their greater win- 
nings and lesser entrapment). The above rea- 
soning, by the way, may help explain the 
emergence of a Decrement X Salience inter- 
action trend for the maximum wait for dic- 
tionary measure, in which it was found that 
waiting time was longer in high-decrement, 
high-salience than in low-decrement, high- 
salience conditions. 

Future research on entrapment in waiting 
conflicts might fruitfully move in one or more 
of the following directions. First, it may be 
useful to attempt the development of a mathe- 
matical model that would permit quantitative 
predictions to be made about how one should 
behave in waiting conflicts. Decision theorists 
such as Edwards (1965) and Rapoport and 
Wallsten (1972) have developed a Bayesian 
model (the “optional stopping” model) which 
deals with individual decision behavior in 
certain multistage decision tasks. The decision 
maker, in the optional stopping model, is 
presented with a set of data, is required to 
estimate the likelihood of occurrence of some 
outcome, and is permitted access (at a cost) 
to information which may affect the accuracy 
of this estimate. While such a model cannot 
be readily fitted to the present data (in part 
because the three experimental variables are 
nominal-ordinal rather than interval/ratio) , 
it does suggest the feasibility of developing a 
model of normative behavior in waiting con- 
flicts, 

Second, in light of our analysis of entrap- 
ment in terms of driving and restraining 
forces, it may be useful to further explore 
their relative importance. For example, is the 
reward associated with the goal and its (pre- 
sumably) increasing proximity as important 
in shaping behavior as the cost associated 
with the passage of time (and the “giving up” 
on one’s investment)? An answer to such a 
question might be obtained by experimentally 
directing a subject’s perspective either for- 
ward in time (toward the goal he believes he 
is nearing) or backward (toward the cost 
he has already incurred to get where he is 
going). 

Finally, given that our analysis of behavior 
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in waiting conflicts began with several ex- 
amples from the world of everyday experi- 
ence, it may be useful to return to such ex- 
amples and study the circumstances in which 
entrapment does in fact occur. Investors, for 
instance, often tend to become “wedded” to 
their stocks, Having made a substantial in- 
vestment, they hold onto their stock—even 
in the face of its declining value. If our analy- 
sis of entrapment is correct, the length of 
time an investor in a “bear” market remains 
wedded to his investment should vary as a 
function of at least three factors: the rate at 
which his stocks are declining in value, the 
amount of information he has about this rate 
of decline, and his estimate of how long it 
will be before the market turns “bullish” once 
again. Such a set of predictions could ob- 
viously be subjected to experimental test. 


REFERENCES 


Deutsch, M. Field theory in social psychology, In 
G. Lindzey & E. Aronson (Eds.), The handbook 
of social psychology (Vol. 1). Reading, Mass.: Ad- 
dison-Wesley, 1968. 

Deutsch, M. The resolution of conflict: Constructive 
and destructive processes. New Haven: Yale Uni- 
versity Press, 1973. 

Edwards, W. Optimal strategies for seeking informa- 
tion; Model for statistics, choice reaction times, 
and human information processing. Journal of 
Mathematical Psychology, 1965, 2, 312-329. 

Lewin, K. A dynamic theory of personality. New 
York: McGraw-Hill, 1935. 

Lewin, K, The conceptual representation and mea- 
surement of psychological forces. Contributions to 
Psychological Theory, 1938, 1, No. 4. 

Lewin, K. Field theory in social science. New York: 
Harper, 1951. 

Miller, N. E. Experimental studies of conflict. In 
J. McV. Hunt (Ed.), Personality and the behavior 
disorders (Vol. 1). New York: Ronald Press, 1944, 

Rapoport, A., & Wallsten, T. S. Individual decision 
behavior, Annual Review of Psychology, 1972, 23, 
131-176. 

Schelling, T. C. The strategy of conflict. Cambridge, 
Mass.: Harvard University Press, 1960. 

Shubik, M. The dollar auction game: A paradox in 
noncooperative behavior and escalation. Journal of 
Conflict Resolution, 1971, 15, 113-120, 

Stoppard, T. Rosencrantz & Guildenstern Are Dead, 
New York: Grove Press, 1968. 

Sutcliffe, J. P. A general method of analysis of fre- 
quency data for multiple classification designs. 
Psychological Bulletin, 1957, 54, 134-137. 


(Received January 7, 1974) 


it 


Journal of Personality and Social Psychology 
1975, Vol. 31, No. 6, 1064-1072 


Integration Theory Applied to Opinion Attribution 


Samuel Himmelfarb 
Tel-Aviv University, Tel Aviv, Israel 


Norman H. Anderson 
University of California, San Diego 


An integration model of opinion attribution was tested in two experiments. 
Subjects judged on a militarism-pacifism dimension the true opinion of stimu- 
lus persons who were characterized by two militaristic and one pacifistic state- 
ment of opinion. Three different instructional conditions were used to vary 
the degree of choice that the stimulus person had in making these statements. 
According to attribution theory, degree of choice should affect the weight given 
to each statement. According to integration theory, these weight manipulations 
should produce specific qualitative and quantitative effects on the judgments. 


The integration model was well supported in the main data, 
was one discrepancy in some auxiliary data. Responses to sets 
statements supported the principle of information integration 


although there 
of inconsistent 
rather than a 


principle of cognitive consistency. Overall, the results indicate the usefulness 
and power of the integration-theoretical approach to social attribution. 


We frequently face the problem of inferring 
a person’s true opinion from what he says 
and does. But, he may express different opin- 
ions on different occasions. Also, he may 
express opinions in which he does not believe, 
or with a conviction he does not feel. How 
are these various behavioral expressions pro- 
cessed to arrive at an inference about the 
person’s true opinion? 

This article attacks opinion attribution 
with a mixture of ideas from information 
integration theory and attribution theory. 
According to integration theory (Anderson, 
1971), the inferred opinion is simply a 
weighted average of relevant information. 
This averaging formulation has received ex- 
tensive support, but it has not been tested in 
the area of opinion attribution. 

In opinion attribution, the weight of each 
piece of information will, in general, depend 
on various situational constraints. Essentially 
this problem has been the main concern of 
attribution theory as developed by Jones and 
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his colleagues (Jones & Davis, 1965; Jones 
& Harris, 1967; Jones, Worchel, Goethals, 
& Grumet, 1971). 

Both theories employ the theoretical con 
cept of weight, but it is not known whether 
these two usages are equivalent. Integratio: 
theory provides a formal definition of weight 
in conjunction with the concept of scale value 
In attribution theory, however, the concept 
of weight is not well formalized and the con- 
cept of scale value is largely lacking. In prin- 
ciple, however, the informational manipula- 
tions employed by Jones should correspond 
directly to the weight parameter of integra- 
tion theory (Himmelfarb, 1972, 1974). One 
main purpose of the present research was to 
assess this correspondence between the two 
usages of the concept of weight. 

In the present experiments, subjects judged 
the true opinion of persons on a militaristic— 
pacifistic dimension. Each person was said to 
have made three statements, two militaristic 
and one pacifistic, each statement having been 
made on a different occasion in school essays 
about topics of war and peace. 

An important independent variable was the 
degree of choice that the person had in 
writing the essays. In accord with Jones’s 
formulation, the choice variable should have 
predictable effects on the weight parameters. 
These predictions can be given quantitative 
form within the averaging model of integra- 
tion theory. For simplicity of exposition, these 
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A predictions are discussed below in parallel 
. with the data. 


METHOD 


l Two similar experiments were run, both of which 
. used three instructional conditions in which a set 
of three statements was made by a single person. 
The subject’s task was to infer a person’s opinion 
from the three statements. Experiment 1 also in- 
| cluded a group condition, in which each statement 
was said to have been written by a different person, 
and the subject was to judge the average opinion 
of the group. 

In Experiment 1, the four conditions were given 
in the same order for all subjects as follows: group, 
own feelings, own feelings plus one pacifistic essay, 
and militaristic essays. In Experiment 2, the group 
| condition was omitted, and the three person condi- 
| tions were run in all six possible orders. No effect 
i of order was found, and there were only minor 
| differences between the two experiments. 


Stimulus Statements 


A large number of opinion statements were 

selected from several previous scales designed to 
| measure attitudes toward militarism and pacifism 
eo (Shaw & Wright, 1967). Preliminary ratings were 
| used to select eight statements each of extremely, 
moderately, or mildly militaristic value, and six 
statements each of extremely, moderately, or mildly 
pacifistic value. 


Stimulus Design 


The basic stimulus design was a 3X 3 factorial. 
The row factor was a pair of militaristic statements, 
and the column factor was a single pacifistic state- 
ment. The three levels within each factor were the 
three degrees of extremity of statement value. The 
row-column imbalance was chosen so that any dis- 
counting effects would work against the single paci- 
fistic statement and thus appear as specific inter- 
actions in the design (Anderson & Jacobson, 1965). 

Set construction and balancing. Four 3 X 3 stimu- 
lus designs were constructed by random choice 
without replacement of two of the eight statements 
at each level of militarism, and one of the six state- 
ments at each level of pacifism. In Experiment 1, a 
Latin square was used to balance these four designs 
across instruction conditions. For further balancing, 
this procedure was repeated to form a second Latin 
square. 

For Experiment 2, the procedure was similar, 
l From each group of four designs constructed in 
Experiment 1, three designs were selected at random. 
Each group of three stimulus designs was then 
balanced with the three instructional conditions, and 
with order of presentation, in a Greco-Latin square. 


Instructions 


Subjects were given an initial set of general in- 
structions about the problem of judging people’s true 
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opinions in everyday life. They were then told that 
they would judge the true opinion of high school 
students on a militarism—pacifism dimension making 
use of statements made by these students in short 
weekly essays in a composition course. The subjects 
were told that the essay topics were assigned by the 
instructor, and that sometimes the students were 
instructed to express their own feelings while other 
times the direction of their essay was assigned. Thus, 
the subject was to read the set of statements and 
try to judge the student’s true opinion, 

Specific instructions for each experimental condi- 
tion were as follows: 

Group condition (Experiment 1 only). Subjects 
were told that the statements on each card were 
made by different persons, randomly put together. 
They were to imagine that these people formed a 
group and to judge the average or general opinion 
of the group. 

Person conditions. In each of the three person 
conditions, subjects were instructed that each state- 
ment on a card had been written by a single person 
at different points during the composition course. 
The statements had been placed on each card in 
random order, 

In the own-feelings conditicn, each statement was 
said to come from an essay in which the stimulus 
person had been told to express his own feelings. 

In the own-feclings-plus-one-pacifistic-essay con- 
dition, the statements were said to come from essays 
in which the stimulus person had been told to 
express his own feelings, except that on one occasion 
he was to write a pro-pacifistic essay. The statement 
written on that occasion was not specificed, 

In the militaristic-essays condition, all statements 
on a card were said to come from essays in which 
the stimulus person had been told to write pro- 
militaristic essays. 


Procedure 


Each set of statements was typed on an index 
card, and these were shown to the subject one at a 
time in random order. Subjects responded by placing 
a pin in a 20-cm graphic rating scale, labeled Ex- 
tremely Militaristic and Extremely Pacifistic at either 
end. The experimenter read the response to the 
nearest millimeter on a scale at the back of the 
rating scale. 

Each subject made 29 judgments under each of 
the four conditions of Experiment 1, and under each 
of the three conditions of Experiment 2. Eighteen 
of these judgments represented two runs through the 
basic 3 X 3 design, The remaining 11 judgments were 
of five filler sets, three pairs of militaristic state- 
ments, and three single pacifistic statements which 
defined the rows and columns of one 3X 3 design, 

Subjects. Subjects were students who were fulfill- 
ing a course requirement in introductory psychology 
at the University of California, San Diego. There 
were 26 and 30 subjects in Experiments 1 and 2, 
respectively. 
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RESULTS 


Three aspects of the data require discus- 
sion. First is a qualitative analysis, noting 
how the instruction conditions affect the 
weight parameters and how that governs the 
elevation, slope, and vertical spread of the 
data. Second is an exact test of the model 
based on the parallelism property. Last is a 
discussion of some auxiliary data in which a 
discrepancy from the model was found. 


Person Conditions 


Weight parameters. According to attribu- 
tion theory, the instructions should have dif- 
ferent effects on the weight parameter in each 
of the three person conditions. The rationale 
is as follows. 

In the own-feelings-plus-one-pacifistic-essay 
condition, each essay was said to have been 
written under instructions to express own feel- 
ings, except that on one occasion the person 
had been instructed to write a pacifistic essay. 
In this condition, therefore, the single paci- 
fistic essay is not very informative, since it 
accords with the situational constraints. It 
should, therefore, receive a low weight in the 
inferred opinion. 

In the own-feelings condition, each essay 
was said to have been written under instruc- 
tions to express own feelings. In this condi- 
tion, there are no situational constraints so 
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Ficure 1. Mean attributed opinion as a function 
of instruction condition and scale values of the 
three opinion statements from essays ascribed to 
the stimulus person. (Data in left panel are from 
Experiment 1; data in right panels are averaged 
over Experiments 1 and 2. Abbreviations are as 
follows: MOD refers to moderate and EXT refers 
to extreme.) 
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that the single pacifistic statement «!-uld * 
receive its natural weight in the fc:red 
opinion. 

In the militaristic-essays condit 1l 


essays were said to have been written 
instructions to write militaristic essays, i- 
bution theory then implies that the 
pacifistic statement should be very infor 
tive, since it goes against the situational c 
straint. It should, therefore, receive a hig 
weight in the inferred opinion. In this pars 
ticular case, the situational constraint should 
have a second effect, causing also a reduc- 
tion in the weight of the pair of militaristic 
statements. 

The data for each person condition is 
shown as a 3 xX 3 panel in Figure 1. (The 
leftmost panel, for the group condition, will 
be discussed below.) The three person Ati 
tions are arranged so that the weight of the 
single pacifistic statement should, by the 
rationale just above, increase from left to 
right. According to the integration model, 
this weight parameter controls three aspects 
of the data which will be discussed in turn. 

Elevation. As the weight of the pacifistic 
statement increases, the attributed opinion 
should become more pacifistic. Thus, the ele- 
vation of the three panels in Figure 1 should 
increase from left to right. This is an obvious 
prediction and it is clearly verified in the 
graph. The F ratios among the overall means 
of the three person conditions were F(2, 50) 
= 147.80 in Experiment 1, and F(2,48) = 
51.01 in Experiment 2. i 

Slope. The weight parameter of the paci- | 
fistic essay controls the overall slope of the i 
curves in Figure 1. As the elevation increases 
across conditions, therefore, the slope should ll 
also increase. Just this pattern can be seen in \, 


Figure 1. | 
The theoretical rationale for this prediction _ 1 

is that the overall slope is a direct function 

of the weight multiplied by the range in scale 

value between the mild and extreme pacifistic 

statements. Since the range in scale value is ~ 

the same in all conditions, the slope of the : 

curves reflects the weight parameter directly. 

This prediction holds both for adding and d 

averaging models. | 
The slope prediction may be assessed by f 

comparing the linear trend of the pacifistic 


Statements across the three person conditions. 
As can be seen in Figure 1, slope was least 
in the own-feelings-plus-one-pacifistic-essay, 
greater in own-feelings, and greatest in the 
militaristic-essays condition. These differences 
were significant, #(2,110) = 6.70. 

Vertical spread. The vertical spread of the 
three curves in each panel of Figure 1 is a 
direct index of the effective weight of the pair 
of militaristic statements. Since the weights 
"must add to one, according to the averaging 
model, the effective weight parameter of the 
pair of militaristic statements must vary in- 
versely with the effective weight parameter 
of the pacifistic statement. The vertical 
spread should then decrease from left to right, 
and this pattern is clearly visible in Figure 1. 
The mean values of the vertical spread were 
4.35, 4,09, and 3.28, respectively. 

This trend in vertical spread was tested 
by using the linear-by-linear component of 
the interaction between the three person con- 
ditions and the three values of the pair of 
militaristic statements. This trend component 
was significant, 7(1,55) = 6.86. 


AAN 


Parallelism Property 


According to the equal-weight averaging 
model, the three curves in each panel of data 
«din Figure 1 should be parallel. This parallel- 
ism property is clearly visible in the graph. 
Statistically, parallelism is tested by the 
interaction term in the analysis of variance 
which should be nonsignificant. The F ratios 
were .17, 1.62, 1.78, and 1.16 in the four 
respective panels of Figure 1, none of which 
approaches significance. These data, there- 
fore, support the model as an exact rule of 
stimulus integration. 

»* The parallelism property depends on two 
conditions that need to be kept in mind. The 
first condition, of equal weighting, requires 

p that the informational stimuli within each 

factor of the design all have the same weight 
among themselves. In particular, the three 
pacifistic statements are required to be 
equally important or informative. If their in- 
formational content varied appreciably, then 
systematic deviations from parallelism would 
be expected, at least within the averaging 
model (Anderson, 1971). Similarly, the three 
y pairs of militaristic statements are required 
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to be equally important among themselves. 
With the present stimulus materials, equal 
weighting was expected to hold because the 
statements were chosen to carry equal amounts 
of information. 

The second condition for parallelism is 
that the stimuli do not interact: the scale 
value and weight of any stimulus item are 
assumed to be constant across the various 
stimulus sets. This condition would be vio- 
lated if there were any appreciable effects of 
inconsistency. In particular, the extremely 
pacifistic statement might be discounted 
owing to inconsistency when combined with 
the pair of extremely militaristic statements, 
but not when combined with the pair of 
mildly militaristic statements. Such discount- 
ing would in general produce systematic 
deviations from parallelism (Anderson & 
Jacobson, 1965; Himmelfarb & Senn, 1969), 
Accordingly, the obtained parallelism implies 
that discounting did not occur. 

A further check on stimulus interaction 
comes from the comparison of the group and 
own-feelings conditions. No interaction is 
expected in the group condition; there is no 
real inconsistency in this condition because 
the three members of the group might well 
hold quite different opinions. Thus, the group 
condition has an important methodological 
role. If there were stimulus interaction, then 
the own-feelings condition would be expected 
to be systematically different from the group 
condition. But since these two conditions gave 
very similar results, it follows that stimulus 
interaction was not present. 

The group condition was used only in Ex- 
periment 1. The statistical analysis for that 
experiment showed no significant differences 
between group and own-feelings conditions. 
Experiment 2 was run to provide a further 
check on the parallelism property for the 
three person conditions. Overall, therefore, 
the data provide substantial evidence against 
the interaction hypothesis. 


Judgments of Singles and Pairs 


Inferred opinions were also given for per- 
sons described either by the single pacifistic 
statements, or by the pairs of militaristic 
statements. These data were obtained mainly 
for the sake of completeness, as a check on 
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the normative scale values. However, they 
showed an apparent discrepancy from the 
averaging model that requires a more detailed 
discussion, 

Theoretically, these judgments should show 
a pattern of elevations more or less similar 
to that already seen in Figure 1. The left 
panel in Figure 2 shows the opinions inferred 
from the single pacifistic statement. As ex- 
pected, the greater weight of the statement, 
the more pacifistic is the judgment. A similar 
pattern of elevations is also seen in the right 
panel for opinions inferred from the pair of 
militaristic statements. 

The data patterns in Figure 2 may appear 
obvious, but they do require theoretical expla- 
nation. Since the three curves in each panel 
represent the very same statements, the dif- 
ferences in elevations need to be related to 
the weight parameter. Within averaging 
theory, differences in elevation come about 
because the overt response is a weighted 
average of the given statement information 
and the “initial impression” or a priori 
expectation (Anderson, 1974c). The initial 
impression is assumed to have the same near- 
neutral value in all three person conditions; 
the instructions should not affect a priori 
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Ficure 2. Mean attributed opinion as a function 
of instruction condition and scale value of the single 
pacifistic statement, or a pair of militaristic state- 
ments ascribed to the person. (Data are averaged 
over Experiments 1 and 2. Abbreviations are as 
follows: MOD refers to moderate and EXT refers 
to extreme.) 
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expectation about the person’s opini hes 
greater the absolute weight of the « nt | 
information, therefore, the less is the je 
or effective weight of the initial im; cy 


This interpretation accounts directly { e 
elevations of the curves in the left pa r 
the single pacifistic statement. It is also con- 
sistent with the pattern of elevations in 
right panel, since only in the militarisi 
essays condition were the instructions ey 
pected to have a direct effect on the weight’ 
parameter of the militaristic essays them- 
selves. This one curve should be higher, 
therefore, and the other two should be about 
equal, and just this pattern appears in 
the graph. 

Averaging theory also makes a more spe- 
cific prediction about the general shape of 
each panel of data in Figure 2. Each panel 
represents a 3 X 3 design in which the row, 
factor represents the weight of the statement 
information, and the column factor represents 
scale value. The parallelism prediction does 
not apply here because the equal-weight 
assumption does not hold across rows. In fact 
the theoretical expression for the vertica 
range of each separate curve can be show 
(Anderson, 1971, Eq. 8) to be cw/(w + wo) 
where w and wo are the weights of the state 
ment information and the initial impression, 
respectively, and c is a constant equal to the 
difference between the scale values of the 
extreme and mild statements. Thus, the 
vertical range of each single curve should 
be an increasing function of w, the weigh‘ 
parameter for that condition. 

Theoretically, therefore, each panel oi 
curves in Figure 2 should diverge as state- 
ment value becomes more extreme. Although 
the expected amount of divergence is rather’ 
small, there is no sign of it in the data. 
Indeed, the right panel actually showed 
significant convergence. 

The main locus of this discrepancy appears 
to lie in a single condition, Curve C, in the 
right panel of Figure 2. The opinion attrib- 
uted to a person who wrote two mild mili- 
taristic essays when under instructions to 
write militaristic essays actually seems to be 
slightly pacifistic, 11.32 on a 0-20 scale. A 
more pertinent comparison is provided by the 
data of Experiment 1. Mean response to the, 
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pair of mild militaristic statements was 9.46 
and 11,54 in the group and militaristic-essays 
conditions, respectively. Similarly, mean re- 
sponse to the pair of moderate militaristic 
statements was 5.83 and 8.60 in the two 
respective conditions. Both differences are sig- 
nificant. Thus, the inferred opinion is more 
pacifistic than would be warranted by the 
statement information per se. 

It is premature to make much of this re- 
sult, since it was obtained incidentally in 
auxiliary data. There is no sign of a similar 
effect in the main data. However, the main 
data are based on heteropolar sets that con- 
tained both pacifistic and militaristic state- 
ments; homopolar sets might have different 
effects. Because of this, and because the re- 
sult does deviate from theoretical expectation, 
a brief discussion seems in order. 

To the subject, this particular condition 
may well seem ambiguous. Since the stimulus 
information consists of a pair of militaristic 
statements produced under instructions to 
write militaristic statements, the subject can 
reasonably suppose that they should not be 
taken at face value. An extreme statement 
could well reflect true opinion, but a mild 
statement might well reflect forced compli- 
ance. If the subject were sensitive to this, it 
would tend to produce the given result. 

This result is similar to a “contrast” effect 
obtained by Jones et al. (1971) and reinter- 
preted by Lopes (1972) in terms of an 
expectancy-discrepancy model (see also An- 
derson, 1974b, Section III.D.6). Applied to 
the present experiment, it would be assumed 
that the instructions in the militaristic-essays 
condition set up an expectation for somewhat 
extreme militaristic essays. The mild militar- 
istic essays would be discrepant from that 
expectancy, and the discrepancy term would 
act to produce a slightly pacifistic attribution. 

Lopes’s (1972) model has special theoreti- 
cal interest, since it provides a theoretical 
explanation of the “contrast” effect. It does, 
however, assume interactive or configural in- 
tegration and so is more difficult to. treat 
quantitatively. 

The present discussion, as already noted, 
may be premature without more solid evi- 
dence for the effect in question. At face value, 
however, the effect is quite interesting and 
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has potential theoretical importance. Further 
work with more emphasis on homopolar sets 
would thus seem desirable. 


Other Results 


A few minor results remain to be noted. 
The analysis of Experiment 2 showed no 
major effects of the order in which the three 
conditions were run. Also, the analyses be- 
tween the two experiments showed only two 
minor differences, both in the militaristic- 
essays condition. The overall mean and the 
response to the triplets containing the ex- 
tremely pacifistic statement were both higher 
in Experiment 1. These differences do not 
affect any of the above conclusions, and so the 
pooled data have been presented. 

One implication of these analyses deserves 
mention. The group condition was run first in 
Experiment 1, and the explicit instruction to 
judge the average opinion could itself have 
set up an averaging tendency in the subse- 
quent conditions. However, this possibility 
can be ruled out in view of the general 
agreement between the common conditions of 
Experiments 1 and 2. 

Within each separate experiment, statistical 
tests were made in terms of the complete 
design. In tests based on data pooled over 
the two experiments, 4 subjects were ran- 
domly eliminated from Experiment 2 in order 
to equalize n. However, the linear trend tests 
included the data from all 56 subjects. All 
such trend tests employed the standard or- 
thogonal polynomial coefficients. This proce- 
dure is statistically valid, though it can suffer 
loss of power when ostensible nonlinearity is 
present. In the present data, this loss of 
power would be small and in any case irrele- 
vant, since the results were significant with 
the given test. 

The normative scales from which the state- 
ments were taken did not prove completely 
satisfactory despite the additional preliminary 
screening employed here. The difficulty, as 
can be seen in Figure 1, is that the mild and 
moderate pacifistic statements turned out to 
have essentially equivalent effects. This 
reduces somewhat the efficiency of the 
experiment but fortunately has no serious 
consequence. 


Discussion 
Integration Theory and Social Attribution 


The present data provide substantial sup- 
port for the integration-theoretical approach 
to opinion attribution. For the main data, the 
averaging model did quite well, both in quali- 
tative aspects and in the exact test of fit. One 
discrepancy from the model was obtained in 
the auxiliary data, and it deserves study. As 
an initial investigation, however, the present 
approach shows considerable promise. 

Attribution theory. The problem of inte- 
grating stimulus information has been rela- 
tively neglected in previous treatments of 
social attribution. The work by Jones and his 
associates (Jones & Davis, 1965; Jones & 
Harris, 1967; Jones et al., 1971) has been 
mainly concerned with a single set of state- 
ments made on a single occasion, and has 
used experimental designs such that the inte- 
gration operation is not of explicit concern. 
However, opinion inferences are ordinarily 
based on multiple sets of information ob- 
tained over a variety of occasions. A political 
figure, to take a prime example, makes hun- 
dreds of speeches and actions. Inferring his 
true opinion on any issue requires integration 
of diverse information. 

Kelley’s (1972) causal schemata explicitly 
allow for multiple information. However, 
Kelley’s schemata remain at the level of 
actuarial analysis of data tables and do not 
become a quantitative model. Similarly, his 
use of analysis of variance remains analogical 
rather than becoming an analytical tool. 

It is this emphasis on the underlying pro- 
cesses of information integration that distin- 
guishes the present from previous approaches 
to social attribution. Reviewers have objected 
to this article with the argument that the 
effects of the instructions on the weight pa- 
rameter are trivial. However, that objection 
applies only to the one prediction about ele- 
vation in Figure 1. It does not apply to the 
parallelism prediction which requires an exact 
model. Neither does it apply to the prediction 
about vertical spread which distinguishes be- 
tween the adding and averaging models. 
These results stem from the integration-theo- 
retical approach, and they set the stage for 
further quantitative analysis. In a general 
and fundamental way, therefore, information 
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integration theory provides a significant ex- 
tension of previous work on attributi of 
opinion. 

Weights and scale values. The central theo- 
retical role of the weight parameter deserves 


specific comment. It ties together data ; 
terns across several experimental cond: 
Also, it provides a unifying link bety 
information integration theory and Jones’: 
attribution theory which provided a rationale 
for the hypotheses about the way in 
which the weights depended on instruction 
conditions. 

It has been suggested that the main results 
might be interpreted to mean that the in- 
struction conditions affect the scale values 
rather than the weights. In order to account 
for the lesser vertical spread in the militar- 
istic-essays than in the own-feelings condi- 
tion of Figure 1, that interpretation would 
require a corresponding lessening of the dif- 
ferences between the scale values of the mili- 
taristic statements. No theoretical justifica- 
tion for this kind of change has been offered, 
whereas the theoretical justification for the 
weighting interpretation seems clear. Further, 
the scale value interpretation is contrary to 
the greater vertical spread of Curve C than 
of Curve B in the right panel of Figure 2, 
though this latter result also causes difficulty 
for the averaging model as noted above. 

A related issue has arisen in person per- 
ception in which extensive experimental study 
has been devoted to the question of whether 
personality-trait adjectives change meaning as 
a function of what other adjectives they are 
combined with. Although a definitive answer 
is elusive, there is little support for the 
change-of-meaning interpretation and substan- 
tial evidence against it (Anderson, 1974b, 
Section III.F). In the present task, the issue 
should be more readily resolvable, since t} 
effect of the instructions is in question rather 
than the effect of the other statements in the 
combination. 

The averaging hypotheses. The ability of 
the averaging model to account for the pres- 
ent data is perhaps not surprising in view of 
the considerable body of previous work on 
cognitive algebra. Nevertheless, this question 
has not been studied in opinion attribution, 
and the present results provide a useful exten- 
sion of previous work. By the same token, 


1e 
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„they suggest that other results from previous 
“ work on integration theory will also generalize 
to the area of opinion attribution. 

A special advantage of the averaging model 
is that it provides a conceptual basis for dis- 
tinguishing between the concepts of weight 
and scale value. An adding model does not in 
general allow this distinction except in an arbi- 
trary way, as in standard regression analysis. 

Further, functional measurement methodology 
ypens the way to measuring the weight and 
scale value parameters. It should be empha- 
sized, however, that functional scaling is not 
a routine computational procedure. It rests 
on establishing a substantive integration 
model. Furthermore, there are various tech- 
nical problems of parameter uniqueness and 
of experimental design that can be trouble- 
some in scaling weight (see Anderson, 1974a, 
1974b, 1974c; Leon, Oden, & Anderson, 
1973). 


Cognitive Consistency Theory 


Evidence on the consistency principle. An 
important, though perhaps surprising aspect 
of the main data is the lack of apparent stim- 
ulus interaction within any given instruction 
condition. In some cells of the design, the 
Statements were quite inconsistent. According 
to the various theories of cognitive consist- 
ency (see Abelson, Aronson, McGuire, New- 
comb, Rosenberg, & Tannenbaum, 1968), 
discounting or other interactions should have 
been expected in these cells. Indeed, the 
present design was patterned after that of 
Anderson and Jacobson (1965) in order to 
facilitate the detection of inconsistency inter- 
actions. Since such interactions would cause 
the parallelism prediction to fail, its success 
js contrary to the assumption of a general 
consistency principle. And, as already noted, 
the similarity of the group and own- 


vad feelings conditions provides further evidence 


against the adequacy of a general consistency 
principle. 

Previous evidence also indicates that people 
may be relatively unaffected by inconsistency 
in the stimulus information. Extensive work 
on the primacy effect in person perception 
has generally failed to support the cognitive 
consistency interpretations (see Anderson, 
1974b, Section III.F). With simultaneous 
‘presentation, Anderson and Jacobson (1965) 
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did obtain a discounting effect, but it was 
relatively small. Himmelfarb and Senn (1969) 
obtained no discounting in an analogous ex- 
periment on judgments of social class despite 
very strong apparent inconsistency in the 
stimulus information. 

The present own-feelings condition is very 
similar to the experiment of Weiss (1963), 
who used capital punishment rather than 
militarism as an opinion issue. Weiss obtained 
a significant interaction and argued for the 
presence of configural or patterning effects. 
Unfortunately for Weiss’s interpretation, sta- 
tistical interaction does not necessarily imply 
psychological interaction (Anderson, 1972). 
Graphical study of Weiss’s data fails to show 
any clear evidence for discounting, but in- 
stead suggests that the statistical interaction 
resulted from end-effect bias in the rating 
scale. 

Lack of stimulus interaction may relate to 
the multidimensional nature of the stimulus 
information. The statements dealt with a wide 
variety of beliefs about the need for arma- 
ments and military preparedness, the condi- 
tions when war is justified, the various conse- 
quences of war, etc. Two statements could 
be at the opposite ends of the attitude scale 
without being logically contradictory. That 
kind of disagreement among a person’s state- 
ments or actions may be tolerable without 
provoking any mechanism for resolving 
inconsistency. 

It has previously been argued that a con- 
sistency principle is an inadequate base for a 
general theory of social judgment (Anderson, 
1971, 1974b). Since it only applies when in- 
consistency is present and operative, it is too 
narrow a base for a general theory. Most 
consistency theories, moreover, have remained 
undeveloped experimentally. In contrast, a 
principle of information integration can pro- 
vide an adequate conceptual foundation, to- 
gether with an articulated superstructure of 
experimental analysis. The present results 
provide further support for this position. 


Implicit Personality Theory 


The above discussion suggests that naive 
judges are realistic in their view of human 
nature. They do not expect personalities of 
uniform consistency. Indeed, they may accept 
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different statements from a given person that 
would be markedly inconsistent according to 
standard attitude scales. Mischel (1968) has 
emphasized the low correlations between be- 
havior and measures of generalized personal- 
ity traits. Although these correlations may be 
perplexing to traditional personality theory, 
they may be entirely in accord with the 
outlook of the man in the street. 

In the study of implicit personality theory, 
Rosenberg has stressed the use of trait co- 
occurrence instead of trait similarity (see 
Rosenberg & Sedlak, 1972). He has argued 
that measures of trait similarity may be 
unduly influenced by purely semantic factors 
that may be avoidable with measures of trait 
co-occurrence. The present results support 
Rosenberg’s concern about the operation of 
merely semantic factors. However, something 
of this same criticism can be made of his trait 
co-occurrence measures as they are obtained in 
practice. There would seem to be considerable 
difference between what people will accept as 
compatible given information and what they 
will verbalize as likely to go together. The 
former seems more appropriate in many 
respects for implicit personality theory. The 
present results thus agree with Rosenberg’s 
emphasis on the need for careful consideration 
of the judgmental data base in studying 
implicit personality theory. 
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Based on a theoretical model developed by Fishbein, it was predicted that an 
individual’s intention to engage in a specific family planning behavior is a 
function of two components: (a) one’s beliefs about the consequences of per- 
forming that behavior multiplied by the evaluation of those consequences 
(2Bia,) and (b) one’s normative beliefs multiplied by one’s motivation to 
comply with those perceived norms (=NB[Mc]). A questionnaire assessing 
the model’s components for three family planning behaviors—having a two- 
child family, having a child during the next 2 years, and using birth control 
pills—was administered to a random sample of 270 married women in Urbana- 
Champaign, Illinois, stratified by religion and socioeconomic status. Consistent 
with the model, =Bia; and SNB(Mc) explained an average of 60% of the 
variance in intentions. It was also found that as the measure of attitude 
deviates in specificity from the measure of intention, the attitude-intention 
correlation decreases. Accordingly, the more specific attitude-toward-the-act 
measure was a better predictor of family planning intentions than the tradi- 
tional attitude-toward-the-object measure, 


The results of several studies reported over 
the past two decades have led to marked 
skepticism among demographers and family 
planners regarding the utility of psychological 
variables in fertility research. Two major fer- 
tility studies, the Indianapolis Study (Whelp- 
ton & Kiser, 1946-1958) and the Princeton 
Study (Westoff, Potter, & Sagi, 1963; West- 
off, Potter, Sagi, & Mishler, 1961) failed to 
find any noteworthy relationships between 
psychological variables and aspects of family 
planning and fertility. The Indianapolis Study 
tested some 23 hypotheses relating social and 
psychological variables to practice of contra- 
ception and planned family size among cou- 
ples; the study revealed numerous significant 
relationships with respect to the social varia- 
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bles considered but almost none for the psy- 
chological variables. Subsequent critiques of 
the study suggested a number of possibilities 
for methodological improvement (see Kiser, 
1967), and it was with these in mind that the 
Princeton Study was designed. 

The Princeton Study was derived, in part 
from the findings of the Indianapolis Study 
and was planned to parallel the more demo- 
graphically oriented Growth of American 
Families Study (Whelpton, Campbell, & Pat- 
terson, 1966). Its aim, like the Indianapolis 
Study, was to isolate the social and psycho- 
logical determinants of human fertility, and 
to do so employed a longitudinal design in- 
volving three separate stages of interviewing. 
Results from the first stage revealed extremely 
low correlations between all psychological and 
fertility variables; in the next two stages the 
findings were too insignificant to warrant re- 
porting. 

The failure to explain variations in fertility 
and family planning behaviors in the United 
States from psychological variables is discour- 
aging, since it is in this country that these 
types of variables should have the greatest 
influence. Numerous investigators have noted 
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that contraception is used almost universally 
by fecund couples in the United States 
(Campbell, 1962; Freedman, 1962; Westoff 
et al., 1963) and that as couples approach the 
number of children desired in their family, 
effectiveness of contraceptive use increases. 
Freedman (1962) has concluded that “the 
patterns of family planning followed are suc- 
cessful in enabling most couples to have the 
number of children they want,” (p. 219) and 
that “the plans and expectations of married 
couples will guide their behavior” (p. 217). 
It seems clear that these “plans and expecta- 
tions” should be related to psychological var- 
iables, and as such, the failure to find rela- 
tions between family size intentions and social 
psychological variables is somewhat surprising. 

One problem with past research in popula- 
tion psychology has been the lack of theory 
guiding research efforts. Most of the large- 
scale field studies have not been based on any 
underlying or organizing theory but rather 
have been restricted to the testing of a large 
number of “interesting” hypotheses. Indeed, 
as Kiser (1962) has pointed out, although the 
members of the Steering Committee of the 
Indianapolis Study “made a special effort . . . 
to formulate an ‘organizing principle’ or ‘body 
of theory’” to guide their work, they found 
this task “rather frustrating,” and this effort 
“was finally abandoned” (p. 163-164). 

Recently, Fishbein (1967, 1972) has pro- 
posed a theory of social behavior which may 
prove fruitful in the analysis of family plan- 
ning and fertility related behaviors. Articles 
by Fishbein (1972) and Fishbein and Jaccard 
(1973) have discussed the application of this 
analysis to population psychology in detail; 
the present article attempts to test some of 
these applications. 

According to the theory an individual’s be- 
havior (B) is assumed to be a function of his 
intention to perform that behavior, that is, 
his behavioral intention (BI), which, in turn, 
is a function of two factors: (a) his beliefs 
about the consequences of performing that be- 
havior and the evaluation of those conse- 
quences and (b) his beliefs about what others 
think he should do, that is, his normative 
beliefs, and his motivation to comply with 
those others. Specifically, the theory is ex- 
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pressed by the following formula: 


n m 
B~BI=[ 2 ByaiJWit+ [> NB:;Mc&]W, (1) 
i=1 i=l 

where B = overt behavior, B= the behav- 
ioral intention to perform that behavior 

= the belief (perceived probability) that per 
forming the behavior will lead to some conse- 
quences X;, a; = the evaluation of X;, NB; 
= the perceived expectations of referent i, 
Mc; = the motivation to comply with referent 
i, and W, and Wz = empirically determined 
regression weights. 

The first component of the model (¥B;a;) 
is actually a measure of attitude. The formu- 
lation is explicitly drawn from Fishbein’s 
(1963) theory of attitude formation in which 
an individual’s attitude toward some object 
is viewed as a function of his beliefs about 
that object and the evaluations of those be- 
liefs. However, it must be emphasized that 
the first component in Equation 1 deals with 
an individual’s attitude toward the behavior 
per se and not an attitude toward the object 
of that behavior. Equation 1 may be alterna- 
tively expressed as the following: 


B ~ BI = [Aact]Wi+ [2 NBiMc,]W., (2) 
i=l 

where Aact = the attitude toward the behav- 

ior in question and the remaining symbols are 

as previously defined. 

A number of studies have provided empir- 
ical support for the model proposed in Equa- 
tions 1 and 2 (for a review of the literature, 
see Ajzen & Fishbein, 1973). However, most 
of these studies have tested the model exclu- 
sively in laboratory settings with a college 
population. One of the major purposes of the 
present research was to test the generalizabil- 
ity of the Fishbein formulation to (a) a wide 
variety of respondents and (b) a new sample 
of behaviors, that is, family planning behav- 
iors. In this context, two basic hypotheses 
were tested in a field survey design: (a) A 
respondent’s intention to perform a given be- 
havior can be predicted from a linear com- 
bination of his attitude toward performing 
that behavior and his normative beliefs 
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weighted by his motivation to comply with 
those norms. (b) An individual’s attitude to- 
ward performing a given behavior is a func- 
tion of his beliefs about the consequences of 
performing that behavior and the evaluation 
of those consequences. 

Perhaps the most distinctive feature of the 
Fishbein (1967, 1972) formulation is that it 
directs attention away from attitudes toward 
objects, the variable most frequently used in 
studies of the attitude-behavior relation, and 
focuses on attitudes toward acts. To date, the 
evidence suggests that the more specific atti- 
tudes-toward-act measure is a better predictor 
of intentions and subsequent behavior than 
attitudes toward object (e.g., Ajzen & Fish- 
bein, 1973; Schwartz & Tessler, 1972). In a 
recent extension of this analysis, Fishbein and 
Jaccard (1973) have noted, that a behavioral 
intention can be viewed as being composed of 
four elements: an action, the object the action 
is directed toward, the situation in which the 
action is to be performed, and the time at 
which the action occurs. Each of these ele- 
ments may vary on a dimension of specificity. 
The maximum attitude-intention correlation 
can be expected when the attitude and inten- 
tion measures are co-ordinate with respect to 
specificity. For example, in order to predict a 
woman’s intention to have a child during the 
next two years, one must assess her attitude 
toward having a child in the next two years 
and: not her attitude toward having children 
or children per se. Therefore, following Fish- 
bein and Jaccard, it is hypothesized that 
as the measure of attitude deviates in specific- 
ity from the measure of intention, the ob- 
tained correlations are expected to decrease. 


METHOD 
Survey Sample 


The respondents consisted of a stratified random 
sample of 270 women in Urbana-Champaign, Illinois. 
To be included in the sample, women had to meet 
the following criteria: (a) currently married, (b) 
not legally separated from husband, (c) between the 
ages of 18 and 38, (d) Caucasian, and (e) not cur- 
rently married to a university student. These women 
were stratified in terms of three levels of socioeco- 
nomic status (SES) and two levels of religious affil- 
iation (Catholic and Protestant). Religion and SES 
were selected as stratification variables because a 
substantial proportion of previous research has docu- 
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mented the effect of these variables on family plan- 
ning intentions and behaviors. 


Sampling 


The sampling procedure was designed to select 
randomly 45 women, each meeting the five above- 
mentioned criteria, for each cell of the 2 X 3 design. 
A set of names was randomly selected from the city 
directory, which lists every residence in the Urbana- 
Champaign cities. For each residence, the directory 
indicates occupation for head of household and, if 
married, the spouse’s name and occupation. On the 
basis of this information, some preliminary decision 
could be made as to the family’s SES. This decision 
was made by checking the socioeconomic index score 
(Duncan, 1961) of the husband’s occupation. Re- 
spondents were contacted via telephone by trained 
female interviewers. Calls were made between 9:30 
a.m. and 8:30 p.m., with up to four “callbacks” for 
those who could not be reached the first time. Using 
telephones to contact our respondents seemed justi- 
fied, since only a very small percentage of households 
in the Urbana-Champaign area do not have tele- 
phones, 

During the course of a brief phone interview, the 
interviewer obtained the necessary information to 
determine (a) if the woman met the previously men- 
tioned sampling ¢riteria, (b) the cell of the design of. 
which she was a member, (c) if she was willing to 
fill out the final questionnaire, and if yes, (d) to set 
a date for the interview, From additional informa- 
tion obtained in the phone interview (e.g. attitude 
toward family planning) and from the demographic 
data we already had for each potential respondent, 
it was possible to ascertain if some subset of the 
population selectively eliminated itself from the sam- 
ple by refusing to participate in either the home or 
phone interview. 


Interviews 


Approximately 1 hour was required for the re- 
spondent to complete the questionnaire. During the 
first stage of the interview, the interviewer presented 
an oral introduction to the study and a standardized 
set of instructions concerning the questionnaire. The 
respondent then filled out a brief practice section, 
This section contained examples of each type of scale, 
and the interviewer could thus detect any misunder- 
standing before actual data collection, Also, the prac- 
tice session assured at least a minimal familiarity 
with the scales for all respondents and tended to 
eliminate any “warm-up” effects from the data, The 
interview was self-administered. Hopefully, this min- 
imized any effect of “interviewer” on the subject's 
responses. The interviewer did, however, remain in 
the house while the respondent was completing the 
questionnaire to answer any questions the respondent 
might have. Each respondent received $10 for com- 
pleting the questionnaire. 


Measurement Procedures 


* The majority of data were obtained in responses 
to a closed-format questionnaire. However, open- 
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ended questionnaires were used for eliciting the in- 
formation necessary for the construction of these 
final questionnaires. The first stage of the research 
consisted of elicitation interviews with 55 women 
randomly selected from the same stratified popula- 
tion as the women in the final sample. By perform- 
ing a content analysis of these questionnaires, a set 
of 10-12 most commonly mentioned consequences of 
engaging in various behaviors was obtained for use 
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in the final questionnaire. Similarly, a list of 5-1 
most frequently mentioned referents was obtained. 

The measurement procedures for the closed-format 
questionnaire have been developed and refined 
through previous research (e.g., Ajzen & Fishbein, 
1969, 1970; Fishbein, 1963; Triandis, 1964). The 
technique is largely based on the semantic differen- 
tial approach. Examples of the types of measure 
employed are as follows: 


. Behavioral Intention: I intend to have two and only two children i in my completed family 


unlikely. ; likely 
2. Beliefs about the Act: Having a two- child family i is selfish 
unlikely. : likely 
3. Evaluation of the consequence: Being selfish i is 
good : bad 
4. Aact: Having two and only two children i in my completed family would be 
good. 2 3 5 m bad 
awful. : : nice 
pleasant i unpleasant 
5. Normative Beliefs: My parents think i should have a two child family 
unlikely. likely 


6. Motivation to Comply: Generally speaking, 
I want to do 
what my 
parents think 


I do not want 

to do what my 

parents think 
: —I should do 


I should do 


All measures of the theoretical constructs were 
scored on a +3 to —3 scale. Bia, was obtained by 
multiplying the score on each belief statement by the 
score on the corresponding evaluation of that belief 
and then summing these products for all beliefs. 
2NB(Mc) was obtained by multiplying the score on 
each normative belief by the score on the corre- 
sponding motivation to comply and summing these 
products for all normative beliefs. A measure of 
Aact was obtained by summing the three evaluative 
scales. The order of presentation of the measures of 
the components was randomized. A number of items 
were presented twice in the questionnaire to obtain 
reliability checks. In addition to the above measures, 
the final questionnaire also contained measures of 
various demographic variables and other attitudes 
toward fertility-related behaviors and concepts. 
Three different behavioral intentions were studied: 
(a) the intention to have a child in the next 2 years, 
(b) the intention to have a two-child family, and 
(c) the intention to use birth control pills. In addi- 
tion, two other measures of BIs were also obtained, 
although the corresponding components of the model 
were not: (a) the intention to use birth control pills 
in the next 2 years and (b) the intention to support 
legislation to legalize abortion. 


RESULTS 


Methodological Checks 


Checks for biases in sampling. Careful rec- 
ords were kept to determine if some subset of 
the population selectively eliminated itself 
from the sample by refusing to participate in 


either the home or phone interview. Eighty- 

five percent of the potential respondents that * 
were contacted accepted the phone interview. 

It will be recalled that prior to the phone 

interview, an SES rating for each potential 

respondent was obtained. No significant effect 

of SES on refusal rate was detected by a 

chi-square analysis. 

For each respondent accepting the phone 
interview, data were obtained regarding her 
religious affiliation and her attitudes and opin- 
ions toward family planning (as measured by 
three questions). Among those women meet- 
ing the sampling criteria, 79% accepted the 
home interview. No significant effect of re- 
ligion, attitude toward family planning, or 
SES on refusal rate was detected by chi- 
square analyses. Thus, it appears that the 
sampling procedures did not bias the com- 
position of the final sample. 

Checks for interviewer bias effects. To test 
for the effect of the interviewer on the sub- 
jects’ responses, a one-way multivariate anal- 
ysis of variance (i.e., one interviewer factor 
with 11 levels corresponding to the 11 inter- 
viewers) was performed on the five behavioral 
intention variables. The intention measures 
were selected for this analysis because it was 
felt that they would be most susceptible to 
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TABLE 1 
ls CORRELATIONS, REGRESSION COEFFICIENTS, AND MULTIPLE CORRELATIONS OF ĐB;a; AND ENB (Mc) on BI 
Correlation coefficients | Regression coefficients 
R for BI 
Subjects Behavior Rfor BI | corrected for 
2Biai —BI| ZNBMO | spa, — Br | 2NB(Mo) einige 
NTY -740 -685 517 -386 804 801 
All subjects* TOE -631 629 A05 400 112. 707 
BCP 695 -141 AOL 510 810 807 
| NTY 744 611 „585 290, .782 775 
Catholic subjects” TCF 675 -603 -490 319 723 713 
BCP 701 691 453 430 784 JIT 
A NTY 734 -760 A38 -500 .837 .832 
Protestant subjects? TCF. 560. | .639 Stl -477 -692 -681 
BCP 091 .786 +352 581 837 832 
; NTY A33. -105 -489 423 811 801 
Low-SES subjects® TCE 577 714 247 S71 -142 728 
BCP -650 714 373 513 .780, 768 
NTY 769 -127 .519 -410 835 -827 
Middle-SES subjects® TCE 634 .591 440 340, -693 677 
= BCP 137 749 Add 475 827 818 
NTY 752 -638 .576 .309 «194 «783 
High-SES subjects” TCF 737 598 584 -279 473 760, 
BCP -681 771 355 506 823 814 


“ = S R 
Note, All zero-order correlations, regression coefficients, and 


multiple correlations are significant (p < .01), Abbreviations are 


as follows: NTY = having a child in the next 2 years; TCF = having two (and only two) children in the completed family; 


BCP = using birth contr 


pills. BI = behavioral intentio: 
havior multiplied by the 


uation of those consequenc 


Biai = one's beliefs about the consequences of performing a be- 
B(MC) = one's normative beliefs multiplied by one’s motivation 


to comply with those perceived norms. SES denotes socioeconomic status, 


xN = 270, 
ON = 135. 
"N = 90, 


interviewer influence. The multivariate inter- 
viewer effect was nonsignificant (F = 1.289, 
ns), and no significant univariate effects ap- 
peared, 

Validity and reliability of measures. The 
reliability and validity of the measures em- 
ployed in the present research were deter- 
mingg using the multimethod—multitrait par- 
adigm (Campbell & Fiske, 1959). The mea- 
sures demonstrated high reliability and both 
convergent and discriminant validity (see 
Davidson, 1974, for these data). 


Tests of Hypotheses 


The major test of the Fishbein (1967, 
1972) model concerns the degree to which the 
attitudinal and normative components predict 
BI. As seen in Table 1, the model provides 
highly accurate predictions of family planning 
intentions. For the total sample the multiple 
correlations ranged from a low of .707 to a 
high of .807 (all ps < .01). Similarly, the 
Separate regression equations computed for 
Catholics, Protestants and high-, middle- and 


low-SES groups yielded multiple correlations 
that ranged from .677 to .832 (all ps < .01), 
with a mean of .778. 

The multiple correlations regressing SBya; 
and SNB(Mc) on BI were also computed for 
each behavior within each of the six cells of 
the sampling design. These correlations ranged 
from .639 to .862 (all ps < .01), with a mean 
of .796, To determine if the model was dif- 
ferentially predictive of BI for different sub- 
sets of the population, 2 X 3 chi-squares anal- 
yses, with two levels of religious affiliation 
(Catholic, Protestant) and three levels of 
SES (high, middle, low)' (see Jones, 1968, 
for an elaboration of this statistical tech- 


1 SES was based on indices by Duncan (1961) and 
Hollingshead (1965). The two measures were highly 
intercorrelated, r= .910 (p < .01). The mean scores 
on the Hollingshead measure for high-, middle- and 
low-SES respondents were 13.48, 34.10, and 55.07, 
respectively (on a scale that ranged from 11 to 77). 
These mean scores indicate that our definition of 
high, middle, and low SES is quite comparable to 
that used in other recent research (see, e.g., Kohn, 
1969), 
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nique, 1968) were performed on the multiple 
correlations for each BI. These analyses re- 
vealed no significant main effects for religion 
or SES and no significant interactions. It can 
be concluded, therefore, that the model is 
highly predictive of family planning inten- 
tions and that these correlations are equally 
high in each of the major subsamples included 
in the present study. 

Table 1 also presents the multiple correla- 
tions on BI corrected for shrinkage. That is, 
the estimated population validity of the sam- 
ple regression equation.” These corrected mul- 
tiple correlations represent only a mild reduc- 
tion in the obtained multiple correlations. 
The greatest reduction in the magnitude of 
a correlation was .016. 

To provide an additional validation of the 
regression equations, “double cross-valida- 
tion” (cf. Wiggins, 1973) was performed for 
each intention for the total sample and each 
of the five subsamples included in Table 1. 
As expected, this procedure yielded results 
quite comparable to those obtained from the 
correction for shrinkage formula. The double- 
cross-validation validity estimates were, how- 
ever, slightly higher than the correction for 
shrinkage estimates in 17 out of 18 cases. On 
the basis of the results from these two meth- 
ods for estimating the true validity of a re- 
gression equation, it is safe to assume that 
the obtained equations could be applied to 
the population with very minimal reduction 
in predictive utility. 

The data in Table 1 also suggest that both 
of the components of the Fishbein model are 
necessary for the prediction of family plan- 
ning intentions. For each sample and each 
behavior the correlation coefficients of the 
model’s two components with behavioral in- 
tention were all significant (p < .01). Sim- 
ilarly, the standardized regression coefficients 
of both components with BI were all signifi- 
cant (p < .01). This indicates that although 
XB;a; and SNB(Mc) are significantly inter- 
correlated (using birth control pills, 7 = .58, 
p< .01; having a child during the next two 


2The formula used for estimating shrinkage is 
based on Darlington’s (1968) modification of the 
Lord and Nicholson formula. The only statistical 
assumption made is that the scores on all variables 
form a multlivariate normal distribution. 
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years, r = .58, p < .01; having ‘idren 
in the completed family, r = 01); 
both components contribute indepe: var- 
iance to the prediction of BI. 

Across samples, the two compon ake 
on about equal weighting in the pred of 
BI. Examination of the correlations & re- 
gression coefficients did not provide any « var 


indications that the relative weights of he 
model’s predictors depend on the type. of 
person responding. 

Inspection of Table 1 did reveal, however, 
that the magnitude of the multiple correla- 
tion varied appreciably with the behavior in 
question.. For the total sample and each of 
the five subsamples, the multiple correlation 
for the intention to have two (and only two) 
children in the completed family was lowe: 
than the multiple correlations for the twc 
other behaviors. Scrutiny of the raw data 
indicated that the relatively lower R for this 
intention was attributable to one specific 
group of women. This group already had 
more- than two children in their families and 
at the same time had a very positive attitude 
toward having two (and only two) children 
in their completed families. In a sense, these 
women were saying that if they could do it 
over again, they would have a two-child 
family but that it was now too late and they 
intended to have three or four children. 

Using two measures included in the ques- 
tionnaire, it was possible to identify empiri- 
cally these “unpredictable” women and ther 
to use these data to modify the regressic 
equation and improve predictability. Spec’ - 


cally, each woman was asked what :ne 
thought the ideal number of children would 
be for someone like herself and how many 
children she presently had in her fam. If 
the woman’s actual family size exceeds: her 


ideal family size, she was given a scoie of 1 
on the moderator variable; if the actual 
family size was equal to or less than the 
ideal family size, she was given a score of 0. 
The moderator variable could therefore be 
considered a measure of “regret.” 

If this present analysis of “unpredictabil- 
ity” is correct, including the Moderator X 
XB;a; interaction term in the regression equa- 
tion should increase the size of the multiple 
correlation. As anticipated, when the moder- 
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‘ator variable and the Moderator x XB;a; in- 

teraction term were added to the two com- 
ponents of the model, the multiple correlation 
for the total sample increased from .709 to 
-790, This represents an increase of 12 per- 
centage points in the amount of variance ac- 
counted for in the criterion variable. 

In summary, the data of the present study 
provide strong support for the first hypothe- 
sis, which predicted that a respondent’s be- 
havioral intention could be predicted from a 
linear combination of the two components 
of the Fishbein model. The data also indicate 
the usefulness of both components in the 
prediction of family planning intentions. In 
addition, the high multiple correlations within 
all subsamples of the present study provide 
evidence for the generalizability of the Fish- 
bein model to noncollege samples of hetero- 
geneous educational and socioeconomic levels. 

The second major hypothesis derived from 
the model is that an individual’s attitude 
toward performing a given behavior (Aact) 
should be highly correlated with his beliefs 
about the consequences of performing that 
behavior and the evaluations of these conse- 
quences (XB;a;). Consistent with Fishbein’s 
model and Hypothesis 2, a high correlation 
was observed between the two measures for 
each behavior. Specifically, the correlations 
between Aact and ¥B;a; were the following: 
806 (p < .01), for having a child during the 
next 2 years; .749 (p < .01), for using birth 
control pills; and .710 (p < .01), for having 
two (and only two) children in the completed 
family. 

‘The correlation between Aact and ¥B;a; 
was also computed for each behavior within 
each of the six cells of the sampling design. 
These correlations ranged from .617 to .845 
(all ps < .01), with a mean of .753. To de- 
termine if the magnitude of the correlation 
coefficient varied as a function of the type 
of respondent, a 2 X 3 analysis of the correla- 
tions was performed for each BI (see Jones, 
1968), Chi-square analyses revealed no sig- 
nificant main effects for religion or SES and 
no significant interactions. Thus in support 
of Hypothesis 2, there is a high correlation 
between SB,a; and Aact for each of the three 
family planning behaviors, and this correla- 
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TABLE 2 


CORRELATIONS BETWEEN SELECTED ATTITUDINAL 
VARIABLES AND BEHAVIORAL INTENTIONS 


Attitudinal variable r 


Using birth control pills 


Attitude toward using birth control pills SAL: ai 
Attitude toward birth control pills .497** 
Attitude toward birth control +202, 


Attitude toward family planning -109 
Having two children in the completed family 
Attitude toward having two children in 

the completed family 
Attitude toward having children A 
Attitude toward children .001 
Having a child in the next 2 years 
Attitude toward having a child in the 


next 2 years AORTE 
Attitude toward having children 229%% 
Attitude toward children —.038 


Note. N = 270. 
*p < 05, two-tailed test, 
** p <.01, two-tailed test. 


tion exists within each of the major subsam- 
ples included in the present study. 

The third hypothesis stated that as the 
measure of attitude deviates in specificity 
from the measure of intention, the obtained 
correlations are expected to decrease. The 
attitude-intention correlations presented in 
Table 2 are consistent with this hypothesis. 
The intention to have a child in the next 2 
years correlated: (a) —.038 (ns) with atti- 
tude toward children, (b) .225 (p< 01) 
with attitude toward having children, and (c) 
-798 (p < .01) with attitude toward having a 
child in the next 2 years. Following a similar 
pattern, the intention to use birth control pills 
correlated: (a) .109 (ns) with attitude to- 
ward family planning, (b) .202 (p< .01) 
with attitude toward birth control, (c) .497 
(p < .01) with attitude toward birth control 
pills, and (d) .778 (p < .01) with attitude 
toward using birth control pills. This pattern 
of correlation strongly supports Fishbein’s 
(1967) recommendation of replacing the 
traditional attitude toward the object with 
attitude toward the act if one is attempting 
to predict a specific behavior. 


Applied Analyses 


Differences in BI. In order to test for dif- 
ferences in behavioral intentions as a function 
of religion and SES, a 2 x 3 (Catholic vs. 
Protestant, high vs. middle vs. low SES) 
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multivariate analysis of variance was per- 
formed on the five behavioral intention mea- 
sures included in the questionnaire. Only a 
significant main effect for religion was ob- 
served (multivariate F = 5.92, p< .001). 
The univariate analysis of variance for re- 
ligion yielded significant differences on two 
of the intentions, the intention to have a 
two-child family (F = 6.85, p < .01), and 
the intention to support legislation to legalize 
abortion (F = 30.31, p < .001). Catholics 
were less likely to intend to have a two-child 
family (X = —.64 on a +3 to —3 scale) 
than were Protestants (XY = .50). Similarly, 
Catholics were less likely to intend to support 
legislation to legalize abortion (X = —1.60) 
than were Protestants (XY = .50). These data 
are in accord with previous demographic 
studies relating religion to fertility ideals 
(e.g., Westoff et al., 1961; Whelpton et al., 
1966). 

Differences in beliefs. As previously men- 
tioned, the ¥B;a; and SNB(Mc) components 
received significant regression weights in the 
prediction of each of the BIs studied. This 
suggests that an analysis of the beliefs should 
give additional insights into variations in in- 
tentions. Simultaneous confidence intervals 
comparing the difference between means on 
the belief, evaluation, normative belief, and 
motivation to comply variables for those who 
intend to perform the behavior and those who 
do not intend to do so were computed for each 
behavior. (For a discussion of simultaneous 
comparisons of multiple sets of means, see 
Morrison, 1967.) 

The beliefs that differentiate women who 
intend to have a two-child family from wo- 
men who do not intend to do so concern the 
effect on the children of growing up in a 
“small” two-child family and the costs of 
children to the parents in both time and 
money. That is, women intending to have two 
children thought it more probable than non- 
intenders that a two-child family would re- 
sult in (a) children having enough com- 
panionship, (b) a variety of interests and 
personalities in the family, (c) having the 
right number of children for the financial 
budget, and (d) having time for social in- 
terests. Concerning the behavior of using birth 
control pills, women intending to use the pill 
thought it less probable that the behavior 
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would lead to negative side effects (e.g., bond 
clots, weight gain) and more likely t d 
to positive side effects (e.g., increased i 
pleasure, regulation of the menstrual « 
than did women not intending to use t} 
For the behavior of having a child dur! 
next 2 years, intenders thought it more 
that this behavior would “fulfill my 
life” and “make my marriage < 
less likely to “be too much of a phy 
emotional strain” than did nonintenders. 
With regard to the normative component 
for each of the three behaviors, intenders 
thought it more likely that their husband, 
close friends, and parents wanted them to 
perform the behavior than nonintenders. It is 
interesting to note that the perceived expec- 
tations of “my religion” did not differentiate 
between the two groups for using birth con- 
trol pills. This finding reflects the results of 
the multivariate analysis of variance analysis 
(reported previously) which did not detect 
a religious difference on this intention. 


Discussion 


The hypotheses derived from the Fishbein 
model of BI were supported by the data. 
For the total sample, the model provided 
highly accurate prediction of family planning 
intentions. The high multiple correlations of 
the model’s components with BI observed 
within each of the major subsamples indicate 
that for the prediction of family planning 
intentions, the model is probably generaliz- 
able to the majority of the American popula- 
tion, The predictability of the model was 
unaffected by such factors as SES, level of 
education, or religion. In addition, as pre- 
dicted by Fishbein’s model, there was evi- 
dence that an individual’s attitude toward 
performing each of the three behaviors was 
highly correlated with the beliefs about the 
consequences of performing the behavior and 
the evaluations of these consequences. This 
correlation between SBj,a; and Aact was also 
unaffected by SES, education, and religion. 

The present study also provides support 
for an important feature of the model of BI, 
namely, the focus on specific measures (e.g., 
attitudes toward Aact) as opposed to more 
general and traditional variables such as atti- 
tudes toward objects. The superiority of 
specific attitude measures in predicting BI 
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5 was clearly demonstrated in the data. As the 


; 


attitudinal measures moved along a con- 
tinuum of specificity with regard to the in- 
tention measure (from attitude toward a 
class of objects, through attitude toward the 
respective object, to attitude toward the act 
in question) their correlations with BI in- 
creased significantly. 

It should be emphasized that the present 
investigation dealt with family planning in- 
tentions and not behavior per se. This should 
not detract from the importance of its find- 
ings. Two major problems may be identified 
for those interested in understanding (family 
planning) behaviors: (a) the identification of 
those variables that determine intention to 
engage in various (family planning) behav- 
iors and (b) the degree to which these in- 
tentions are predictive of the overt behaviors 
in question. The present article has focused 
on the former problem, a problem which 
demographers and psychologists have studied 
for years. The latter problem is, of course, 
crucial and must be studied empirically (for 
a discussion of factors affecting predictability 
of B from BI and a review of relevant family 
planning research, see Fishbein & Jaccard, 
1973). In this respect, the authors plan to 
monitor the present sample in a panel design 
to determine, for example, the degree to which 
the intention to have a child within the next 
2 years does predict whether or not the re- 
spondent has a child during this time and the 
kinds of factors that may influence this rela- 
tion. 

One of the more interesting findings of the 
present study was the inability of the model 
to predict the intention to have a two-child 
family for a small group of women, namely, 
those women whose ideal family size was 
smaller than their actual family size. On the 
basis of their attitude toward having a two- 
child family, one would predict that these 
women intend to perform the behavior, but 
“on the basis of their prior behavior it was 
impossible for them to do so. This finding 
is important because it points out the useful- 
ness of measuring the components of the 
model prior to the performance of the be- 
havior rather than following the behavior. It 
is interesting because it suggests the necessity 
of a feedback loop for a dynamic theory of 
social behavior in which the performance of 
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the behavior “feeds back” to the beliefs 
about the consequences of performing that 
behavior which, in turn, may change the 
person’s attitude. 

From the perspective of population re- 
search, the present investigation has a num- 
ber of implications. With respect to religion 
and SES differentials in behavioral intentions, 
only Catholic—Protestant differences were ob- 
served for two intentions, the intention to 
have a two-child family and the intention to 
support legislation to legalize abortion. It is 
interesting to note, however, that no religious 
differences were observed for intentions to 
use birth contro] pills. Although this may 
seem somewhat surprising, the data are con- 
gruent with recent research. Over the past 15 
years, the proportion of Catholics conforming 
to the Catholic doctrine on birth control has 
been steadily declining. Westoff and Ryder 
(1970) reported that the percentage of fecund 
white Catholic women conforming was 70% 
in 1955, 62% in 1960, 47% in 1965, and 
only 36% by 1970. 

The fact that no socioeconomic differences 
in intentions was observed is also consistent 
with current trends in American fertility. As 
Freedman (1962) has pointed out: 


In the postwar studies, unlike the Indianapolis 
Study, correlations with socio-economic status mea- 
sures are non-existent or very modest, In fact, all of 
the traditional differentials with respect to education, 
occupation, income and rural-urban origin are des- 
tined to disappear or to contract sharply if current 
desires and expectations are realized in action 
(p. 220). 


The present study obtained both: direct 
(Aact) and indirect (=Bja;) measures of atti- 
tude toward performing the behavior. As 
noted above, these measures are highly inter- 
correlated. The SB,a; measure is more costly 
in terms of questionnaire construction and 
administration time. However, in an applied 
setting, the 5B;a; measure provides consider- 
able information over and above the direct 
Aact measure. If, as in the present research, 
the salient beliefs are elicited from the popu- 
lation of interest, then an analysis of the be- 
liefs not only indicates an individual’s atti- 
tude but also suggests why he holds that 
attitude. For example, analyses on beliefs 
about birth control pills indicated that both 
those who intend and those who do not in- 
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tend to use the pill believe equally strongly 
that the pill does not lead to morally un- 
desirable consequences. However, these groups 
do differ in the strength of their beliefs that 
using the pill leads to undesirable physical 
side effects. This type of information identifies 
those items that differentiate between in- 
tenders and nonintenders and suggests which 
beliefs one might seek to change in order to 
influence an individual’s family planning in- 
tentions. 

In conclusion, the present investigation in- 
dicates that fertility-related intentions may 
be understood in terms of the theoretical 
model of BI. Generally speaking, demogra- 
phers have been unable to identify the specific 
factors which underly important variables 
such as intended family size, etc. Attempts 
to predict such variables from a psychological 
perspective have for the most part been un- 
successful. In contrast, the present data indi- 
cate that family size intentions can be pre- 
dicted when approached from a strong 
theoretical framework. Further, as improve- 
ments and modifications of the theory are 
made, so should our understanding of family 
planning behaviors be improved. 


REFERENCE NOTE 


1. Hollingshead, A. B. Two factor index of social 
position, Unpublished manuscript, 1957. (Available 
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Observational Learning in Children: Effects of Sex 
of Model and Subject’s Sex Role Behavior 


David G, Perry and Louise C. Perry 
University of Queensland, Queensland, Australia 


The present article studied the relationship between children’s sex role be- 
havior and children’s acquisition of behaviors exhibited by same- and opposite- 
sex adult models, Elementary school children of both sexes were classified as 
either masculine or feminine on the basis of their scores on a projective test 
of sex role behavior. They then watched a short film depicting the behavior 
of an adult male and an adult female model under one of two conditions: free 
verbalization (subjects described aloud the models’ actions) and passive ob- 
servation. Children of both sexes classified as masculine recalled more of the 
behavior of the male model (p < .01). In addition, while watching the film, 
masculine children in the free-verbalization condition spontaneously verbalized 
more of the actions of the male than of the female model (p < .01). Parallel 
results were not found for subjects classified as feminine. Results were inter- 


preted in the light of some recent developments in social learning theory. 


Social learning theories of sex typing (e.g., 
Bandura & Walters, 1963; Mischel, 1966) 
usually assume that the development of ap- 
propriate sex role behavior in children de- 
pends on two complementary processes: (a) 
identification with a same-sex model and (b) 
reinforcement for performing imitated sex- 
typed behaviors. Bandura (1965, 1972) has 
suggested that children learn or acquire many 
social behaviors by symbolically encoding a 
model’s enactment of them but perform the 
behaviors only if they anticipate a positive 
outcome for doing so. One popular interpreta- 
tion of the sex-typing process (Mischel, 
1966) holds that appropriately sex-typed 
children perform sex-appropriate behavior be- 
cause their socialization histories lead them to 
expect reward for such behavior and punish- 
ment for sex-inappropriate behavior. A sec- 
ond possible interpretation (not incompatible 
with the first) is that appropriately sex-typed 
children have for some reason learned more 
. about the behavior of like-sex than of oppo- 
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Perry, Department of Psychology, University of 
Queensland, St. Lucia, Brisbane, Queensland, Aus- 
tralia, 4067, 


site-sex models, and obviously one cannot 
perform what one has not learned. 

Some recent experiments by Grusec and 
Brinker (1972) bear on this issue. These in- 
vestigators discovered that elementary school 
boys learn more about the behavior of same- 
sex than of opposite-sex models (a similar, 
but less reliable, result was found for girls). 
The authors argued that since children are 
ordinarily reinforced for imitating like-sex 
models, it becomes advantageous for them to 
learn as much as possible about what like-sex 
models do. In this way, they are in a better 
position to maximize the rewards they might 
later receive for imitation. Two additional 
and carefully controlled experiments by the 
same authors indicated that children who are 
reinforced for imitating one of two adults 
are indeed able to recall more of the subse- 
quent behavior of that adult. 

Although the Grusec and Brinker (1972) 
study thus suggests that children generally 
learn more about like-sex models than oppo- 
site-sex models, there no doubt exist within 
each sex marked individual differences in the 
strength of this tendency. Indeed, social learn- 
ing theories assume that a major reason why 
some children show inappropriately sex-typed 
characteristics is that these children have 
been reinforced for imitating opposite-sex 
models. If a child is consistently rewarded 
for imitating predominantly opposite-sex 
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models, then one might expect the child to 
develop a generalized tendency to learn more 
about the behavior of opposite-sex models 
than same-sex models. 

It was the purpose of this study to explore 
the possibility that although appropriately 
sex-typed children learn more about the be- 
havior of like-sex than of opposite-sex models, 
this tendency may be attenuated or even re- 
versed in inappropriately sex-typed children. 
Elementary school children were first given a 
test of sex role behavior. On the basis of 
their scores on this test, the children were 
divided into four groups: masculine boys, 
feminine boys, masculine girls, and feminine 
girls. Children were then shown a film depict- 
ing an adult male and an adult female model 
displaying distinct behavioral sequences and 
were subsequently tested for recall of the 
filmed behaviors. On the assumption that 
masculine children of both sexes have experi- 
enced reinforcement histories for imitating 
primarily male models, it was predicted that 
these children would learn more about the 
male model’s behavior. For parallel reasons, 
feminine children of both sexes were expected 
to learn more about the female model’s be- 
havior. In other words, it was predicted that 
a child’s social learning history as inferred 
from his current sex role behavior is an im- 
portant variable affecting the sex of model he 
is likely to imitate. 

A second aim of this research was to inves- 
tigate one possible mechanism which may help 
to account for a child’s learning more about 
one model’s behavior than another’s. Grusec 
and Brinker (1972) tested the hypothesis that 
one reason children learn more about models 
they have been rewarded for imitating is that 
they pay closer attention to such people. 
However, their experiment, in which chil- 
dren’s eye movements were recorded, showed 
that in fact children pay no more attention 
to these models than to ones they have not 
been reinforced for imitating. Hence, the au- 
thors rejected this possibility and suggested 
that reinforcement for imitation may lead 
subjects to process differentially the behav- 
iors of different models at a more central 
level. Bandura (Bandura, 1972; Bandura, 
Grusec, & Menlove, 1966) has suggested that 
symbolic coding, primarily in terms of covert 
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verbalizations, may be necessary for ac- 
quisition and retention of modeling stin\.!. It 
may very well be the case that childre we 
learned to encode spontaneously the av- 
iors of models they have been rewarde for 
imitating to a greater extent than the be iv- 


iors of other classes of models. One wav to 
test this possibility is to ask children expo, +d 
to models to verbalize aloud what they sce 
the models doing. If children spontaneouy:y 
verbalize more of the behavior of a model 
they have previously been rewarded for imi- 
tating (for masculine children, a male model; 
for feminine children, a female model), then 
support will have been gathered for this posi- 
tion. In the present study, this possibility was 
explored by asking half of the children in each 
group, while viewing the film, to describe in 
their own words the actions of the models. It 
was hypothesized that boys and girls classified 
as masculine would spontaneously verbalize 
more of the actions of the male model, where- 
as children classified as feminine would verb- 
alize more of the female model’s behavior. 


METHOD 


Determination of Sex Role Behavior and 
Selection of Subjects 


As a preliminary step, 188 boys and girls in th 
third and fourth grades of an upper-middle-class 
public school in Brisbane, Queensland, Australia, 
were administered a specially constructed projective 
test of sex role behavior. Existing tests of sex rok 
preferences were deemed unsuitable for use wit 
Australian children. For example, the It Scale f 
Children (Brown, 1956) includes stimulus materi: s 
which may not be familiar to some Australian chil- 
dren (e.g., figures of American Indians). The pios- 
ent test materials consisted of 60 3 X 5 inch 
cards each of which depicted in color some 
game, activity, or object typically associated v 
one sex. Half of the cards depicted some masculi 
activity or object, as determined by the concurre 
judgment of four adult judges (two male, two fe- 
male). Examples of the cards included® tools and 
household articles (e.g., hammer, washing machine), 
games (e.g., cricket, jump rope), and toys (¢.g., 
sling shot, doll’s house). 

Each child was individually tested by a same-sex 
experimenter. The child was shown a blank index 
card and instructed to pretend that a picture of a 
child named It was on the card. This “imaginary 
It” procedure is one successfully adopted by Fling 
and Manosevitz (1972), and their article contains 4 
discussion of its merits. The subject was then shown 
each card, one at a time in a predetermined order, 
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and asked to tell whether or not It would play with 
or use what was in the picture. A subject’s score 
on this test was determined by calculating the per- 
centage of choices he made for It which were keyed 
as masculine. In order to eliminate subjects who 
showed a response “bias to make either a dispro- 
portionately large or small number of choices for It, 
any child who made more than 45 or fewer than 
15 choices for It was deleted from the sample. 

The design of the study required 24 children in 
each of four groups: masculine boys, feminine boys, 
masculine girls, and feminine girls. To control for 
grade of child, it was also required that half of the 
children in each group be third graders and the 
other half, fourth graders. To fill these require- 
ments, the following steps were taken: Frequency 
distributions of children’s test scores were constructed 
by sex and grade of subject. The 12 highest- and the 
12 lowest-scoring subjects in each group, respectively, 
were taken as the masculine and feminine subjects 
for that distribution. The average percentage of 
masculine choices made for It by the 24 subjects in 
the masculine boys group was 95.6; for feminine 
boys, 26.8; for masculine girls, 75.8; and for femi- 
nine girls, 10.4, 

The average age of the total 96 children in the 
study was 8 years 11 months. 


Procedure 


Children were brought individually from their 
class to a spare room at the school by a same-sex 
experimenter who was blind to the child’s sex role 
status. As soon as the children entered, the experi- 
menter explained that each child was to see a film 
involving two people. Before showing the film, the 
experimenter displayed to the child and verbally 
labeled each of the objects that were used by the 
models in the film, This step was taken to make it 
easier for children to supply verbal labels for the 
objects if subsequently asked to verbalize the ac- 
tions of the models during the film. 

Each child was then seated in front of a movie 
screen. One half of the subjects in each group were 
assigned to the free-verbalization condition and were 
given these instructions: 


Now when the film starts, I want you to tell me 
what the people are doing with these things. Of 
course, there won’t be time for you to tell me 
everything you see them do, so don’t worry if you 
leave some things out. But try to tell me as much 
as you can. 


The other half, assigned to the passive observation 
group, were simply told to watch what was going on 
in the film, For subjects in the former group, the 
experimenter activated a tape recorder to record 
the subjects’ verbalizations. Only half of the chil- 
dren in each group were asked to verbalize in order 
to control for any effect which verbalization instruc- 


1 Complete information on test materials, admin- 
istration, and scoring may be obtained from the 
present authors, 
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tions might have on recall scores. However, no spe- 
cific prediction was made concerning the effect of 
free-verbalization on recall, since previous research 
has produced conflicting results (see, e.g., Bandura 
et al., 1966, who found that free verbalization facili- 
tated recall in children aged 6-8, and Coates & Har- 
tup, 1969, who found that free verbalization facili- 
tated recall in 4-year-olds but interfered with it in 
7-year-olds). 

The film showed the male model and the female 
model, each situated on one side of the screen and 
each displaying a series of different behaviors. There 
Were 12 sequences of behavior, half displayed by 
the male model and half by the female model. At 
the end of each sequence, the positions of the models 
on the screen were reversed. Included among these 
sequences were the following: While one model un- 
screwed a bottle lid, smelled the contents of the 
bottle, held his nose, and dropped the bottle in a 
basket, the other model split a tissue in half, blew 
his nose, put one tissue to his ear, and dropped the 
tissue into a basket. Again, while one model stretched 
a rubber strap, hooked its ends together to form a 
loop, stepped into the loop, and jumped up and 
down, the other model placed two drinking glasses 
over his eyes, then over his ears, placed the glasses 
together, and then put them on his head. 

Two films were shown, which were identical ex- 
cept that the behaviors the models performed were 
reversed. Half of the children in each condition saw 
one film, while the rest of the children saw the 
second one. All of the sequences of behavior in- 
cluded in the film were neutral as far as sex typing 
was concerned. 

Immediately after the film, the children were 
asked to demonstrate all of the models’ responses 
they could recall. For each matching response cor- 
rectly reproduced, the children were praised and 
earned a point toward a prize to be given later (a 
piece of candy). A standardized cuing procedure 
ensured that the order of reproduction was the 
same for all subjects. The experimenter first dis- 
played to the child the props used by both models 
in the first part of the film (the first two sequences) 
and asked the child to show what the models did 
with them. When the child appeared not to remem- 
ber anything else from those sequences, the experi- 
menter prompted, “Can you think of anything else 
they did with these things?” He then continued in a 
similar way through the rest of the sequences. This 
procedure prevented a child from recalling all he 
could of one model’s behavior before trying to re- 
call the other model's behavior. The experimenter 
recorded the children’s matching responses on a 
checklist, In order to achieve interscorer reliability, 
three experimenters tested pilot subjects until they 
agreed on scoring criteria and were in perfect agree- 
ment on at least 90% of the specific matching 
responses, 

As noted above, the verbalization of children as- 
signed to the free-verbalization condition during the 
film were recorded on tape. In order to determine 
how many actions of each model the subject spon- 
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taneously verbalized, these tapes were played back 
and scored by the experimenter. For each verb a 
child used to correctly describe a model’s behavior, 
he earned 1 point toward his verbalization score for 
that model. Of course, in describing a model’s ac- 
tion, children usually verbalized the subject and 
object as well as the verb, but since these were 
assumed to be understood when omitted, they 
were not a formal criterion for scoring the verbali- 
zation. In order to estimate interscorer reliability, a 
second male experimenter listened to the tapes of 21 
of the boys who were in the verbalization condition, 
and a second female experimenter listened to the 
recordings of 21 of the girls who were in the verb- 
alization condition. The Pearson correlation between 
the two male experimenters on the boys’ verbaliza- 
tions of the male model was .83, on their verbali- 
zations of the female model, .88; the correlation 
between the two female experimenters on the girls’ 
verbalizations of the male model was .85, on their 
verbalizations of the female model, .86. 


RESULTS 
Recall Scores 


The mean number of items recalled by chil- 
dren in each group is presented in Table 1. A 
five-way analysis of variance involving four 
between-subjects factors (sex role status, sex 
of child, film, and verbalization condition) 
and one repeated factor (sex of model) was 
performed on these data. This analysis yielded 
a statistically significant main effect of sex 
of model, F(1,80) = 9.70, p<.01, with 
subjects recalling more of the male model’s 
behavior than the female’s. Of major interest 
was a significant Subjects’ Sex Role Status x 
Sex of Model interaction, F(1,80) = eons 
p < .05. Further analysis of this interaction 
by £ tests indicated that masculine subjects 
remembered significantly more of the male 


TABLE 1 


MEAN NUMBER or ITEMS RECALLED or Each Mopet’s 
BEHAVIOR IN EXPERIMENTAL CONDITIONS 


Masculine sex role | Feminine sex role 


behavior behavior 
Verbalization 
condition 
Male Female Male Female 
model model model model 
Boys 
Free 13.6 13.3 125 |e 
Passive 13.9 11.0 13.7 12.3 
Girls 
Free 14.6 12,3 135 12.6 
Passive 12.4 11.2 12.3 13.8 


Note. Number of observations per cell = 12, as data for the 
two film conditions are combined. 
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TABLE 2 
MEAN NUMBER OF ACTIONS VE 


BALIZED ov EACH 


Mobet’s BEHAVIOR IN VERBALIZATION CONDIT 
Masculine sex role 
behavior 
Sex of child 
Male | Female | Male | Female 
model | model | model | model 
Boys 17.4 15.4 17.8 | 16.6 
Gids 17.2 15.0 | 163 17.5 
| i 
Nole. Number of observations per cell = 12, as data for the 


two film conditions are combined. 


model’s behavior than the female’s, (80) = 
3.84, p < .01, two-tailed; ¥ for male model 
= 13.5; X for female model = 11.9, but 
there was no significant difference for fe: 
nine subjects, £ = .57; X for male model 
13.0; X for female model = 12.8). 

Four additional interaction effects reached 
significance in the analysis. However, none oí 
these necessitated a qualification of the above 
two-way interaction in terms of any of the 
other experimental variables. Because none of 
these interactions was predicted, each is only 
briefly noted, and no ¢ tests are reported. 
Significant interactions were the following: 
Film x Sex of Model, F(1,80) = 37.89, p < 
01; Sex of Child x Verbalization Condition 
X Sex of Model, F(1,80) = 9.25, p< .01; 
Film X Verbalization Condition X Sex of 
Model, F(1, 80) = 4.41, p < .05; and Sex of 
Child x Film x Verbalization Condition x 
Sex of Model, F(1,80) = 7.572, p <.01. 


Verbalization Scores 


The mean number of actions verbalized by 
children in each verbalization group is given 
in Table 2. 

A four-way analysis of variance involving 
three between-subjects factors (sex role status, 
sex of child, and film) and one repeated factor 
(sex of medel) was performed on these data. 
This analysis yielded a significant main effect 
for sex of model, F(1,40) = 5.82, p < .05, 
with subjects verbalizing more of the male 
model’s behavior than the female’s. Of prime 
interest was a significant Subjects’ Sex Role 
Status x Sex of Model interaction, F(1, 80) 
= 4.97, p < .05. Further analysis of this in- 
teraction by ¢ tests indicated that masculine 
subjects verbalized more of the male model’s 
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behavior than the female's, (40) = 3.28, p < 
.01, two-tailed; ¥ for male model = 17.3; X 
for female model = 15.2, but there was no 
significant difference for feminine subjects, 
¢=.12; X for male model = 17.1; X for 
female model = 17.0. These results on the 
verbalization index clearly paralleled the re- 
sults for the recall scores. No other effect in 
the analysis reached statistical significance. 


Discussion 


The hypothesis that children of both sexes 
classified as masculine would remember more 
of an adult male than an adult female’s be- 
havior was supported. The parallel hypothesis 
for feminine children was not supported; femi- 
nine children recalled equal amounts of each 
adult’s behavior. It is of interest that this 
result parallels the Grusec and Brinker (1972) 
findings for male and female children. These 
investigators found that boys show a marked 
tendency to acquire more of a male model’s 
behavior but that girls do not show as strong 
a tendency to learn more about a female mod- 
els behavior. Apparently, there is a stronger 
tendency for boys (or masculine children of 
either sex) to learn more about a male mod- 
el’s behavior than there is for girls (or femi- 
nine children of either sex) to learn more 
about a female model's behavior. 

Interpretations of this finding are specula- 
tive, but at least one possibility comes to 
mind, A study by Grusec and Mischel (1966) 
found that children who interacted with a 
powerful model were later able to recall more 
of his behavior when given incentives for do- 
ing so than were those who interacted with a 
less powerful model. One reason why femi- 
nine children may learn as much about a male 
model as a female model is that males are 
generally perceived as possessing greater so- 
cial power, in terms of making decisions and 
controlling desired resources, than are fe- 
males, Thus, feminine children may learn 
about the behavior of different-sex models for 
different reasons: about women’s behavior be- 
cause they have received reinforcement for 
imitating women and about men’s behavior 
because they perceive men as powerful indi- 
viduals. In contrast, children with masculine 
sex role preferences would appear to have two 
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good reasons for learning about men’s be- 
havior but no apparent reason for learning 
about women’s: Not only have they been re- 
warded for imitating men, but they also per- 
ceive men as powerful. Hence, masculine chil- 
dren learn more about men, 

Although it is possible to interpret the find- 
ing that children generally recalled more of 
the male model’s behavior in terms of the 
greater social power of males, an alternative 
explanation is worth considering. The male 
model was taller, occupied more space on the 
film screen, and generally tended to perform 
the modeled behaviors more distinctly. Thus, 
his superior modeling ability as well as his 
greater perceived social power probably con- 
tributed to the effect. Ideally, of course, re- 
search should employ several models of each 
sex as well as control for distinctiveness of 
modeling stimuli across models before drawing 
firm conclusions about the effects of model 
sex, 

It should be emphasized that the present 
investigation selected for study the extremes 
on sex role status, and consequently, approxi- 
mately half of the possible subjects were not 
studied. Presumably, these unstudied children 
would have learned equally the behavior of 
the two models. 

One of the most interesting findings of the 
present study was the discovery that mascu- 
line boys and girls spontaneously verbalize 
more of the actions of a male than of a female 
model. If children who receive masculine sex- 
typing scores have in fact been reinforced for 
imitating male models, they should learn that 
it is to their advantage to learn as much as 
possible about the behavior of such models. It 
follows that it would be in their best interests 
to employ whatever strategies can help them 
to learn and remember what such people do. 
Several studies with both children (Bandura 
et al., 1966; Coates & Hartup, 1969) and 
adults (Bandura & Jeffery, 1973; Gerst, 
1971) have confirmed that one such strategy 
consists of verbalizing the model’s actions. 
Masculine children spontaneously verbalize 
more about men than women probably be- 
cause they have discovered that this aids in 
the acquisition of masculine behaviors. 

It is of some interest that instructing chil- 
dren to verbalize did not generally increase 
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their recall scores. It is possible that the 
third- and fourth-grade children who were 
assigned to the passive observation condition 
were spontaneously, but covertly, verbally 
encoding the relatively simple modeling stim- 
uli used in the present study, even without 
being explicitly requested to do so. If this 
were the case, then it is not surprising that 
explicit verbalization instructions would add 
little, if anything, to their baseline acquisition 
scores. Possibly, requiring children of this age 
to say aloud what they normally do silently 
merely permits us to observe and measure 
whatever encoding processes are ordinarily 
occurring naturally but covertly. 

In the present research, children’s sex typ- 
ing was assessed by a single projective test of 
sex role behavior, no attempt being made to 
assess separately the different aspects of the 
sex-typing process. Biller and Borstelmann 
(1967) have proposed three dimensions of sex 
role development (sex role preference, sex 
role adoption, and sex role orientation) and 
have suggested that it is possible for a child 
to be high (or low) on one dimension with- 
out necessarily being high (or low) on either 
of the other two.? The usefulness of this 
scheme is already being recognized (e.g., 
Fling & Manosevitz, 1972), but as yet there 
is little agreement in the literature on the 
best way to assess each aspect. Once consen- 
sus is reached as to operational definitions for 
these dimensions, it should be of interest to 
examine the relationships between children’s 
scores on each aspect and their tendencies to 
learn about models of different sexes. For 
example, presumably a child with masculine 
sex role preference but feminine sex role ori- 
entation would find it to his advantage to 
learn more about male models. Is this in 
fact the case, or does a feminine orientation 
predispose a child to acquire feminine char- 


2 The reader is referred to Biller and Borstelmann 
(1967) for a description of these dimensions. 
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acteristics despite his conscious pre‘:-ence 
for the masculine role? 
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A 2X23 factorial design was employed to study conditions that influence 
members’ evaluation of their own group’s product and that of a competing 
group. Group members were told that they either would or would not con- 
tinue working together in future competitive confrontations with another 
group. After the competition, group members were informed that their product 
had either won or lost or that a decision about the winner had not yet been 
made. Subjects were then asked to evaluate their own group’s product and 
that of the competing group either in public or in private. The results showed 
that in “continuing” groups, members of groups declared the winner showed 
less overevaluation of their own product when ratings were given in public 
than when they were given in private. The reverse effect was found in the 
losing groups. These effects were interpreted to result from subjects’ desire to 
avoid complacency in winning groups and to avoid despair in losing groups. 
Subjects in the “discontinuing” groups overevaluated their own group’s product 
more in public than in private. This effect was found in winning as well as 
losing groups and was interpreted as showing subjects’ desire to leave the 
group with a positive expression. The value of communication of group 


product evaluation was discussed in light of the obtained results. 


The proposition that “how the world looks 
depends on from where you are viewing it” 
has nowhere been better demonstrated than 
in the research on the evaluations of group 
products. It has been found with almost total 
consistency that group members tend to over- 
evaluate their own group’s products and per- 
formance and underevaluate the performance 
of outgroups or competing groups (e.g., Blake 
& Mouton, 1961; Ferguson & Kelley, 1964; 
Sherif, Harvey, White, Hood, & Sherif, 1961). 
This overevaluation has been found when 
estimates are made of the ingroup product in 
relation to an outgroup product (Blake & 
Mouton, 1961; Ferguson & Kelley, 1964) or 
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when the evaluation of the ingroup product 
is compared with actual reality (Sherif et al., 
1961). Overevaluation has been observed with 
both public and private ratings of group 
products, although the effect is reported to 
be stronger in the former situation (Schopler, 
Insko, & Hinkle, Note 1). 

While the studies on overevaluation have 
varied the type of product to be evaluated, 
the age of the subjects, and the size of the 
groups, all of the research in this area has 
elicited the evaluations before the group mem- 
bers learned whether their product had been 
declared the winner or loser in the intergroup 
competition, Further, in the majority of re- 
search, the possibility that the groups might 
compete on other projects was expressly ruled 
out (Schopler et al., Note 1) or was left 
ambiguous (Blake & Mouton, 1961; Sherif 
et al., 1961). Subjects in the Ferguson and 
Kelley (1964) study were told that they 
would be working together on three separate 
tasks but that they would not be in competi- 
tion with the other group. 

In a situation where group members are 
unaware of the success of their group’s prod- 
uct and are unclear as to whether they will 
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be competing as a group in the future, there 
are several reasons to expect overevaluation 
of the ingroup product. Blake and Mouton 
(1961) suggested that overevaluation may be 
used to attempt to influence the judges (if 
any) who will be deciding on the winner of 
the competition. Blake and Mouton also sug- 
gested a balance interpretation of overevalua- 
tion. From balance theory (Heider, 1958) it 
would be expected that an individual who is 
positively associated with a group (+ unit 
relation) will positively evaluate the group 
and its product. The balance interpretation 
has been supported by Ferguson and Kelley 
(1964) and Schopler et al. (Note 1). It is 
also reasonable to suppose that group mem- 
bers, waiting to discover which group has won 
an intergroup competition, may be interested 
in fostering a positive atmosphere in the 
group and that high evaluations of the in- 
group product may serve this purpose. 

If, however, members of a group expect 
several competitive confrontations with an 
outgroup rather than one single competitive 
encounter, the factors influencing overreevalu- 
ation may be different. For example, although 
overevaluation of the ingroup product may be 
useful during the period before the outcome 
of competition is known, high estimates of 
the relative quality of the ingroup product 
may be disadvantageous after a group mem- 
ber learns whether his group has won or lost 
an early round of the competition. A member 
who increases his estimate of the ability or 
performance of his group after finding that 
his group has won the first round of competi- 
tion may feel that he and his fellow group 
members need not prepare or work as hard 
in future confrontations. It is, however, to 
the advantage of all group members that this 
feeling of complacency among group members 
does not occur. Drastically overevaluating 
the ingroup’s product or underevaluating the 
outgroup’s product could create a general 
overconfidence which might result in group 
members “slacking off” and not working as 
hard in future competition. This possible de- 
cline in effort by the group members could 
cause the group to lose future confrontations 
with other groups. Thus, it might be expected 
that group members who anticipate future 
competition with an outgroup and who find 
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that they have won an early round wi!’ at- 
tempt to guard against complacency. “xe 
means of guarding against a group atmosp™«+e 
of overconfidence is for group member: o 
underplay or evaluate less highly their a 
group’s performance or evaluate more hi 

the performance of the opposing group. | 
ther, this effect should be strongest when t 
product evaluations are public. Through pul 
lic underevaluations, an individual may con 
municate to members of his own group tha! 
they must continue to work hard and tha 
the competition is still a threat. The use o! 
group product evaluations to combat over- 
confidence should be most evident when the 
evaluations have the capacity to serve as 
communication to the group members. 

A different situation confronts members of 
a group which has lost one round of a com- 
petition and which faces future contests. The 
loss may be expected to decrease the mem- 
bers’ estimates of their group’s competency 
and might result in the members feeling that 
they cannot win in future confrontations. The 
concern of members of groups in this situation 
should not be with overconfidence but with 
feelings of incompetency and a “what's the 
use of trying” attitude. In this case it is im- 
portant that motivation be sustained within 
the group by bolstering the members’ evalua- 
tion of the group’s ability. One way that this 
can be accomplished is to increase the evalua- 
tion of the ingroup product and decrease the 
evaluation of the outgroup product. By bring- 
ing the evaluations of the ingroup and out- 
group products close together, the individual 
is in essence saying, “We barely lost, and 
there is hope we can win the next time.” This 
overevaluation of the ingroup product shoul: 
be most evident when the product evaluation: 
can serve as communications, that is, wher. 
they are public. 

Group members who expect no future inter 
group competitions should not have the same 
concern about the psychological atmosphere 
in the group. There is no need to “keep up” 
the spirit of the group for the future, and it 
could be expected that members of groups in 
this situation would only be concerned with 
the immediate group environment. The mem- 
bers of a winning group should be interested 
in “savoring the delight of victory” and in 
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“playing up” their group’s performance. 


Thus, marked overevaluation of the ingroup 
product and underevaluation of the out- 
group’s performance would be expected of 
winning groups in this situation. Members 
of losing groups, on the other hand, should 
feel the “agony of defeat” and should be dis- 
satisfied with their group’s performance. 
There is little to restrain the members of 
these losing groups from transfering this dis- 
satisfaction to their estimates of the group 
products, and considerable disparagement of 
the ingroup product may be expected. It 
might also be predicted that these effects will 
be more evident in public expressions of the 
product evaluations than in private expres- 
sions. The communication of such evaluations 
in groups facing no future competition can 
be a tactic of expressing appreciation to the 
other members of a winning group or ex- 
pressing displeasure with the other members 
of a losing group. 

The present experiment was designed to 
test these predictions. Group members’ an- 
ticipation of future competition, their belief 
that the evaluations would be given publicly 
or privately, and the assignment of a win or 
loss to the group’s product were manipulated. 
A set of conditions was also included in which 
the members’ evaluations of the group prod- 
ucts were given before the members learned 
who had won the competition. These condi- 
tions were included to replicate the findings 
of previous studies in which the outcome of 
the competition was not known at the time 
the product evaluations were made. In order 
to rule out the possibility that group members 
might feel that their product evaluations 
could influence the outcome judgment, the 
outcome of the competition in the present 
study was alleged to depend upon a com- 
puter analysis of the group products. 

Ratings of attraction for the group were 
also collected in the present study. It was 
expected that members of winning groups 
would be more attracted to the group than 
members of losing groups, since the former 
had received positive reinforcement for their 
association with the group while the latter 
had received negative reinforcement (cf. 
Thibaut & Kelley, 1959). It was also pre- 
dicted that group members who expected to 
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be together in future contests would show 
more cohesiveness (greater attraction) than 
would members of groups that were to be 
disbanded after one event. 


METHOD 
Subjects 


Subjects were 336 undergraduate students enrolled 
in introductory psychology courses at the University 
of North Carolina at Chapel Hill. They were run 
in groups of three or four. There was an attempt to 
keep the proportion of three- and four-person group 
sizes constant across conditions so that group size 
would not differentially influence the results. No 
condition had more than five nor less than 3 three- 
person groups, and the results failed to show any 
significant differences due to group size. Participation 
in the experiment was in partial fulfillment of a 
course requirement. Each of two experimenters ran 
four groups in each cell of the design. 


Overview of the Design 


The experiment was a 2 X 2 X 3 complete factorial 
design. Subjects were told that their group would 
either continue competing with another group present 
during the same session or would compete on one 
task only (continue-discontinue manipulation), Sub- 
jects rated their own group's product and a product 
purported to be that of the competing group under 
public conditions or confidentially (public-private 
manipulation). Following the competition but before 
the rating of the group products, some groups were 
told they had won the competition, some were told 
they had lost the competition, and others were given 
no information concerning the outcome of the com- 
petition (outcome manipulation). 


Procedure 


At the beginning of each session, from six to eight 
subjects reported to a large experimental room. The 
subjects were told that they would be participating 
in a study concerned with the generation of new 
ideas. Supposedly the study was being sponsored by 
the “National Institute for Social Work Research,” 
an organization which was evaluating several of its 
social work programs and required the responses of 
groups of subjects untrained in social work theory, 
Subjects were assured that their participation would 
be of value, since, it was alleged, previous research 
had revealed that the social work programs designed 
by nonexperts were often as good as those designed 
by experts. The subjects were told they would be 
divided into two competing groups, placed in separate 
rooms for privacy, and would be given 15 minutes to 
develop, as a group, a rehabilitation program based 
on the Johnny Rocco case (e.g., Schachter, 1951). 
They were also told that the groups would be in 
competition with each other to ensure that they tried 
as hard as possible to develop a good rehabilitation 
program. Each member of the winning group would 
receive a prize of $2.50. 
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The experimenter explained that the Institute had 
developed a computer program that could judge the 
groups’ programs on a number of dimensions. The 
groups’ solutions would be judged on the basis of 
three criteria: creativity, practicality, and effective- 
ness. (Subjects’ attention was directed to a remote 
teletypewriter located in the large room with which 
the experimenter would supposedly submit their 
solutions to the computer.) The experimenter noted 
that the computer had proven to be an effective and 
impartial judge. 

Subjects were then randomly divided into two 
groups with the constraint that all members of each 
group be unacquainted prior to the experiment. 
Groups of four always consisted of two males and 
two females, and groups of three were never filled 
by members of one sex. After the groups were sepa- 
rated into two rooms, the continue-discontinue 
manipulation was executed. Subjects in groups as- 
signed to the discontinue conditions were told, “This 
is the only problem on which you will be working 
in competition with the other group. After this, the 
groups will be disbanded.” In contrast, subjects in 
the continue conditions were told, “This problem will 
not take the entire time, so your group will be 
competing against the other group on some other 
problems.” The group members did not know, of 
course, what instructions had been given to the mem- 
bers of the outgroup. Following these instructions, 
the groups were left for 15 minutes to work on their 
rehabilitation programs. 

The experimenter was blind, up to this point, to 
the levels of the public-private and outcome factors 
to which each group would be assigned, During the 
work period these assignments were determined by 
consulting a randomization schedule. After the group 
had worked on the Johnny Rocco case for 15 
minutes, the experimenter collected their solutions 
and purportedly typed them into the computer via 
the remote teletypewriter. After much clattering of 
the teletypewriter (which was clearly audible to 
the subjects), the experimenter returned to each 
group and told the members that their group's 
product had either won or lost the competition. In 
either case subjects were shown a computer output 
which indicated that the program of the winning 
group (their own group in win conditions and the 
other group in Jose conditions) was judged superior 
on the creativity and effectiveness criteria and not 
significantly different on the practicality criteria. 
Groups in the ambiguous-outcome conditions were 
told that the computer had not completed its analy- 
sis. Thus there were three levels of the outcome 
manipulation: win, lose, and ambiguous, 

Following this manipulation, the experimenter read 
aloud the rehabilitation program of the subjects’ own 
group and a program reported to be that of the 
other group. Subjects were then asked to rate each 
program on the three judgment criteria so that the 
experimenter could obtain supplemental information 
on the subjects’ own perceptions. In fact, the pro- 
gram alleged to be that of the other group was a 
standard program designed by the experimenters 
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and was the same in all conditions, The > blic- 
private factor was manipulated through the ruc- 
tions about the questionnaire. Half of the ups 
were told the ratings would be strictly conf lial 
(private conditions), and half were told th ich 
member’s ratings would be announced to thc up 
(public conditions). Finally, a second questi: re, 
which was to be confidential in all conditions, vas 
administered to assess subjects’ perceptions of oir 


own group, including their affect toward the i 
The subjects were then debriefed and pledge: 
secrecy. 


RESULTS 


Because of the high degree of interacti 
among subjects in the groups and becaus: 
groups of subjects were randomly assigned to 
experimental conditions, the group avera; 
was the unit of analysis in all of the tests. 
Preliminary analyses revealed no effects 
the experimenter who conducted the session. 
Therefore, the design was collapsed over this 
factor, yielding eight groups in each cell of 
the 2 x 2 X 3 design. 


Differences Between Ratings of Ingroup an’ 
Outgroup Products 


The predictions about product evaluatio: 
were generated from consideration of possib! 
subjects’ motivation to overevaluate the ij 
group product relative to the outgroup prox 
uct. Therefore, the primary interest of th 
study was in differences between the rating: 
of the two rehabilitation programs. For this 
reason, a “relative evaluation index” was 
computed which consisted of the sum of the 
differences between ratings of the ingroup and 
outgroup products on two of the judgmen 
criteria, creativity and effectiveness. (Sinc 
subjects in the win and lose conditions were 
told that the two products did not differ on 
the practicality criterion, ratings of practi 
ity were not expected to show the hypo 
sized effects and were not used in the con- 
struction of the index.) Positive values of the 
relative evaluation index reflect overevaliia- 
tion of the ingroup product vis-à-vis the 
standard outgroup product, while negative 
values indicate underevaluation of the ingroup 
product relative to the outgroup product. The 
means and analyses of the evaluation index 
are presented in Tables 1 and 2. 

For continuing groups, it was predicted 
that the relative evaluation index scores of 
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winning and losing groups in the public con- 
ditions would converge toward zero, This 
convergence was expected to result from ef- 
forts by subjects in these groups to combat 
complacency (in winning groups) and “giving 
up” (in losing groups) on future tasks by 
members of their group. Less convergence was 
expected in the continuing groups in the pri- 
vate conditions, since the value of private 
product ratings as a means of communication 
to other group members was obliterated. 
Groups in the discontinue conditions, on the 
other hand, were expected to show a very 
different pattern of effects, The relative eval- 
uation index scores of winning and losing 
groups in the public conditions were expected 
to diverge in comparison with the private 
ratings of similar discontinue groups. This 
hypothesis was based on the belief that sub- 
jects in the discontinue-public groups would 
be likely to use the ratingg for mutual con- 
gratulation (in win conditions) or mutual 
disparagement (in lose conditions). 

These predictions were partially confirmed 
by the significant Continue—Discontinue x 
Public-Private X Outcome triple interaction 
(see Table 2). Internal analyses revealed a 
significant Public-Private x Outcome inter- 
action in the continue conditions, F (2,84) = 
33.99, p < .001, but not within the discon- 
tinue conditions (F < 1.0). Subjects expect- 
> ing to continue competing on future tasks and 
~/ who were told they had.won the first compe- 

tition overevaluated their own product rela- 
tive to the outgroup product less when the 
ratings were public than when the ratings 
were private, F(1, 84) = 32.79, p < .001. In 
contrast, members of continuing groups in the 
lose conditions gave higher relative evalua- 


TABLE 1 
CELE MEANS or RELATIVE EVALUATION INDEX 


p = 


Continue Discontinue 
Outcome — j a 
Public Private Public | Private 
Win 2.227 5.668 7.448 J 

Lose —1.135 | —4.302 | —4.148 | —5.051 
Ambiguous 4.385 2.521 5.433 4.020 
Nole. n = 8 groups per cell. Higher values indicate greater 
Overevaluation of the ingroup product relative to the outgroup 


product. 


p 
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TABLE 2 


ANALYSIS OF VARIANCE OF RELATIVE 
EVALUATION INDEX 


Source MS df F 
Continue-discontinue(A) 15.33 1 10.62* 
Public-private (B) 15.99 EEEE id 
Outcome (C) 772.63 | 2 | 535.178** 
AXB 1.96 1 1.36 
AXC 49.09 | 2 | 34.00** 
BXC DIVE YASS [iP Was Wess oa 
AXBXC 23.97 | 2 | 16.61** 

Error 1.44 | 84 
*p <.005. 
** p <.001, 


tions in the public than in the private condi- 
tion, F(1,84) = 27.80, p < .001. Subjects in 
continuing groups in the ambiguous outcome 
conditions showed greater overevaluations 
with public than with private rating instruc- 
tions, F(1,84) = 9.62, p < .003. In the dis- 
continue conditions, the public-private ma- 
nipulation yielded a main effect on the rela- 
tive evaluation index, F (1,84) = 10.09, p < 
002, such that public evaluations of the 
products resulted in higher values than did 
private evaluations. The predicted Public- 
Private X Outcome interaction failed to ma- 
terialize. Thus, the results on the relative 
evaluation index are congruent with the pre- 
dictions for subjects in the continue condi- 
tions, but they were not exactly as predicted 
in the discontinue conditions. 

Of the significant double interactions and 
main effects reported in Table 2, all but two 
are qualiñed by the triple interaction. The 
main effect for the purported outcome of the 
competition was highly significant within all 
combinations of the continue-discontinue and 
public-private factors. The Continue—Discon- 
tinue X Outcome interaction was significant 
even within the private conditions, /(2, 84) 
= 3.66, p < .05. Inspection of the means (see 
Table 1) reveals that subjects in continuing 
groups were less responsive to the outcome 
manipulation than were those in discontinuing 
groups. Stated differently, the members of 
continuing groups showed less private over- 
evaluation of their own products in the win 
and ambiguous conditions and less under- 
evaluation in the lose condition than did the 
members of discontinuing groups. 
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Absolute Evaluations of the Products 


Subjects’ absolute evaluations of the in- 
group and of the outgroup product were ana- 
lyzed in an attempt to determine the locus of 
the effects described above. That is, the rela- 
tive evaluation results could have been due to 
differences in the evaluations of the ingroup 
product, differences in the evaluations of the 
outgroup product, or both. (The absolute 
evaluations were generated by summing aver- 
age responses of members in each group on 
“creativity” and “effectiveness” scales.) 

These analyses revealed a significant Con- 

tinue—Discontinue X Public-Private x Out- 
come interaction for ratings of the ingroup 
products but not for ratings of the outgroup 
product (F < 1.0). The means and analysis 
of variance of the absolute evaluations of the 
ingroup product are presented in Tables 3 and 
4. Inspection of Table 3 reveals that the 
pattern of means observed with absolute 
evaluations of the ingroup product was very 
similar to that found on the relative evalua- 
tion index. There was a significant triple 
interaction (see Table 4). The Public-Private 
X Outcome interaction was significant within 
the continue conditions, F (2, 84) = 30.17, p 
< .001, and was not significant within the 
discontinue conditions (F = 1.11, ns). The 
Public-Private X Outcome interaction within 
the continue conditions was of the same form 
as that observed for the relative evaluation 
index; subjects in continue—win groups rated 
their own product less highly under public 
than private rating instructions, while mem- 
bers of continue-lose and continue-ambigu- 
ous groups were more favorable in their rat- 
ings of the ingroup product when the ratings 
were public than when they were private. 


TABLE 3 


CELL MEANS OF ABSOLUTE EVALUATIONS OF THE 
INGROUP PRODUCT 


Continue Discontinue 
Outcome 
Public Private Public Private 
Win 12.176 | 14.771 | 15.341 | 14.135 
Lose 12.176 8.949 8.976 7.948 
14.343 | 13.386 | 14.276 | 14.122 


Ambiguous 


Note. Higher values indicate higher evaluations of the ingroup 
product. 
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ANALYSIS OF VARIANCE OF ABSOLUTE EVAL INS 
OF THE INGROUP PRODUCT 


us |u | 


TABLE 4 


Source 

Continue-discontinue (A) .67 | 1 i 
Public-private (B) 10.56 | 1 oe 
Outcome (C) | 221.46 | 2 ee 
AXB SSPE SS 
AXC | 24.18 | 2 hae 
BXC 602 A i 147 
AXBXC O TERA 17. a 

Error | 1.14 | 84 | 

* p < 005. 

** p < .001, 


The absolute evaluations of the outgra > 
product showed significant main effects for 
the continue-discontinue manipulation 
marginal means were 11.07 for continuing 
groups and 10.01 for discontinuing groups, 
F(1, 84) = 21.30, p < .001) and for the out- 
come manipulation (marginal means of 8.66 
for winning groups, 13.17 for losing gronps, 
and 9.94 for ambiguous-outcome groups; / (? 

84) = 164.55, p < .001). In addition, | 


significant double interactions were obtai: 
on this variable. The Public-Private X © i- 
come interaction was significant, F(2, 84 
5.29, p < .01, and was revealed by internal 
analyses to be due to significantly lower + 
ings of the outgroup product under pub::c 
instructions than under private instructions 
only when the groups were in the ambiguous- 
outcome conditions (within the ambiguou 
level the test for the public-private simple 
effect yielded F(1, 84) = 8.92, p < .005). A 
significant Continue—Discontinue X Outconie 
interaction was also observed on the ab e 
evaluations of the outgroup product, F (2, $4) 
= 4.39, p< .015, which was the result of 
higher ratings of the outgroup produci by 
subjects in continuous—ambiguous groups than 
discontinue-ambiguous groups, F(1,84) = 
6.69, p < .011, higher ratings by subjects in 
continue-win than discontinue-win groups, 
F(1, 84) = 23.02, p < .001, but no significant 
difference between the ratings of subjects in 
continue-lose and discontinue-lose groups (F 
< 1.0). The ratings of the outgroup product 
did not mimic the relative evaluation ratings. 
The close correspondence between the ef- 
fects observed on the absolute ratings of the 
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ingroup product and those found with the 
relative evaluation index would seem to sup- 
port the idea that subjects in continue-public 
groups used their evaluations of the ingroup 
product rather than their evaluations of the 
outgroup product to attempt to ensure the 
continued motivation of their co-workers, 


Other Responses to the Ingroup 


Subjects in all of the groups were asked to 
complete a confidential questionnaire which 
included several items concerning affect for 
the ingroup and perceptions of its processes. 
The most general effect of the experimental 
manipulations on these measures was a main 
effect for the outcome factor such that the 
members of win and ambiguous groups showed 
greater intragroup affect than did groups in 
the lose conditions. These results are best ex- 
emplified by a measure consisting of the 
summed responses to questions asking how 
much the subject enjoyed the group, how 
much he would like to work with the group 
on some future task, and how much he liked 
the other members of the group. The marginal 
means for this overall measure on a scale of 
1 (not at all) to 27 (very much) were 22.12 
for winning groups, 21.02 for ambiguous 
groups, and 17.40 for losing groups. The out- 
come main effect for this measure was highly 
significant, F(2,84) = 28.33, p < .001. 

Another effect observed on responses to 
the final questionnaire was a main effect for 
the continue-discontinue manipulation on a 
question asking how much the subjects agreed 
with the other members of their group con- 
cerning the rehabilitation program, F(1, 84) 
= 6.33, p< .014. Members of continuing 
groups felt they were in better agreement on 
the program than did members of discontinu- 
ing groups (marginal means of 7.30 and 6.74, 
respectively). 


Discussion 


Among the most important implications of 
the results reported above are the following: 
(a) that overevaluation in ongoing groups 
may vary in a rather complex manner and 
(b) that processes other than those related 
to balance phenomena must be considered to 
explain this complexity. While studies such 
as that reported by Ferguson and Kelley 
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(1964) have demonstrated that a balance 
interpretation is sufficient to account for the 
minimal conditions engendering the overeval- 
uation effect, several findings of the present 
study indicate that balance alone will not 
explain the effect as it emerged in this ex- 
periment, For example, there is very little 
correspondence between the pattern of results 
observed on measures of liking for the ingroup 
and that found with the relative evaluation 
index, The within-cells correlation in this 
study between the general affect measure and 
the relative evaluation index was .298 (p < 
.05). Although the correlation is significant, 
only about 9% of the residual variance of the 
overevaluation score is accounted for by the 
affect measure. This finding seems to indicate 
the necessity of considering other factors for 
accurate prediction of the magnitude of over- 
evaluation. 

The predictions advanced earlier concern- 
ing the product evaluations of continuing 
groups were confirmed. These predictions de- 
veloped from the basic assumption that group 
members would use public evaluations of the 
products as communcations designed to sus- 
tain a high level of motivation among their 
co-workers. Although the use of communica- 
tions to maintain motivation has not been 
investigated previously in situations of inter- 
group competition, much the same idea has 
been advanced by Zander (1968) in connec- 
tion with his work on group levels of aspira- 
tion and by Emerson (1966) in a case study 
of communications during the completion of 
a very difficult group task (climbing Mount 
Everest). Emerson’s results generally con- 
firmed his hypothesis that optimistic com- 
munications would evoke pessimistic replies 
from other group members, and vice versa, 
in an attempt to create high uncertainty 
about the outcome of the task and thus main- 
tain motivation. The results of the present 
study demonstrate that communications con- 
cerning product evaluations in conditions of 
continuing intergroup conflict show a similar 
effect. That is, objective feedback, which was 
met with less positive public evaluations, 
while negative feedback was met with more 
positive evaluations (compared to subjects’ 
ratings in the private conditions), 

The present study offers results to suggest 
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how evaluation of group products are used 
to increase members’ motivation. The simi- 
larity of effects found with absolute evalua- 
tions of the ingroup product to those obtained 
with the relative evaluation index suggests 
that subjects in continuing groups, when at- 
tempting to sustain the motivation of their 
co-workers, addressed their attention to the 
ingroup product. Although no predictions 
were advanced concerning which product 
rating would contribute most to the hypothe- 
sized relative evaluation effects, it seems 
reasonable that subjects would choose the 
ratings of the ingroup product for their 
motivation-sustaining communications. It was, 
after all, the ingroup product over which the 
subject and his co-workers had control and 
which would be affected by the presence or 
absence of motivation in the group. 

In addition to the effects attributable to 
the communication value of the product 
evaluations, there was another result of 
the continue-discontinue manipulation on the 
relative evaluation of the products. The sig- 
nificant Continue—Discontinue X Outcome in- 
teraction within the private conditions was 
apparently the result of less influence of the 
outcome manipulation upon the relative eval- 
uations of members of continuing groups than 
upon the ratings of members of discontinuing 
groups. It seems likely that subjects in the 
continue conditions tempered their evalua- 
tions of the products in anticipation of addi- 
tional information about the groups’ abilities 
from later competitions. 

Within the discontinue conditions, subjects 
tended to overevaluate their own group’s 
product more in public ratings than in private 
evaluations. It was expected that subjects 
would use public evaluations for mutual con- 
gratulation in the win condition and mutual 
disparagement in the lose condition. It seems, 
however, that subjects in the disbanding groups 
used the public ratings to foster a more positive 
group atmosphere regardless of the purported 
outcome of the competition, as is indicated 
by the public-private main effect within these 
cells. That is, rather than being motivated 
to express appreciation or displeasure with 
the group’s performance, these subjects may 
have been motivated to obtain whatever posi- 
tive outcomes they could from a favorable 
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group atmosphere during the last minui: of 
the group’s existence and used the pi uct 
evaluations, when these were public, tos. ify 
this motivation. Schopler et al. (Note 1: ob- 
tained a similar effect for the publicity ne 
evaluations. 

One possible alternative explanation fo: he 
results deserves exploration. It is possible | it 
subjects in the continue-public conditions ‘it 


the greatest pressure for their evaluations lo 
be correct, since they had to continue wo:! 

ing in their group and the other group mei: 

bers would hear their evaluations. In order to 
“play it safe” in public, they tended to rate 
the two performances relatively equal. Per- 
sons in the private-continue groups and 
both public-discontinue and private-discon- 
tinue groups may not have felt such a strong 
pressure to be correct. This explanation 
would account for the greater difference be- 
tween the public and private ratings in the 
continue groups than in the discontinue 
groups. This explanation has merit and can- 
not postively be ruled out. However, it seems 
to have two drawbacks. First, why should 
subjects in the ambiguous-continue condi 
tion give greater difference ratings in public 
than in private conditions? These people had 
no information on which to base their evalua 

tions, and a “cautious strategy” approach 
would suggest greater differences in private 
than public ratings. Second, the results indi- 
cated that subjects in the continue—public— 
lose condition actually evaluated their own 
group’s products higher than subjects in the 
continue-private-lose condition. If subjects 
had a greater desire to be correct in the 
public condition than in the private condi 

tion, it would seem that the lower rating 
should occur in the public cell. Thus while 
the desire-to-be-correct explanation cannot be 
ruled out, it does not seem to adequatcly 
handle all the data. 

In general, the results of this study indicate 
the need and the value of consideration of 
communication and motivational processes 
in the explanation of evaluation of group 
products. While there was no direct evidence 
that subjects were using their evaluations to 
increase their own group members’ motiva- 
tion to work, the results were such that an 
explanation based on this hypothesis would 
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be very plausible. Future research might 
focus on the effects on member motivation of 
having ingroup and outgroup products judged 
as being either close together or far apart in 
quality. If the present reasoning is correct, 
members of a group who feel that their 
group’s product won or lost a competition by 
a close margin should be more motivated to 
work hard in the next round of competition 
than would members who felt their group’s 
product had won or lost by a large margin. 


REFERENCE NOTE 
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Naturalistic Studies of Aggressive Behavior: Aggressive Sti 


uli, 


Victim Visibility, and Horn Honking 


Charles W. Turner, John F. Layton, and Lynn Stanley Simons 
University of Utah 


Three studies extended laboratory research on aggression to a naturalistic setting 
which involved horn honking from drivers as a measure of aggression; the 
studies were adapted frem Doob and Gross. The results from a survey (Study 
1) of 59 drivers suggested that they were frequently irritated by and aggres- 
sive toward other drivers, A second study (using a 3 X 2 factorial design with 
92 male drivers) indicated that manipulations of a rifle in an aggressive con- 
text and victim visibility (dehumanization) both significantly influenced horn 
honking rates subsequent to obstruction at a signal light. A third study with 
137 male drivers and 63 female drivers examined the interactive effects of a 
rifle, an aggressively connotated bumper sticker, and individual subject char- 
acteristics (sex and an exploratory index of self-perceived status) on horn 
honking, The results for three studies in naturalistic settings offer possible 
extensions of laboratory based findings on aggression. The role of inhibitions 
in modifying the pattern of results was also discussed. 


There has been considerable recent contro- 
versy about the validity of laboratory studies 
of aggression (Buss, Booker, & Buss, 1972; 
Feshback & Singer, 1971; U.S. Surgeon Gen- 
eral’s . . . Committee on Television, 1972). 
For example, some researchers have suggested 
that the commonly used measures of aggres- 
sion may not have external validity. Thus, 
variables which affect laboratory-based re- 
sponses may not influence naturally occurring 
aggressive responses. 

One possible limitation of the laboratory 
setting is that subjects may sharply modify 
their behavior if they believe that someone 
is carefully monitoring and evaluating their 
reactions. According to Turner and Simons 
(1974), when subjects were led to believe 
that the experimenter was monitoring their 
responses to weapons, they were less likely 
than nonaware subjects to shock their part- 
ners. Awareness of the experimenter’s purpose 
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apparently caused subjects to inhibit ag: 
sive behavior. In order to reduce the sub/eci’s 
inhibitions, researchers have introduce 1 
variety of deceptions to minimize beliefs 

the experiment was designed to evaluate `- 
gressive behavior. However, many subje is 
today may be too sophisticated for the dec: o- 
tions commonly used in the laboratory. Ti 
some laboratory findings could be artifact) 1 
for two reasons. First, subjects may be >- 
sponding primarily to awareness of deceptions 
rather than the experimental treatments 
(Page & Scheidt, 1971; Stricker, Messick, & 
Jackson, 1969). Second, their primary moti 
vation may often be to portray themsely 
in a favorable light to the experiment r 
(Rosenberg, 1969). 

Since some laboratory results may be pro- 
duced by experimental artifacts such as evalu- 
ation apprehension, suspicion, negativism, and 
sophistication, it is important that attempts 
be made to investigate aggressive behavior in 
subjects who are not aware that they are being 
studied. The primary purpose of the present 
research was to assess whether naturalistic 
manipulations conceptually similar to labora- 
tory procedures can affect human aggressive 
responses. Laboratory researchers have at- 
tempted to manipulate arousal, aggressive 
stimulation (Berkowitz, 1974; Geen & O'Neal, 
1969), and dehumanization (Milgram, 1965; 
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Zimbardo, 1969). In an exploratory attempt 
to extend laboratory research to a naturalistic 
setting, a rifle in an aggressive context (ag- 
gressive stimuli) and victim visibility (dehu- 
manization) were manipulated in the present 
study for obstructed (and possibly aroused) 
drivers at a signal light. 

In order to develop an appropriate natural- 
istic setting to measure aggressive responses, 
guidelines were adapted from Webb, Camp- 
bell, Schwartz, and Sechrest’s (1966) analysis 
of unobtrusive measures. The following cri- 
» teria were adopted to reduce subjects’ percep- 
tions that they were being studied and to 
develop adequately sensitive and independent 
measures of aggression: (a) There should be 
relatively low inhibitions about the behavior 
so that the base level responding would be 
considerably above zero probability, (b) the 
response should not be likely to produce con- 
tagion effects on other’s aggression, (c) the 
subject’s anonymity should be preserved, (d) 
the experimental setting should be reasonably 
naturalistic so that the procedures would not 
be an unusual imposition on the subjects or 
endanger them in any way, (e) the subjects 
should remain in the experimental setting for 
short periods of time such that they would 
not be exposed to more than one experi- 
mental treatment, and (f) the experimental 
, treatments could be randomly assigned to 
subjects. 


Stupy 1 


Doob and Gross (1968) have offered one 
possible procedure for a naturalistic study of 
aggression. Their findings suggest that horn 
honking might be an aggressive reaction 
toward low-status drivers who prevent the 
flow of traffic at a signal light. Anecdotal 
evidence suggests that many drivers become 
ıggressive when “frustrated” by the behavior 
of other drivers. Parry (1968) surveyed 
English drivers concerning their aggressive 
reactions while driving. His findings suggest 
that hostile reactions while driving included 
facial expressions, verbalizations such as 
swearing, hand gestures, tailgating, light flash- 
ing (high beams), and horn honking. Some 
drivers also reported actual fist fights or at- 
tempts to chase other drivers off the road. 
Parry’s findings suggest that many drivers 
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may become angry and aggressive while 
driving. In order to determine whether similar 
hostile reactions occur in Salt Lake City, a 
survey was conducted based on Parry’s ques- 
tionnaire. Thus, Study 1 was designed to 
determine whether there was a sufficiently 
high base rate of anger and aggressive re- 
sponses from drivers so that experimental 
treatments might be expected to produce 
reliable results. 


Method 
Subjects and Procedure 


The subjects were sampled from the population of 
frequent drivers in Salt Lake City. One hundred 
homes were randomly selected from the city address 
directory. When an investigator located a residence, 
the most frequent male or the most frequent female 
driver (randomly determined) was asked to complete 
Parry’s (1968) driving survey. Acceptable subjects 
were located in 93 homes. The subjects were given 
a stamped envelope to return the questionnaire. If 
they did not respond within 2 weeks, they were 
again encouraged to complete the questionnaire, The 
subjects were assured of the complete confidentiality 
of their responses. Fifty-nine (63%) of the delivered 
questionnaires were returned. 


Results and Discussion 


Twelve questions (of 77) from Parry’s 
(1968) questionnaire were selected for analy- 
sis, and the results (see Table 1) are re- 
ported separately for males and females. 
The findings suggest that a high proportion of 
“frequent” drivers sometimes become angry 
or are irritated by the driving behaviors of 
other drivers, For example, 77% of males 
and 56% of females reported “swearing under 
their breaths” at other drivers, while 50% of 
males and 15% of female drivers reported 
“flashing their lights in anger” at other 
drivers. While overt hostile responses were 
not reported by a majority of drivers on 
every question (e.g., Questions 1, 2, 10, and 
12), there does appear to be evidence that 
hostile reactions to other drivers are a fre- 
quent occurrence. If the verbal reports are 
accurate reflections of actual driving situa- 
tions, then a large number of drivers might 
be frequently irritated by the behavior of 
other drivers. This anger or irritation could 
sometimes lead to an overt aggressive re- 
sponse if such a response is readily available 
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TABLE 1 


PERCENTAGES OF FEMALE AND MALE RESPONDENTS 
REPORTING HOSTILE REACTIONS TO 


QUESTIONNAIRE ITEMS 
| Male | Female 
Ee respon- | respon- 
Questionnaire item | dents dental 
(%) (%) 
I am easily provoked when driving. 23 18 
I lose my temper when another 
driver does something silly. 40" 41 


I have been known to flash my 
lights at others in anger. 50 15 
I get annoyed if the traffic lights 
changed to red as I approached 


them, 23 23 
I make rude signs at other motorists 

when I am provoked. 15 11 
At times, I’ve felt that I could 

gladly kill another driver. 12 18 
If someone suddenly turns without 

signaling, I get annoyed. 58 92 
I swear out loud at other drivers. 23 41 
I swear under my breath at other 

drivers, 77 56 
I have given chase to a driver who 

has annoyed me. 12 4 


If the driver behind has his lights 
shining in my mirror, I pay him 


back in some way. 23 12 
I am usually impatient at traffic 
lights. 19 1 


Note. The samples are based on the responses of 26 men and 
27 women. 
a One subject did not complete the item. 


(e.g., horn honking following an obstruction 
at a signal light). 


Stupy 2 


The present research was primarily designed 
to extend laboratory-based procedures for 
investigating possible determinants of aggres- 
sive reactions in a naturalistic setting. Hence, 
the procedure of Doob and Gross (1968) was 
adapted in order to manipulate exposure to 
aggressive stimuli and to attempt manipula- 
tions of inhibitions by dehumanization of the 
subject’s “victim.” The alleged victim (an 
experimental confederate ) would potentially 
frustrate all subjects by obstructing them at 
a signal light. 

Berkowitz and LePage (1967) manipulated 
aggressive stimulation by exposing some sub- 
jects to a pistol and a shotgun. One possible 
analogous field manipulation of aggressive 
stimuli would be to present a rifle in the gun 
rack of a pickup truck, especially since rifles 
are often carried that way in Utah. However, 
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a high proportion (perhaps 50% ) 

males have used rifles frequent in a 
“sporting” context. It is possible th: weap- 
ons (a rifle or a pistol) are not al per- 
ceived as aggressive stimuli when t have 
been observed frequently in a nona ssive 
context; for example, the rifles may per- 
ceived as sporting equipment somewh kea 
fishing pole or skis. In the present e3 ent, 


an attempt was made to vary the sali of 


an aggressive meaning for a rifle by p ing 
it with an ostensible bumper sticker ho ing 
an aggressive or a nonaggressive label. his 


manipulation is somewhat analogous to one 
employed by Berkowitz and Alioto (1975). 
They led some subjects to believe that a 
filmed football game was a “grudge” matc.:, 
while other subjects were encouraged to think 
of the game simply as a sporting event. Sub- 
jects watching an apparent grudge match 
were more likely to see an aggressive meanings, 
to the players’ actions and were more likely 
to shock a partner who had previo ly 
angered them. Berkowitz and Alioto’s (1973) 
findings suggest that the context of stimt us 
materials may play an important role in 
determining whether the material is viewed 
with an aggressive meaning. 

In an attempt to vary the subject’s inl 
tions about being aggressive, the mutual \ ‘si- 
bility of the victim and subject was varicd. 
According to findings by Milgram (1965), 
subjects appeared to be more willing to ad 
minister shock to an ostensible fellow subject 
when they were both less likely to see and 
to be seen by the victim. In addition, 
Zimbardo (1969) has proposed that deindi- 
viduation of both the subject and victim (de- 
humanization) can increase the probability of 
aggressive behavior. 


Method 
Subjects 


Experimental treatments were randomly assigned 
to 92 male drivers who served as subjects. Nine 
additional subjects were dropped from the sample, 
since they were females (n= 4) or male drivers of 
older vehicles (n =5). These subjects were approxi- 
mately evenly distributed across conditions. The sub- 
jects were an arbitrarily selected sample of drivers 
of late-model vehicles (less than 6 years of age) in 
a 20 X 20 block region of a mixed busi residen- 
tial district of Salt Lake City. Only newer car 


Utah | 


le ae 
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drivers were employed because high-status victims 
seem to lead to inhibitions in honking (Doob & Goss, 
1968). It is possible that older car drivers would 
perceive themselves as having low status relative to 
the victim, which could lead to inhibitions masking 
the effects of independent variables. The experimen- 
tal treatments were run on Saturdays from 9 a.m. 
to 5 p.m. It was assumed that Saturdays would 
produce a broader based sample of drivers from the 
potential population of all drivers, since fewer would 
be working. Moreover, the influence of “rush hour” 
traffic conditions could be minimized by testing on 
Saturdays, 


Experimental Design 


A 3 X2 between-subjects factorial design was em- 
ployed to manipulate aggressive stimulation and 
victim visibility (dehumanization), The subject was 
obstructed at a signal light for 12 sec. by an older 
model (1964) pickup truck with a gun rack in the 
rear window. The aggressive stimulation variable 
had three levels: (a) The gun rack was left empty 
(control), (b) a .303-calibre military rifle was placed 
in the gun rack and a bumper sticker was attached 
to the truck in order to reduce the perceived ag- 
gressiveness of the rifle (Rifle & “Friend” bumper 
sticker), or (c) the rifle was paired with a bumper 
sticker designed to increase the perceived aggressive- 
ness of the rifle (Rifle & “Vengeance” bumper 
sticker). The bumper stickers were attached to the 
tailgate of the truck directly in line with the sub- 
ject’s vision, and they could be easily removed and 
reattached after each trial. The bumper stickers 
measured approximately 4X 15 inches (102 X 381 
mm), and the words (3 inches, or 76 mm, high) 
were printed with broad lettering (È inch, or 9.5 
mm, thick), so that they could be easily read at 50 
feet (15 m). The words friend and vengeance were 
selected from the aggressive or altruistic lists of 
Parke, Ewall, and Slaby (1972). Ratings by 30 col- 
lege students indicated that the word vengeance was 
highest (without also being rated high in anxiety) 
and friend was lowest on an aggressive-nonaggres- 
sive dimension from the words in Parke et al.’s lists. 

Victim visibility (dehumanization) was manipu- 
lated by closing a curtain across the rear window 
of the pickup (without obstructing the view of the 
gun rack) in half of the conditions (low visibility) 
and leaving the curtain open in the other conditions 
(high visibility), The experimental conditions were 
run in blocks of six such that each condition was 
completed before any condition was replicated. 


Procedure 


The procedure was modeled closely after Doob 
and Gross (1968). An experimental confederate 
driving a pickup truck timed his arrival at an 
intersection at approximately the same time that the 
light turned red. If a male driver of a late-model 
apparently privately-owned vehicle came to a com- 
plete stop behind the confederate before the light 
changed to green, the driver confederate started the 
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trial (if the conditions were not satisfied, the trial 
was aborted). When the light turned green, the driver- 
confederate started a stopwatch, faced straight 
ahead, and kept his brake lights on to avoid any 
indication that he might be having trouble with the 
pickup. At the end of 12 sec., the confederate moved 
forward with the traffic. Thus, the subjects were 
obstructed at the light for 12 sec. The first driver 
in line behind the confederate was always considered 
to be the subject. An observer was placed in an 
inconspicuous spot at the intersection so that the 
subject would be unlikely to see him. The observer 
rated demographic characteristics of the subject be- 
fore the trial began (e.g., age and sex of subject; 
age of car, make of car, number of occupants, and 
general traffic density). Based on the observer’s 
ratings, nine subjects were dropped from the sample, 
since they were either females or male drivers of older 
vehicles. The deleted subjects were inadvertently 
exposed to treatments when the driver-confederate 
could not see them clearly in his mirror. The driver’s 
side-view mirror was partly obstructed with tape 
so that his reflection would not be visible to the 
subjects behind him when the curtain was closed. 
The tape also prevented the confederate from clearly 
seeing the subject's vehicle, and he misjudged the 
sex or vehicle age of some subjects, The observer's 
judgments were employed to establish the vehicle 
age or sex of subjects. The observers started a stop- 
watch when the light turned green and recorded the 
latency and frequency of honks from the subject. 
The observers received at least 1 hour of pretraining 
in the rating procedures. 


RESULTS AND Discussion 


Subject’s honking responses were dichoto- 
mized into those honking (scored as 1) and 
those not honking (scored as 0). According 
to Lunney (1970) and D’Agostino (1971), 
analysis of variance procedures may be 
applied to dichotomous data when sample 
sizes are reasonably large (for error, df = 20) 
and the sample proportions (means) fall be- 
tween .20 and .80. Typically, hypotheses 
about proportions are tested with the bino- 
mial distribution, but the binomial is closely 
approximated by the normal distribution for 
n greater than 10, especially when the 
“population proportion” or null hypothesis is 
approximately .50. Since these conditions 
were satisfied with the present data (ñ = 15.2; 
$ = 42), the analysis of variance procedures 
were employed. 

The rates of honking are reported in 
Table 2. Five independent planned con- 
trasts (Kirk, 1968, pp. 73-76; 178-182) 
were computed on the six cell means using 
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TABLE 2 


PERCENTAGE OF HORN HONKING IN THE EXPERIMENTAL 
CoxnnDITIONS OF STUDY 2 


| Aggressive stimulation 
Victim Rifle present 
visibility | Di | 
| Control | ] Mean % 
| | Friend [Veni e 
bumper | bu 
| sticker | st 
Low visibility 33.31 | 46.71 76.5111 52.2 
(cur osed) | (n = 15) | do 5 15) | (= 17) 
High visibility 2idiv | 29.4y | 42.9v1 3 
(curtain open) | (n = 14) (n= 17) (n = 14) 
27 38.0 59.7 


Mean % | 


Cell entries refer to the percentages of subjects pro- 
g at least one honk. Numbers in parentheses refer to cell 
ze: “rhe Roman numerals specify the order of the contrast 
yeis which were applied to means, For example in Contrast 

B (—}, —4. +1, 0, 0, 0), the contrast weight +1 was applied 
to Condition IIL (rifle present/vengeance bumper sticker). 


an unweighted-means solution for the unequal 
sample sizes. The contrast weights for each 
mean are presented in order according to the 
cell subscripts (I-VI) which are reported in 
Table 2: Contrast A= —4, —4, —4, +4, 
+4, +4; Contrast B = —}, —4, +1, 0, 0, 0; 
Contrast C = es +1, 0, 0, 0, 0; Contrast D 
=0, y 0, —4, +1; Contrast E = 0, 0, 
Mami T, Y The results of the planned 
contrast analysis indicated that the closed 
curtain significantly increased the rate of 
honking compared with the open curtain 
treatment, for Contrast A: F(1,86) = 4.43, 
p< .05. In addition, the honking rate for 
the rifle/vengeance condition (XY = .765) was 
significantly higher than the average (X = 
400) of the other two conditions ([rifle/ 
friend + control] x 4) when the curtain was 
closed, for Contrast B: F(1,86) = 5.98, 
p< .02, but the effect was not significant 
when the curtain was open, for Contrast D: 
F(1,86) = 1.37, p > .20. The two remain- 
ing contrasts (C and E) which compared the 
rifle/friend condition to the control condition 
were also nonsignificant (F < 1.0). 

The results of the present study are gen- 
erally consistent with the reasoning that led 
to the procedures, since both victim visibility 
and the rifle/vengeance condition increased 
horn honking. Thus, the present findings 
tentatively suggest that dehumanization and 
the presence of a rifle which is perceived 
as an aggressive stimuli can increase the prob- 
ability of aggressive responding in a natural- 
istic setting. The rifle did not significantly 
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influence the rate of honking when it was in a 
friendly or “prosocial” context (Contrasts C 
and E), nor did the rifle/vengeance oo idi- 
tion significantly influence honking whe» the 
victim was visible (Contrast D). The iset 
that the rifle/vengeance condition hor og 
rate was significantly higher only when ‘he 


victim was not visible may be consistent w::h 
laboratory procedures used to study aggi 
sive behavior, That is, most researchers i 
the laboratory typically isolate the victi: 
from the subject in order to lower inhibitions 
about giving shocks. Similarly, in the present 
study, reduced visibility of the victim (when 
the curtain was closed) might have increased 
the rate by reducing inhibitions. 


Stupy 3 


One limitation of the procedure in Study 2 
was that the rifle and the vengeance bumper 
sticker were not independently manipulated. 
Thus, the findings for the rifle/vengeance 
condition might have been due to either ob- 
ject alone or to the interactive effects of both 
objects. In Study 3, the rifle and the ven 
geance bumper sticker were independently 
manipulated so that their hypothesized inter 
active effects on horn honking could be tested 
The vengeance bumper sticker was designed 
to increase the likelihood that subjects woul: 
perceive an aggressive connotation to the 
rifle. Previous laboratory findings suggest 
that uninhibited subjects would be more ag- 
gressive when they viewed stimuli with an 
aggressive meaning (Berkowitz & Alioto 
1973). Since it was possible that inhibitions 
might mask any effects of the rifle and th: 
vengeance bumper sticker, the dehumani 
tion (curtain) manipulation of Study 2 was 
employed for all conditions in an attempt to 
lower inhibitions. 

The work of Doob and Gross (1968) sug- 
gests that there may be strong individual dif- 
ferences (e.g., status and sex differences) in 
driver’s reactions to obstruction at a stop- 
light. Based, in part, on Doob and Gross’s 
findings, male drivers of older vehicles and 
female drivers were not used as subjects in 
Study 2 because it was assumed that they 
would inhibit horn honking. Doob and Gross 
found that male subjects honked less at 
higher than lower status victims, possibly be- 
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cause high-status victims produced inhibitions 
about honking. Presumably, the lower the 
subject’s self-perceived status, the higher the 
victims status (relatively) is likely to appear. 
Thus, self-perceived status relative to the 
victim may influence willingness to honk as 
an aggressive response. 

In the present investigation, the subjects 
were divided into two groups based on the 
age of the vehicle they were driving. This 
procedure was employed in an attempt to 
derive an exploratory measure of subject’s 
self-perceived status. It is possible that the 
older a person’s car, the less likely he would 
be to perceive himself as higher in status than 
the confederate in the pickup truck. Since the 
vehicle age variable could reflect other differ- 
ences than self-perceived status (e.g., differ- 
ential likelihoods of being frustrated due to 
different experiences with stalled automo- 
biles), the variable was included only as an 
exploratory assessment of possible status 
differences." 

Several researchers (summarized by Bard- 

ick, 1971, chap. 7) have found different 

tterns of male and female aggressive behav- 

r. Further, Doob and Gross (1968) found 
different horn honking reactions in men and 
women, since women had longer latencies for 
their first honks. Based on these findings, it 
was assumed that males and females might 
not respond with horn honking in the same 
way to the manipulations of the present study 
if horn honking reactions to obstruction at 
a signal light reflect aggressive responses. 
Hence, the subject’s sex was recorded to 
permit separate comparisons of the experi- 
mental manipulations for males and females. 
't is possible that the effects of the rifle or 

vengeance bumper sticker would be sig- 
niicant only with male subjects driving new 
vehicles, since other subjects might inhibit 
horn honking responses. 


1Status was varied by vehicle age rather than 
model for several reasons. It was assumed that dif- 
ferent models might not be consistently perceived 
as representing high or low status by all drivers 
(e.g., sports cars vs. luxury sedans). Moreover, it 
was difficult to determine how subjects would judge 
older model, expensive vehicles relative to newer 
model, less expensive vehicles. There might be con- 
siderable inconsistency in perceptions of status for, 
model-cost and model-age variations. 
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Method 

Subjects 

Male (m=137) and female (n= 63) drivers of 
apparently privately owned vehicles served as sub- 
jects. Subjects were selected by the same procedures 
used in Study 2 except that no restrictions were 
placed on the age of drivers’ vehicles or sex of sub- 
jects. Four additional subjects were dropped from 
the sample due to recording errors (i.e., not recording 
age of vehicle or sex of subject). 


Experimental Design 


Each subject was exposed to one level of a weapon 
(rifle vs. no rifle) manipulation and one level of 
the bumper sticker (vengeance bumper sticker vs. no 
bumper sticker) manipulation in an attempt to in- 
dependently manipulate perceived aggressiveness of 
the rifle. The status of the subjects was also classi- 
fied according to a median split (approximately) on 
the age of the subjects’ vehicle (new vehicle: less 
than or equal to 4 years old; old vehicle: more than 
4 years old). 


Procedure 


The procedure was identical to Study 2 except for 
the independent manipulation of the bumper sticker 
and the rifle. In addition, both male and female 
drivers were exposed to the treatments. Two pickup 
trucks (1969 models) were used to introduce the 
experimental conditions. These trucks were 5-6 
years newer than the truck used in Study 2. Hence, 
the victims’ perceived status in Study 3 might have 
been higher than that of the victim in Study 2. 

Hidden observers started a stopwatch when the 
light turned green and recorded the latency and fre- 
quency of honks. The raters also recorded informa- 
tion about subject’s age, sex, and age of vehicle. 
The raters received at least 1 hour of pretraining 
in the rating procedures. In order to assess the reli- 
ability of observer ratings, two raters were employed 
for two separate samples of subjects (Sample A = 62 
subjects from Study 3; Sample B = 46 subjects from 
an unpublished study testing other hypotheses about 
the effects of a rifle). The percentage agreement 
between the raters for presence or absence of a honk 
was 100% in Sample A and 96% in Sample B. The 
reliability (ree) for rated age of subject's auto was 
-83 for Sample A and .78 for Sample B; most of 
the disagreements occurred for older age autos. Reli- 
abilities for frequency of honking were .94 in 
Sample A and .87 in Sample B; reliability for 
latency of honking was .90 for both Samples A and 
B. No attempt was made to record or rate duration 
of honks, since reliabilities of ratings would have 
been too low, and adequately sensitive, portable tape 
recorders were not available to record the honks. 


RESULTS AND DISCUSSION 
As in Study 2, subject’s horn honking re- 


à Sponses were dichotomized into those honking 


(scored as 1) and those not honking (scored 
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as 0). According to the reasoning advanced 
above, it was assumed that the effects of the 
rifle would be most pronounced when it ap- 
peared in an aggressive context (ie., the 
bumper sticker). Predictions about the interac- 
tive effect of the rifle and the bumper sticker 
can be tested most directly by a planned com- 
parison procedure. The primary interaction 
hypothesis was tested by a contrast that com- 
pared the rifle/vengeance condition to the 
average of the other three rifle and vengeance 
combinations (Contrast A: —4, —4, —4, 
+1). Two additional independent contrasts 
were selected: The second contrast com- 
pared the rifle/no bumper sticker condi- 
tion to the two no rifle conditions (Contrast 
B: —4, —}, +1, 0); the third contrast com- 
pared the two no rifle conditions (Contrast C: 
—1, +1, 0, 0). The pooled results for Con- 
trasts B and C represent deviations from the 
main hypothesis. The mean rates (propor- 
tions) of subject honking which are reported 
separately for new- and old-vehicle male 
drivers and for female drivers are presented 


TABLE 3 


PERCENTAGE OF HoRN HONKING IN THE EXPERIMENTAL 
Conpitions oF STUDY 3 


Experimental conditions 


No rifle present Rifle present 


Vengeance Vengeance 
b No br 
Siker CD) | alimen | sticker (D | Amt, 
Male drivers of new vehicles 
50.0 33.3 30.4 65.0 
(n = 18) (n = 12) (n = 23) (n = 20) 
Male drivers of old vehicles 
56.2 38.1 46.2 14.3 
(n = 16) (n = 21) (n = 13) (n = 14) 
Female drivers 
80.0 50.0 50.0 50.0 
(n = 15) (n = 16) (n = 16) (n = 16) 


Note, Cell entries refer to the percentages of subjects pro- 
ducing at least one honk. Numbers in parentheses refer to cell 
sizes. The Roman numerals specify the order of the contrast 
weights which were applied to means. For example, in Contrast 


A (—3, —3, —4, and +1), the contrast weight +1 was applied 
to Condition IV (rifle present-vengeance bumper sticker). 
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TABLE 4 
SUMMARY OF THE CONTRAST ANALYSIS 7 [ORN 
HONKING RATES IN STUDY 3 
Source | a | us 
I ea 
Males drivers of new vehicles 
Hypothesis (Contrast A) | 1. | .948 | 3" 
Deviation from hypothesis | | | 
(Contrast B + C) ha e A SD ie 
| | | 3 
Male drivers of old vehicles 
Hypothesis | 1} 1.228) 5.23* 
Deviation from hypothesis 
(Contrast B + Č) 2) 128) <1 
Error for males 129 | .235 
Female drivers 
Hypothesis 1} U7} <1 
Deviation from hypothesis 
(Contrast B + C) 2) A741 1.92 
Error for females 59 | .244 


| 


Note. The harmonic mean (ñ) for new-vehicle males was 
17.21, for old-vehicle males, ñ = 15.48, and for females, 
ñ = 15,72. 

*p <.05. 


in Table 3. The results for the contr: 
analysis are summarized in Table 4, 


New-Vehicle Male Drivers 


The results for the planned compariso 3 
indicated that the rate of honking in ù < 


rifle/vengeance condition with new-vehicle - 


male drivers was significantly higher than the 
average of the other three rifle/bumpe 
sticker conditions, for Contrast A: F(1, 12° 

= 4.03, p < .05. The other three conditi: 

did not differ significantly from each oth r, 
for Contrast B + C: F < 1.0. It should 4.50 
be noted that there is no single alternative 
contrast to A for new-vehicle male drivers 
which could be significant, maximum alterna- 
tive contrast F (1, 129) = 1.6, p > .20. Heuce, 
the results tentatively support the predictions 
leading to the present procedures. Although 
the rifle/vengeance condition was signif- 
cantly different from the average of the other 
three conditions, a careful inspection of the 
means reported in Table 3 indicates that it 
was not significantly different from the con- 
trol condition. One possible explanation for 
the somewhat weaker results obtained in 
Study 3 is based on the fact that the con- 
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federate-victims drove newer vehicles (3-4 
years old) in Study 3 as compared to an 
8-9-year-old vehicle in Study 2. Perhaps some 
subjects inhibited honking to victims in the 
newer trucks in Study 3 because the victims 
were perceived to be of relatively high status. 
Since Doob and Gross (1968) found evidence 
of inhibitory reactions toward high-status 
victims, perhaps the weaker findings in the 
present study were partly accounted for by 
inhibitory processes.* 


Old-Vehicle Male Drivers 


The three planned contrasts (A, B, C) 
were employed to assess the effects of treat- 
ments for old-vehicle male drivers. The re- 
sults for Contrast A indicated that the rifle/ 
vengeance condition produced significantly 
lower rates of honking than the other three 
conditions, F(1, 129) = 5.23, p< .05. The 
other three conditions did not differ signifi- 
cantly (Contrast B + C: F < 1.0). One pos- 
sible explanation can be offered for these find- 
ings: When the old-vehicle drivers were 
exposed to the rifle in an aggressive context 

1e bumper sticker), they were more likely 
to perceive an aggressive meaning to the rifle 
and hence to their own honking responses. 
If they perceived their honks as potentially 
aggressive, they might have inhibited reac- 
tions in the presence of a higher status victim. 
These interpretations are somewhat similar to 
those offered by Ellis, Weinir, and Miller 
(1972), who found that subjects produced 
lower levels of shock giving in the presence 
of a rifle and a pistol. Apparently for their 
subjects, the weapons produced inhibitions 
about being aggressive. The present findings 


In order to assess the robustness of the aggressive 
stinulation manipulation which was reflected in the 
comparison of the rifle/vengeance condition to the 
control (no rifle/no bumper sticker) condition, the 
results for Study 2 and Study 3 were reanalyzed. A 
2X2 factorial analysis of variance was computed 
using Study 2-Study 3 as one factor and the aggres- 
sive stimulation manipulation as the second factor. 
The results indicated that the aggressive stimulation 
factor was significant, F(1,56) =5.78, p< .02, 
while neither the other factor (study replication) 
nor the interaction was significant (F < 1.0). The 
absence of a significant interaction suggests that the 
Pattern of results was similar in the two studies 
for the effect of the rifle/vengeance condition versus 
the control condition. 
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tentatively suggest that the presence of a rifle 
in an aggressive context (like the bumper 
sticker) for some male subjects may pro- 
duce inhibitions rather than stimulate more 
aggression. 


Female Drivers 


The planned contrasts (A, B, C) were also 
applied to the cell means for female subjects. 
Neither Contrast A (F < 1.0) nor the de- 
viation Contrasts B+C, F(2,59) = 1.92, 
p < .20, were significant. The differences be- 
tween conditions also were not significant 
when female subjects were divided by the 
age of their vehicles (new and old). However, 
the results for the females must be interpreted 
cautiously, since there were fewer subjects 
in any condition; thus, the condition differ- 
ences might be significant with sample sizes 
as large as those obtained for the male sub- 
jects. The lower frequency of female subjects 
resulted from the fact that most drivers were 
males, at least on Saturdays. 


GENERAL Discussion 


The primary reason for employing a natu- 
ralistic paradigm in the present research was 
to explore the possibility that laboratory pro- 
cedures could be extended to a setting where 
subjects were unaware that they were being 
studied. While there are many advantages to 
naturalistic studies, one disadvantage results 
from the fact that it is difficult to obtain va- 
lidity or manipulation checks from subjects to 
determine their perceptions of the experimental 
treatments. For example, it was not possible in 
the present research to assess directly any 
effects of the vengeance bumper sticker on 
subject’s perception of the rifle or to measure 
independently possible differences in inhibi- 
tions produced by the victim visibility or 
vehicle age variables. As a consequence, any 
inferences about possible mediating principles 
can be offered only very tentatively, since 
alternative interpretations can be offered for 
the present results. Additional research is re- 
quired before any firm conclusions are war- 
ranted about the present manipulations and 
the dependent measures. 

The results of a survey and two naturalistic 
experiments in the present research tenta- 
tively suggest that findings somewhat analo- 
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gous to laboratory research on aggression 
can be produced in a naturalistic setting. For 
example, male subjects in Study 2 were more 
likely to honk at a victim when he was not 
visible. Similarly, Milgram (1965) found in 
a laboratory setting that subjects were more 
likely to harm a victim who was not visible. 
Zimbardo’s (1969) construct of dehumaniza- 
tion provides one possible explanation for the 
effects of victim visibility. According to the 
construct, inhibitions against harming a vic- 
tim are lowered in the absence of cues which 
“humanize” a victim. The present curtain 
manipulations might have “dehumanized”’ the 
victim by removing visual cues from him 
which might have reduced possible inhibi- 
tions against horn honking as an aggressive 
response. 

However, there is another interpretation 
possible for effects of the curtain manipula- 
tion. For example, the horn honking subse- 
quent to obstruction at the light might be 
interpreted better as a “signal” response 
rather than as aggression. Since the subjects 
could not see the driver-confederate when the 
curtain was closed, they might have thought 
that he was being inattentive at the light. 
Thus, they might have used their horns to 
signal that the light had changed. Anecdotal 
evidence suggests that drivers often honk at 
others to attract their attention or to warn 
them about some danger. This alternative 
interpretation of the horn honking measure 
cannot be dismissed. Nevertheless, the pattern 
of findings (including the results of the 
survey in Study 1) suggests that drivers may 
become frustrated and angry at other drivers, 
and this anger or frustration can lead to vari- 
ous hostile reactions such as light flashing, 
swearing, or hand gestures. Presumably, horn 
honking might also be perceived as an aggres- 
sive response by subjects, especially in the 
presence of aggressive stimuli. 

Male drivers of new vehicles in Study 2 
gave more honks when they were exposed to 
the rifle/vengeance bumper sticker condition 
but only if they could not see the confederate. 
In one sense, the findings of Study 3 replicate 
the results for Study 2, since the honking 
rate in the rifle/vengeance condition was sig- 
nificantly higher (Contrast A) only for male 
drivers of new vehicles (the confederate was 
not visible for any subjects in Study 3). 
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In the present research, the 
bumper sticker condition was add: ) the 
rifle manipulation in an attempt to -xtend 
laboratory studies of an aggressive c< zt for 
stimuli (Berkowitz & Alioto, 19) 1a 
naturalistic setting. The vengeance iper 
sticker context was selected in an ati t to 
increase the salience of the aggresive cou. ota- 
tion for the rifle. Otherwise, it was po: ble 
that an aggressive meaning of the rifle w- ild 
not be salient in the gun rack of a pic! ip 
truck (e.g., it might be viewed as sporting 


equipment such as a fishing pole or skis). 
The present results are somewhat analogous 
to the results of Berkowitz and Alioto 
(1973), who found that the context in whic! 
a football game was presented played ai 
important role in determining what aggressiv‘ 
reactions followed the game. When the filmed 
game was characterized as a “grudge match’ 
as opposed to a sporting event, subjects were 
more likely to perceive an aggressive meaning 
to the football player’s actions, and they were 
more likely to shock a partner who had prev « 
ously insulted them. Similarly, the vengea 
bumper sticker might have modified h- n 
honking responses in the presence of the » de 1 
because its aggressive meaning was m: re 
salient. 

One important finding in Study 3 was ine 
strong individual differences in subjects’ re: c- 
tions. Although male drivers of new vehicles 
with nonvisible victims honked more whe i 
exposed to both the rifle and the vengear -e 
bumper sticker, the higher rates did jot 
occur for all subjects, When male drivers of 
new vehicles could see their victim, or wien 
male drivers of old vehicles and female drivers 
were exposed to the rifle/vengeance condition, 
they did not honk more. One possible expla- 
nation for the lower honking rates is that 
these subjects might have inhibited horn 
honking responses, especially in the presence 
of the rifle/vengeance condition. For example, 
if male drivers of old vehicles perceived them- 
selves to be of lower status than the con- 
federate, they might have inhibited horn 
honking as an aggressive response due to fears 
of retaliation from the high status driver in 
front of them. The results suggest the pos- 
sibility that the presence of aggressive stimuli 
might lead to lower levels of aggression from ~ 
many individuals due to inhibitions about á 
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engaging in aggressive behavior. Hence, there 
might be important limitations on the gen- 
eralizability for the effects of aggressive 
stimuli on horn honking responses and, pos- 
sibly, on other aggressive and antisocial 
responses, 

As with the victim visibility manipulations, 
there are several possible explanations for the 
present aggressive stimulus manipulations. 
For example, the rifle/bumper sticker combi- 
nation might have served as a classically con- 
ditioned aggressive stimulus which elicited 
aggression (Berkowitz & LePage, 1967); it 
might have served as a retrieval cue (Tulving 
& Thomson, 1973) to remind subjects of pre- 
vious experiences with aggressive stimuli (e.g., 
violent portrayals in the mass media), or it 
might have served as a cue which changed 
the subject’s perceptions of the aggressive 
meaning of their responses (Berkowitz & 
\lioto, 1973; Berkowitz & Turner, 1974). 

Since there are alternative explanations for 
the present findings which cannot be dis- 
missed, no firm conclusions can be offered 
about which principles best explain the results 
until additional research is completed. Still, 
the present research provides procedures 
which might be used to extend laboratory 
esearch to naturalistic settings where sub- 
‘ects do not know that they are being studied. 
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Territoriality and Control: A Field Experiment 


Julian J. Edney 
Arizona State University 


The relationship between territoriality and control was examined by compar- 
ing the behavior of college students in their own dormitory rooms with 
visitors to those rooms, when control dynamics between these residents and 
visitors were artificially manipulated. Results indicate that territory residents 
experienced more passive (resistive) control than visitors; residents also saw 
their territories as more pleasant and private than visitors there, and were 
perceived by visitors to be more “at home.” Visitors tended to attribute 
behavior on the territories to different causes (mainly personality) than resi- 
dents in the same room. A second kind of control (active control, uncorrelated 
with passive control) was found to relate to other forms of spatial behavior— 
crowding tolerance, interpersonal distance, perceived room size, and minimum 


space needs per person. 


One interesting aspect of territorial behav- 
ior among animals is that it shows dominance 
(a social behavior) to be affected by geo- 
graphic factors. There have been several 
demonstrations of the enhanced fight-winning 
capacities of territorial individuals when they 
are on home ground; in fact, winning or 
losing a fight is often better predicted by 
which individual is on home territory than 
by which is bigger and stronger (e.g., 
Braddock’s, 1949, demonstration with fish). 
Likewise among monkeys (Leary & Moroney, 
1962) territory often acts as a social equalizer 
for low-status individuals by enhancing the 
dominance rank of any individual at home. 

Just how closely one can draw the parallels 
between animal and human territoriality is 
still an open question. There is a saying that 
a man’s home is his castle, implying that on 
home territory a person does enjoy a special 
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sovereignty that he may not elsewhere 
Casual observation also shows that people 
often behave more unconstrained, relaxed, 
and at ease at home; as visitors in somebody 
else’s territory they are often constrained, 
restricted, and circumspect. 

A few empirical studies have addressed t’ s 
territory-dominance phenomenon in hum: s, 
with mixed results. Esser, Chamber) n, 
Chapple, and Kline (1965) observed a w rd 
of 22 mental hospital patients over 16 we ks 
using time-sampling methods; they rel ed 
patients’ territorial behavior (defined as :t- 
tachment to one particular location on tre 
ward for more than 25% of the observa- 
tions—an “exclusiveness of use of place’ 
criterion) to their dominance behavior (c °- 
fined by a combination of the number of 
social contacts a person made, whether he 
initiated these, and their duration). Generlly 
speaking, inmates in the top third of the domi- 
nance hierarchy moved freely over the ward 
and defended no personal territories. ‘The 
bottom third tended to claim spaces in 
remote areas of the ward; the middle third 
claimed larger territories in more centra! loca- 
tions. Using the same paradigm, Esser (1968) 
later studied a group of hospitalized children 
for 6 weeks. A relationship between domi- 
nance (defined by degree of social contact and 
by staff reports) and territoriality emerged 
which weakly resembled the previous finding. 

In a similar setting Esser (1970) again 
studied the spatial and dominance behavior 
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> of 20 adults over 6 weeks. Here, territorial- 


- ity was defined as occupation of a particular 


space for more than 15% of the observations 
and standing ground against dominance at- 
tempts of higher-ranking patients. Territori- 
ality was shown by 7 of the 20 patients by 
these criteria; these 7 men won 85% of all 
acts of dominance on home territory, and 
55% of those off home ground. Later, Paluck 
and Esser (1971) noted the spatial behavior 
of three small groups of aggressive, retarded 
boys in free play areas over 3 weeks; an 
“exclusiveness of use of places” criterion was 
again used for territoriality. Normally these 
boys were reprimanded for aggressive behay- 
ior; and while this proved effective in some 
cases, it did not reduce aggressive behavior 
related to defense of territory. 

A final study of Esser’s (1973) involved 
17 institutionalized boys over a period of 25 
weeks. In this study, using similar criteria, 
territory holders were generally found to hold 
higher dominance ranks in the group than 
nonholders (contrary to Esser et al., 1965). 

Noting discrepancies among Esser’s find- 
ings, Sundstrom and Altman (1974) investi- 
gated dominance-territory relationships over 
10 weeks in a residential facility for 23 male 
juvenile delinquents. Using two operational 
definitions for territoriality, (a) a person 
limiting his space ‘usage to one or a few places 
and (b) an area being used more or less 
exclusively by one person (formulas were 
provided for these measurements), their re- 
sults showed a variable relationship between 
dominance and territoriality: A positive asso- 
ciation appeared during the first of three un- 
equal time periods, and a (nonsignificant) 
negative association in the final period. 
Dominance was defined here as a relationship 
in which one person had the ability to 
influence another. 

De Long’s (1970) study, deviating from 
the usual paradigm, involved a college semi- 
nar group for 16 weeks. Individual member’s 
dominance ratings (based on “demonstrated 
leadership ability”) correlated nonlinearly with 
their positions around the seminar table (par- 
ticipant’s territories were determined accord- 
ing to three related criteria, reflecting pre- 
dominant use of a particular position); the 


1109 
correlations also changed with shifts in 
leadership within the group. 

It is not known whether the general un- 
reliability of dominance-territory relation- 
ships in these studies stems from the problems 
of defining the respective concepts, other 
methodological problems (e.g., the use of un- 
usual subject populations), or from the ab- 
sence of any such reliable relationships among 
humans at large in the first place. We pro- 
pose that part of the problem is conceptual: 
Instead of dominance, control may prove to 
be more meaningfully related to territory. 
Control is a concept related to dominance, 
but broader; instead of referring only to an 
active, social behavior it accommodates the 
influence a person has over other people, 
inanimate spaces, and even ideas, and in both 
active (initiating or offensive) and passive 
(resistive or defending) ways. Since most of 
the studies conducted to date have dealt 
exclusively with that subset of control 
characterized by active interpersonal behav- 
ior, their scope has been unnecessarily narrow 
and their findings may therefore have been 
influenced by idiosyncratic aspects of the 
settings involved. 

The relation between territoriality and con- 
trol has been mentioned by a number of 
writers. Sommer (1966) states “| territory] is 
an area controlled by an individual, family, 
or other face-to-face collectivity.” Goffman 
(1961) says: “personal territory [is a] pri- 
vate claim on space, where the individual 
develops some comforts, control, and tacit 
rights. . . .”” Roos (1968) says: “People . . . 
use their territories to increase power, com- 
fort, and control.” Proshansky, Ittleson, and 
Rivlin (1970) say: “Territoriality [is] de- 
fined as achieving and exerting control over 
a particular segment of space.” For the latter, 
control is instrumental in achieving a funda- 
mental goal of humans: freedom of choice in 
their environments. 

Empirical validations of the territory- 
control relationship are lacking, however, and 
one focus of the present study is to test 
it, using a paradigm involving residential 
territories of college students and simultane- 
ous manipulations of control dynamics. Since 
relatively little is known about other aspects 
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of human territoriality, a second purpose of 
this study was to gather exploratory data 
through several dependent measures. Of par- 
ticular interest was the relationship between 
territory and other forms of spatial behavior 
(crowding, interpersonal distance). It has 
been previously proposed that control is a 
psychological variable influencing both crowd- 
ing tolerance (Zlutnick & Altman, 1972) and 
interpersonal distance (Duke & Nowicki, 
1972), presumably because it enables the 
individual to handle the stresses of interper- 
sonal closeness. If territory provides control, 
it follows that people should tolerate more 
of a crowd and shorter interpersonal distances 
when on home ground than when not, In 
other words, control may be a psychological 
factor linking territory, crowding, and inter- 
personal distance together. 


METHOD 
Subjects and Settings 


One hundred sixty male undergraduates at Yale 
participated in partial fulfillment of a course re- 
quirement. Subjects were run in pairs; 60 of the 
pairs were run in as many different dormitory rooms 
and each pair was comprised of a resident (subject 
in his own dormitory room) and a visitor. Thus 
the basic unit of study was the behavior differences 
between members of a pair. The remaining 20 pairs 
comprised a separate experimental group which met 
in a comparable room belonging to neither subject. 
Data from 6 pairs were discarded before analysis 
either because of serious interruptions in the ses- 
sion or because the assistant (see below) did not 
follow the procedure correctly. Extra pairs were run 
to equalize cell ns. 


Procedure 


Territorial residency manipulation. Members of 
each subject pair were initially unknown to one 
another, One (the future “resident”) was randomly 
selected from each pair of names on the experi- 
ment sign-up sheets, telephoned, and asked to make 
his dormitory room available for the study (these 
rooms were normally shared; each resident had the 
responsibility of asking his roommates to leave for 
the experiment).1 The other member of the pair 
was then phoned and asked to report to the resi- 
dent’s dormitory room for the study (initially de- 
scribed as a study in housing problems). This pro- 
cedure established 60 resident-visitor pairs altogether. 

Control manipulation. An aide met each resi- 
dent-visitor pair at the dormitory room to be used 
for that session (to blind the experimenter to resi- 
dent/visitor statuses of the subjects) and assembled 
both subjects outside in the adjacent hallway. The 
experimenter then took over; he told the pair he 
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would need one of them to “double as an 
to help run the session, It was explained 
assistant would need complete cooperation 
other subject—in fact the other subject wo 
to sign a release form (read to the pair 
point) to indicate his acceptance of the 3 
position of authority and his willingness 
orders from the assistant. As each pair agreed 
arrangement, the experimenter tossed a coin 
termine who would get the assistant’s r 
winner was then given a short booklet of 
tions which would enable him virtually to rur e 
experiment by himself. (The booklet was array 
in simple flowchart format, and given a few minui: 
familiarity with it assistants were, in fact, able to 
lead sessions successfully.) The toss winner, who was 
either resident or visitor, spent 5 minutes alone 
with the booklet in the dormitory room for fa- 
miliarization with the procedure to come; the loser 
remained in the hallway with the experimenter and 
was requested to reread and sign the release form 
In sum, one subject in each pair was placed in 
control over the other. This procedure was used for 
60 of the pairs. For those pairs in dormitory rooms, 
this meant that either a resident controlled a visitor, 
or vice versa; for the 20 visitor—visitor pairs meet- 
ing in a room belonging to neither, one was also 
placed in control over the other. As a baseline group, 
an additional 20 resident-visitor pairs (see Figure 
1) met at dormitory rooms as before, but neith: 
subject was placed in control (i.e., the experimen 
ran these sessions without an assistant 
Sequence of events during each session, The co 
troller (assistant) led the session by directing 
other subject through a number of steps, many i 
which pertained to separate dependent measure: 
the experiment (however, the controller comple! d 
the same dependent measures as the controllee). \ 
number of deliberately arranged events established 
and maintained the assistant’s controlling role; these 
were introduced into the session by being writter 
into the assistant’s instruction booklet: (1) The cor 
troller (assistant) called the other subject into the 


1 Grateful thanks go to the 120-odd roommates 
who were kind enough to cooperate by comply: 
with this request. 


2 In this condition, both resident and visitor were 
simultaneously handed a length of string after the 
introduction and instructed to claim nonoverlapping 
portions of the space by entering the room, one at 


a time, and laying their strings across the floor, 
Which subject should enter the room first was un- 
specified by the experimenter, and it was noted 
who initiated this event. Subjects in this condition 
then completed the same dependent measures as 
those in the other conditions, the experimenter di- 
recting the sequence instead of a controller. Results 
showed that neither residents nor visitors in this 
condition entered the room first a significantly 
greater number of times. However, residents did 
appear to control this choice more frequently; they 
either entered first or told the visitor to enter first 
18 out of 20 times ($ < .001). 
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room to hand over the release form, then sent him 
back outside; (2) the controller called the other 
subject back and ordered him to move two pieces of 
furniture or decor across the room (reinforcing his 
order-giving role) then sent him back outside; (3) 
the controller established control spatially by claim- 
ing more than 50% of the room with a string 
which divided the floor area. He then directed the 
other to claim a personal area from the remaining 
space (always less than 50%). Having established 
his role, the controller now led the dependent mea- 
sures sequence: (4) First he cleared the floor of 
the string dividers, directing the other to help. At 
this point the experimenter directed a two-part 
interpersonal distancing test: (a) one subject was 
randomly selected and distanced himself comfortably 
from the other by approaching toward the other’s 
stationary standing position in the center of the 
room; (b) subjects then reversed positions for a 
second test. Final distances were marked with paper- 
clips which subjects placed by their toes after each 
approach; this enabled the experimenter to stay 
outside the room, (5) The controller supervised the 
replacing of room-dividing strings (see 3, above), 
gave a self-report questionnaire to the other subject 
and himself, and told the other subject to sit down 
and fill out the questions. The controller then started 
on an identical questionnaire himself. 

Dependent measures. These fell into two groups: 
spatial measures and evaluative, attitudinal, and per- 
ceptual measures. Spatial measures included the 
following: (a) interpersonal distance (see above) ; 
(b) density tolerance (a questionnaire item asking 
for an estimate of the maximum number of people 
that could occupy the room without overcrowding) ; 
(c) an estimate of room dimensions; and (d) mini- 
mum space needs per person (estimated room area/ 
maximum number of people before overcrowding). 
Evaluative, attitudinal, and perceptual measures 
were included to tap subjects’ ideas about them- 
selves, each other, and the territory itself, such as a 


TABLE 1 


ITEMS FROM SELF- AND OTHER RATINGS AND THEIR 
Loapincs on THE “AT Home” Factor 


Item Loading 
Relaxed-Tense -78 
Predictable-Unpredictable 
Powerful-Weak 
Free/Unconstrained-Restricted/Constrained -12 
Aggressive-Retiring 
Secure-Insecure .59 
Warm-Cold .54 
High Status-Low Status 
Organized-Disorganized 
Initiating-Hesitating 
Directing-Following 

Strong Identity-Anonymous 
Impulsive-Reserved 54 
Uninhibited-Inhibited 81 
In Control-Lacking Control (over situation) 
In Control-Lacking Control (over the other) 


1111 
TABLE 2 

TreMs FROM THE Room RATINGS AND THEIR LOADINGS 

Loading 

Item Fi 
Factor 1| T i reels 
Pleasant smy: Privacy. 

Good-Bad 72 
Valuable-Worthless -60 35 
Pleasant-Unpleasant T3 36 
Active—Passive 75 
Calm-Agitated =,39 A4 
Strong—Weak 78 
Light-Heavy 
Large-Small 71 
Inviting—Hostile -60 
Individualized-Anonymous | .42 37 55 
Familiar-Unfamiliar JA 
Friendly-Unfriendly 7 
Private—Public 83 
Reassuring-Unsettling 74 
Stimulating—Tranquilizing 77 
Unusual-Commonplace 61 
Liberating-Constraining ot 
Integrating—Disorganizing 


number of 9-point bipolar dimensions included on 
the questionnaire (see Tables 1 and 2). Second, an 
exploratory measure was also included to determine 
if subjects differed in their explanations of their 
behavior in the settings: Each subject was asked to 
weigh five explanations for his own and the other 
subject's behavior: (a) reaction to the other person, 
(b) the surrounding room, (c) own personality, 
(d) being in an experiment, (e) other. Finally, 
eight items were selected from the control scale of 
Schutz’s (1970) FIRO-B personality test on the 
basis of representativeness and face validity. Four 
of these items measured “active” control (Schutz’s 
“expressed” control, or the degree of outward con- 
trol a person exhibits) and four measured “passive” 
control (Schutz’s “needed” control, or the degree 
one resists the control attempts of others). Scores 
tapping these two types of control were virtually 
uncorrelated (r=.07) indicating separate dimen- 
sions. 

A number of other dependent measures, paper- 
and-pencil items, were originally included for ex- 
ploratory purposes. However, these did not provide 
additional significant information and are therefore 
not discussed in this report. A full description of 
the measures can be found in the original disserta- 
tion manuscript (Yale University). 

In summary, the experiment design involved 80 
pairs of subjects. Sixty of these pairs were run in 
personal dormitory rooms; members of these pairs 
were either residents or visitors (cells 1 through 6, 
Figure 1). An additional 20 pairs (cells 7 and 8) 


8Scores from each set were later split at the 
median (see Results) for the purposes of supple- 
mentary analyses. 
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Resident 


Visitor 


Resident 
in Control 


7 | Controller 


Visitor 
in Control 


8 | Controlled 
No 
Controller 


Ficure 1. Design of the experiment. 


met in a room belonging to neither. One (baseline) 
condition (cells 5 and 6) involved dormitory rooms 
but no controlling subject; in all other conditions, 
one subject controlled the other. Main effects for 
territorial residency were analyzed by comparing 


cells 1, 3, and 5 versus cells 2, 4, and 6, Main 
effects for control emerged from comparison of 
cells 1, 4, and 7 versus cells 2, 3, and 8. In all ses- 
sions, data from subjects run together were assumed 
to be correlated and in the analysis their responses 
were treated as repeated measures. Pairs of subjects 
were, of course, nested within each row of Figure 1 
and within cells 7 and 8. 


RESULTS 
Territorial Residence 


A number of differences emerged between 
residents and visitors, regardless of who was 
assigned the controller’s role. These are sum- 
marized in Table 3. 

Residents indicated on the passive FIRO-B 
control items that they characteristically 
allowed others to control them less than visi- 
tors did. Since pasive control is defensive in 
nature, it is reasonable that this is the type 
of control linked to territory. It should be 
recalled, however, that the FIRO-B was a 
trait measure, not a situational one (see 
below). 

On the “other-ratings’ (see Table 1), 
which were factor analyzed, visitors generally 
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saw residents as being higher on 
(composed of the dimensions relaxe 
secure, impulsive, warm, and unin) 
than residents saw visitors. Given the 
cal context, this factor was label ‘at 
home.” No complementary differences : lf- 
perceptions occurred, however. 

Residents also differed from visitors in © eir 
perceptions of the physical territory, ne 
room-evaluation dimensions (see Table ) 
were factor analyzed; they revealed t 
residents saw the rooms as more “pleasant 
(Factor I) than visitors and more “private” 
(Factor II). 

Finally, on the attribution measures, visi- 
tors weighed personality as an explanation— 
for both their own and resident’s behavio: 
—more heavily than did residents. Residents 
tended to attribute their behavior more to 
the room and the attribution (about residents 
only) was supported by visitors. 


ctor 
free, 
ted) 


Control Manipulation 


Controllers saw the physical territory a 
being more “pleasant” (Factor I, Table 
4.37 vs. 4.98; t(59) = 3.06, p< .005) aii 
more “stimulating” (Factor II, Table 
4.82 vs. 5.30; #(59) = 2.74, p<.01) tha 
did controlees. Presumably these effects w 
due to projected feelings on the controll: 
parts. 


Control and Spatial Measures 


Neither the direct manipulation of contri 
nor the territorial residence manipulation ho 
straightforward effects on the spatial mci- 
sures apart from one: Visitors saw the ter:i- 


TABLE 3 
MAIN EFFECTS FOR TERRITORIAL RESIDENCE 
Effect 
Measure Es 
Resident M Visitor M F df | p 
Passive control 15.78 14.26 5.59 1,57 05 
Other ratings 
Appearing to feel “at home” 5.25 4.80 4.51 1,57 05 
Explanations for behavior 
Personality (of both subjects) 5.12 6.36 14.10 RTA L005 
Room (own behavior) 4.43 3.65 4.6 DS .05 
Room (other’s behavior) 4.17 5.02 44 IESI .05 
Room evaluations 
Pleasant (Factor I) 4.88 3.82 23.0 1, 57 001 
Privacy (Factor IV) 441 3.85 5.6 E S7 05 
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ies as capable of holding larger crowds 
san did residents (8.21 vs. 6.63 people, 
(1,57) = 5.47, p<.05). Since it was 
vought that being on home ground might 
rovide a sense of control to residents, who 
uld then tolerate greater crowding and 
maller interpersonal distances, this finding 
vas unexpected. 

Individual personality differences in control 
(measured by the FIRO-B) were next ana- 
lyzed for their effects on spatial behavior. 
An internal analysis using a median split on 
the active control items showed (a) high 
active controllers gave significantly larger 
crowd capacities for the rooms than did low 
controllers (8.29 vs. 6.18 people; #(158) = 
3.18, p < .01); (b) high active controllers 
saw minimum space needs per person (esti- 
mated room size/crowd capacity) to be sig- 
nificantly less than did low controllers, 31.35 
versus 47.27 square feet (2.92 vs. 4.20 sq. m), 
#(158) = 2.72, p< .01. These differences 
support Zlutnick and Altman’s (1972) and 
Duke and Nowicki’s (1972) suggestions on 
the relationship between personal space and 
control, 

In sum, territory is linked with a passive 
form of control, while other forms of per- 
sonal space (especially crowding) are linked 
with a separate, active form of control. 

Further’ internal analyses revealed that the 
two remaining spatial measures, interpersonal 
distance and estimated room size, were af- 
fected when the personality trait and the 
experimental manipulation reinforced each 
other. Table 4 shows that subjects high on 
active trait control who were also given the 
controller’s role tended to use shorter inter- 
personal distances than any other subjects, 
F(1,116) = 3.66, p < .059, The same pat- 
tern followed for estimates of the room size 
(here marginally significant, see Table 5). 


TABLE 4 


INTERPERSONAL DISTANCE 
(in inches) 


Role 
Trait control (active) 
Controller Controlled 
High 9.73 14.90 
Low 13.69 13.24 
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TABLE 5 


Estmatep Room Size 
(in square feet) 


Role 
Trait control (active) 
Controller Controlled 
High 170.9 198.7 
Low 234.1 186.2 


Finally, when high active trait controllers 
were put in a role consistent with their per- 
sonality dispositions, they tended to see the 
rooms as smaller than did others, F(1, 116) 
= 3.18, p < .078. 


DISCUSSION 


The results generally support the idea that 
control is broader than the unitary concept 
of dominance and that one form (passive 
control) can be articulated meaningfully to 
territory, while another form (active control) 
relates to other spatial behaviors. That these 
relationships involve mainly the trait forms 
of control is partly predictable from Zlutnick 
and Altman (1972) and Duke and Nowicki 
(1972), but it is interesting because it sug- 
gests situational or temporary determinants 
of control are relatively ineffective as modi- 
fiers of interpersonal spatial behavior (e.g., 
crowding). 

Why did the situational factor of territory 
affect a trait measure of passive control (see 
Results)? Two explanations suggest them- 
selves: (a) specifically passive control items 
appeared only in the FIRO-B measure, thus 
this was the only measure on which these 
differences could emerge; (b) the FIRO-B 
passive control items involved were phrased 
so as to ask respondents what they “charac- 
teristically” did. Even if both residents and 
visitors were characteristically passively con- 
trolling on home ground, the present environ- 
ment cued memories of this behavior better 
for residents, resulting in residents’ higher 
scores. 

A number of results add to our under- 
standing of the territorial phenomenon at the 
cognitive level. Regardless of who held con- 
trol in the situation, visitors tended to rate 
residents as feeling more at home than vice 
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versa. Residents also saw their territories as 
more pleasant and private than did visitors 
to the territories. 

Visitors tended to use “personality” to 
explain both subjects’ behavior on the terri- 
tory, while residents attributed their own 
behavior more to the room itself. These dif- 
ferences can be explained by reference to the 
meaning of the environment: To the resident 
at home, the environment was unique, the 
only place that was Ais with all the special 
connotations of a home. To the visitor, the 
same room was merely one of a multitude 
of different environments qualifying as for- 
eign territory. It therefore lacked distinctive- 
ness and significance, making it unlikely to 
be selected as a special cause of behavior. 
Hence visitors weighted that alternative 
lightly and focused more on the individuals 
in the situation and their personalities. These 
differences in attribution of behavior are sig- 
nificant because they raise the question of 
ecological validity in previous attribution 
literature (see Nisbett, Caputo, Legant, & 
Maracek, 1973; Storms, 1973). If a person’s 
explanations of behavior depend partly on his 
geographic location, this limits the generaliz- 
aility of laboratory studies. 

Relating the present territory-control find- 
ings to previous empirical work (see Altman 
& Haythorn, 1967; Altman, Taylor, & 
Wheeler, 1971; Cheyne & Efran, 1972; 
Edney, 1972a, 1972b; Edney & Jordan- 
Edney, 1974; Efran & Cheyne, 1973, 1974; 
Roos, 1968; Sommer & Becker, 1969), it is 
significant that none of the latter have suc- 
cessfully used spontaneous dominance at- 
tempts (in the sense of unprovoked aggres- 
sion) as a criterion of territoriality, even 
though territoriality has been traditionally 
linked to aggression (cf. Ardrey, 1966; 
Lorenz, 1969). In the very few (and statisti- 
cally nonsignificant) cases where active ag- 
gression has been involved in territoriality 
(e.g., Esser et al., 1965) it has always ap- 
peared as a defense, thus falling under our 
rubric of passive (i.e. resistive) control. 
Dominance, in the sense used by Esser et al. 
(1965), Esser (1970), Esser (1971), Esser 
(1973), Sundstrom and Altman (1974), and 
De Long (1970), is difficult to classify as 
it was based on behaviors such as “social 
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contact” and “leadership ability? which may 
include both active and passive control, a fact 
which probably contributed to the variation 
in their findings. 

Finally, data from the present crowding, 
interpersonal distance, and minimum-space- 
per-person measures generally indicated that 
the more a person was controlling, the less 
personal space he needed. This is paradoxical 
because part of our method of establishing 
control required certain subjects to claim, 
with string, larger spaces in the room than 
the other subjects; this manipulation proved 
effective since space claimed by this method 
correlated (r = .44, p < .01) with amount of 
control as judged by the other subject in each 
pair. One possible answer to thi ontradic- 
tion (suggested by Stokols, 1972) is that 
dynamics of personal space follow analo- 
gously to those of personal freedom: ‘The less 
one has, the more one wants—a Spatial ana 
logue, in fact, to reactance theory (Brehm, 
1966). Research is currently being developed 
to test. this hypothesis. 
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Overjustification, Competing Responses, and the 
Acquisition of Intrinsic Interest 


Steven Reiss and Leonard W. Sushinsky 
Universily of Illinois at Chicago Circle 


The overjustification hypothesis proposes that expectation of reward for an 
inherently interesting activity produces less interest in the activity when re- 
ward is subsequently unavailable. Two experiments designed to test this hy- 
pothesis are reported, Experiment 1 replicated previous findings that a single 
trial of noncontingent, promised reward undermines young children’s interest 
in the rewarded play activity. Experiment 2 discenfirmed the overjustification 
hypothesis by demonstrating that a multiple-trial, contingent reinforcement 
procedure did not produce an overjustification effect. Both of these findings 
are understandable in terms of a competing response hypothesis but not in 
terms of overjustification. Accordingly, previous findings of an overjustification 
effect are reinterpreted as having demonstrated that curiosity in a novel play 
activity can be undermined by presentation of distracting stimuli, The follow- 
ing additional conclusions are drawn: (a) The overjustification hypothesis is 
conceptually vague, (b) self-perception theory seems to have been concerned 
with variables relatively unimportant for understanding effects of tangible 
reward procedures on overt behavior, and (c) recent criticisms that token 
economies undermine intrinsic interest were based on a failure to appreciate 
differences between single-trial, noncontingent reward and multiple-trial, con- 


tingent reinforcement procedures. 


Several theorists have proposed that tangi- 
ble reward procedures undermine intrinsic 
interest in rewarded activities (e.g., Calder 
& Staw, 1975; deCharms, 1968; Deci, 1971, 
1972; Lepper, Greene, & Nisbett, 1973; 
Macmillan & Forness, 1970). If these proce- 
dures have this effect, it is obviously impor- 
tant that the empirical conditions under 
which the effect occurs be identified. Other- 
wise, unwarranted criticism of tangible re- 
ward procedures can have the practical conse- 
quence of discouraging the use of effective 
methods for modifying behavior and allevi- 
ating human suffering. 
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Recent tests of the overjustification hy- 
pothesis have provided data seemingly rele 
vant to this issue (Lepper & Greene, 1975; 
Lepper et al., 1973; Lepper, Greene, & 
Sagotsky, Note 1; Ross, Note 2). This hy 
pothesis was derived from self-perception 
theory and is based on a distinction between 
intrinsic and extrinsic motivation. Intrinsic 
motivation is defined as engaging in an 
activity for “its own sake,” while extrinsic 
motivation is defined as engaging in an activ 
ity to obtain reward or to avoid punishment. 
Accordingly, playing with a toy without 
expectation of reward exemplifies intrinsic 
motivation, while playing with a toy with 
expectation of reward exemplifies extrinsic 
motivation. The overjustification hypothesis 
predicts that subjects who expect reward for 
intrinsically motivated behavior will reattrib- 
ute the cause of their behavior from internal 
(e.g, “I engage in this activity because I 
like it”) to external factors (e.g., “I engage 
in this activity to obtain reward”). As a 
result, intrinsic motivation is decreased; that 
is, when rewards are no longer available, 
subjects will engage in the activity less often. 
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overjustification hypothesis specifies 
properties of reward procedures that 
ease intrinsic interest. The reward con- 
zencies must be salient, unambiguous, and 
cient to explain to the subject why he 
formed the behavior, and the reward must 
expected (Lepper et al., 1973). Under 
ese conditions, an inferential process result- 
z in reattribution of behavior to reward is 
edicted. Lepper et al. tested this hypothesis 
1 a three-phase experiment with nursery 
school children. Phase 1 was conducted during 
free-play periods in the subjects’ classroom 
and served as a pretest of initial interest in 
the experimental activity of drawing with 
novel crayons. During Phase 2, subjects were 
induced to draw in an experimental room 
under one of three conditions: Subjects in the 
promised reward condition were shown and 
promised a tangible reward prior to com- 
mitting themselves to draw, and they received 
reward at the conclusion of the activity; sub- 
jects in the unexpected reward condition were 
neither shown nor promised a tangible reward 
prior to committing themselves to draw, but 
they received reward at the conclusion of the 
activity; and subjects in the control condi- 
tion were neither shown nor promised reward 
prior to drawing, nor did they receive reward 
at the conclusion of the activity. Experi- 
mental Phase 3 served as a free-play post- 
test of interest in the experimental activity 
and was conducted in the subjects’ classroom. 
As predicted by the overjustification hypothe- 
sis, children in the promised reward condition 
spent less time drawing during posttest than 
did children in both the unexpected reward 
and the control conditions. This finding, to 
be termed the “decreased play effect,’ was 
replicated in similar studies by Lepper and 
Greene (1975) and by Lepper et al. (Note 1). 
The Lepper et al. (1973; Note 1) studies 
have been interpreted as providing important 
support for the overjustification hypothesis 
(cf. Bem, 1972; Calder & Staw, 1975). 
Nevertheless, the results of these studies are 
open to alternative explanations, as the fol- 
lowing comments indicate: 


1. The Lepper et al. studies provided in- 
sufficient evidence that expectations of reward 
were elicited and no direct evidence that sub- 
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jects learned to attribute play to reward. 
Rather, these cognitive processes were as- 
sumed to have occurred primarily on the basis 
of the verbal instructions promising reward. 
This assumption is questionable because some 
subjects, particularly those who were only 40 
months old, may have misunderstood the 
instructions (cf. Caldwell & Hall, 1970; 
Gruen, 1966). 

2. The Lepper et al. studies confounded 
presentation of a potentially distracting 
stimulus with verbal instructions defining this 
stimulus as a promised reward. As far as the 
present investigators could determine, the 
decreased play effect has been produced only 
under conditions in which a salient and poten- 
tially distracting stimulus—such as the re- 
ward (Lepper et al., 1973; cf. McCullers & 
Martin, 1971; Miller & Estes, 1961; Spence, 
1971), attractive toys (Barker, Dembo, & 
Lewin, 1941), a television with lights (Lepper 
& Greene, 1975), or an attractive box (Ross, 
Note 2)—was exposed to experimental sub- 
jects prior to and/or during experimental 
training. Moreover, Ross found that promise 
of reward without exposure to a tangible 
stimulus did not decrease interest in the play 
activity, although promise of reward with 
such exposure did. Thus, the Lepper et al. 
studies could be interpreted as effects of 
salient and potentially distracting stimuli. 

Previous investigators may have ignored 
distraction as a possible explanation of the 
overjustification effect because the distracting 
stimuli presented during experimental train- 
ing were not present during free-play post- 
tests. The underlying assumption might have 
been that unlike the effects of verbal instruc- 
tions, the effects of distracting stimuli dis- 
appear when the stimulus is withdrawn. This 
assumption is implausible; for example, mem- 
ory is not reestablished when interpolated 
distracting stimuli (e.g., nonsense syllables) 
are withdrawn. 

3. The Lepper et al. studies confounded 
different practice experiences with experimen- 
tal training conditions. Specifically, compared 
with other subjects, promised reward subjects 
were found to have practiced hurried and/or 
low-quality play prior to reward. Practiced 
response topologies often become learned 
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characteristics of the response (Logan, 1972) .1 
Thus, subjects who practice hurried and/or 
low-quality play on a novel activity prior to 
reward, may be likely to exhibit hurried 
and/or low-quality play following reward. 
Since subjects who practice hurried or low- 
quality play would seem likely to find play 
less enjoyable, the decreased play effects re- 
ported by Lepper and his colleagues could 
be explained without appeal to cognitive 
learning consisting of reattribution of play 
from intrinsic to extrinsic factors.” 

Findings of an overjustification effect in 
young children can be explained more parsi- 
moniously by what is termed the “competing 
response hypothesis” (cf. Child & Water- 
house, 1952), This hypothesis proposes that 
after reward contingencies are terminated, 
children will be less interested in play activi- 
ties to the extent that responses are elicited 
that interfere with play behavior prior to 
termination of the contingencies. Promised 
reward procedures preexpose subjects to a 
salient rewarding stimulus either by verbal 
or visual means. Exposure to a salient reward- 
ing stimulus can elicit many responses that 
interfere with play. For example, play could 
be disrupted by perceptual distraction, cogni- 
tive distraction (e.g., thinking about reward), 
excitement in anticipation of reward (Miller 
& Estes, 1961; Sheffield, 1966), or frustration 


1As Logan (1972, p. 1057-8) has noted, “A 
person who practices a response in a sloppy or in- 
efficient manner learns to behave in a sloppy and 
inefficient manner.” 

“In related studies on expectation of reward with 
adult subjects (e.g, Deci, 1971, 1972), it is neces- 
sary to consider the possibility that experimental 
procedures offering reward were aversive. Culture 
and significant others provide considerable training 
concerning when tangible rewards are appropriate 
and when they are not. Consequently, offers of re- 
ward are insulting under some conditions, are re- 
garded as suspicious under others, and are sometimes 
perceived as manipulative. Thus, demonstrations 
that offers of reward can decrease an adult’s interest 
in rewarded activity can be interpreted in terms of 
the overjustification hypothesis, the competing re- 
sponse hypothesis, and the possibility that the re- 
ward procedure was aversive, unless the procedure 

is first demonstrated to produce a reinforcement 
effect. However, as yet, no study on overjustifica- 
tion has produced a reinforcement effect during 
training and a decreased interest effect during post- 
tests. 
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resulting from delay or withdrawal of ard 
(Barker et al., 1941). 

Under the competing response hyp is, 
the Lepper et al. (1973; Note 1) es 
are regarded as additional data demonst:: ‘ng 


mildly distracting effects of tangible re 1, 
Barker et al. (1941) had previously )- 
duced a decreased play effect by exposin, t- 
tractive toys to young children while 7 
were playing with less attractive toys. 
cause of their theoretical interests, Barke 

al. interpreted the decreased play effect a 
result of frustration. Child and Waterho: 
(1952) reinterpreted the Barker et al. stu: 
in terms of a competing response hypothesi 
More recently, McCullers and Martin (1971 
Miller and Estes (1961), and Spence (1971 
have all demonstrated and commented upor 
the distracting effects of tangible rewards. 

The capacity of a salient rewarding stimu- 
lus to elicit responses that interfere with on- 
going behavior can be assumed to habituate 
with repeated stimulus exposures (cf. Reiss 
& Wagner, 1972). When rewards are intro- 
duced, children often become excited and con 
sequently exhibit low-quality performances 
For this reason, it is not surprising that th 
overjustification effect has been found after 
single trial of reward in which hurried and /< 
low-quality performance occurred prior t 
reward. However, this effect should be le: 
likely to occur following repeated trials be- 
cause subjects’ initial excitement would be 
expected to subside. 

The competing response hypothesis doe 
not deny that expectation of reward can pre 
duce decreased play effects. This hypothesis 
views thinking about reward as only one of 
many possible responses that can interfere 
with play behavior. However, a critical dif- 
ference between the two hypotheses concerns 
whether or not multiple-trial, contingent re- 
inforcement procedures produce decreased 
play effects. Since the overjustification hy- 
pothesis predicts decreased interest regardless 
of whether expectation of reward is induced 
by verbal promises or by repeated trials, this 
hypothesis predicts overjustification effects 
for multiple-trial, contingent reinforcement 
procedures. However, since these procedures 
typically do not elicit responses that inter- 
fere with rewarded behavior, the competing 


| 


: 
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ise hypothesis does not predict a de- 

€ d play effect under these conditions. 
he present article reports two studies 
© ened to test for an overjustification effect 
owing termination of reward procedures 
t do and do not produce reinforcement 
cts, In Experiment 1, the promised reward 
cedures were designed to maximize the 
bability that the subject’s play behavior 
uld be disrupted prior to reward. Accord- 
igly, this study employed promised reward 
rocedures similar to those employed in the 
Lepper et al. studies. In Experiment 2, expec- 
tation of reward was induced by repeated 
operant trials of reinforcement. Accordingly, 
this study employed promised reward proce- 
dures typically employed in behavior modifi- 
cation situations. The competing response hy- 
pothesis predicted a decreased play effect in 
Experiment 1 but not in Experiment 2, while 
the overjustification hypothesis predicted a 

decreased play effect in both experiments. 


EXPERIMENT 1 


The primary purpose of Experiment 1 was 
to replicate the Lepper et al. (1973; Note 1) 
findings and to extend them to procedures 
that elicit expectation of reward without con- 
founding exposure to a tangible stimulus with 
verbal instructions that define this stimulus 
as a promised reward. This was accomplished 
in an experiment employing a 2 X 2 factorial 
design. During experimental training, subjects 
listened to a song under one of four condi- 
tions resulting from the pairing of two factors 
(a) preexposure or no preexposure to a tan- 
gible stimulus and (b) promise or no promise 
of reward. In the promised reward conditions, 
the verbal instructions were repeated until 
subjects could correctly answer questions 
concerning the nature of the reward contin- 
gency, thereby providing evidence that expec- 
tations of reward were in fact elicited. At the 
conclusion of the experimental activity, all 
subjects were rewarded and returned to their 
classroom. Three to five hours later, free-play 
posttests were conducted to assess subject’s 
interest in the experimental song. 

Under the competing response hypothesis, 
the following predictions were made. Since it 
seemed plausible that thinking about forth- 
coming reward would interfere with listening 
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to a song and given that expectations of 
reward (competing responses) were clearly 
elicited, it was predicted the subjects in the 
promise-preexposure condition would listen 
less during posttests than would subjects in 
the no promise—no preexposure condition, and 
a main effect of promising reward was also 
predicted. The former effect would replicate 
the Lepper et al. findings. Although the pri- 
mary purpose of this study was to produce 
decreased play effects by promising reward, a 
main effect of preexposure could be explained 
ad hoc by the competing response hypothesis 
but not by the overjustification hypothesis. 
A main effect of preexposure was not pre- 
dicted because the present methodology did 
not permit assessments of whether or not 
preexposure elicited any competing responses. 


Method 
Subjects and Experimenters 


The subjects, who were randomly assigned to 
experimental conditions, were 32 first-grade girls, 
aged 6 to 7 years. Both of the two experimenters 
and the two observers were female adults who 
were unaware of the study’s purposes. Each experi- 
menter trained half of the subjects in each experi- 
mental condition. Analysis of the data revealed that 
the results were not a function of the experimenter’s 
personality. 


Experimental Setting and Materials 


The study was conducted in a large room equipped 
with tables and chairs. During experimental train- 
ing, subjects listened to one of three songs, “Frosty 
the Snow Man,” “Rudolph the Red Nose Reindeer,” 
or “Jingle Bells.”® Each song was played by a 
different cassette tape recorder, onto which an 
appropriate drawing symbolizing the song had been 
taped. Play with an attractive doll served as a 
reward, while exposure of the doll prior to listening 
served as a distracting stimulus. A large timer, with 
both minute and second hands and a buzzer that 
sounded at zero time, served as a second distracting 
stimulus. 


Experimental Training 


The experimenter individually escorted the sub- 
jects to and from the experimental room. Upon 
arrival, each subject was seated at a table on 
which a cassette tape recorder had been placed. 
For subjects in the two exposure conditions, the 
doll and timer were also located on the table such 
that each subject could see but not touch them. If 


3 Extensive pretesting had revealed that these 
songs elicited approximately the same degree of 
interest in most young children tested. 
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promised reward instructions were to be given, the 
experimenter described the attractive properties of 
the doll and then elicited from the child a positive 
response that she would like to play with it. At 
this point, the timer and its buzzer were demon- 
strated, and the subject was told that she could 
play with the doll if she would listen to the song 
for 5 minutes. It was emphasized that when the 
5-minute period expired, the buzzer would go off and 
the subject could play with the doll, At this point, 
the subject was asked, “What will you win by listen- 
ing to [song’s name] ?” and “What must you do to 
win play with the doll?” If the subject answered 
either question incorrectly, the verbal instructions 
defining the reward contingency were repeated until 
both questions were answered correctly. Thus, the 
exposure-promise procedure was designed to maxi- 
mize the likelihood that the subject would wait for 
the buzzer to go off rather than listen carefully to 
and thus enjoy the music. However, for subjects in 
the exposure-no promise condition, the experimenter 
described the attractive qualities of the doll and then 
said that she had better put it away lest she forget 
to take it home. At this point, the doll was placed 
in a closet and the subject listened to the song while 
the timer “counted down” a 5-minute interval. 

For subjects in the two no exposure conditions, 
the doll and timer had been placed in a closet prior 
to the subjects’ entry into the room. Within these 
conditions, the promise and no promise procedures 
were essentially the same as those described above, 
with only minor modifications dictated by the ab- 
sence of the doll and timer. For example, when the 
experimenter described the attractive properties of 
the doll, she said, “I have a doll that can be 
dressed and undressed” . . . “rather than look at the 
doll you can dress... .” 

As noted, promised reward instructions were re- 
peated until the child could correctly state the con- 
tingencies. The mean number of times it was neces- 
sary to repeat these instructions was 2.4. 

After listening to the song for 5 minutes, the 
Subject was permitted to play with the doll for 3 
minutes and was then escorted back to her class- 
room, 


Posttests 


Five-minute posttests were conducted on the same 
school day, 3-5 hours following experimental train- 
ing. On these occasions, a recorder had been placed 
on each of three tables that were positioned so that 
a child sitting at one table could not hear the music 
from the other two recorders, Each recorder played 
a different song. Thus, the posttest was a free-play 
period during which the child could choose to listen 
to any of the three songs. 

One of the investigators escorted subjects to the 
experimental room in groups of four, East testing 
group consisted of one subject from each experi- 
mental condition, but all four subjects had been 
previously rewarded for listening to the same song. 
Upon arrival, the two observers, who had been 
sitting in adjacent corners of the room, were intro- 
duced to the subjects by the investigator as sub- 
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stitute teachers who had some work to finish. The 
investigator then brought to the subjects ntion: 
the three songs they could listen to, instructed them 
to do whatever they wanted, and left the room, after 
which a free-play period followed. 

The observers recorded when each subject began 
and stopped listening to the experimental song, 
operationally defined as sitting or standing within 
5 feet (1.524 m) of one of the recorders. This proce- 
dure permitted an assessment of interrater relia- 
bility on time of onset and termination of each 
listening response, Since recorders were physically 
separated, near perfect agreement was achieved (r 
= 99). 


Treatment of Data 


Some subjects who ran around the room were 
observed to have listened to their target song sev- 
eral times but for brief durations. Clearly, these 
subjects were expressing little interest in the target 
song. Accordingly, only listening responses greater 
than 30 sec. were summed to obtain each subject’s 
total time score. Otherwise, the data were treated 
in a manner similar to that reported in the Lepper 
et al. studies, That is, the data were then converted 
into proportions of total time spent listening to the 
target song and a log transformation (Y=log |) 
+1]) was then performed to produce homos: 
ous treatment variances. 


Results 


The mean proportions of total time sub- 
jects chose to listen to the target song during 
posttest, following the data transformations 
described above, are presented in Table 1. 
A comparison of the effects of exposure- 
promise procedure with those of the no expo- 
sure-no promise procedure was planned as a 
conceptual replication of the Lepper 
findings of an overjustification effect. A spa- 
rate one way analysis of variance was per- 
formed on the data for these conditions. It 
was found that subjects in the exposure- 
promise condition listened to the target song 
significantly less than did subjects in the 
no exposure-no promise condition, F(1, 14) 
= 11.24, p < 01.4 

Inspection of Table 1 indicates that the 
effects of promising reward were in the pre- 
dicted directions. These data were submitted 
to a 2 X 2 analysis of variance that indicated 
a significant main effect for promising reward, 
F(1, 28) = 5.67, p< .05. The main effect 
for preexposure was not statistically reliable. 


+ All statistical tests reported are two-tailed prob- 
abilities even though some of the effects had been 
predicted. 


TABLE 1 
Mi PROPORTIONS OF TOTAL TIME ON TARGET SONG 
i ING Posrrests AFTER LOG TRANSFORMATION 
\D BEEN PERFORMED ON THE PROPORTIONS 


Exposure to distracting, visual stimuli 


nise of Reward 


Yes No 
Yes 739 839 
No 1,104 1,302 


These results indicate that expectation of 
eward can produce less subsequent interest 
in the rewarded activity. This finding extends 
the Lepper et al. findings in two ways: (a) 
The present study provided evidence that 
expectations of reward were in fact elicited, 
and (b) unlike the Lepper et al. studies, the 
present study did not confound exposure to 
a tangible rewarding stimulus with verbal 
instructions that defined this stimulus as a 
promised reward. 

The present results can be explained both 
vy the competing response and overjustifica- 
tion hypotheses. The competing response hy- 
pothesis suggests that expectation of reward 
interferes with listening to a song more than 
loes preexposure to a doll and a clock. 


EXPERIMENT 2 


\ Experiment 1 replicated the overjustifica- 
tion effect under experimental conditions 
* that typically do not produce reinforcement 
effects. Specifically, the promised reward 
procedure provided a single trial of noncon- 
tingent reward and was designed, with the aid 
of a timer, to maximize the likelihood that 
subjects would be thinking about reward at 
the expense of listening to and enjoying 
music. Experiment 2 was designed to test for 
an overjustification effect following termina- 
tion of a token economy. The overjustifica- 
‘tion hypothesis predicts decreased play ef- 
fects following withdrawal of token economies 
(e.g, Lepper et al., 1973; Lepper et al., 
Note 1) because repeated reinforcement trials 
induce expectations of reward. However, the 
competing response hypothesis clearly pre- 
dicts that when a reinforcement effect is pro- 
duced during experimental training such that 
behavior is not interfered with, a decreased 
play effect will not occur. 
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Predictions of decreased behavior fre- 
quencies following termination of procedures 
known to produce reinforcement effects are 
equivalent to predictions of a phenomenon 
termed “successive behavior contrast” (cf. 
Dunham, 1968); that is, behavior frequencies 
increase and remain above baseline levels 
when reward is available but decrease below 
baseline frequencies when reward contingen- 
cies are withdrawn, It is emphasized that the 
term successive behavior contrast is employed 
here only as a description of an empirical 
phenomenon. Some readers familiar with the 
behavior modification literature might con- 
sider the present study superfluous, the in- 
validity of the overjustification hypothesis 
already having been established by many 
behavior modification studies that» failed to 
find evidence of successive behavior contrast. 
However, since behavior modifiers rarely at- 
tempt to modify play behavior, it seemed 
possible that successive behavior contrast 
might be produced by the present experi- 
mental procedures. 


Method 

Overview 

Children were trained to discriminate among three 
songs in an experimental situation by reinforcing 
them for listening to one song and by nonrein~ 
forcing them for listening to the other two songs, 
The effect of training on subject's interest in the 
rewarded song was assessed in the child’s classroom 
48 hours following training. Posttests were con- 
ducted by the classroom teacher, who showed sub- 
jects all three recorders and instructed them to do 
whatever they wanted. 


Subjects and Experimenters 


The subjects, six girls and three boys randomly 
selected from a kindergarten class, were randomly 
assigned to one of three target song conditions, 
Discrimination training was provided by the first 
investigator, who was assisted by three fe J 
adults. Observations of the subjects’ behavior dur- 
ing posttests were made by two other female adults 
who were unaware of the purposes of the study. 


Experimental Materials and Setting 


The experimental training phase was conducted 
in a large room in the subjects’ school, The re- 
corders and songs were the same as those employed 
during the posttests of Experiment 1, as was the 
arrangement of materials in the room. The only 
additional materials were poker chips that served as 
tokens and attractive toys that served as rewards, 


1122 


Children who earned 10 tokens were permitted to 
“buy” a toy and take it home. 

Posttests were conducted in the subjects’ class- 
room. This classroom actually consisted of two 
rooms, A and B, that were joined by a doorway. 


Discrimination Training 


The investigator escorted subjects to and from 
the experimental training room in groups of three. 
Showing them toys and a poker chip, he said. “I'll 
give you a toy if you give me 10 chips like this 
one. Does anybody have any? [Pause.] I know 
what, you can win chips by listening to one of the 
songs.” The children were then shown the recorders 
and told, “Go win chips.” 

Each of the three children who received training 
together was reinforced for listening to a different 
song. Discrimination training consisted of 10 re- 
inforcement trials such that listening to the target 
song for 10 consecutive sec. earned the first token, 
but the subject was required to listen for an addi- 
tional 15, 20, 35, 20, 60, 30, 100, and 80 sec. to 
earn Tokens 2-9. When one of these responses 
occurred, the appropriate experimental assistant, 
who had been timing the subject involved, signaled 
to the investigator by raising her hand; the in- 
vestigator then exclaimed, “You win! You listened 
to [song’s name],” and gave the child a token. 
After the subject earned his ninth token, he was 
asked to sit in a corner of the room. When all sub- 
jects earned a ninth token, the following instruc- 
tions were given to permit an assessment of dis- 
crimination learning: “You need one more chip to 
win a toy, Go to the song that you listen to to 
win chips.” All children earned a tenth token by 
listening to their target song for 90 sec. and were 
then allowed to buy a toy. 


Posttests 


Forty-eight hours following training, children 
were posttested in groups of three, with new part- 
ners such that each child in each testing group had 
been reinforced for listening to a different song. 
On these occasions, all of the children in the class 
had been assembled in Room A, so that posttesting 
could be conducted in Room B. The recorders 
were placed on tables in Room B in a manner 
similar to that during experimental training. The 
child’s teacher introduced the observers as substitute 
teachers who had some work to finish and in- 
structed the children to do whatever they wanted 
to do. For the next 10 minutes, each of the two 
observers recorded when the subject began and 
stopped listening to a song, operationally defined 
as sitting or standing within 5 feet (1.524 m) of one 
of the recorders, Observer 1 recorded listening to 
Songs A and B, while observer 2 recorded listening 
to Songs B and C. This procedure permitted an 
assessment of interrater reliability on time of onset 
and termination of listening to Song B. Since re- 
corders were physically separated, near perfect inter- 
rater agreement was achieved (r= .99). 
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Results 
When the songs were first introduce ing 
experimental training, only two of n ub- 
jects listened first to the predetermi ex- 


perimental song, while seven went 
ately to the target song following || 9. 
These data indicate that discrimination © i 
ing was effective. 


The mean time spent listening to the t+ “et 
song during 600-sec. posttests was 350 3 
while a mean of only 145.6 sec. was it 
listening to the most preferred nonta: et 

) 


song. Moreover, six subjects had listenec 
the target song more than to either of 
other two songs; this preference for the tar; 
song was statistically reliable (p < .04, | 
nomial test). Thus, the preference for t 
target song established during experiment: 
training had transferred to the posttest situa 
tion. Clearly, this finding disconfirms the 
overjustification hypothesis’ prediction of de- 
creased interest in the target song, indicating 
instead an opposite effect. 


GENERAL DISCUSSION 


The present findings disconfirm the ov: 
justification hypothesis , since they dem 
strate that extrinsic reinforcement can i 
crease subsequent interest in a behavior: | 
activity. Experiment 1 demonstrated that 
decreased play effect can be produced for 
listening to a song when one trial of noncon- 
tingent, promised reward is provided; Experi 
ment 2 demonstrated that when multiple- 
trial, contingent reinforcement procedures 
were provided, the decreased play effect did 
not occur. Under the competing response hy- 
pothesis, the critical difference between these 
two studies was the manner in which expecta- 
tions of reward were elicited. In Experiment 1, 
the procedures had been designed to maximize 
the likelihood that subjects would not only 
expect reward but also think about reward at 
the expense of carefully listening to and en- 
joying a song. In Experiment 2, the proce- 
dures were designed so that subjects would 
expect reward for listening to a song and 
learn to prefer the rewarded song to non- 
rewarded songs. Thus, only when the pro- 
cedures maximized the probability that 
competing responses would be elicited was 
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su) '\s’ interest in the experimental activity 
shed. Moreover, the results of Experi- 
) simply disconfirm the overjustification 
hesis because subjects had listened to 
th song under conditions of expected reward. 
he conditions required for a proper test 
© ‘he overjustification hypothesis maximize 
t ° probability that reward procedures would 
© -it responses that compete with rewarded 
ravior. Specifically, a proper test of this 
pothesis requires that the activity be in- 
rently interesting and that reward contin- 
sencies be salient to the subject. The presence 
of one of these requirements could be con- 
sidered an operational definition of the ab- 
sence of the other. An anecdotal example is 
the person who becomes so absorbed in 
what he is doing that he is oblivious to his 
surroundings. When an individual is highly 
interested in what he is doing, a rewarding 
stimulus must be very salient (intense) to 
elicit the individual’s attention. The poten- 
tial for elicitation of strong competing re- 
sponses through distraction would seem to be 
maximal with very intense stimuli. 

Lepper et al. (1973) and Lepper et al. 
(Note 1) have suggested that the overjustifi- 
cation effect has important implications for 
tangible reward procedures employed by be- 
havior modifiers. These suggestions are highly 
theoretical and tenuous. The Lepper et al. 
findings provide no plausible basis for sug- 
gestions regarding the clinical practice of 
behavior modification, Rather, it seems im- 
pressive that, as yet, no reinforcement pro- 
cedure employing tangible rewards has pro- 
duced deleterious “side effects” beyond chance 
levels. 

In a broader context, the present results 
question the utility of distinguishing between 
intrinsic and extrinsic motivation. As formu- 
‘ated by the overjustification hypothesis, this 
distinction concerns the perceived locus of 
control for one’s own behavior. Bem (1972) 
has noted that these cognitions have been 
treated both as dependent and independent 
variables. When they are treated as depen- 
dent variables, the primary interest is under- 
standing the manner in which people inter- 
pret their own behavior (Bem, 1965, 1967). 
The present article does not address this 
issue. However, the overjustification hypoth- 
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esis treats self-perception as an independent 
variable affecting overt behavior. This posi- 
tion maintains that a critical determinant of 
the effects of reward procedures is whether 
or not the subject perceives his own behavior 
as motivated to obtain rewards. Thus, the 
overjustification hypothesis merits interest 
from behavior therapists and psychologists 
who are otherwise not interested in self- 
perception. 

However, the thrust of the present article 
is that a much more detailed and accurate 
account of reward procedures, including the 
effects of saliency and amount of reward, 
already exists. The behavior modification 
literature strongly suggests that perceived 
locus of control of self-behavior is a relatively 
unimportant variable for understanding the 
effects of reward procedures on overt behay- 
ior, Specifically, a critical determinant of 
overt behavior is not perceptions of behavior 
as a consequence of reward (e.g., “I study 
hard to obtain good grades”) but rather 
perceptions of reward as a consequence of 
behavior, ability, or luck (eg., “I obtained 
good grades because I am smart”), as well 
as the subject’s perception of the motives of 
the reinforcing agent (e.g., “He’s trying to 
bribe me” ys. “He’s trying to help me’’).* 
In token economies, attributions of behavior 
to reward would be associated with behavior 
contrast effects, while attributions of reward 
to behavior, ability, or luck would be asso- 
ciated with generalized behavior changes 
(either transfer or response generalization) 
including changes in self-image. There exists 
considerable evidence that behavior modifica- 
tion procedures rarely produce successive con- 
trast effects (cf. O'Leary & Drabman, 1972; 
Reiss, 1973) but instead often produce gen- 
eralized behavior changes including changes in 
self-image (e.g., Buell, Stoddard, Harris, & 
Baer, 1968; Discipio & Trudeau, 1972; Dweck 
& Reppucci, 1973; Hall, Lund, & Jackson, 
1968). Moreover, Reiss (1973) has recently 
provided data suggesting that when reward 


5 However, it is possible that an observer's overt 
behavior toward a person who was rewarded might 
often be determined by the observer’s attribution of 
behavior to reward, for example, “That so-and-so 
is always studying to get good grades” (cf. Skinner, 
1971). 
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is perceived as symbolizing success, behavior 
changes are likely to transfer, but when 
extinction procedures are perceived as unfair, 
transfer of behavior changes is unlikely. 

In the final analysis, the distinction be- 
tween intrinsic and extrinsic motivation is 
such that the overjustification hypothesis is 
too vague for scientific purposes, as the 
following comments indicate. 


1. The concept of intrinsic motivation de- 
mands operational definition. The vagueness 
inherent in this concept is clearly evident in 
the Lepper et al. (1973; Note 1) studies in 
which control subjects were asked to draw 
because an experimenter was interested in 
viewing their drawings. The investigators con- 
sidered drawing elicited by this request, even 
when accompanied by coaxing, drawing under 
conditions of “intrinsic” motivation, that is, 
under conditions that do not cause reattribu- 
tion of drawing to external factors. However, 
it could be maintained that in the Lepper 
et al. studies, the promised reward procedures 
elicited intrinsic motivation (mastery, the 
attainment of which was symbolized by the 
reward), while the control procedures elicited 
extrinsic motivation (compliance). Thus, the 
concept of intrinsic motivation is so vague 
that its presence or absence can be inferred 
arbitrarily, depending on the investigator’s 
theoretical bias. 

2. Under the operational definition of 
intrinsic motivation employed in the Lepper 
et al. studies—percentage of time a subject 
chose to engage in activity—all behaviors are 
intrinsically motivated. A child will engage in 
an activity to the extent that it is preferred 
to available alternatives. Accordingly, the 
Lepper et al. definition of intrinsic interest 
refers to behaviors characteristic of particular 
situations; that is, intrinsic motivation is 
defined as a property of a situation rather 
than as a property of behavior. Thus, draw- 
ing will appear intrinsically motivating under 
one set of alternatives but not under others; 
the same could be said about any other 
behavior including solving boring arithmetic 

problems. 
3. If the concept of intrinsic motivation is 
intended to define a class of behavior such as 
play, the overjustification hypothesis implies 
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SUSHINSKY 
that play behavior obeys some la not “ 
obeyed by other behaviors. Under this </ini- 
tion, the findings of Experiment 2 zest 
that play behavior transfers and sts 
across situations in a manner not unli! st 
other behaviors. If the concept of i sic 
motivation is defined as a source of nv va- 


tion such that any behavior is in part ai. ult 
of intrinsic and extrinsic motivation, univ: -al 
predictions of various degrees of succes. ve 
behavior contrast could be derived from (he 
hypothesis. Thus, if the overjustification  - 
pothesis is to be of further scientific use, e ® 
concept of intrinsic motivation should e 
clarified.® 
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Effect of Norms, Public Disclosure, and Need for Appro 
on Volunteering Behavior Consistent with Attitudes 


Janet Ward Schofield 
Harvard University 


Young women with positive attitudes toward sex education were asked to 
commit themselves to an action consistent with their attitudes (reviewing sex 
education pamphlets) under four experimental conditions. These conditions 
were created by crossing two levels of a norm favoring the action (present, 
absent) with two levels of disclosure of the act (public, private). The hy- 
potheses that the amount of consistent action pledged would be increased by 
(a) the presence of the norm and (b) disclosure of the act to a peer were 
supported ($ < .05). One month later, subjects received some pamphlets to 
review and the opportunity to revise their initial commitment in privacy. The 
number of pamphlets subjects pledged to read decreased (p< .01). However, 
the predicted effects of the norm and of disclosure on the level of commitment 
remained significant (p < .05). Hypotheses about the interaction of the norm, 
disclosure, and the need for approval were not supported. 


The relationship between attitudes and 
overt behavior has been of interest to Amer- 
ican social psychologists for well over a 
quarter of a century. However, in recent 
years the nature of the questions posed 
about the attitude-behavior relationship has 
changed. Whereas earlier research dealt pri- 
marily with whether attitudes influence be- 
havior (Kutner, Wilkins, & Yarrow, 1952; 
La Piere, 1934), more recent research has 
focused on under what conditions attitudes 
influence behavior (Frideres, Warner, & 
Albrecht, 1971; Warner & DeFleur, 1969). 

Two themes recur in the literature on 
conditions which may modify the expected 
impact of attitudes on behavior. The first 
theme is that the likelihood of consistent 
action is greatly lessened when the con- 
sistent action is counternormative (DeFleur 
& Westie, 1958; Green, 1972; Warner & 
DeFleur, 1969). The second theme is that the 
extent to which an act is public influences the 
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relationship between attitudes and behavior, 
especially when the consistent act is counter- 
normative (DeFleur & Westie, 1958; 
Frideres, 1971; Frideres et al., 1971; Green, 
1972; Warner & DeFleur, 1969). 

The studies dealing with these themes ar: 
quite similar to each other. Most of them 
(a) focus on interracial attitudes and dis 
criminatory behavior, (b) operationalize the 
public or high disclosure condition as dis 
closure of one’s behavior in the mass media 
(c) investigate the impact of deeply ingrained 
societal norms, (d) study the effect of norma- 
tive pressures to refrain from behaviors which 
are consistent with one’s attitudes, and (e) 
focus on actions such as signing petitions ot 
written commitments to perform an action a 
some unspecified future date. With a few 
notable exceptions such as Linn (1965), the 
studies have not determined whether subjects 
would actually fulfill the commitments they 
have made. Finally, except for Frideres 
(1971), very few investigators have explored 
Wicker’s (1969) suggestion that personality 
variables, acting either by themselves or in 
combination with situational variables, may 
exert a significant impact on the relation 
between attitudes and behavior. 

The study reported here expands existing 
literature about the factors influencing the 
relationship between attitudes and behavior 
in a number of ways. First, the attitudes 
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stu were not interracial attitudes, but 
ati des toward sex education. Second, the 
hi sclosure condition simulated everyday 
cc: ‘tions of action more closely than in pre- 
vi work, High disclosure was operational- 


iz: here as disclosure of the individual’s act 
{ one new acquaintance who was known to 
i making inferences about the actor’s 

‘sonality. Third, the norm whose effect was 
idied was experimentally created. In addi- 
n, it was purposely made rather weak. 
ourth, this experiment studied the impact 
»f social pressures for an action which was 
consistent with individuals’ attitudes on the 
amount of consistent behavior undertaken. 
Fifth, two behaviors were studied. The first 
was the signing of a commitment to perform 
a certain action. The second was a revised 
pledge obtained after subjects had been asked 
to perform some of the action to which they 
had committed themselves. Hence, this ex- 
periment was able to assess not only the im- 
pact of situational pressures on a decision 
made in their presence but also on a similar 
decision made sometime later under very 
different circumstances. Finally, this study 
investigated the interactive effects of a per- 
sonality variable, need for approval, and situ- 
ational variables which influence the amount 
of social approval gained for consistent 
_ behavior. 

Four hypotheses were tested: (a) when a 
norm favors an action which is consistent 
with an individual’s attitudes, more consistent 
action is undertaken than when no such norm 
exists; (b) when a peer who knows and 
shares one’s attitudes will learn of one’s 
action relevant to those attitudes, more con- 
sistent action is undertaken than when the 
action is private; (c) persons with a high 
need for approval undertake more action 
consistent with their attitudes than those with 
a low need for approval when their action 
will be disclosed to a peer who shares relevant 
attitudes; (d) a norm favoring the consistent 
action increases the amount of consistent be- 
havior undertaken by subjects high in need 
for approval more than it increases the 
amount of consistent behavior undertaken by 
subjects low in need for approval when the 
action will be disclosed to a peer who knows 
_ and shares one’s relevant attitudes. 
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METHOD 


Overview of Procedures 


The study had three phases. First, a questionnaire 
which measured need for approval and ego involve- 
ment with attitudes toward sex education was 
mailed to potential subjects, With the exceptions 
discussed below, questionnaire respondents were 
asked to participate in the second phase of the 
study, the experimental sessions. In these sessions, 
subjects were asked to commit themselves to read 
and review some sex education pamphlets. Subjects 
were led to believe that the request to read pam- 
phlets was not part of the study. Both disclosure 
of the subject’s responses and the presence of a norm 
relevant to the responses were manipulated creating 
a 222 factorial design with two levels of dis- 
closure (public and private), two levels of the 
norm (present and absent), and two levels of need 
for approval (high and low). In the third phase 
of the study, subjects were mailed the first pam- 
phlets for review and were given the opportunity 
to revise in privacy the number of pamphlets they 
were committed to read. 


The Attitude and the Action 


The attitude studied had to be of enough interest 
to the potential subjects to make completion of the 
questionnaire likely, Pretests with women similar to 
the potential subjects showed that attitudes toward 
sex education met this criterion in addition to the 
criterion of attitudinal homogeneity. Relatively 
homogeneous attitudes were desirable since the ex- 
periment was investigating the impact of nonatti- 
tudinal variables on behavior consistent with atti- 
tudes. Pretesting also suggested that a large ma- 
jority of the potential subjects would be sufficiently 
ego involved with positive attitudes toward sex 
education so that reading and commenting on some 
sex education pamphlets was consistent with their 
attitudes. 

In the experimental sessions, subjects were asked 
to sign a commitment to review briefly some sex 
education pamphlets designed for adolescents, The 
first dependent variable, initial commitment, was the 
number of pamphlets which subjects agreed to read 
in the experimental sessions. The second dependent 
variable, final commitment, was the number of 
pamphlets subjects agreed to read 4 weeks later after 
receiving the first pamphlets to review and being 
given the opportunity to change their earlier com- 
mitment in privacy. This second dependent mea- 
sure made it possible to see whether the hypothe- 
sized differences in behavior due to participation in 
the various experimental conditions would persist 
outside of the experimental sessions. 


Procedure for Part 1: Questionnaire 


Subjects. All female residents of a large married 
student apartment at an Eastern university were 
sent questionnaires (1 = 367), 
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Questionnaire. A letter accompanying the ques- 
tionnaire asked cooperation in a study of “certain 
aspects of women’s attitudes and personality.” The 
questionnaire had three sections, one to obtain rele- 
vant background information, one to assess attitudes 
toward sex education, and one to measure need for 
approval. Information gathered in the first two 
sections was used in deciding which respondents 
would be asked to participate in the rest of the 
study as is described below. 

Need for approval score. A shortened form of 
the Marlowe-Crowne Social Desirability Scale 
(Crowne & Marlowe, 1964) was used to assess need 
for approval. This short form consisted of items 
which a factor analytic study of the Marlowe- 
Crowne scale suggested were most closely related 
to the total scale score (Nuttall, Note 1). Scores 
were computed in the standard manner except that 
21 rather than the original 33 items were used. 
Scale reliability of this short form was .77 (Cron- 
bach’s alpha). Women scoring above the mean for 
those subjects who returned the questionnaire were 
categorized as high in need for approval, All others 
were categorized as low in need for approval. 

Ego involvement score. Each subject rated the 
importance of three goals on a 5-point scale, These 
goals, being happily married, achieving sexual satis- 
faction, and understanding her children, were se- 
lected for inclusion in the questionnaire from a 
larger set of goals because similar subjects in the 
pretest felt they were related to sex education. 
Each subject also rated the extent to which sex 
education for adolescents facilitated attainment of 
these goals. The importance of each goal was multi- 
plied by the degree to which sex education facili- 
tated its attainment. A subjects ego involvement 
score was the mean of these three products. 

The highest possible ego involvement score, 25, 
indicated that each goal received a rating of “ex- 
tremely important” and that sex education was 
always perceived as strongly linked to attainment 
of these goals. The lowest possible ego involvement 
score, 3, indicated that all the goals were both of 
no importance and not at all furthered by sex edu- 
cation. The mean ego involvement score was 18.5 
with a standard deviation of 4.1. Ego involvement 
scores correlated significantly with direct ratings of 
the personal importance of one’s attitudes to sex 
education (r=.35, p < .001, n = 217). 


Procedure for Part 2: Experimental Sessions 


Subjects. Two hundred seventeen women, 59% 
of those to whom questionnaires were sent, became 
potential subjects for the experimental sessions by 
returning completed questionnaires. Forty-one of 
these women were eliminated because of insufficient 
ego involvement (a score of less than 15) or in- 
sufficient rated importance of attitudes to sex edu- 
cation (less than “somewhat important”), Twenty- 
two anonymous respondents could not be contacted. 
Nineteen more were eliminated due to characteristics 
revealed in the questionnaire such as graduate study 
in psychology, difficulty with English, or children 
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over 6 years old. A significant number o men 


were eliminated from the subject pool as < ibed 
above. However, the criteria for inclusion v not 
likely to lead to any systematic biases in ub- 
ject pool relevant to the hypotheses being < 

Of the 135 respondents in the subject p or 
the experimental sessions, 15 could not be con d, 
leaving 120. Each of these women was tele od 
and told that she had been randomly selected m 


the questionnaire respondents for further 
Seventeen women refused to participate in ve 
second part of the study which was conducte: n 
a public room in the apartment complex where 
lived. Twenty-seven agreed to participate but 

not appear at the scheduled time. Hence, 76 won 
about two-thirds of those asked to participate, ac 
ally became the subjects for the experimental s 
sions. 

Group scheduling and composition. The 1 
women in the subject pool for the experimental s¢ 
sions were stratified according to their level of nec 
for approval. Then they were randomly assigne 
to 12 groups so that each group had approximatel) 
the same number of subjects, half low and hali 
high on need for approval. All subjects in any one 
group were in the same experimental condition. 
Three separate groups were assigned to each of 
the four experimental conditions. The total number 
of subjects actually participating in each condition 
varied from 15 to 18.1 

Background Questionnaire. Upon arrival at th 
experimental sessions, subjects completed a shc 
Background Questionnaire with questions abo 
demographic characteristics, occupational status, ar 
leisure activities. This questionnaire’s primary fun 
tion was to prevent subjects who arrived early fro: 
talking with each other or the experimenter, Pretes! 
ing suggested that such conversations might affec 
subjects’ responses to the manipulation of the dis- 
closure variable. 

Experimenter’s outline of procedures. Next, the 
experimenter outlined the rest of the study. Three 
tests were described and later administered to th 
subjects to reinforce the impression that the stuc 
concerned personality variables. No mention wos 
made of the upcoming request to read the scx 
education pamphlets. 

TAT slides. Subjects were given standard 7 
matic Apperception Test (TAT) instructions and 
shown three TAT slides. 

Self-Description Questionnaire. The second 
ported personality test, the Self-Description Ques- 
tionnaire, consisted of a few sentence completion 
items such as, “Sometimes I wonder if E 

Request to read sex education pamphlets. A 
stamped postcard addressed to the Sex Education 


1 Figures given here refer to subjects actually in- 
cluded in the data analysis. Nine subjects who par- 
ticipated in experimental sessions were eliminated 
for reasons including difficulty with English, de- 
parture from the state before the end of the study, 
and suspicion about the purpose of the request to 
read pamphlets, 
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Re h Foundation (SERC) was stapled to the 
Se escription Questionnaire. On the side of the 
po ard facing the subjects were printed the words, 
“l, a willing to read and to comment on — of the 


£ sex education pamphlets for 10-14 year old 
€ ren that you are now developing.” A space 
left for a signature under which was typed 
individual’s name and address. 
sefore distributing the Self-Description Ques- 
nnaires and the attached postcards, the experi- 
nter told the subjects that she had planned to 
plain about SERC and to ask for volunteers to 
eview pamphlets at the end of the study. How- 
ver, since her research assistant had mistakenly 
tapled the postcards to the Self-Description Ques- 
tionnaires, she felt she should explain immediately. 
The experimenter said that she occasionally served 
as a consultant for SERC, a nonprofit sex education 
organization, and that SERC desired feedback on 
35 pamphlets they were developing. The experi- 
menter then noted that for purposes connected 
with the personality study subjects had been grouped 
homogeneously according to their views on sex 
education, She stated that all subjects in the present 
group had positive attitudes toward sex education. 
Hence, she said she felt she could ask for volunteers 
to review some pamphlets. Finally, she reiterated 
that 35 pamphlets were being developed and asked 
subjects to sign up to review as many as they felt 
they could. 

Next, the experimenter glanced at a small note- 
card which indicated to which of the four ex- 
perimental conditions the group had been randomly 
assigned. She then added whatever statements were 
appropriate to the group’s condition before passing 
„out the Self-Description Questionnaires with the 
attached postcards. A brief written description of 
SERC was also distributed. This description was on 
a different kind of paper and duplicated with a 
different ink than were the other experimental ma- 
terials to intensify the impression that the request 
to review pamphlets was not part of the experiment. 

Norm-presence manipulation. In the norm-absent 
condition nothing was added to the procedures out- 
lined above. However, in the norm-present condi- 
iton, the experimenter remarked that she was pleased 
that most women in the previous groups had been 
willing to read some pamphlets. 

Disclosure manipulation. In the private condition, 
the experimenter told the subjects to detach the 
stamped postcards from the Self-Description Ques- 
tionnaire after filling them in. She pointed out that 
this would insure a high degree of privacy since 
only the SERC personnel to whom the cards were 
addressed would see how many pamphlets each sub- 
ject was willing to read. In the public condition, 
the experimenter asked subjects to leave the post- 
cards stapled to the questionnaires. She remarked 
that each subjects response would be an extra 
piece of information that her partner in the Person 
Perception Task could use if she so desired. 

Person Perception Task. Next, all subjects traded 
Self-Description Questionnaires (with or without 
the postcards depending on the experimental condi- 
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tion) with another subject. The Person Perception 
Task required subjects to make judgments about the 
personality of the individual with whom they 
had traded Self-Description Questionnaires, 

Poststudy questionnaire. Finally subjects com- 
pleted a questionnaire about their perceptions of 
the experiment which included manipulation and 
suspicion checks. 


Procedure for Part 3: Follow-Up 


Four weeks later, the 64 subjects who had agreed 
to read some pamphlets received by mail three 
pamphlets to review as well as the postcards they 
had signed in the experimental sessions. A cover 
letter signed by the experimenter instructed the 
subjects to change the number written on their 
postcards if they had changed their minds about 
the number of pamphlets they could read. The en- 
velope enclosed for the reviews and the postcard 
were addressed to SERC in a different state. 


Debriefing 


Nine weeks after the pamphlets had been mailed, 
subjects were sent a debriefing letter thanking them 
for their participation and explaining the experi- 
ment’s purpose. A summary of the results was in- 
cluded as was an assurance that the reviews of the 
pamphlets would be sent to the organizations which 
had produced them. 


RESULTS 


Manipulation Checks from the Poststudy 
Questionnaire 


Norm-presence manipulation. All subjects 
were asked whether the experimenter had 
made any comments about how other women 
reacted to the request for volunteers and, if 
so, what the experimenter had said. Subjects 
in the norm-present condition were signifi- 
cantly more likely to report having heard the 
normative statement than subjects in the 
norm-absent condition (£= 2.20, p< .04, 
n= 67). 

Disclosure manipulation, All subjects were 
asked if, when filling out their postcards, 
they had believed that another subject would 
see their postcards. Subjects in the public 
condition were significantly more likely to 
respond affirmatively than subjects in the pri- 
vate condition (= 10.9, p < .001, n = 67). 

Attitudinal consensus. Since hypothesis b 
concerned the impact of disclosure of an act 
to a peer sharing one’s attitudes, it was neces- 
sary that all subjects believe that the other 
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subjects in their group favored sex education. 
Approximately 85% of the subjects said that 
they thought the other subjects in their group 
were positive toward sex education, 10% did 
not know, and 5% said other subjects were 
negative. Patterns of response were similar in 
all experimental conditions. 


Suspicion Checks from the Poststudy 
Questionnaire 


Subjects were asked to “list . . . the things 
that happened since you entered the room for 
the study. Then state what you feel the pur- 
poses of each of these occurrences was.” 
Subjects were specifically instructed to write 
about “anything that seemed unclear or 
strange.” Sixty-nine percent of the subjects 
failed to mention the request to review the 
pamphlets. Similar results had been found in 
a pretest. Pretest subjects explained the omis- 
sion by saying that there was no reason to 
mention the request to review the pamphlets 
because it was not part of the study. Thus, 
the omission itself can be taken as probable 
evidence that no suspicion was aroused. 

Subjects were also asked, “At any time, did 

you wonder if the real reason for some occur- 
rence was different than the stated reason? 
If so, when? What ideas did you have about 
the real reason for the event or events?” 
Aprpoximately 65% of the subjects expressed 
no suspicions at all. About 15% of the sub- 
jects expressed mild suspicion of psycho- 
logical experiments in general but made no 
specific reference to suspicions about this 
particular experiment. Another 15% said they 
thought the Person Perception Task was a 
way of finding out about the individual 
making the judgments rather than about the 
person being judged. Such suspicions were 
not considered serious since they were un- 
likely to lead to spurious confirmation of the 
hypotheses. Two subjects expressed suspicion 
about the request to review the pamphlets 
and hence were eliminated from the analyses. 
Two other subejcts said that the questions on 
the poststudy questionnaire related to the 
request for volunteers raised suspicions about 
the purpose of the request, although no such 
suspicions had been present previously. These 
subjects were included in the analyses. 
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Data Analysis Procedures 


A 2 X 2 2 factorial analysis of v: ce 
was performed on initial commitmen 
number of pamphlets subjects agreed to 
at the time of the experimental sessions. <e 
same analysis was also performed on i 
commitment, the number of pamphlets 
jects agreed to read after they had actu 
been mailed some pamphlets to review. 
independent variables in these analyses wi 
need for approval, public disclosure, a 
norm presence. In addition, a repeated m« 
sures analysis of variance was performed ‘ 
see if the impact of these three independer 
variables on final commitment was signifi 
cantly different from their impact on initia! 
commitment. 

The fact that individuals were tested in 
groups raised the possibility that the scores 
of individual subjects within given groups 
were not completely independent. Extreme 
care was taken to avoid nonindependence of 
data from members of given groups. For ex- 
ample, all of the experimental procedures 
were strictly standardized and subjects wer 
seated so that they could not see the others 
postcards. Nonetheless, the analyses outline: 
above were repeated using group means in 
stead of individual scores as the dependent 
variables in case nonindependence of indi- 
vidual scores accounted for some of the find- 
ings. This second set of analyses led to pre- 
cisely the same conclusions as those reported 
below which are based on the first set of 
analyses. However, the levels of significance 
were somewhat lower due to the greatly 
reduced number of degrees of freedom. 
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Substantive Results 


Table 1 shows that hypotheses a and b 
were supported. In the experimental sessions, 
subjects in the norm-present condition volun- 
teered to read nearly one and one-half times 
as many pamphlets as did subjects in the 
norm-absent condition, F(1, 59) = 4.40, p < 
-03. Similarly, subjects in the public condition 
volunteered to read nearly twice as many 
pamphlets as subjects in the private condi- 
tion, F(1, 59) = 7.88, p < .01. 

However, the subjects either did not realize 
or did not care to admit that their initial 
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EFFECT OF NORMS AND DISCLOSURE on CONSISTENT BEHAVIOR 


TABLE 1 


EAN ĪNITIAL AND FINAL COMMITMENT IN THE 
Four EXPERIMENTAL CONDITIONS 


| Initial Final 
commitment commitment 
Need 
eee Disclosure condition 
Private | Public | Private | Public 
\bsent | Low 6.2 15.9 2:3. 13.0 
High 8.0 20.3 4.1 11.3 
Present | Low 18.1 20.8 12.0 20.0 
High 13.8 22.8 15.4 15.6 


commitment had been affected by the pres- 
ence of a norm or the fact of disclosure. 
On the poststudy questionnaire subjects were 
asked what factors they had weighed when 
deciding how many pamphlets to read. Not a 
single subject spontaneously mentioned think- 
ing about the fact that: (a) she knew other 
women had volunteered or (b) she knew a 
peer would see her postcard. 

Since subjects might hesitate to mention 
spontaneously being influenced by a norm or 
by the disclosure of their acts to a peer, the 
poststudy questionnaire also instructed sub- 
jects to indicate the impact of the following 
consideration on their decision to volunteer: 
“T believed that other women with similar 
opinions had volunteered and I thought I 
should do my share.” Approximately 65% of 
the subjects in the norm-present condition 
said this consideration was irrelevant. These 
subjects volunteered to read about the same 
number of pamphlets as subjects in their 
condition who admitted to being influenced 
by the norm (t= .17, p> .50, n = 34). 
Subjects also rated the importance of the 
consideration: “I believed that my partner 
would see whether or not I volunteered and 
that she might disapprove if I didn’t volun- 
teer to read at least one pamphlet.” Fewer 
than 25% of the subjects in the public condi- 
tion rated this consideration as relevant to 
their initial commitment. Their behavior did 
not differ significantly from that of those who 
said this consideration had not influenced 
their behavior (t = .21, p > .50, n = 28). 

The data showed no trace of the predicted 
interactions between need for approval and 
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disclosure, F(1, 59) = .56, p < .47, or need 
for approval, disclosure, and norm presence, 
F(1,59) = .09, p >.50. Nor was there a 
main effect of need for approval, F(1, 59) 
= .10, p > .50. 

The repeated measures analysis showed 
that the subjects initially committed them- 
selves to read significantly more pamphlets 
than they agreed to read after having the 
opportunity to revise their commitment, 
F(1, 59) = 10.21, p< .01. The drop, from 
a mean of 16,0 to 12.2 pamphlets, was al- 
most entirely due to the 22 subjects who 
initially agreed to read some pamphlets but 
failed to read any or to return their postcards. 

None of the two- or three-way interactions 
involving the repeated measures factor, time, 
was even marginally significant, suggesting 
that the impact of the independent variables 
on final commitment was similar to their 
impact on initial commitment. 

After having the opportunity to revise pri- 
vately the number of pamphlets they would 
read, subjects who had been in the norm- 
present condition volunteered to read a mean 
of 16.6 pamphlets compared to 8.3 for the 
subjects in the norm-absent condition, F 
(1,59) = 6.27, p< .03. The final commit- 
ment for subjects who had participated in 
the public condition averaged 15.6 pamphlets 
compared to 9.2 for subjects who had partici- 
pated in the private condition, F(1,59) = 
4.08, p < .05. Not surprisingly given the lack 
of relation between need for approval and 
initial commitment, there were no significant 
interactive effects of need for approval and 
disclosure, F(1, 59) = .78, p < .39, or need 
for approval, disclosure, and norm presence, 
F(1,59) = .11,p > .50, on final commitment. 


DISCUSSION 


The results suggest that situational vari- 
ables such as norms or disclosure which 
often account for behavior inconsistent with 
attitudes can also increase the amount of 
consistent behavior undertaken. In the experi- 
mental sessions, subjects in the private, norm- 
absent groups volunteered to read a mean of 
approximately 7 pamphlets. Introduction of 
either a norm or disclosure raised the mean 
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to about 17. The presence of both factors 
raised the mean even further to 22 pamphlets. 

Differences in behavior between subjects in 
the various conditions were rather remark- 
able given the mild experimental manipula- 
tions used. The normative statement was no 
stronger than statements one hears everyday 
made to encourage actions such as refraining 
from littering or giving to charity. The high 
disclosure condition also paralleled everyday 
life. Subjects were meeting new acquain- 
tances who they knew would be making 

inferences about them. However, the conse- 
quences of a positive evaluation by the other 
subjects were not nearly as great in the ex- 
periment as they often are in similar real-life 
situations such as job interviews. 

In light of the strong effects of the presence 
of a norm and disclosure, it is interesting 
that the great majority of the subjects denied 
that the presence of the norm or the anticipa- 
tion of disclosure influenced their behavior. 
This finding is similar to Sherif’s (1965) 
finding that even though most subjects were 
influenced by the judgments of other subjects 
in an autokinetic situation, “the majority of 
subjects reported not only that their minds 
were made up as to the judgments they were 
going to give before the others spoke, but 
that they were not influenced by the others 
in the group” (p. 470). The social influence 
in the autokinetic situation appears to be 
primarily informational social influence, in- 
fluence accepted because the individual is 
unsure about reality (Deutsch & Gerard, 
1955). In contrast, the influence most opera- 
tive in this experiment was probably norma- 
tive social influence which is accepted because 
of the interpersonal or practical rewards its 
acceptance brings. Hence, people may react 
to normative as well as informational social 
influence without being aware that they have 
done so. 

In sharp contrast to presence of a norm and 
disclosure, need for approval had no impact 
on the subjects’ behavior. Frideres’s (1971) 
work suggests that need for approval affects 
behavior more when an individual’s attitudes 
are different from those of others in a face-to- 
face group than when they are similar as they 
were in this experiment. Hence, it would 
be premature to conclude that additional 
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study of the effect of need for appr: 
behavior would be fruitless. 

Part 3 of the study permitted exam 
of the impact of 4 weeks elapsed time n- 
bined with a demand for action based he 
initial commitment, on subjects’ final 1- 
mitments. Although the subjects could ch- ge 
their commitments in privacy, nearly - 
thirds of the subjects who were sent -e 
pamphlets reviewed them and indicated t 
were willing to review at least as m 
pamphlets as they had previously said t 
would. 

The independent variables in this study h: 
essentially the same impact on the final con 
mitment as on the initial commitment. 
seems likely that in deciding on their fin: 
commitment the subjects felt bound by theii 
prior commitment. If this is so, the impact 
of the situational variables on final volunteer- 
ing was due in large part to their impact on 
initial volunteering. However, three factors 
may have increased the likelihood that sub- 
jects’ final commitments would be similar to 
their initial commitments. First, subjects ma; 
have felt that the experimenter would lear 
of their final commitments. Hence, to th 
extent they desired experimenter approv: 
they would be motivated to live up to the 
original commitment. Second, the manipula 
tion checks may have contributed to the con 
tinuing effects of the independent variables on 
subjects’ commitments. For example, the fact 
that subjects went through the process of 
recalling and recording what, if any, norma- 
tive statement was made may have lodge: 
that statement firmly enough in subjects’ 
minds so that it was salient when the final 
commitments were made. Third, and rather 
paradoxically, the very mildness of the experi- 
mental manipulations may help to account for 
their impact on behavior outside of the 
experimental situation. Studies have shown 
that when a communication is “accidentally 
overheard,” it produces more attitude change 
than when the same communication is pre- 
sented as an attempt to influence the audience 
(Festinger & Maccoby, 1964; Walster & 
Festinger, 1962). Similarly, if individuals do 
not perceive great pressure at the time of 
their initial commitment, they may interpret 
their commitment as relatively freely given, 
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and ence, as more binding than if it were the 
resi. | of coercion. 


REFERENCE NOTE 
tall, R. Unpublished study, Boston College, 
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The effect of language spoken and conversational phase on temporal aspects 
of speaking style were examined. Male Cuban bilinguals conversed once in 
Spanish and once in English; their conversations were monitored to determine 
frequency and duration of speech. Subjects spoke longer and more frequently 
per turn in English than in Spanish, which is interpreted as reflecting their 
identification with two different social groups. Results also showed that varia- 
tions in turn-taking behavior, as a function of conversational phase, were the 
same in English and Spanish, which is interpreted as a manifestation of larger 
cultural norms regulating the phases of conversation. In general terms, the 
results indicate that measures of turn-taking style are powerful tools for 
examining the social psychology of conversation. 


The temporal aspects of speaking style 
have long been of interest to social scientists, 
and a tradition of research in this area began 
with the development of the Interaction 
Chronograph (Chapple, 1939). Since that 
time, temporal variables have gained in im- 
portance in research, partly because they can 
be measured easily and reliably. Chapple’s 
and subsequent research has shown that the 
frequency and duration of an individual’s 
speech behavior in an interview is stable and 
can be related to other aspects of his person- 
ality (for a review of this literature, see 
Matarazzo, 1962; Matarazzo, Saslow, & 
Matarazzo, 1956). 

This early research was conducted mainly 
in an interview setting; however, recent 
studies have dealt with two-person conversa- 
tions, and various temporal measures of 
speaking style have been related to biosocial 
factors such as the sex of the interlocutors 
(Long, 1972; Prebor, 1972; Shaw & Sadler, 
1965), interpersonal distance (Long, 1972), 
and level of intimacy (Shaw & Sadler, 1965). 
Jaffe and Feldstein (1970), in their work on 
unconstrained conversation, studied changes 
in speaking style across time, partner, and 
conversational setting, and found that while 
certain measures, such as the total duration 
a subject spoke, were quite stable across 
experimental conditions, other measures, such 
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as simultaneous speech and switching pause , 
were much more sensitive to situation: 
changes. 

This last measure, the switching pause, is 
associated with the notion of turn-taking. The 
turn has been defined as beginning when on° 
speaker starts solo talking and ending wh: 
another speaker starts solo talking (Jaffe wx 
Feldstein, 1970; Markel, in press). The tun 
is the most natural unit into which a c:a- 
versation can be divided, and it is implici ly 
recognized by all interlocutors. Relative y 
few researchers, however, have studied con- 
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versational behavior within the framework cfe 


the turn. Among those who have are Duncaa 
(1972), who studied the paralinguistic and 
kinesic behavior associated with turn-taking, 
and Schegloff (1968), who studied the 
rules governing the sequencing of turns in 
conversational openings. 

If relatively few researchers have studied 
turn-taking behavior directly, more have 
recognized the episodic nature of conversa- 
tions. For example, Pittenger, Hockett, and 


Danehy (1960) noted paralinguistic “bound- ™ 


ary markers” delineating the “episodes” in a 
psychiatric interview. For Soskin and John 
(1963) the episode was the basic unit of the 
“ecological” analysis and was defined as a 
conversation about one major topic in one 
setting. Erickson (in press) found that 
changes in topic, or episode boundaries, were 
nearly always marked by gross postural shifts. 
There are practically no reports, however, of 
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SociaL PERSONALITY AND CONVERSATIONAL STYLE 


. the specific behavior occurring in the different 
phases of conversational interaction (we know 
of only two: Prebor, 1972; Schegloff, 1968). 
For this reason, the opening, middle, and end 
of conversations were examined in the present 
study to discover whether turn-taking and 


oiver temporal behaviors vary as a function 
©. the point at which they occur in the 
c nversation. 

“he measurement of temporal aspects of 

‘aking style seems particularly well-suited 

cross-cultural research. In fact, bilinguals 

re especially good subjects for cross- 

cultural studies of the relationship between 
personality and speaking style, as they 
speak two languages and participate in two 
different cultures. There is plentiful anecdotal 
evidence that bilinguals feel that they have 
two different personalities, one for each of 
the languages they speak. For example, Lowie 
is quoted as saying: “the popular impression 
that a man alters his personality when speak- 
ing another tongue is far from ill-grounded. 
When I speak German to Germans, I auto- 
matically shift my orientation as a social 
being” (Haugen, 1956). In the same vein, the 
writer Julian Green was unsuccessful in an 
attempt to translate one of his books from 
French into English, but had to write a com- 
pletely new book: “It was as if, writing in 
_ English, I had become another person” 
(Haugen, 1956). 

These anecdotes are supported by research 
into different personality factors and values 
in bilinguals. Ervin (1964) administered the 
Thematic Apperception Test (TAT) on sepa- 
rate occasions in French and English to a 
group of French—English bilinguals and found 
that the bilinguals’ responses showed different 
manifest and latent content depending on 
language used. Ervin hypothesized the exist- 
ence of two “social personalities” in these 
f PA bilinguals, which were influenced by the dif- 
ferent roles they played in their two cultures. 
In another study, Botha (1970) administered 
the Uses Test to a group of English-Afrikaans 
bilingual children and again found variation 
in responses depending on language used. The 
responses in each language were in the direc- 
tion of responses given by monolinguals. In 
addition, Collett (1972) administered Kelly’s 
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Repertory Grid to English-Arabic bilinguals 
and showed that they saw people they liked 
as more dissimilar to people they disliked 
when speaking Arabic than when speaking 
English. 

Strangely enough, although these studies 
have been concerned with the relation be- 
tween social personality and language, few of 
them have measured speech behavior directly. 
One exception to this trend is a study by 
Ervin-Tripp (1964), in which female Japa- 
nese-English bilinguals were found to change 
the topic of conversation when they changed 
language and to use briefer speech, with more 
intrusion of Japanese words and syntax, when 
speaking English to a Japanese person than 
when speaking to an American. These studies 
do indicate, however, that bilinguals should 
be good indicators of the power of temporal 
measures of turn-taking and other conversa- 
tional behavior to detect shifts in overt be- 
havior reflecting changes in role or social 
personality. The present study, therefore, at- 
tempted an investigation of the temporal 
aspects of speech as they are related to the 
different languages spoken by bilinguals. It 
was hypothesized that temporal aspects of 
speech behavior would vary according to lan- 
guage used and that they would change be- 
tween the opening, middle, and end of the 
conversations. This was an exploratory study, 
and thus no direction was predicted for 
differences. 


METHOD 
Subjects 


Thirteen male bilingual subjects were drawn from 
the student population at the University of Florida, 
Their ages ranged from 18 to 22 years, with a 
mean age of 20.8 years. Each subject was asked to 
recruit a Cuban friend with whom to converse in 
Spanish and an American friend with whom to 
converse in English. The subject was asked to choose 
only men with whom he was well-acquainted, The 
conversation partners selected by the subjects ranged 
in age from 18 to 24 years, with a mean age of 21.2 
years. 

Definitions of what constitutes a bilingual vary 
from a person speaking two languages, neither of 
which dominates the other (Lambert, 1955), to a 
person with one native language and at least pas- 
sive knowledge of another (Mackey, 1962). Given 
the comparative rarity of bilinguals in Lambert’s 
sense, and since this study was primarily concerned 
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with changes in speaking style as a function of 
participation in different linguistic and cultural 
groups, it seemed appropriate to use the criterion 
of several years’ participation in each language com- 
munity (Ervin, 1964). Therefore, all subjects were 
required to have spent at least three years in both 
Cuba and the United States. In fact, all the sub- 
jects were born in Cuba and had spent from 6.5 
to 11 years there (the mean was 9.3 years); they 
had lived in the United States from 11 to 12 years. 
In addition, most of the subjects were residents of 
Miami, Florida, and were still active in the Cuban 
community there, as well as in the primarily Ameri- 
can student community at the university. 

In all cases, the American partners spoke no more 
than a few words of Spanish. The Cuban partners, 
however, were bilingual, although they indicated 
that they spoke English rather less often than the 
subjects (34% vs. 48% of their time spent speaking 
English). Like the subjects, the Cuban partners were 
born in Cuba and had spent from 9 to 18 years 
there (the mean was 12.5); they had lived in the 
USS. from 3 to 11 years (the mean was 9.1). 


Procedures 


Subjects were contacted prior to the experiment 
in order to ascertain that they met the “bilingual” 
requirements specified above and to arrange an 
appointment for the experiment. The experiment 
was conducted in the subject’s home, in a room 
which could be closed off from the rest of the 
house and which contained at least two comfort- 
able chairs. Seven of the 13 subjects had their first 
conversation in English with the American partner, 
and 6 began in Spanish with the Cuban partner. 
At the beginning of each conversation, the subject 
and his partner filled out brief questionnaires giving 
background information. During this time a Nagra 
IV-S stereo tape recorder was set up in the experi- 
mental room in as unobtrusive a place as possible 
and connected to two small lavalier microphones, 
which had been shielded with foam rubber and 
plastic tubing to minimize cross-channel pickup. 
Tape recorder volume was set and checked, and 
conversation identification was recorded by the ex- 
perimenter, The subject and his partner then seated 
themselves in the experimental room, put on the 
microphones, and listened to a cassette recording of 
the following instructions (or their Spanish trans- 
lation): 


This experiment is concerned with the way people 
talk to each other. Your task is simply to engage 
in conversation with your partner for as long 
as you like and about whatever topic you like. 
Your conversation will be tape-recorded. If you 
have any questions, please ask them now. Please 
state your subject number and today’s date, in 
English [“in Spanish” in the Spanish version], 
into the microphone, first subject A, then sub- 
ject B. [Cassette stopped at this point.] You 
may begin the conversation as soon as the ex- 
perimenter leaves the room. When you have fin- 
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ished your conversation, come back o the 

room and the experimenter will meet wi ou, 

These instructions were recorded in Sp and 
English by a bilingual student. The exp: ater 
did not speak to the subjects unless it 
lutely necessary and then minimal respi vas 
made in English. 

The tape recorder was started as soon the 
subject and his partner were seated. In or to 
acquaint them with the recording procedu: he 
cassette was stopped at the point indicated he 


instructions above, and they recorded their id ñ- 
cation numbers and the date. At the end 0} 
conversation, they removed their microphones 
left the room, 


Sections and Half-Minutes of the 
Conversations 


A total of 15 minutes of each conversation, 
tained from the very first, the exact middle, a 
the very last 5 minutes, is the sample of conver: 
tion on which this study is based. These 5-minu! 
sections are referred to as the opening, middle, ani 
end, respectively. There were 8 conversations (out 
of 26) which lasted less than 15 minutes, and the 
opening, middle, and end sections of these con- 
versations were determined by dividing them inte 
three equal parts. Each section of the conversatic 
was then divided into 30-second segments, call 
“half-minutes.” These half-minutes were indica‘ d 
on the tape itself by a signal dot. Thus, those c 
versations with 5-minute sections had 10 hs- 
minutes per section; those conversations with < - 
tions of less than 5 minutes duration had, acco 
ingly, fewer half-minutes, but in no case was C e 
number of half-minutes less than 6 per section, 
and in no case was a segment of less than 30 sec 
onds duration included in the analysis. 


Monitoring the Conversations 


Listeners were trained to monitor the recordings. 
Each conversation was monitored three times — 
first by a team of two listeners and then twice 
one listener. During the first monitoring, the li: 
er’s task was to press a switch each time the 
locutor he was monitoring spoke and to m 
switch pressure until that interlocutor stopped 
speaking. Any phonation except sneezing, coughing, 
and laughter was counted as speech, and any per- 
ceptible pause was counted as a stop in spi . 
The switch activated a countertimer each time it 
was pressed, and thus a cumulative record was kept 
of the frequency and duration of speech. On the 
first monitoring, the two listeners used three inter- 
connected countertimers: one countertimer for each 
speaker in the dyad, activated by one monitor’s 
switch; and one countertimer, activated when both 
monitors pressed their switches at the same time. 
This third countertimer was measuring the fre- 
quency and duration of simultaneous speech. On the 
second and third passes, one monitor, using one 
countertimer, measured the frequency and duration 


es 


of hing pauses—those pauses between the 
one interlocutor and that of the other. 


In ition, the monitor counted the number of 
tur ‘ollowing simultaneous speech—that is, the 
nu r of times that one interlocutor gained the 
fic by starting to speak while the other was 


where having the floor (or taking a 
ti is defined as speaking alone for any length 
f; me. Finally the number of turns—those turns 
w ch followed switching pauses plus those following 
si \iltaneous speech—was determined. In all cases, 
t! tape and countertimers were stopped after each 
minute, and the scores for that half-minute 

re recorded. This segmentation provided an easy 
‘thod of correcting errors in monitoring. Inter- 
monitor reliability was determined by rescoring a 
random selection of 15% of the sections by a 
second team of monitors and correlating the two 
sets of measures, The reliability coefficients for 
the “raw” measures were as follows: (a) amount 
of speech, .98; (b) frequency of speech acts, .90; 
(c) duration of switching pauses, .94; (d) fre- 
quency of turns following switching pauses, .98; 
(e) duration of simultaneous speech, .95; (f) fre- 
quency of simultaneous speech, .80; and (g) fre- 
quency of turns following simultaneous speech, .80. 


speo sing, 


à Dependent Measures 


Eight dependent measures were used in the analy- 
sis of the data. Amount of speech per turn was ob- 
tained by dividing the duration of speech in a 
half-minute by the number of turns in that half- 
minute. Frequency of speech acts per turn was 
obtained by dividing the frequency of speech in a 
half-minute by the number of turns in that half- 
minute. Amount of switching pause time, number 
of turns, number of turns following switching 
pauses, and number of turns following simultaneous 

Speech were raw scores recorded for each half- 
minute by the process described above. Average 
duration of speech acts was obtained by dividing 
the duration of speech in a half-minute by the 
frequency of speech in that half-minute; and aver- 
age duration of switching pause time was obtained 
by dividing the amount of switching pause time in 
a half-minute by the number of turns following 
switching pauses in that half-minute. 

Since the turn is a natural measure of behavior 
in conversation, unlike the arbitrarily selected half- 
minute, derived turn measures replaced some of the 
raw measures in the analysis.t Finally, because this 
study was concerned with changes in speaking style 
cross linguistic situations, only the scores for the 
ngual subjects, and not those for their partners, 
were included in thé analysis. Simultaneous speech 
measures, which involve both partners in the con- 
versation, were also excluded for this reason, al- 
though these measures are clearly important in their 
own right (see Jaffe & Feldstein, 1970; Long, 1972). 


1 Amount of speech per half-minute and fre- 
quency of speech acts per half-minute were also 
analyzed, and the results were similar to those 
obtained for the per-turn measures. 


SOCIAL PERSONALITY AND CONVERSATIONAL STYLE 


1137 


RESULTS 


A 2X3 analysis of variance with two 
repeated measures examined whether turn- 
taking behavior varies as a function of lan- 
guage spoken or stage of the conversation. 
The two factors of the analysis of variance 
were language (Spanish or English) and sec- 
tion (opening, middle, or end). The scores 
entered into the analysis of variance were 
the means across half-minutes for each sec- 
tion. Where the analysis of variance indicated 
significance, follow-up tests were made using 
Tukey’s test for pair-wise comparisons. The 
following results were obtained: 


1. Amount of speech per turn was signifi- 
cantly greater in English, F(1,12) = 8.22, 
p < .05. The section factor was also signifi- 
cant, F (2,24) = 7.10, p < .01, with follow-up 
tests indicating that the difference was due to 
the end showing more speech per turn than 
the opening. 

2. Frequency of speech acts per turn 
was significantly greater in English, F(1, 12) 
= 15.48, p < .01. The section factor also 
showed a significant difference, /(2, 24) 
= 5.93, p < .01, which was due to the end 
showing more speech acts per turn than the 
opening. 

3. Amount of switching pause time showed 
a significant difference on the section fac- 
tor, F(2, 24) = 4.75, p < .01, with follow-up 
tests indicating that the opening contained 
a larger amount of switching pause time than 
the middle. 

4. Frequency of turns was significant on 
the section factor, F(2, 24) = 4.32, p < .05, 
with follow-up tests indicating the difference 
was due to the opening containing more turns 
than the end. 

5. Frequency of turns following switching 
pauses was also significant on the section 
factor, F(2,24) = 5.57, p< .01, with the 
difference due to the opening showing more 
turns following switching pauses than either 
the middle or the end. 

6. Frequency of turns following simul- 
taneous speech showed a significant differ- 
ence on the section factor, F(2,24) = 5.83, 
p < .01, with follow-up tests indicating the 
difference was due to the middle containing 
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more turns following simultaneous speech 
than the end. 

7. Average duration of speech acts showed 
no significant difference on either the lan- 
guage or section factors, nor did average 
duration of switching pauses although the 
language factor approached signiñcance, F 
(1,12) = 4.05, p < .08. 

8. None of the dependent variables showed 
a significant interaction between language 
and section. 


The mean scores for language and section 
on each measure are shown in Table 1. 


DISCUSSION 


The results of the present study confirm 
the hypothesis that temporal aspects of 
speech, including turn-taking behavior, vary 
as a function of language. In English, the 
subjects spoke longer and more frequently in 
each turn than when they were speaking 
Spanish. On the other hand, their Spanish 
conversations showed a slightly longer aver- 
age switching pause duration. The most plau- 
sible explanation for these differences is that 
temporal variables in face-to-face interaction 
are influenced by cultural variables, or at any 
rate, that the two cultures in which the bi- 
lingual has participated influence his reaction 
to the situation he is in. This is particularly 
true for the differences between English and 
Spanish for the variable “amount of speech 
per turn.” The fact that the amount of speech 
varied as a function of language spoken is 
important since a previous study reported 
that this variable is stable across partners 
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and situations (Jaffe & Feldstein, |970). = 
Amount of speech has been related t xtra- 
version” (Markel, Bein, Campbell, © shaw, 
in press) and it is possible that the di’ 1ces 
found in this study may point to the « nce 
of factors in either U.S. or Cuban ure 
which encourage or discourage a of 
“extraversion” or “dominance.” It is al S- 
sible that the bilinguals speak more in | sh 
because their roles as foreigners in th si 
put them in a more prominent position | in 
they enjoy in the Cuban community. kv- 
ever, these hypotheses about person: ty ¥ 
characteristics are highly speculative in he 


absence of any actual personality assessment, 
On the cultural, rather than the personal’ y, 
level, Laboy’s (1973) reasoning would lead to 
the hypothesis that these differences in tu: 1- 
taking behavior may in fact signal ti» 
identification of the bilingual with two dii 
ferent groups. It may be that each group con- 
siders certain rhythmic factors to be appro- 
priate or prestigious, in the same way thet 
certain pronunciations are considered prest’ 


ious in various dialects of English. The ext’ t 
to which the bilinguals conform to th e 
norms would then be an index of their ider i- 
fication with the two different groups. In + y 
case, “amount of speech” appears to bi a 


relatively stable element of social personal’ y, 
and further research on both monolingual «4 
bilingual speakers should be quite fruitfui as 
explicating the relationship between speaking 
patterns, linguistic structure, and personality 
attributes. 

The hypothesis that turn-taking behavior 
varies as a function of the phases of the 


TABLE 1 
LANGUAGE AND SECTION MEANS 
Language Section 
Variable — fio 
Spanish English Opening Middle End 
Amount of speech per turn* 3.1 47 3.0 3.7 F 4.9 
Frequency of speech acts per turn 2.8 4.3 2.9 3.4 4.2 
Amount of switching pause time*:» 2.3 2.0 BS 2.0 2.0 
Frequency of turns? 4.0 kyi 4.2 3.9 3.5 
Frequency of turns following switching pauses? 3.2 3.1 3.5 3.0 2.9 
Frequency of turns following simultaneous speech? 8 35 6 8 9 
Average duration of speech acts* 1.0 1.0 1.0 11 1.1 
Average duration of switching pauses* a 6 af 7 1 


a In seconds, 
b Per half-minute 


yp. cons tion was also confirmed by the results 
of th » study. When the subjects opened their 
con) sations, they produced less speech and 
few’ speech acts per turn than at the end 
an more switching pause time than in the 
micile, Together these variables indicate a 
gre ler amount of silence at the beginning 


of ne conversation than later on. In addition, 
s found that significantly more turns 
fc owing switching pauses occurred in the 
0) ning of the conversation, and that the 
tal number of turns was also higher in this 
yscction. This shows that more frequent ex- 
hanges of the floor were taking place in the 
pening. 

In the middle section of the conversations, 
less time was devoted to switching pauses, 
and the floor was more often gained through 
simultaneous speech. Simultaneous speech has 
been called both a measure of dominance and 
a measure of discomfort in a conversation 
(Long, 1972); however, when a speaker gains 
‘the floor via simultaneous speech in the 
middle section, it may actually be a measure 
of heightened involvement in the conversation. 

The conversational endings, finally, were 
narked by longer and more frequent speech 
er turn and fewer turns than earlier in the 
conversation. 

Thus, the picture which emerges is one of 

san interaction which begins with considerable 
xchange of the floor. As the conversation 

progresses there is more speech, and exchange 

Í of the floor is less, but more rapid. The 
fact that this conversational framework (or 
“history,” as Prebor, 1972, calls it) occurred 
in both Spanish and English is the most im- 
portant finding of this study. This finding 
leads us to believe that the bilinguals may 
have been responding to broader cultural 
norms which determine the overall struc- 
ture of conversational interaction. We do not 
iknow what culture these norms represent; 
however, our guess is that these turn-taking 
markers of conversational phases extend to all 
Western European cultures and their North 
American variants. 
In general terms, the results of this study 
indicate that the frequency and duration of 
speech, and measures of turn-taking style, 
are powerful tools for examining the social 
Psychology of conversational interaction. 
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These easily identified and objective measures 
of overt behavior reflect situational factors, 
traits involving role and social personality, 
and cultural norms in a relevant and meaning- 
ful social situation: the conversation. 
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The present study was an investigation of 
the role that the norm of equity plays in 
the formation of coalitions within triads. 
Past research on coalition formation (e.g., 
Kelley & Arrowood, 1960; Vinacke & Arkoff, 
1957) to a large extent has focused on the 
effect of differences in apparent power. Typi- 
cally, power is defined as resources that en- 
able their owner to achieve some goal; the 
greater the resources, the more easily the goal 
is reached. For example, in one case of 
Caplow’s (1956, 1968) Type 5 triad, re- 
sources are distributed in the ratios 4/3/2. 
Thus, the person with the greatest resources 
¢4) will reach the goal first unless two par- 
ticipants pool their resources, that is, form a 
coalition. 

In Type 5 triads, any coalition will tri- 
` umph, since any combination of two sets of 
resources (@.g., 2+3, 2+4, or 3+4) is 
greater than the resources of the isolate. In 
spite of this equality in potential effective- 
ness, subjects in past studies have tended to 
overchoose the 2-3 combination and form 
what Caplow (1968) has termed “revolution- 
ary” coalitions. This coalition is revolutionary 
because the two weaker parties join together 
to “overthrow” the person who originally 
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Equity and the Formation of Revolutionary and 
Conservative Coalitions iù Triads 


Lawrence A. Messé, Robin R. Vallacher, and James L. Phillips 
Michigan State University 


This study tested opposing predictions made by Gamson’s Minimum Resource 
Theory and equity theory concerning the type of coalitions that are formed 
when members of a triad expect that rewards will be correlated with amounts 
of individual resources. Results supported equity theory in that subjects, whose 
work on a pretask was used as the basis for assigning resources in a bargain- 
ing game, formed conservative (strong against weak) coalitions, When subjects 
either did not work or were paid prior to playing the game, results replicated 
past research in that there was a tendency to form revolutionary (weak against 
strong) coalitions. Findings were discussed in terms of their implications for 
the generality of existing theories of coalition formation. 


had the most resources. A number of explana- 
tions have been advanced in attempts to 
understand why subjects favor the revolu- 
tionary coalition (e.g., Caplow, 1956, 1968; 
Gamson, 1961, 1964; Komorita & Chertkoff, 
1973; Vinacke, 1969). Of most relevance to 
the present research is Gamson’s (1964) 
Minimum Resource Theory, which proposes 
that persons tend to form that winning coali- 
tion in which the sum of the resources is the 
least, since “any participant will expect others 
to demand from a coalition a share of the 
payoff proportional to the amount of resources 
which they contribute to a coalition” (p. 87). 
Thus, in the 4/3/2 Type 5 triad, persons 
tend to form revolutionary coalitions because 
the minimum sum of the resources (5) is also 
a winning coalitions (since 5 > 4). 

Gamson (1964, pp. 103-104) further pro- 
poses that Minimum Resource Theory should 
be more applicable—that is, the tendency 
to form revolutionary coalitions should be 
greater—when persons (a) are highly moti- 
vated for the reward, and (b) have some 
basis for the perception that the magnitude 
of their resources should determine the mag- 
nitude of their reward. This latter hypothesis 
is of particular interest because it directly 
contradicts a prediction that can be derived 
from equity theory. Stated briefly, this hy- 
pothesis asserts that when persons perceive 
that their rewards should be contingent upon 
resources, they tend to form “conservative” 
coalitions in which the two stronger parties 
ally against the weakest. 
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Equity theory—as developed by Adams 
(1965) and Homans (1961), and extended 
by Leventhal (e.g., Leventhal & Lane, 1970; 
Leventhal & Michaels, 1969) and Messé 
(eg., Lane, Messé, & Phillips, 1971; Messé, 
1971)—assumes that persons are satisfied 
with a distribution of rewards to the extent 
that it provides them with payments that are 
congruent with their inputs (i.e., perceived 
contributions). Thus, if persons are awarded 
resources as a function of inputs, then coali- 
tions would form to the extent that they were 
necessary to allocate rewards equitably. In 
this case, the two stronger parties would 
coalesce to obtain the greater reward that 
they “deserved.” 

Equity theory, however, generates no 
predictions about coalition formation when 
resources are not associated with inputs—or 
any other basis that would legitimize their 
correlation with rewards. In this instance, it 
does seem reasonable that persons would 
act to protect themselves from the possibility 
that other members of the triad, especially 
the stronger members, might perceive that 
the correlation between resources and rewards 
was legitimate. This is not true, though, when 
differential inputs are used as the basis of 
resource allocation. In this latter situation, 
equity theory asserts that all parties should 
agree that such a correlation is appropriate, 
and, therefore—contrary to Gamson’s hypoth- 
esis—they should form conservative coalitions 
to distribute rewards fairly. 

The present research tested the validity of 
these two contrasting positions by examining 
the types of coalitions that are formed when 
work inputs are used as the basis for assign- 
ing resources. Also, to insure that the situa- 
tion which was used in this examination ap- 
proximated the relevant conditions that were 
present in past studies, coalition behavior 
also was observed when, as in prior research, 
the correlation between resources and inputs 
was absent. 


METHOD 


Subjects and Recruitment 


Subjects were 54 male undergraduates who were 
selected by chance from over 200 respondents to an 
advertisement in the Michigan State University 
student newspaper. The advertisement solicited per- 
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sons who were interested in earning mone par- = 
ticipating in “motivational research.” Sinc jond- 
ents knew that money was to be their jay- 
ment, it was expected that this procedi juld 


provide subjects who were reasonably moti by 
this reward. 


Pretask and Inputs 


A pretask was used to provide subjects he 
appropriate conditions (see below) with difi 
inputs. The pretask consisted of two ins 


a “Test of General Knowledge,” in mimec h, 
which subjects had to proofread; and a ser of 
lines, curves, and circles, which subjects h o 


complete into meaningful pictures. This set of n 
complete pictures” appeared at different point- in 
the test booklet, depending upon the input co i- 
tion of the subjects; the drawing task was alv :ys 
placed so that it appeared about midway in he 
subject’s work period. 

Inputs were manipulated through the dura >n 
of time for which subjects worked on the pret: 
Some subjects worked for 2 hours, others for 
hours or 1 hour; the remainder did not work at a !. 


Bargaining Game 


Triads played a game for monetary rewards that “ 
was similar to those used in past studies of coalition 
formation (e.g. Vinacke & Arkoff. 1957), Ea i 
member of a triad was given a small peg which 
was to move along a cribbage board. There wa a 
small cardboard flag with a numeral painted or it 
attached to each peg. One subject was given a -eg 
marked with a 4, another was given a peg mar ed 
with a 3, and the third one that was marked wi a 
2. 

Subjects were told to place their pegs at the s; 
end of the board and move them when th 
perimenter said to do so. Each subject could, ©: a 
single move, advance his peg along the crib age 
board the number of spaces that corresponde to 
the number on his flag (i.e. his resources), ‘i hus, 
the subject with the peg marked 4 moved it four 
spaces each time, the subject with a resource weight 
of 3 moved his marker 3 spaces, and the remaining 
subject moved his peg 2 spaces. 

Subjects were told that the party who won the 
game—that is, who reached the other end of the 
board first—would be awarded $7, while the two 
losers would have to agree on how to split the 
second prize of $2. It was obvious that the person,< 
with 4 resources would always win, so, “to make ¢ 
the game more interesting,” two players could com- 
bine forces by trading in their pegs for a single 
peg whose number was the sum of their individual 
resources. In order to do this, the two players had 
to agree on how they would divide the winner's 
share ($7), knowing that the excluded person would 
receive the second prize ($2). 


Design 


Subjects were randomly assigned to triads in one 
of three conditions. In the inputs relevant to re- 


sow condition, members of a triad were given 
reso for the bargaining game as a function of 
hoy they had worked on the pretask. Thus, 
the ject who had worked for 2 hours was given 
4 1 urce points, the subject who had worked for 
1è curs was given 3 points, and the subject who 
ha orked for 1 hour was awarded 2 points. In 
the random resource condition, subjects, who had 


nci worked on a pretask, were awarded 4, 3, or 2 
po.sts by chance. This condition is similar to the 
p! cedure used in many past studies of coalition 
ic vation, In the prepaid inputs condition, subjects 
v re also assigned resources (4, 3, or 2 points) as 

function of their work inputs (2, 13, or 1 hour), 
~out they were paid a standard amount of money 
for their work before they played the game for 
monetary rewards. This last condition was in- 
cluded to insure that any differences in behavior 
that occurred in the first two conditions were not 
due to “demand characteristics” that might have 
been generated by the experimenter in the inputs 
condition assigning resources as a function of time 
worked. 


Procedure 


In the input conditions, subjects in a triad were 
scheduled so that they finished their work at the 
same time, Thus, the 14 hour subject came to the 
experiment about 30 minutes after the 2 hour sub- 
ject, while the 1 hour subject was scheduled to 
arrive 1 hour later. All subjects in a triad worked 
on their pretask in separate rooms. 
^ When the time to work on the pretask had 
elapsed—or, in the no-inputs condition, when the 
three subjects had arrived—the members of a triad 
were taken to a small room and seated around a 
rectangular table. At this point, in the prepaid- 
inputs condition, the subject who had worked 2 
hours was given $4, the 14 hour subject was given 
$3, and the 1 hour subject was given $2.1 

At this point, in all triads, the experimenter ex- 
plained the bargaining game as outlined above. In 
both input conditions, he gave the person who had 
worked 2 hours the peg marked 4, the person who 
worked 14 hours the peg marked 3, and the peg 
marked 2 to the person who worked 1 hour. He 
made sure that the subjects understood that re- 
sources were being distributed according to the 
amount of time that each person had worked (at 
a rate of 1 resource point per 4 hour of work). 
In the no-input condition, subjects selected among 
Sealed envelopes which contained the pegs, so that 

‘ignment of resources obviously was determined 
y chance. 

In all triads, the actual winner’s share ($7) and 
the loser’s share ($2) were always placed on the 
table before the game began so that subjects could 
see that the experimenter was serious about re- 
warding them in this manner. After the game was 
explained, including the possibility of forming “com- 


Prior research (Lane & Messé, 1972) has estab- 
lished that undergraduates perceive $2 an hour to 
be reasonably equitable pay. 
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binations,” the experimenter answered any ques- 
tions. Then he told the players to discuss among 
themselves, before the first “move,” whether they 
would form a combination, if so, with whom and 
how the winner’s share was going to be divided. 
He then remained silent until two of the players 
formed a coalition and agreed on a division of their 
reward. Thus, he did not interact with the subjects 
until, as he had explained to them, two of them 
handed in their pegs in exchange for a peg whose 
number was the sum of their individual resources; 
this action signified that a binding coalition had 
been formed. 

When a coalition was formed, the experimenter— 
with the subjects’ permission—declared its members 
tu be the winners without having the players go 
through the motions of moving their pegs along 
the board. He then made sure that the members 
of the coalition divided the $7 as they had agreed 
and that the isolate received his $2. Finally, sub- 
jects were asked to write down individually why 
they behaved as they did in the game, When they 
completed this task, the experimenter thanked them 
for participating and allowed them to leave. 


RESULTS 


There were six triads assigned to each 
condition of the design. Table 1 presents the 
frequencies of each type of coalition that was 
formed in the three conditions. This array 
suggests strongly that the prediction derived 
from equity theory was more accurate than 
was the hypothesis that Gamson (1964) 
generated from Minimum Resource Theory, 
since conservative coalitions were formed in 
all six triads assigned to the inputs condition. 
Table 1 also reveals that the procedure used 
in the present study generated results that 
were equivalent to the usual findings of prior 
research in the two control conditions (i.e., 
in the no-inputs and the inputs-prepaid con- 
ditions) since subjects in these treatments 
tended to form the more typical, revolution- 
ary coalition. 


TABLE 1 
COALITIONS FORMED IN EACH INPUT CONDITION 


Coalition type 


Condition T 
2-38 2-40 3-40 
Inputs 0 0 6 
Inputs-prepaid 3 2 1 
No-inputs 4 1 1 


^ This designates the individual resources of the members of 
the coalition that was formed. 
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Moreover, statistical analysis strongly sup- 
ported these conclusions. An analysis of vari- 
ance, which was performed on the sum of the 
resources of the coalition members, revealed 
an overall treatment effect, F(2,15) = 8.90, 
p < .005. Further analysis indicated that 
there was essentially no difference in coalition 
formation between the no-inputs and the 
inputs-prepaid conditions (F=.18, df= 
1,15); however, as predicted by equity 
theory—but contrary to Gamson’s assertion— 
the coalitions that were formed in the inputs 
condition were considerably more conservative 
than those in the other two treatments, 
F (2,15) = 17.62, p < .01, Scheffé test.2 

Analysis of the coalitions formed in the 
two control conditions through a binomial 
test indicated that, as in past research, revo- 
lutionary (2-3) coalitions tended to be 
formed more often than would be expected 
by chance (p < .064). Further, the exact 
distribution of coalitions in these two condi- 
tions approximates well that predicted by 
Walker (1973) in his refinement of Caplow’s 

(1956) theory; Table 2 presents these pre- 
dicted and observed frequencies, 

Finally, the subjects’ postgame responses 
were examined ‘to determine the extent to 
which they reported being concerned with 
equity, fairness, etc. when playing the game. 
This examination revealed that only 1 sub- 
ject of the 36 assigned to the control condi- 
tions mentioned being concerned with equity, 
while 14 of the 18 subjects in the inputs 
condition did so, a highly significant differ- 
ence, x*(1) = 29.25, p < .0001.° These re- 
sults are particularly striking in their support 
for equity theory, given that the question— 
Why did you behave the way you did in 
the game?—was completely open-ended and, 


“It should be noted that examination of the 
data through Fisher’s Exact Tests—which seemed 
to be a less efficient but somewhat more conserva- 
tive procedure than analysis of variance—yielded 
the same findings; that is, counting only 3-2 coali- 
tions as revolutionary, there was little difference 
between the no-input and the prepaid-input condi- 
tions, but these treatments, combined, were signifi- 
cantly different from the inputs condition, 

*Tt should be noted as well that all six coalitions 
in the inputs condition divided the winner’s share 
equitably (ie. $4-$3). On the other hand, coali- 
tions in the other conditions tended to divide the 
reward equally, 
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TABLE 2 


OBSERVED AND PREDICTED COALITION Fo JN IN 
THE No-Inputs AND ĪNPUTS-PREPAID Ci NS 


| Coalition type 


Formation [c — 


Observed 7 | 3 
Predicted 7 | 


thus, subjects could have responded wi 
wide variety of reasons. 


DISCUSSION 


Results of the present research provided 
strong support for the position that the norm 
of equity can serve as the basis for coalition 
formation. These findings, and those of past 
research (e.g., Messé, 1971), indicate that 
persons use equity to guide their behavior 
in bargaining situations. On the other hand, 
other variables—for example, concerns with { 
competition—become more salient in circum- 
stances such as the two control conditions in 
the present study, in which the norm of 
equity does not appear applicable. g 

The results clearly refute the hypothesis ; 
that Gamson (1964) derived from Minimum 
Resource Theory. This perspective was espe- | 
cially relevant to the present research, Sioe d 
it proposes explicitly that there exists a rela- “ 
tionship between resource salience and coali- 
tion formation. It should be noted, however, : 
that no theory of coalition formation (cf. 
Caplow, 1968; Komorita & Chertkoff, 1973; | 
Walker, 1973) predicts or explains ade- 
quately the formation of conservative coali- 4 
tions that occurred in the input condition of 
the present research. 

The failure of these theories seems to be a 
result of an assumption that persons, espe-_.4 
cially males, tend to approach situations M 
potential conflict with competitive orienta- 
tion. Past research (e.g., Benton, 1971; 
Messé, 1971; Messé, Dawson, & Lane, 1973; 
Pepitone, 1971), however, has demonstrated 
that this assumption may not be valid for 
many interpersonal situations, especially those 
in which it is possible to distribute rewards 
in a manner that is congruent with perceived 
inputs. This was the case in the inputs condi- 


tior of the present study, so, as results indi- 

theories that assume a competitive 

atation cannot predict accurately subjects’ 
ponses. 

vloreover, the results cannot be explained 

equately in terms of demand character- 

‘ics. While it was possible that subjects 
ı the inputs condition would deduce from 
3 nhe resource assignment that they were 
č expected” to form conservative coalitions, 
‘wo important considerations render such an 
+ explanation implausible. First, subjects were 
recruited through the promise of money as 
their sole reward. Given this, it seems un- 
likely that they would behave in ways that 
could prove costly to them just to please the 
experimenter. Second, resources were assigned 
in the same manner in the inputs-prepaid 
condition, but subjects’ responses differed 
substantially from those in the inputs condi- 
tion. Thus, any demand characteristics that 
might have been present should have been 
esentially the same in the two inputs condi- 
tions, but, since coalition formation behavior 
differed, it is unlikely that such character- 
istics—if they were present at all—affected 
responses. 

On the other hand, previous theories of coali- 
tion formation—especially Walker (1973)— 
were predictive of the behavior that was 
emitted in both the inputs-prepaid and the 
no-inputs conditions. The purpose of the pres- 
¿went research was not to differentiate among 
them, but rather to demonstrate that, as a 
class of theories, they are most applicable in 
situations in which persons should have a 
competitive orientation. The present study 
and prior research lead to the conjecture that 
a competitive orientation arises primarily in 
two situations. First, it appears that this 
orientation is generated when explicit rewards 
are trivial and/or not motivating in them- 
selves (e.g., Vinacke & Arkoff, 1957). In this 
case, subjects—perhaps to relieve boredom— 
may convert the situation to a zero-sum situ- 
ation with little real cost. Second, competitive 
tendencies also can occur when the structures 
of inputs and rewards are such that an equi- 
table solution is impossible. Lane and Messé 
(1972), for example, have demonstrated that 
persons become self-interested when they dis- 
fribute an amount of reward that is either 
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insufficient or oversufficient to provide every- 
one with a fair share. 

It is likely that the two control conditions 
of the present research generated a competi- ` 
tive orientation in subjects, since both were 
situations of oversufficient rewards. The no- 
input condition provided subjects with $9, a 
rather large amount of money for about 
10 minutes of work. The inputs-prepaid condi- 
tion gave subjects $9 in excess of the equita- 
ble amounts that were awarded to them 
previously. 

An important issue that remains to be con- 
sidered fully is the extent to which conditions 
in the “real-world” typically favor an equi- 
table or a competitive orientation. While a 
definitive answer to this question must await 
extensive empirical exploration, it appears 
that a reasonable estimate is that the norm 
of equity is an important concern in most 
interpersonal situations. This speculation 
follows from the argument, developed by 
Thibaut and Kelley (1959, pp. 132-135) and 
others, that norms are widely used since they 
facilitate productive social behavior. Also, it 
seems reasonable that persons, in general, 
would believe in a just world in which, typi- 
cally, differences in resources have some 
appropriate and rational basis. 

Thus, for example, the contemporary Amer- 
ican family—viewed as a father-mother-child 
triad—often may be characterized by a con- 
servative, parental coalition. In fact, Fleck 
(1972), argues that revolutionary parent- 
child coalitions are associated with behavior 
pathologies in the family—an assertion with 
which Caplow (1968, pp. 66-67) appears to 
disagree, however. 

This, and other, points of contention, of 
course, cannot be resolved without relevant 
data. It is clear, however, from the findings 
of this and past research on coalition forma- 
tion that there are factors which influence the 
type of coalition that characteristically is 
formed. Thus, it now appears to be appropri- 
ate to broaden the scope of the empirical and 
theoretical work in this field—from its pres- 
ent rather narrow focus on why revolutionary 
coalitions tend to be formed under conditions 
of equal pivotal power—to include the more 
general issue of when conservative or revolu- 
tionary coalitions are more likely to occur. 
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Perceptions of and Reactions to Stressful Situations 


David Magnusson and Bo Ekehammar 
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University of Stockholm 


Two approaches to the study of the psychological significance of situations 
were compared, one using data for individuals’ perceptions of situations and 
the other using data for individuals’ reactions to situations, Perception and 
reaction data for 40 subjects were used for multidimensional classification of 
12 situations, which were selected so as to cover four different types of stressful 
situations. For three of the a priori groupings of situations the congruence in 
outcomes between the two methods was high. The implications of the con- 
gruence and the discrepancies in outcomes for the two approaches are dis- 
cussed. The main conclusion is that one should distinguish clearly between 
situation perception factors and situation reaction factors in further research 
on the psychological significance of situations. 


There is an implicit need for systematic 
analyses of situations in some well-known 
psychological theories, for example, in Mur- 
ray’s (1938) need-press theory and in Lewin’s 
(e.g., 1951) field theory. The importance of 
. situational analyses was clearly expressed on 
different grounds during the early 60s by 
Abelson (1962), Miller (1963), and Sells 
(1963a) among others. The emphasis on situ- 
ational analyses increased in connection with 
the rapidly developing theoretical and empiri- 
cal interest in the Person X Situation inter- 
action (see Argyle & Little, 1972; Bowers, 
1973; Endler, 1973; Endler & Hunt, 1968, 
1969; Magnusson, Gerzén, & Nyman, 1968; 
Magnusson & Heffler, 1969; Pervin, 1968). 
From different standpoints researchers have 
recently advocated that the systematic study 
of situations is urgent (e.g., Endler, in press 
b; Magnusson, 1971, 1974; Mischel, 1973; 
Schneider, 1973). 

As was early stated by Kantor (1924), 
Koffka (1935), and Murray (1938), among 
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others, a situation can be described in two 
main ways: by describing in objective terms 
the physical and social characteristics of the 
situation “as it is,” and by describing it in 
terms of the psychological significance for 
the individual. The first of these approaches 
for empirically classifying situations has been 
advocated by Sells (1963b) and Sherif and 
Sherif (1969), However, most of the re- 
searchers mentioned above underline the im- 
portance of using the second approach, that 
is, studying the psychological significance of 
situations. 

Systematic analyses of situations with re- 
gard to their psychological significance for 
individuals have two purposes: (a) the clas- 
sification of situations (see Frederiksen, 1972; 
Moos, 1973; Sells, 1963b), and (b) the use 
of the Person X Situation interaction in the 
description and prediction of behavior (see 
Magnusson, 1971). According to the inter- 
actional view of behavior “individuals differ 
not mainly with regard to certain stable as- 
pects of behavior but particularly regarding 
their specific, characteristic ways of adjusting 
to the varying characteristics of different situ- 
ations” (Magnusson, 1971, p. 851). Thus 
the behavioral pattern of different reactions 
of individuals across situations of varying 
significance should be an important object 
for psychological research, Knowledge about 
the significance of the situations for the indi- 
vidual is a necessary condition for under- 
standing and predicting individual behavior. 
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Two main approaches can be used for the 
empirical study of the psychological signifi- 
cance of situations, one based on individuals’ 
reactions to the situations, the other based 
on individuals’ perceptions of the situations. 

In the reaction approach the individual 
might be instructed to rate his reaction, say 
the experienced degree of apprehensiveness 
(Hodges & Felling, 1970) or his degree of 
unpleasantness (Schalling, 1971) to a set of 
situations which are verbally presented to 
him. The data give individual reaction pat- 
terns across situations for the variable rated. 
For the classification of the situations, the 
correlations between situations are calculated 
and the correlation matrix can be treated 
with factor or cluster analysis. 

A more subtle variation of this approach 
was first proposed by Endler, Hunt, and 
Rosenstein (1962) and was used in the con- 
struction of their S-R Inventory of Anxious- 
ness. According to that technique the subjects 
rate their own reactions on a number of 
scales for each of a number of situations, 
thus yielding a three-dimensional data matrix. 
In addition to main effects these data con- 
tain information about the Person X Reaction 
X Situation interaction, as well as simple 
interactions between two main variables at a 
time. For the classification of situations, the 
correlations between situations can be calcu- 
lated on the basis of the summed individual 
reaction scores for each situation and the 
matrix can be treated with factor or cluster 
analysis. This technique for classification has 
been used in studies on the Person X Situa- 
tion interaction (see e.g., Ekehammar, Mag- 
nusson, & Ricklander, 1974; Endler et al., 
1962). It was also employed in the present 
study. 

The perception approach, using measures 
of individuals’ perceptions of situations as 
raw data, was discussed by Magnusson 
(1971) who also proposed a method for the 
empirical study of situation perception. Ac- 
cording to this method subjects are presented 
short, concrete descriptions of situations two 
at a time and instructed to judge the per- 
ceived similarity between the situations. For 
classifications of situations the obtained simi- 
larity matrix can be treated with multidimen- 
sional scaling techniques or cluster analytic 
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methods, which are linked to a dimensi al 
or a categorical model for cognitive func’) n- 
ing, respectively (see Bieri et al., 196 A 
few studies which support the potentiali f 
the method have been made (Ekehamme: & 


Magnusson, 1973; Magnusson, 1971; - 
nusson & Ekehammar, 1973). It was ) 
used in the present study. A study on ve 
same problem, using different groups of 
jects for perception and reaction data 
reported by Ekehammar, Schalling, and ! 
nusson (in press). 

According to the interactional model act 
behavior of an individual in a situation is i 
result of an interaction between the individ 
and the situation. It is reasonable to a 
that it is the meaning or significance w’ 
the individual gives to the situation wnich 
is of importance for his behavior (se> 
Raush, Dittmann, & Taylor, 1959). Fur 
more, individuals differ with regard to the 
meaning they give to the same situation 
(Magnusson, 1974; Mischel, 1973). Dat: 
for situational meaning are treated in the 
perception approach. The reaction approa: h 
treats reactions to situations, that is, di a 
refer to situational behavior. According 
the assumptions made above, the individu 
reaction in a certain situation is depend 
upon his perception of the situation, Thi- 
a systematic relation can be expected betw n 
data for individuals’ perceptions of a 
of situations and data for their reactions 
the same situations. Such a prediction -s 
empirically tested in the present study. Wii 


the reasoning above in mind, perception dita 
should be regarded as independent variahies 
and reaction data as dependent variables in 


studies of the relation between the two kinds 
of data. 


The Present Study 


The main purpose of the present study was 
to investigate the relationship between situ- 
ation perception data and situation reaction 
data. This was attained by comparing the 
outcomes of a classification of the same 
stressful situations using the two approaches 
discussed above. 

For perception data the classification of 
situations was made by dimensional analysis 
of the mean similarity matrix, and for reac- 
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tion ‘a by factor analysis of the matrix for 
the relations among situations. According 
to © assumptions made about the depen- 
dev > of individual behavior on individual 


pe eption of the situations, the individual 
nn reaction scores should be more similar 
fo situations which are perceived as similar 

n for situations which are perceived as less 

nilar or not at all similar. Consequently, 

e matrix for the correlations between situa- 
ions, based on individual mean reaction 
scores, should have the same main structure 
as the similarity matrix for perception data. 
Thus, the reaction and perception data were 
expected to give the same main grouping of 
the situations. 


METHOD 
Experimental Procedure 


Inventory ratings and similarity estimates were 
collected during 2 hours of group testing. Two 
groups of subjects made their ratings separately 
but simultaneously. One group completed the in- 
ventory ratings (reactions) before giving the simi- 
larity ratings (perceptions), with an unrelated task 
(distractor) performed between these two ratings. 
For the second group the order of presentation was 
reversed. The time interval between the two types 
of ratings was about 30 minutes for both groups. 


The Reaction Approach 


Subjects. The subjects were 40 adolescents, 25 
girls and 15 boys, from two ninth-grade classes of 
the compulsory unselected school system. The sub- 
jects had a mean age of about 16 years and can be 
regarded as fairly representative of their age group. 

Inventory. A modified version of an “S-R in- 
ventory of anxiety” constructed by Magnusson and 
Ricklander and presented by Ekehammar et al. 
(1974) was employed. The present version com- 
prised 12 situations and 10 reaction scales, selected 
on the basis of previous analyses of the inventory. 
The situations used here were chosen so as to cover 
four types of stressful situations, tentatively denoted 
“Threat of punishment,” “Ego threat,” “Threat of 
pain,” and “Inanimate threat” in the earlier report. 
The reaction scales were selected so as to cover two 
iypes of anxiety reactions, tentatively denoted “Psy- 
chic anxiety” and “Somatic anxiety.” In addition 
to the instruction page, the inventory consisted of 
12 pages. At the top of each page a situation was 
formulated, and below were printed the 10 reac- 
tions, with their 5-point intensity scales. 

Situations, The 12 verbally described situations of 
the inventory, grouped according to the a priori 
classification, are presented in Table 1. The order 
of presentation of situations was randomized among 
subjects, 
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TABLE 1 


SITUATIONS GROUPED ACCORDING TO A 
PRIORI CLASSIFICATION 


“Threat of punishment” 


1. Have just been caught pilfering 

2. Called to the headmaster for doing something 
forbidden 

3. Discovered by a teacher when playing truant 


“Threat of pain” 


4. Going to have an injection that will hurt 
5. Going to have a wound sewed up at the hospital 
6. Wait at the dentist’s to have a tooth pulled 


“Inanimate threat” 


7. Alone at home hearing someone try to get in 

8. Got lost in the woods at nightfall 

9. Alone in a desert cottage during a violent 
thunderstorm 


“Ego threat” 


10. Give an oral report before the class 
11. Starting a summer job for the first time 
12. Entering an athletic contest 


Ratings. For each situation, the subjects were in- 
structed to rate the intensity of their reactions on 
each of the 10 reaction scales. The ratings were 
given on a numerical 5-point scale ranging from 
“1—not at all” to “5—very much.” The order of 
presentation of reaction scales was the same for 
each situation. The reactions assumed to indicate 
“Psychic anxiety” were: ‘1—Become nervous,” “4— 
Get worried,” “6—Get feelings of insecurity,” “8— 
Get depressed,” “10—Get feelings of panic.” The 
reactions assumed to indicate “Somatic anxiety” were 
“2—Hands begin to shake,” “3—Swallowing diffi- 
culties,” “5—Heart beats faster,’ “7—Get in a 
sweat,” “9—Get stomach trouble.” 

Method of analysis. Data were collapsed across 
reaction scales, and the Person X Situation matrix 
was analyzed. Product moment correlation coeffi- 
cients were computed among the situation scales 
across subjects, and the correlation matrix was 
treated with principal components analysis. The 
factor matrix was orthogonally rotated to simple 
structure according to the varimax procedure. The 
number of factors to be rotated was chosen ac- 
cording to two criteria: (a) factors with eigen- 
values greater than unity, and (b) the number of 
factors (four) expected from the a priori grouping 
were rotated. The eigenvalue criterion gave three 
factors which accounted for 72% of the total vari- 
ance. The four-factor solution accounted for 78% 
of the total variance. 
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The Perception Approach 


Subjects. Similarity estimates were obtained from 
the same 40 subjects who took the reaction inven- 
tory. 

Situations. The same 12 stressful situations used 
in the inventory approach were employed (see 
Table 1). 

Ratings. The subjects were instructed to rate the 
degree of perceived similarity for each pair of situ- 
ations on a numerical 5-point rating scale, ranging 
from “O—not at all similar’ to “4—completely 
similar” (see Magnusson, 1971). The order of 
presentation of stimulus pairs was randomized 
among subjects. 

Methods of analysis. Mean similarity estimates 
among situations across subjects were computed as 
arithmetic means of the individual similarity ma- 

trices. The mean similarity matrix was analyzed 
according to a multidimensional scaling methodology 
proposed by Ekman (e.g., 1965). The simplest vari- 
ant of this method (cf, Ekehammar, 1972) was 
used here, by which the similarity estimates after 
linear transformation to a scale ranging from 0 to 1 
are treated, in principle, as correlation coefficients. 
The same factorial procedure as described under 
the inventory approach was employed. The eigen- 
value criterion (greater than 1) gave four factors in 
accordance with the expectations based on the a 
priori grouping (see Table 1). The four factors 
accounted for 67% of the total variance. 


Overall Comparison of the Approaches 


The main comparison between the two approaches 
was made by estimating the degree of congruence 
among corresponding situational factors obtained by 
the two kinds of data, This was accomplished by 
rotating the two factor structures to a common 
space according to a method proposed by Veldman 
(1967), which may be regarded as a target rotation 
procedure. In this case the four-factor solution 
based on the similarity estimation approach was 
used as the target in accordance with the view that 
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the perception data is regarded as the indepen: 
variable in relation to the reaction data, as exp 
in the introduction. After rotation, two diffe: 
measures were computed for the agreement betw 
the loadings of corresponding factors in the 
matrices: (a) product moment correlation coeffic 3 
and (b) coefficients of congruence (cf. a 
1967). 


Comparison oj Single Situations 


More detailed comparisons between perceptior 
and reaction data were made for one situation at 
a time. Two different indices were employed for 
indicating the agreement of each situation’s position 
in the perception and reaction spaces, respective 
The first index was based on the mean similar: 
matrix for perception data and the correlation ma 
trix for reaction data, For each situation a correla- 
tion was calculated between the values of its row 
in the similarity matrix and the values of its row 
in the correlation matrix. Such a coefficient, for a 
certain situation, gives a rough indication of the 
similarity between the perception and the reaction 
matrices with respect to the target situation’s dis- 
tance from the other situations. Second, product 
moment correlation coefficients were calculated be 
tween corresponding rows (situations) of the facto; 
matrices for perception and reaction data, respec- 
tively. The last index was computed on the factors 
after rotation to a common space. 


RESULTS 


The mean similarity matrix for perception 
data and the correlation matrix for reaction 
data are shown in Table 2. The similarity 
estimates are presented above the main diago- 
nal and the correlation coefficients below. 

Since the perception data were regarded as 
the independent variable, the factor matrix 
for the similarity data is reported first. The 


TABLE 2 
MEAN SIMILARITY ESTIMATES AND CORRELATION COEFFICIENTS AMONG SITUATIONS 

Situation 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | it | 12 
1 57 46 21 24 20 24 21 19 26 24 27 
2 70 62 24 23 23 26 26 19 30 26 29 
3 62 70 22 23 20 22 19 19 30 21 26 
4 40 38 31 58 59 26 24 26 32 18 26 
5 62 56 46 172 63 29 25 23 30 23 31 
6 38 46 49 46 50 25 23 16 28 19 25 
7 66 52 44 47. 62 25 43 41 19 16 19 
8 56 55 35 32 51 18 62 53 21 14 24 
9 42 42 08 31 54 11 49 61 23 15 24 
10 42 50 44 40 51 47 36 33 12 41 34 
11 67 18 57 54 64 40 50 52 50 43 33 

12 58 45 49 41 65 36 58 42 41 36 58 


Nolte. Zeros and decimal points omitted. Mean similarity estimates are above the diagonal and correlation coefficients are 


below the diagonal. 


PERCEPTIONS OF AND REACTIONS TO STRESSFUL SITUATIONS 


TABLE 3 
Si INAL FACTORS BASED ON THE PERCEPTION 
KOACH ROTATED TO SIMPLE STRUCTURE 
Factor 

Situation | k 

I IV 
ught pilfering 76 15 | 63 
to the headmaster 85 16 | 77 
vered playing truant 81 14 | .69 
ng an injection .12 14 | 71 
ving a wound sewed up | .12 19 | <74 
At the dentist's aol 12 | .76 
Alone at home 19 04 | 55 
Got lost in the woods -11 11 | .69 
1. Alone in a desert cottage -06 16 | .70 
Give an oral report 219 71 | 59 
11. Starting a new job 13 82 | 68 
12, Athletics contest 19 61 | 48 


Note, Highest loading for each situation is italicized. 


four situational perception factors, chosen on 
the basis of the eigenvalue criterion and 
rotated to varimax simple structure, are 
shown in Table 3. Using similarity data, the 
situations were distributed on factors accord- 
ing to the expectations based on the a priori 
grouping of the situations (see Table 1). 
The three factors for reaction data chosen 
on the basis of the eigenvalue criterion and 
rotated to simple structure are reported in 
Cable 4. The highest loadings of the situa- 
ions for the first three a priori groupings 
vere distributed on factors according to the 
xpectations. However, the situations for the 
urth a priori grouping (Situations 10, 11, 
nd 12) did not form a factor of their own. 
situation 10 had its highest loading on Fac- 
tor II, Situation 11 on Factor I, and Situa- 
tion 12 on Factor III, and all these three 
situations had two loadings higher than .40. 


TABLE 4 


SITUATIONAL FACTORS BASED ON THE REACTION 
APPROACH ROTATED TO SIMPLE STRUCTURE 


Factor 
Situation |__| hi? 
I I iit 

, Caught pilfering 64 55 

. Called to the headmaster 77 43 

. Discovered playing truant 88 12 

. Having an injection «05 32 

. Having a wound sewed up -24 55 
At the dentist's 46 —.08 

7, Alone at home 32 70 
8. Got lost in the woods -30 78 
9, Alone in a desert cottage —.05 87 
10. Give an oral report 50 06 
11. Starting a new job 55 53 
12. Athletics contest 36 51 


Note. Highest loading for each situation is italicized, 
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TABLE 5 


SITUATIONAL Factors BASED ON THE REACTION 
APPROACH ROTATED TO MAXIMUM AGREEMENT 
WITH THE FACTORS FROM THE 
PERCEPTION APPROACH 


Factor 
Situation — y 
I rie Eee AE aia 
1. Caught pilfering a A ae EE, 
2. Called to the 
headmaster 60] 08} .29| 65 | .87 
3. Discovered playing 
truant „8&4 »21 || —.04 34 | .87 
4. Having an injection | —01 | ‘80 | 26] 131 | ‘80 
5. Having a wound 
sewed up 23| 68} 46] .35 | .85 
6. At the dentist's 20 e a ET ET E 
7. Alone at home 44] 33| <62] 112 | 270 
8. Got lost in the 
woods 24] —02]| 72] 44] 07 
9. Alone in a desert 
cottage —.05 08 8S 82 
10. Give an oral report 21 35 | —.05 69 
11. Starting a new job 47 | 20] 42 72 
12. Athletics contest 55| 54| 142 a7 
Note. Highest loading for each situation is italicized. 
In order to examine the congruence be- 


tween the factor structures of perception and 
reaction data, the four-factor solutions were 
rotated to a common space with the similar- 
ity matrix as the target (see the Overall Com- 
parison of the Approaches section above). The 
four-factor solution for reaction data rotated 
in this way is reported in Table 5. 

Two different quantitative measures of the 
agreement between the two factor matrices 
rotated to maximum possible agreement were 
calculated as described in the Overall Com- 
parison of the Approaches section above. The 
obtained indices are presented in Table 6. 
These results plus an inspection of Tables 4 
and 5 indicate that the congruence between 
the two matrices was good for the first three 
factors, “Threat of punishment,” “Threat of 
pain,” and “Inanimate threat.” For the fourth 


TABLE 6 


INDICES OF FACTORIAL AGREEMENT BETWEEN 
CORRESPONDING FACTORS OF THE PERCEPTION 
AND REACTION DATA, RESPECTIVELY 


Agreement index 


Factor 
Coefficient of Product moment 
congruence coefficient 
pi 90 18 
I 92 84 
it 89 -16 
IV 69 AS 
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TABLE 7 


Propuct MoMENT COEFFICIENTS FOR THE 
AGREEMENT BETWEEN EACH SITUATION’S 
POSITION IN THE PERCEPTION AND 
REACTION SPACE, RESPECTIVELY 


Factor matrix for 
Similarity matrix perception data 
Situation vs. va 
correlation matrix factor matrix 
for reaction data 
1 -56 91 
2 709) 81 
3 07 -96 
4 259. 97 
5 -20 91 
6 46 SIR 
7 ovat -93 
8 ai -90 
9 -50 -98 
10 AB 97 
tt -06 69 
12 06 AL 


factor, containing situations which were sup- 
posed to be ego threatening in the a priori 
grouping, the congruence was lower. The cor- 
relation coefficient, mainly expressing the 
agreement between the two rank orders of 
factor loadings disregarding magnitude, was 
very low for the fourth factor. The two 
indices gave the same rank order of the 
factors with respect to agreement between the 
two types of data. 

Two indices for the agreement of each situ- 
ation’s position in the perception and reac- 
tion space, respectively, were computed as 
described previously. The indices are pre- 
sented in Table 7. For most of the situations 
the measures indicate a high agreement in 
Positions in the two situational spaces. There 
are, howeyer, some situations for which this 
does not hold, For Situations 11 (“Entering 
an athletic contest”) and 12 (“Starting a 
summer job for the first time”), both mea- 
sures show the poorest agreement. These two 
situations belong to the fourth a priori group- 
ing of situations, supposed to be ego threat- 
ening, and they contribute heavily to the low 
agreement for the fourth situational factor 
(see Table 6) in the two kinds of data. 


Discussion 


The expectations were that the situations 
should be distributed on factors in the same 
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main way for situation reaction data as ! 

situation perception data. For the first thr: 
factors, tentatively denoted “Threat of pu 
ishment,” “Threat of pain,’ and “Inanima 

threat,” the congruence was very clear ap 
the grouping of situations was as expecte 
from the a priori groupings. These result 
support the use of the first three groupings « 
stressful situations in further research on anx 
iety. Especially in research on the Perso: 
X Situation interaction, empirically verifiec 
situational factors are of great importance 

The clear distribution of stressful situa- 
tions on different factors both for situation 
perception data and for situation reaction 
data supports the conclusion that anxiety 
cannot be considered as one general factor 
but should be studied in a multidimensional 
space (Endler, in press-a). 

For one of the a priori groupings, contain- 
ing situations which were supposed to be ego 
threatening, the two kinds of data did not 
yield the same result. In the analysis of 
situation perception data these situations 
formed a separate factor as expected. When 
the eigenvalue criterion was applied in the 
analysis of situation reaction data only three 
factors were obtained, corresponding to the 
first three a priori groupings. The situations 
in the fourth a priori group were distributed 
on different factors, with rather high loadings 
on more than one factor. It is interesting to 
note that a similar difference in structure 
between situation perception and situation 
reaction data was obtained in the earlier 
study from which the situations for this study 
were selected (see Ekehammar et al., 1974). 
The situations for that study were initially 
selected on the basis of an analysis of situa- 
tion perception data, which gave four main, 
expected situation factors. In the main study, 
which yielded only situation reaction data, 
three factors were obtained corresponding 
rather well to the ones in the present study 
for the same kind of data. 

What then is the character of the situa- 
tions in the fourth a priori grouping of situ- 
ations, which were obtained in the analysis of 
the perception data but not in the analysis 
of the reaction data? A reasonable inter- 
pretation is that they all involve demand of 
achievement from the individual. In this re- 


ké 
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spect t! liffer from all the other situations. 
If this . che most obvious similarity between 


these tions, and a characteristic feature 
not ed with the other ones, it seems 
reas ble that they form a homogeneous 
grou» in the analysis of situation perception 
dat». This situation perception factor could 


be | abeled “Demands of Achievement.” 
the introduction of this article reasons 
wW given why we might expect a system- 
alic relation between situation perception 
«ata and situation reaction data. Such a sys- 
ematic relation was also found for the first 
three a priori groupings of situations. The 
less clear factor structure for reaction data 
and the discrepancy in outcome for the 
fourth a priori grouping of situations lead, 
however, to a more nuanced view on the 
relation between the two kinds of data. 
The discrepancy in outcomes for the fourth 
a priori grouping of situations may serve as 
a starting point for a tentative discussion on 
he relation between situation perception and 
ituation reaction. From the, reasons given 
bove, it seems natural that the situations 
hich have the common feature of demanding 
hievement, as distinguished from the other 
s uations, formed one separate factor when 
> analysis was based on situation percep- 
ton data. However, individuals obviously 
c ‘fer in their reactions to achievement de- 
ands. To some individuals such demands 
eve challenging, to others anxiety provoking. 
‘Vhe first kind of individual would probably 
seek achievement demanding situations and 
react actively and perhaps aggressively to 
them, while the second kind would react with 
avoidance and withdrawal. Thus, individuals’ 
reactions may be different to a situation 
which they perceive similarly. This reasoning 
can explain the fact that we obtained a clear 
situation factor for perception data, which 
was not obtained for reaction data. It can 
also explain the generally less clear factor 
structure for reaction data than for percep- 
tion data in the present study. The point is 
clearly seen for the achievement demanding 
situations but the same kind of reasoning also 
holds for other types of situations, to an 
extent which is dependent upon the character 
of the situations. 
The basic conclusion of this discussion is 
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that there is no necessary general systematic 
relationship between situation perception data 
and situation reaction data which is valid 
across individuals and across situations of 
different character. The consequence of this 
is that we have to proceed with research on 
both kinds of data. This conclusion has impli- 
cations for the two kinds of problems which 
are crucial in the study of the psychological 
significance of situations: for the classification 
of situations and for the understanding and 
prediction of individual behavior in an inter- 
actionist frame of reference. (a) Even if there 
is a substantial agreement in factor structure 
between the two kinds of data, we have to 
distinguish clearly between situation percep- 
tion factors and situation reaction factors in 
further research. Each of these factor struc- 
tures needs to be investigated. The relation- 
ship between the two kinds of situational 
factors may be a fruitful subject for further 
research. (b) Obviously it is not always pos- 
sible to predict an individual’s reaction to a 
situation only from knowledge about his per- 
ception of it. Individuals differ in reactions 
to a situation which they perceive similarly. 
These interindividual differences per se are of 
interest for further research. According to the 
interactionist view, what is characteristic for 
an individual is the specific multivariate pat- 
tern of reactions across situations that have 
different meanings for him. From the stand- 
point that the meaning of situations deter- 
mines an individual’s behavior, the specific, 
individual relationship between perception of 
and reaction to different situations should be 
an important subject for further research. A 
better knowledge about that relationship 
would contribute to a better understanding 
and a better prediction of individual behavior. 
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Two experiments tested whether anticipatory belief change represents move- 
ment toward the speaker’s position or moderation (movement toward a neutral 
position), Forty-four college undergraduates participated in the first experi- 
ment, One condition replicated the procedure of earlier studies by informing 
subjects that they would hear a message advocating a position on the opposite 
side of the issue from their own. In a second condition, subjects were told the 
topic of the impending message but not the position the communicator would 
take. Subjects in the third condition anticipated hearing a message advocating 
a position on the same side of neutrality but much more extreme than their 
own, putting the toward-a-position and moderation explanations in direct 
conflict. In a fourth, control condition subjects did not anticipate hearing a 
communication. The results of the first experiment offered some support for 
the moderation explanation, but ambiguities in the data made other explana- 
tions possible. Forty-eight college undergraduates participated in a second 
experiment that replicated the findings of Experiment 1 and gathered data that 
ruled out the previous alternative explanations. The results of the two experi- 
ments suggest that pressures both to moderate one’s view and to adopt the 


speaker's position operate to produce precommunication belief change. 


When members of an audience are com- 
itted to their current viewpoint, forewarning 
əm that an impending persuasive com- 
nication will advocate a position contrary 
their own may entrench the individuals in 
ir precommunication beliefs and reduce 

speaker’s persuasive impact on them. On 

other hand, a similar forewarning pre- 
sented to subjects who are minimally com- 
mitted to their existing position may produce 
significant attitude change in the same direc- 
tion the communicator will recommend, 
change that occurs prior to presentation of the 
message (Kiesler & Jones, 1971). 

Initially, McGuire and Millman (1965) 
proposed an explanation for precommunica- 
tion belief change toward the speaker's posi- 
tion based on what they called self-esteem 
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theory. Seeing oneself as susceptible to influ- 
ence or gullible was damaging to self-esteem, 
they reasoned. So, to preserve self-esteem and 
avoid the appearance to oneself or others of 
having been persuaded, an individual should 
adopt the viewpoint the communicator will 
recommend before the message is actually 
presented: postwarning but precommunica- 
tion. Then, when the attack comes, the illu- 
sion of no change in attitude will be estab- 
lished, self-esteem preserved, and/or “face 
saved.” 

Another explanation suggested by Mc- 
Guire and Millman (1965) and preferred by 
Papageorgis (1967) relies on notions of cog- 
nitive consistency. Forewarning informs an 
individual that someone disagrees with his or 
her own position on the issue. This knowledge 
creates a state of inconsistency among the 
person’s cognitions and evokes pressures to 
reduce the inconsistency. Changing one’s be- 
liefs to coincide more nearly with those of 
the communicator, even before the message is 
delivered, would be one way of restoring cog- 
nitive consistency. 

Examination of procedures used to produce 
anticipatory attitude change in earlier experi- 
ments suggests a third explanation. Studies 
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in which forewarning has produced greater 
resistance to persuasion have used issues on 
which the subjects were homogeneous and 
extreme in their opposition to the speaker’s 
position (Papageorgis, 1968). Experiments 
producing anticipatory change have, by con- 
trast, used issues on which subjects’ atti- 
tudes were much less extreme and were prob- 
ably based on little initial information. For 
example, McGuire and Millman (1965) used 
issues such as the likelihood of further com- 
munist takeovers in Latin America and the 
difficulties of developing a cure for cancer. 

So, anticipatory belief change has been 
found with issues on which subjects’ initial 
attitudes were moderate, topic knowledge 
and commitment were low, and subjects were 
probably not closed to new information on 
the issue. When open to and anticipating 
new information on an issue, one may be 
quite willing to withdraw to a neutral, open- 
minded posture, expecting an updating of his 
or her knowledge of the topic. When one is 
about to be confronted by an expert on an 
issue on which one possesses only rudimentary 
knowledge, a moderation of opinion or “wait 
and see” orientation may prevail. Under such 
conditions a shift in opinion toward the mid- 
dle of an attitude scale would not be surpris- 
ing. Intriguingly, despite the relative neutral- 
ity of initial opinion on the issues used in the 
anticipatory attitude change studies reviewed, 
in every instance the mean opinion measured 
after warning was closer to the midpoint of 
the scale used than the prescores or control 
group (Cooper & Jones, 1970; Deaux, 1968, 
1972; Dinner, Lewkowicz, & Cooper, 1972; 
McGuire & Millman, 1965; Papageorgis, 
1967). In other words, the opinion change 
in the direction of the position that was ex- 
pected to be advocated by the communicator 
in the studies may have been merely inci- 
dental to a shift toward neutrality. 

When one is about to be exposed to new 
information, a neutral position would be as- 
sociated with broadmindedness and lack of 


1 Kiesler and Jones (1971) did not report their 
data in a manner that makes this retrospective 
analysis possible. However, it seems likely from the 
data reported that their conditions showing antici- 
patory attitude change were also closer to midscale 
after warning than before. 
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bias. Neutrality could result from a self- 
admission of ignorance on the topic or i om 
a dispassionate contemplation of known a: u- 
ments on both sides of the issue, argum: ts 
the speaker may propose as well as some ‘ at 
may oppose the speaker’s position. A new: al 
position is defensible and difficult for ot’ rs 
to criticize when one’s own knowledge of <e 
topic is low or the issue is controversial. o 
moderation may be either logical or strateg c, 
representing a synthesis of one’s knowlec ʻe 
on a topic on which one is poorly inform 1 
and uncommitted or a maneuver to a sae 
position to protect the image one presents 
others. 

Interestingly, preexposure opinion chang 
was first termed anticipatory belief lowerin; 
by McGuire and Millman (1965). An indi 
vidual anticipating a message “will moderat 
his belief away from his initial position . . . 
when he gets the warning” (p. 471, italics 
added). 

Of course, in the studies conducted to date 
pressures in the direction proposed by t 
source and toward neutrality have been co 
founded. The present research was design: 
to separate and test these alternative exp! 
nations in two ways. First, if anticipato: 
effects are the result of adopting a neutr: 
position while awaiting new information, for: 
warning of the speaker’s topic but omittin 
the position to be taken should not diminis! 
the preexposure results. Assuming a neutra 
stance while awaiting new information on a 
unfamiliar topic should not depend upon 
knowledge of the speaker’s position. Without 
knowledge of the position the source will 
recommend, psychological effects of self- 
esteem or consistency that require awareness 
of a discrepancy between one’s own position 
and that of the speaker should be eliminated. 
However, effects due to anticipation of new 
information should be left intact: j 
communication changes that occurrèd without 
knowledge of the speaker's position could not 
be construed as a shift toward the source 
either to protect self-esteem or to provide 
consistency. Still, the predictions made by the 
“toward-a-position” explanations when one is 
ignorant of the speaker’s position may not 
be so unequivocal. For example, the self- 
esteem notions might argue that when the 
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positi bout to be advocated is unknown, 
adop more neutral position gives one the 
great protection against being confronted 
wit! ge discrepancy between one’s own 
posi on and the communicator’s and then 
app cring gullible if persuaded. Moving 


tov wd the speaker’s position is preferred, but 
wih that information is unavailable, neu- 
tr- ity is then seen as the safest position from 
wich to protect one’s self-esteem. As a result, 
ie second test of the moderation explana- 
ion was considered more stringent. 

If a shift toward neutrality is the sole pro- 
ducer of anticipatory belief change, notifica- 
tion that the communicator will recommend 
a position on the same side of the issue but 
more extreme than the moderate viewpoint 
initially held by subjects should also produce 
a change toward neutrality. In this case the 
change would be away from that to be pro- 
posed by the communicator, Consequently, 
the two explanations were put in direct con- 
flict: the toward-a-position explanations pre- 
licting anticipatory change toward a more 
xtreme position and the moderation explana- 

on predicting anticipatory lowering of belief 

an even more neutral position. 


EXPERIMENT 1 
ethod 


Design overview. The experiment was an after- 

y design with three experimental conditions and 
ese control condition. In one condition the pro- 
cedure of earlier studies was replicated. Subjects 
were led to anticipate a message advocating a posi- 
tion on the opposite side of the issue from the 
position held by most of them. In a second condi- 
tion subjects were informed of the topic of the 
impending message but not of the position the 
communicator would take. Subjects in the third 
condition expected to hear a communicator recom- 
mend adoption of a position much more extreme 
but on the same side of the issue as the existing 
opinions of most of the subjects. Finally, a control 
group of subjects simply indicated their attitude on 
the experimental issue without anticipating exposure 
to a persuasive message. 

Subjects. Forty-seven undergraduates participated 
in the experiment. Subjects were recruited from 
introductory psychology classes and were paid $1 
for participating. Although 1-5 subjects participated 
simultaneously during an experimental session, sub- 
jects were seated apart in order to minimize inter- 

n between them. Only one condition was run 
ach session, and the sessions were randomly 
assigned to one of the four conditions before the 
Subjects arrived. In order to equate the sample sizes 
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in all conditions, the data from three randomly 
selected subjects in the control condition were 
omitted from analysis. The remaining 44 subjects 
were evenly distributed among the four conditions 
of the experiment. 

Procedure. The procedure was a modified version 
of that used by Cooper and Jones (1970). When 
subjects assigned to the three experimental condi- 
tions arrived, they were told that the research was 
investigating what makes a successful persuasive 
communication. The experimenter explained that he 
had obtained tape recordings of several messages 
that other audiences had found extremely persuasive 
and that he was interested in determining just what 
made them so persuasive. He suggested that while 
the subjects were listening to the message they 
might want to focus on such things as the nature of 
the arguments used, the way they were presented, 
and the speaker’s style. The tape was to be played 
on a tape recorder that was prominently displayed 
in the room. The experimenter told the subjects 
that the particular tape they would hear was en- 
titled, “The coming economic situation” (for sub- 
jects in the topic-only condition), “The coming 
economic depression” (same-side condition), or “The 
coming economic prosperity” (opposite-side condi- 
tion), The experimenter continued in all three con- 
ditions by saying, 


This is a recording of a speech made by Dr, John 
M. Bauman. Dr. Bauman is a well-known au- 
thority on economics, a member of the President’s 
Council of Economic Advisors, and a professor of 
economics at Princeton University. This particular 
talk is a recording of Dr. Bauman’s presentation 
over the National Educational Television Net- 
work in May of this year, At that time the talk 
was acclaimed as an extremely persuasive and 
interesting analysis of current economic trends, 


The completion of the description of the anticipated 
massage differed for each condition as follows: 
opposite-side condition—‘In his talk Dr. Bauman 
takes the position that the U.S. economy is defi- 
nitely moving into an era of unparalleled prosperity, 
full employment and abundance for Americans at 
every economic level”; same-side condition—‘In his 
talk Dr. Bauman takes the position that a serious 
economic depression will definitely affect the U.S. 
economy in the immediate future”; topic-only con- 
dition—‘In his talk Dr. Bauman gives his views on 
where the U.S. economy is heading in the near 
future.” 

All subjects were then asked to indicate their own 
opinions on the topic before hearing the tape- 
recorded message. Subjects were asked to respond to 
the statement “There will soon be an economic 
depression in the United States” by indicating their 
opinion on a 15-point scale with the endpoints 
labeled “Definitely False” and “Definitely True” 
and the midpoint labeled “Uncertain.” 

To ensure that the subjects had been paying 
attention and could recall what they had been told 
about the upcoming message, after indicating their 
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opinions subjects were asked to write the topic of 
the communication and then the specific position the 
speaker would take, They were instructed that if 
they did not know the answer to one or both of 
the questions, they were to write “Don’t know.” Of 
course, subjects in the topic-only condition could 
not know the position to be taken in the message. 
Finally, subjects were asked to return to any ques- 
tion they had marked “Don’t know” and try to 
guess what the answer might be. 

‘Control group subjects, after they arrived, were 
asked simply to indicate their opinion on the same 
attitude scale. No mention was made of an up- 
coming communication, 

No message was actually presented to any sub- 
jects, and the experiment was terminated following 
completion of the questionnaires. The true nature 
of the experiment was then revealed to the subjects 
and the hypotheses and necessity for deception 
explained. Before leaving, subjects’ cooperation in 
not revealing the true purpose of the experiment to 
other potential subjects was solicited, and all 
readily agreed, 


Results and Discussion 


All subjects in the same-side condition and 
opposite-side condition correctly recalled the 
speaker’s topic and the position they had 
been told he would recommend. All subjects 
in the topic-only condition correctly remem- 
bered the speaker’s topic and indicated that 
they did not yet know the position he would 
take. Therefore, the manipulations were per- 
ceived by the subjects as they had been in- 
tended. In the topic-only condition, when 
asked to guess the position the speaker would 
take, 9 out of 11 subjects thought that the 
speaker would foresee a depression for the 
United States in the immediate future, the 
position that they weakly held (x? = 4.46, 
p< .05). 

The results of the attitude measure are 
summarized in Table 1. A preliminary single- 
factor analysis of variance performed on these 
data yielded a significant result, F(3,40) = 
7.58, p< .01. Additional comparisons be- 
tween individual cell means were conducted 

where appropriate using an orthogonal con- 
trast and Dunnett’s method of comparing all 
treatment conditions with a control (Winer, 
1962).2 
Hypothesis 1. The opposite-side condition 
conceptually replicated the procedure and be- 
lief change results of previous research. When 


? All statistical tests are two-tailed, 
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compared with the control condition, subjects 
in the opposite-side condition showed evidence 


of anticipatory attitude change, Dunnett's 
t(40) = 3.66, p < 01. 

The first hypothesis tested the leration 
explanation of precommunicati belief 
change by omitting information on speak- 
er’s position during the warning h the 
self-esteem and consistency explan pre- 
dict change toward a communicator ition. 
Consequently, if subjects are igno f the 
position to be recommended, as in pic- 
only condition, a moderation expla: still 
predicts attitude change; the tow jsi- 
tion explantions may not. The r: 1p- 
ported the moderation explanation. ning 
subjects in the topic-only conditi he 
speaker’s topic but not the position he would 


take also produced significant precommuni:i- 
tion shift in opinion when compared \ 2 
control condition, Dunnett’s #(40) = 2.66, 
< .05. Furthermore, the amount re 
produced in the topic-only condition did not 
differ from that produced by the proce)" of 
earlier studies (opposite-side condit 

As the moderation explanatio: 
elimination of subjects’ knowledge < 
municator’s position failed to redu 
cantly the precommunication belie Ige. 
However, as mentioned earlier, th vard- 
a-position explanations may also b le to 
account for these findings, if in the nce 


cts, 
m- 
ifi- 


of knowledge of the speaker’s position, sub- 
jects view neutrality as the safest ition 
from which to guard one’s self-esteem o: pro- 


tect oneself against threats of cognitive incon- 
sistency. The data on guesses made ' b- 
jects in the topic-only condition pros sme 
slight evidence against the alternat expla- 
nation. In the absence of other information, 
guesses about the speaker’s position were not 


TABLE 1 


MEAN FINAL ATTITUDE OF Future U.S. Economic 
SITUATION, EXPERIMENT 1 


Topic only Opposite side Same side Control 
8.55 7.55 11.45 11.18 
(1.69) (2.88) (2.46) (2.14) 

Note. Measurements are given on a 15-point scale. The 


higher the number, the greater is the belief that there will 
Soon be an economic depression in the United States, Standard 
deviations are shown in parentheses. n = 11 per cell. 


wf 


La 


PERSUASION OR MODERATION? 


made udom, There was consensus among 
the s: s as to the position the speaker 
woul bably take. They did not anticipate 
that communicator would advocate a 
posi on the opposite side of the issue from 
the vn viewpoint. Therefore, their belief 
ch was directed away from their best 
gi of the speaker’s position. Nevertheless, 


th. evidence is of the weakest sort, since the 
jects could have been protecting them- 
ves against what they considered even an 
likely eventuality. 

Hypothesis 2, Because of the possible alter- 
native explanation for precommunication be- 
lief change in the topic-only condition, the 
second hypothesis was considered a more 
rigorous test of the moderation explanation. 
When subjects are forewarned that a com- 
municator will advocate a position on the 
same side of the issue but much more ex- 
treme, the toward-a-position explanations 
predict precommunication attitude change in 
the direction of greater extremity, while the 
noderation explanation predicts a lessening of 
xtremity of opinion. As may be seen in 

able 1, when the toward-a-position and 

\deration explanations were put in direct 

aflict in the same-side condition, no evi- 

nee of anticipatory change in either direc- 
ton was found (Dunnett’s ¢ < 1, ms). 

By itself, a finding of no difference is am- 
| iguous at best. But in the present situation 
the no-change results of the same-side condi- 
tion must be considered relative to the 
substantial precommunication attitude change 
found in the opposite-side condition. What- 
ever produced the belief change among sub- 
jects in the opposite-side condition was elimi- 
nated or counteracted simply by suggesting 
that the speaker would take an equally ex- 
treme position on the other side of the issue. 
Since the only difference between the proce- 
dure of the two conditions was the position 
the communicator was expected to take, the 
null finding suggests the possibility of off- 
setting, opposing psychological forces. In the 
Opposite-side condition and all previous in- 
gations of anticipatory attitude change, 

s toward the speaker’s position and 
toward moderation have been in the same 
direction and reinforced one another in 
Producing precommunication effects. Putting 
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them in opposition apparently eliminated the 
phenomenon. 

Unfortunately, other explanations may be 
possible, too. For example, the psychological 
force opposing the toward-a-position precom- 
munication effects may simply have been a 
ceiling effect. A ceiling effect seems unlikely, 
since the data were not highly skewed, nor 
did any subject use either endpoint of the 
attitude scale, but end-of-scale effects should 
be considered a possibility. 

A somewhat more troublesome alternative 
explanation is the possibility that subjects in 
the same-side condition did not change their 
belief because they did not perceive any dif- 
ference between their own position and that 
of the speaker. Most subjects only weakly or 
moderately anticipated an upcoming depres- 
sion, while the communicator believed “that 
a serious economic depression will definitely 
affect the U.S. economy in the immediate 
future.” Nevertheless, if the subjects did not 
perceive a difference between the speaker’s 
position and their own as intended, no belief 
change would be predicted by the toward-a- 
position explanations. 


EXPERIMENT 2 


A second experiment replicated the first 
with only a few changes that were designed to 
test the alternative explanations for Experi- 
ment 1. First, although the hypotheses and 
outcome of the topic-only condition suggested 
the plausibility of the moderation explana- 
tion, because the condition could not con- 
clusively distinguish between the competing 
explanations, it was not included in Experi- 
ment 2. Second, a general deterioration in 
confidence in the economy had occurred 
during the time between the experiments, 
which made a possible ceiling effect an even 
greater problem. So the topic of the antici- 
pated message was changed to the likelihood 
of discovering a cure for cancer. Pretesting 
indicated that the undergraduate population 
from which the subjects would be drawn was 
mildly optimistic that a cure for cancer will 
be found soon. Few of the students believed 
that a cancer cure was unlikely, and at the 
same time very few were extreme in their 
view that cancer will be cured quickly. 
Finally, in the second experiment subjects 


—— 
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were asked not only to give their own opinion 
but also to make a precise estimate of the 
upcoming speaker’s opinion in order to deter- 
mine whether they perceived the two positions 
as the same. 


Method 


Subjects. Subjects were 48 undergraduate volun- 
teers recruited from introductory psychology classes. 
As in Experiment 1, from 1 to 5 subjects partici- 
pated simultaneously with minimum interaction per- 
mitted between them. Only one condition was run 
each session, and the sessions were randomly assigned 
to one of the three conditions before the subjects 
arrived. 

Procedure. The experiment was an after-only de- 
sign with two experimental conditions and one con- 
trol condition. The conditions were procedurally 
similar to the same-side, opposite-side, and control 
conditions of Experiment 1, except that the topic 
of the anticipated communication was changed from 
the economic future of the U.S. to “A cure for 
cancer is in sight” (same-side condition) or “No 
quick cure for cancer” (opposite-side condition), In 
each case the introduction of the message to the 
subjects described in the procedure of Experiment 1 
was adapted to the topic of finding a cure for 
cancer, 

All subjects were asked to indicate their opinion 
on the topic before hearing the tape recording. 
Subjects responded to the statement “Medical re- 
search will soon discover a cure for cancer” by 
indicating their opinion on a 43-point scale labeled 
at intervals, 1 (completely agree), 7 (strongly agree), 
13 (moderately agree), 19 (slightly agree), 25 
slightly disagree), 31 (moderately disagree), 37 
(strongly disagree), and 43 (completely disagree). 

After subjects had indicated their own opinion, 
they were asked to indicate on an identical scale 
the position they believed the speaker would recom- 
mend in the message they were about to hear. 
Finally, they were asked to indicate on a 31-point 
scale how similar or different they thought their 
position would be in relation to that of the speaker. 

Control group subjects merely completed the first 
attitude item. No mention was made to them of 
a communication, 

No message was actually presented to any of the 
subjects, and the session was terminated in the same 
manner as in Experiment 1, 


Results 


The results of the second experiment are 
summarized in Table 2. The pattern of re- 
sults on the attitude measure replicated the 


* Transcripts of the descriptions of the message 
used in Experiment 2 may be obtained by writing 
to the author. 
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results of Experiment 1. An overall single “2c 
tor analysis of variance on the data of «ll 
three conditions yielded a significant res ‘i, 
F(2,45) = 4.29, p < .05. The opposite-s. ie 
condition replicated the procedure of ear! +r 
research and again produced significant : +- 
ticipatory belief change. The final attitude -f 
subjects in the opposite-side condition diffe 
from the attitude of control group subjec 
Dunnett’s #(45) = 2.87, p < .02. Reversi 
the position the subjects believed the coi 
municator would take again eliminated pr 
communication belief change, as found i 
Experiment 1. The final attitude of subject 
in the same-side condition did not differ sig 
nificantly from the attitude of subjects in the 
control group (Dunnett’s ¢ < 1, ms). 

Thus, the results of the attitude measure 
are essentially the same in the two experi- 
ments. In order to test the alternative expla- 
nations posed to account for the results of 
Experiment 1, the subjects’ perception of the 
speaker’s position must be examined (see 
bottom row of Table 2). First of all, th: 
subjects in the two conditions perceived th: 
position of the speakers differently, as wa 
intended, #(30) = 21.93, p< .001. But 
more important question with regard to thi 
alternative explanations is how the subject: 
perceived the speaker’s position relative t 
their own, particularly those subjects in the 
same-side condition. 

The subjects’ attitude and perception o! 
the speaker’s position were compared usin; 
a £ test for correlated samples (Ferguson 
1966). Not surprisingly, the attitude ex 
pressed by subjects in the opposite-side condi 
tion differed from the attitude they attrib- 


TABLE 2 
MEAN FINAL ATTITUDES AND PERCEPTIONS OF 
SPEAKER’S POSITION ON PROSPECTS POR A 
CANCER CURE, EXPERIMENT 2 


Opposite 


Dependent measure side Same side Control 
Subject’s own attitude | 18.13 14.06 12.19 
; (8.06) (4.12) (459) 
Perception of speaker’s | 33.06 4.44 
position (4.73) (2.22) 
Note. Measurements are given on a 43-point scale, The 


lower the number, the greater is the belief that a cure for 
cancer will be discovered soon. Standard deviations are shown 
in parentheses, n = 16 per column. 
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utec the anticipated communicator, ¢(15) 
E= p < .001. More importantly, subjects 
in same-side condition perceived the 
sp “s position as on the same side of the 
is: but significantly more extreme than 


th- own, £(15) = 7.73, p < .001. In fact, all 
si jects in the same-side condition attrib- 

d a more extreme position to the speaker 

in they held. In the second experiment at 

ist, the absence of anticipatory belief 
hange in the same-side condition did not 
occur because the subjects failed to perceive 
a difference between their position and the 
one the speaker was expected to take. They 
perceived the speaker’s view as different from 
their own but did not move toward it. 

Similarly, the fact that the subjects con- 

sistently attributed more extreme positions to 
the expected speaker than they had previ- 
ously attributed to themselves argues against 
a ceiling effect depressing subjects’ attitude 
scores following forewarning in the same-side 
condition. Subjects were not reluctant to use 
more extreme parts of the scale when they 
considered them appropriate. Of course, in 
his case they were describing someone other 
han themselves, but in pretesting sessions, 
ther subjects from the same population, 
hen describing their own attitudes on other 
sues, had also been willing to adopt more 
treme positions on similarly constructed 
titude scales. In fact, the topic of a cancer 
ire was chosen for the experiment because 
he subjects’ attitudes were homogeneously 
1oderate. 

Finally, when asked to indicate how similar 
ov different they felt their position and the 
speaker’s were, subjects in the opposite-side 
condition (X = 17.13) did perceive a greater 
difference between the two positions than 
did subjects in the same-side condition, 
X = 10.00; ¢(30) = 3.46, p < .01. But even 
subjects in the same-side condition did not 
report feeling a very high correspondence with 
the speaker’s position; a mean of 10.00 on 
he 31-point scale suggests feelings of only a 
mild similarity between the two positions. 


Discussion 


Taken together, the results of the two ex- 
periments offer considerable support for the 
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moderation explanation of precommunication 
belief change. However, it is important to em- 
phasize that the toward-a-position explana- 
tions are in no way invalidated. The absence 
of anticipatory change in the same-side condi- 
tions of both experiments, accompanied by 
attitude change in the opposite-side condi- 
tions that differed only in the position the 
communicator was expected to take, suggests 
the existence of both types of psychological 
forces. In addition, differences in amounts of 
anticipatory change produced between condi- 
tions of previous experiments are probably 
best explained by other mechanisms. 

Different perceptions of moderateness be- 
tween conditions might also have affected the 
results of the present experiments. For ex- 
ample, subjects in the same-side conditions 
may have felt little inclination for precom- 
munication belief change because, when com- 
paring their own position to the extreme 
stance to be taken by the communicator, they 
may have felt relatively moderate to begin 
with. Of course, this explanation, too, is not 
based solely on pressures to establish a 
moderate viewpoint. If forces to moderate 
one’s position were the only factors affecting 
precommunication belief change, the position 
of the speaker should have no effect on the 
amount of change produced, a prediction 
clearly not supported by the data. Further 
research should examine whether different 
feelings of what constitutes “moderateness” 
mediate the amount of anticipatory belief 
change produced. But even if they do, these 
feelings apparently represent a consolidation 
by the subject of an absolutely neutral posi- 
tion and the speaker’s attitude. Both the com- 
municator’s attitude and the subject’s desire 
to hold a moderate view combine to influ- 
ence the amount of precommunication belief 
change produced. 

Recently, Cialdini, Levy, Herman, and 
Evenbeck (1973) have also obtained evidence 
for precommunication moderation of ex- 
pressed opinion when one anticipates engaging 
in face-to-face discussion with a peer, Al- 
though they found no evidence of precom- 
munication moderation when their subjects 
anticipated merely listening to a peer state 
his opinion of an issue, this result conflicts 
only slightly with the current and earlier 
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studies. When one anticipates a message from 
a peer who is no better informed than oneself 
on an issue, neither the logical nor self- 
presentational aspects of moderation are en- 
gaged to any great degree, nor are the self- 
esteem or consistency pressures aroused. There 
is little expectation of new information or 
updating. One’s own position is fairly secure, 
and the prospect for persuasion is slight. 
Finally, the subjects did not expect their atti- 
tudes to be closely monitored and examined 
for change, a concern that at the very least 
strongly enhances precommunication change 
(Cooper & Jones, 1970). Cialdini et al. agreed 
that their results are compatible with 
earlier research and argued persuasively for 
an explanation based on tactics of self- 
presentation. 

However, regardless of the specific mecha- 
nism producing anticipatory belief modera- 
tion, the results of the present experiments 
strongly urge that future research avoid con- 
founding moderation and toward-a-position 
effects, perhaps by the use of a counter- 
balanced design. Counterbalancing the com- 
municator’s direction of advocacy not only 
provides an indication of the relative magni- 
tude of the effects, but also reveals any inter- 
action between psychological forces toward 
moderation and the manipulated variables. 
Such interactions would otherwise be, and 
may have already been, concealed. 


R. Gren Hass 
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vfluence of Causal Beliefs on Affect and Expectancy 
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Michigan State University 


This article investigated the influence of one’s causal attributions for success» 
on subsequent achievement behavior. Two of Weiner's hypotheses were testéd: 
(a) that the locus of control of the attribution would influence the subject’s 
affective response and (b) that the stability of the attribution would influence 
the subject’s expectancy of success. One hundred twenty seven men and women 
who believed that they were participating in a piano practicum received in- 
structions describing success at piano playing as determined by either ability, 
effort, task simplicity, or chance. All subjects received feedback that they were 
successful. As predicted, subjects given the internal attribution instructions 
(ability and effort) reported more positive affect than those receiving external 
attribution instructions (task and chance). This pattern held for both males 
and females and for all four experimenters. The results of the stability dimen- 
sion were not as clear as those for the locus of control. 


Work and achievements are powerful deter- 
minants of a person’s feelings of self-worth. 
The special task force to the Secretary of 
Health, Education, and Welfare (U.S. De- 
vartment of Health, Education, and Welfare, 
972) reported that workers who engage in 

ssembly line work experience depression and 

vathy. Various feminist writings (Friedan, 

63) expose the detrimental effects of per- 
f rming “women’s” work, that is, housework 
« q childcare to the exclusion of other per- 
s nal achievements, on women’s self-concepts 
a. | self-esteem. 

Nhat characteristics do “women’s” work 
aid assembly line work have in common? 
Cne is the simplicity of the work, a result 
oi technological development. Perhaps this 
perception that success is predetermined by 
the task simplicity contributes to this depres- 
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sion and apathy. Intuitively, individuals’ 
perceptions of the causes of their success 
influence their subsequent reactions. 

Weiner (1972) and others of his research 
group (Cook, 1970; Meyer, 1970) have 
incorporated the individual’s perception of the 
causes of success as an important variable in 
the study of achievement behavior. Weiner’s 
theory of achievement behavior locates the 
individual’s perception of the causes of suc- 
cess and failure on two dimensions, locus of 
control and stability. He uses the corners of 
this two-dimensional task space, that is, abil- 
ity, effort, task difficulty, and chance, as the 
standard cognitions for use in discovering the 
relationship between attributional processes 
and achievement behavior. Ability is located 
on the internal-stable point; effort on the 
internal-unstable point; task difficulty on the 
external-stable point; and chance on the 
external-unstable point. Weiner hypothesized 
that these two causal dimensions indepen- 
dently influence achievement behavior and 
more specifically, they influence different re- 
Sponses. Weiner then sets up a parallel proc- 
ess model of achievement motivation. The 
locus of control is hypothesized to influence 
people’s affective response to success and 
failure. Thus pride in accomplishment would 
be contingent upon locating the cause of 
success in the person (making a dispositional 
attribution in lieu of situational attribution). 
If this hypothesized relationship is correct, 
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then an individual who succeeds and perceives 
the cause of that success to be his or her 
effort or ability should experience more pride 
than the individual who believes that success 
was due to luck or the simplicity of the task. 
Theoretically, the stability dimension influ- 
ences the individual’s expectancy of future 
success and failure. Weiner hypothesized that 
stable attributions should lead individuals to 
believe that their future performance will be 
the same as past performances, while unstable 
attributions should lead individuals to expect 
inconsistencies between past and future per- 
formance. Thus, if this proposition is valid, 
an individual who succeeds because of per- 
ceived ability or task simplicity should be 
more confident that he or she would perform 
the same on the same task than an individual 
who succeeds because of effort or luck. 

Weiner has not yet fully demonstrated the 
hypothesized relationships between causal be- 
liefs and the resulting affect and expectancy. 
Cook (1970, as discussed in Weiner, 1972) 
studied the relationship between causal beliefs 
for success and affect and demonstrated that 
subjects who were predisposed to attribute 
success to effort (as measured by the Crandall 
Intellectual Achievement Scale) exhibited the 
greatest resultant reward (reward for success 
minus punishment for failure). Meyer (1970, 
as cited in Weiner, 1972) found positive 
correlations between positive effect for suc- 
cess, negative affect for failure, and causal 
attributions to luck and ability. Affects were 
more intense when attributions were internal 
rather than external. 

Phares (1957), investigating the rela- 
tionship between attribution and expectancy, 
found more frequent and greater magnitude 
expectancy shifts in skill than in chance at- 
tributional set conditions. Individuals who 
were succeeding in the task (defined as a 
skill) bet more money following success than 
those succeeding in the same task defined as 
determined by chance. Meyer (1970) found 
that expectancy decrements were the most 
evident when individual’s ascriptions for fail- 
ure to lack of effort and bad luck were low. 

Although lending support to the hypothe- 
sized relationships between locus of control 
and affect, and stability and expectancy, 
these studies above did not systematically 
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investigate the direct relationship of att Su- 


tion and affect and expectancy. Ali ut 
Phares’s study used assessment of the ca al 
attributions; therefore one would be abi to 


discuss the relationship of causal infere: es 
and responses only as a correlation. Seco d, 
the studies did not use all four causal >- 
ments, but focused on one or two of thi a. 
With Meyer (1970) for example, the relati: :- 
ship of affect to locus of control is confus 1 
because he used ability and chance which : 
confounded for the stability dimension. 

The present study differs from previo 
studies in that subjects’ causal cognitio: 
about success were manipulated as indi 
pendent variables rather than assessed a 
dependent measures. The present study i 
somewhat restricted because it deals only wit! 
attributions for success... The hypothese: 
tested were taken directly from Weiner’s 
model: 


1. The locus of control of the causal attri- 
bution should influence the affective response 

2. The stability of the causal attributio 
should determine the expectancy response. 


METHOD 
Overview 


The experimenters recruited subjects who had r 
previous experience with piano for a “music pra 
ticum.” The subjects individually received 15 mir 
utes of instruction on piano with the “correspon 
ence of notes technique.” When finished all subjec 
were given a written evaluation form on whici: 
the experimenters marked the pass column on t 
three evaluated items of consistency, evenness, and 
coordination and wrote in the feedback “you were 
successful.” In order to manipulate the subjects’ 
causal attributions of this success, subjects were 
handed one of four instruction sheets before they 
began playing. One-fourth of the subjects were 
instructed that their successful performance was en- 
tirely due to their ability, one-fourth were told that 
their success was entirely due to their effort, ct 
fourth were told that their success was « 
to the simplicity of the task, and one-for 
told that their success was entirely due to chance. 
These four elements were classified on two dimen- 
sions, locus of control and stability, thus yielding 
a 2X2 design with two levels of locus of control, 
internal and external, and two levels of stability, 
stable and unstable. To measure the subjects’ re- 


1 Because debriefing was 3 weeks after the ex- 
periment, it was decided to use only success and not 
failure. 
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completed evaluation forms which 


sponses 
includec eli-report dependent measures. The 
experim sed subjects’ choice of subsequent 
pieces Each subject received a thorough 
letter rieñng when the study was completed. 
Subj 

Ti jects for this study were students at the 
Univ sity of Wisconsin, 64 males and 64 females. 
No. the subjects had ever had formal music 
les: . The experimenters recruited subjects in the 
do: -itory. Proceeding door by door, the experi- 


m. ater asked the prospective subjects first if they 
h d ever had formal music training. If the person 
sid no, then the experimenter asked the subject if 
ne or she would take part in a music practicum 
in which he or she would receive a free lesson. The 
experimenter described the practicum as a require- 
ment for a doctoral degree in music. If the subject 
agreed, a time was arranged at both the experi- 
menter’s and the subject’s convenience. The meet- 
ings were arranged at the dormitory piano rooms, 


Experimenters 


To enhance experimenter generality, two male 
«nd two female experimenters were employed, This 
vecaution was also taken because of possible, but 
npredicted interactions between sex of subject and 
x of experimenter. Each experimenter completed 
¿ least four replications of the 2 X 2 design, yield- 
iv a 32 cell design (Locus of Control X Stability x 
S -< of Subject X Sex of Experimenter). 


P+ cedure 

ie experimenter met the subject in the piano 
ro n at the designated time and told the subject 
th.. he or she would be given a short lesson and 
the an evaluation of that performance. The ex- 
persnenter introduced himself as a “doctoral can- 
didate” in the music school who “worked in con- 
Juvciion with the music faculty on teaching tech- 


niques for piano.” 

Next, in order to avoid sex typing of piano 
Playing the experimenter described the task as in- 
volving many different variables, “eye-hand coordi- 
nation, looseness of hands, length of the phrase 
you can read at one time.” The experimenter further 
attempted to dispel stereotyping by stating that 
“from research studies and from my own practical 
experience men and women have equal ability,” 


Manipulation of Causal Attributions for 
Success 


The experimenter next handed the subject a de- 
scr. )tion of the “correspondence of notes technique” 
Which included the manipulation of causal attribu- 
tions, All subjects received the first paragraph 
which <escribed the procedure which would be used 


- for this technique. The subjects then read a second 
paragrap! which stated that success or failure at 
Plano playing in the beginning stages was due to 


either ability, effort, task simplicity, or chance. 
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The instructions informed the subjects that success 
or failure is a result of “possessing skill and ability 
[effort, task simplicity, chance].” The subjects were 
told that “research studies” supported these ob- 
servations. The paragraph concluded with a repeti- 
tion of the manipulation: “So these beginning les- 
sons reflect your ability [efforts, task simplicity, 
chance]. If you’re successful at this elementary 
stage it is because you have ability [effort, task 
simplicity, chance].” 

After the subject read the introduction the ex- 
perimenter explained the procedure of the corre- 
spondence of notes technique.2 To hold performance 
constant, subjects were asked to play to the beat 
of a metronome set very slow. The subjects played 
the piece through once, then the experimenter set the 
metronome and watched the subject play. This pro- 
cedure was followed for the three pieces the sub- 
ject played, The subjects were told that the pieces 
were at the same level of difficulty. The experi- 
menter then gave the subject an evaluation form on 
which the experimenter marked pass on the three 
items listed (consistency, evenness, and coordination) 
and wrote in “you were successful.” The manipula- 
tion was also repeated in the feedback by way of 
a note to the instructor on the sheet the subject 
received telling him or her to remember that “piano 
playing reflected” either ability, effort, task simpli- 
city, or chance. 


Dependent Measures 


After the subject read the feedback the experi- 
menter handed him or her an evaluation form for 
completion. This form asked six questions which 
were all to be answered on 7-point scales: 


1. How proud are you of your performance? 

2. Would you like to participate in a similar 
music practicum again? 

3. How rewarding do you feel the feedback you 
received from the instructor was? 

4. How interested are you in studying piano? 

5. If you were to do the first three pieces again, 
how confident would you be that you would per- 
form the same? 

6. How well do you think you would do on a 
subsequent more difficult set of pieces? 


The last question was the manipulation check: 
Which of the following elements primarily deter- 
mined your success (rank order): ability, effort, 
simplicity of task, chance. 

The experimenter then assessed subject’s choice 
of subsequent pieces to play. The experimenter 
asked the subject to choose one of a set of pieces. 
They were told that “the pieces are rated in dif- 


? This is an actual piano-teaching technique. The 
assigned piano pieces had the letters of the notes 
written in. A cardboard piano keyboard with the 
notes written in was also used. This keyboard was 
placed on the piano so that the subject could read 
the letter off the piano score and then find the key 
with the use of the cardboard, 
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ficulty and you can choose which one you want to 
practice on for the rest of the period. This is the 
most difficult in this series of beginner pieces, This 
is the easiest, This is the level of difficulty you have 
been playing.” The experimenter recorded the level 
of difficulty that the subject chose. 


Debriefing 


At the end of 3 weeks, each subject received a 
letter explaining in detail the purpose of the ex- 
perimenter, the reason for secrecy, and an address 
and telephone number for further questions he or 
she might have.® 


RESULTS 
Manipulation Check 


The subjects rank ordered ability, effort, 
task, and chance as which element “primarily 
determined your success.” The percentage of 
individuals ranking the manipulated element 
first was 47% in the ability condition, 56% 
in effort, 69% in task, and 47% in chance. 


Affective Measures 


The four affect measures (questions 1-4 
in the dependent measures section) were 
summed. This affect index, F(1, 125) = 12.09, 
p < .001, verified the prediction that the 
locus of control of the attribution would in- 
fluence the subject’s affective response, The 
generality of this pattern of results is sup- 
ported by the absence of interactions with 
the experimenters and sex of subjects (see 
Table 1). 


Expectancy Measures 


Two expectancy measures (questions 5 and 
6 in the dependent measures section) were 
summed, Contrary to prediction, the stability 
index did not have the predicted effect on ex- 
pectancy of success. There were no significant 
effects for the combined index. 


Level of Aspiration 


The measure of an individual’s level of 
aspiration was the level of difficulty of the 
piece he or she chose to play for the rest 


This was decided in lieu of verbal debriefing 
because students were recruited in dormitories over 
a 3-week period and the experimenter was con- 
cerned that individuals not find out the purpose of 
the study or that the practicum was a psychology 
experiment. 
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TABLE 1 


MEAN INDEX SCORES FOR EACH ATTRIBUT 
or SuccEss 


Attributional set 


Dependent measure it 
Task | Cance 


Ability | Effort 
(I-S)_| (I—U) | (E—S) -U), 
Affect index 
(range: 4-28) 20.19 | 2044 | 18.03 28 
Expectancy index 
Gases 2-14) 9.53 | 910] 9.35 6 
Level of aspiration | 
(range: 1-34) 21,69 | 17.06 | 17.00 | 19.50 
u z 
Note. internal locus of control, E = external loc of 


control, stable, U = unstable. 


of the period. The possible levels were 1- 
Subjects were told that levels 1-10 wee 
easier than the pieces they played, leve s 
11-22 were the same difficulty, and 23-2! 
were more difficult. The higher the numbe: 
the more difficult the piece was proposed tc 
be. The prediction was that individuals be- 
lieving their success was due to fixed ele- 
ments would choose higher levels of aspir «- 
tion than those who believed their succ 
was due to variable elements. The largest s 
nificant effect was sex of the experiment <, 
Female experimenters’ subjects chose higi r 
levels of aspiration (M = 22.91) than t-e 
male experimenters’ subjects, M = 14.71, X 
(1,96) = 26.29, p < .001. There was also a 
significant interaction of stability and loc.s 
of control, F(1, 96) = 5.52, p < .05. When 
given an external attributional set, those per- 
sons who have the unstable attribution chose 
higher levels of aspiration. 


Discussion 


This study provides partial support for 
Weiner’s analysis relating individuals’ beliefs 
about the causes of their success to their af- 
fect and expectancy. This evidence s! 
supports that individuals’ affective 5 
are influenced by Weiner’s locus of control 
dimension of causality but does not support 
the hypothesized relationship between the 
stability dimension and expectancy. The lack 
of verification of this hypothesis seems to in- 
dicate a weakness in Weiner’s formulation. 
The first question to consider in this regard 
is whether the four elements do in fact fall 
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into the : <o dimensions of locus of control 
and st «vy as Weiner hypothesized. Weiner 
does © resent conclusive evidence that ef- 
fort, xample, is considered to be less 
stab! can task characteristics. This question 
can ipproached in various ways through a 
faci: analysis or by presenting subjects with 
des: ‘ptive situations where they rank the 


cai al elements on the stability dimension. 
T s question cannot be answered in this 
s udy and thus one cannot sufficiently evalu- 
te the theoretical issue of whether the sta- 
bility dimension has an influence on expecta- 
tions of future success. 

The results of the level of aspiration mea- 
sure also did not confirm Weiner’s prediction 
that individuals believing their success was 
due to fixed elements would choose higher 
levels of aspiration than those who believed 
their success was due to variable elements. 
‘Chere was a significant interaction of locus 
‘i control and stability, Weiner’s reasoning 
ssumes that level of aspiration is a measure 
f expectancy. However, one can conceive of 
l vel of aspiration as an achievement behavior 
ad therefore expect both stability and locus 
©: control to influence the choice. 
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The other strong influence on level of as- 
piration was the sex of the experimenter. 
Just why both men and women chose higher 
levels of aspiration for the female experi- 
menters is not apparent. Possibly the female 
experimenters were more encouraging or were 
seen as less judgmental and evaluative than 
the male experimenters. 
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ERRATUM 


In the article, “Effect of Cognitive Appraisal on d’ 


mental Stress,” by Richard W. 


and Response Bias to Experi- 


J. Neufeld, which appeared in the April issue, 


Volume 31, Number 4, the following changes should be made in Table 1, which 
appeared on page 738: Specifically, in Group HV-I-D, Stimulus Rank 5, entries that 


erroneously appeared under Rating 
Rating Categories 3, 4, 5, and 6, so 


Categories 4, 5, 6, and 7 should appear under 
that they read as follows: Rating Category 
3= 13, Rating Category 4 = .26, Rating Category 5 = 


.23, and Rating Category 


6 = .38, In addition, the entry under Group SD-SH, Stimulus Rank 1, Rating Cate- 


gory 1, which erroneously appeared as 79, 


should appear as .74. 


Journal of Personality and Social Psychology 
1975, Vol. 31, No. 6, 1168-1176 


When Beauty May Fail 


Marshall Dermer and Darrel L. Thiel 
University of Wisconsin—Milwaukee 


The Dion, Berscheid, and Walster person perception study was quasi repli- 
cated in order to assess the generality of the “what is beautiful is good” stereo- 
type. In Experiment 1, female participants who were either unattractive, 
average, or physically attractive made a variety of attributions about female 
target persons of varying attractiveness levels. Attribution favorability was 
found to be contingent upon the physical attractiveness of the participant as 
well as the dimensions along which the attributions were made. While many 
of the attributions were congruent with the postulated stereotype, others were 
not, Socially undesirable attributions regarding vanity, egotism, likelihood of 
marital disaster (requesting a divorce/having an extramarital affair), and 
likelihood of being bourgeois (materialistic/snobbish/unsympathetic to op- 
pressed peoples) were reliably increasing monotonic functions of target persons’ 
attractiveness levels. Plausible explanations for these divergencies were explored 


in Experiment 2, 


The work of Berscheid, Walster, and col- 
leagues (e.g., Berscheid & Walster, 1974) 
suggests that physically attractive as com- 
pared to unattractive individuals generally 
have a considerable social advantage. To 
account for why beautiful people might be 
preferred, Dion, Berscheid, and Walster 
(1972) postulated the existence of a physical 
attractiveness stereotype—“‘what is beautiful 
is good.” In the Dion et al. person perception 
study, participants attempted to estimate 
accurately, on the basis of black and white 
photographs, the characteristics of men and 
women who varied in physical attractiveness. 
The results were interpreted as being com- 
patible with the stereotype since participants 
were reported to have rated the physically 
attractive target persons as having more so- 


The first experiment is based, in part, upon a 
doctoral dissertation submitted to the Graduate 
Faculty of the University of Minnesota, Special 
thanks are due Ellen Berscheid and Paul C. Rosen- 
blatt for their constructive reviews. Professors Rob- 
ert S. Baron, Milton Rosenbaum, and Penny Baron 
of the University of Iowa graciously provided ex- 
cellent logistical support. The second experiment is 
based upon an independent study project conducted 
by the second author. This research was supported 
by National Science Foundation Grant GS-35157 to 
Ellen Berscheid as well as by University of Wiscon- 
sin—Milwaukee Graduate School Grant 136-7960. 

Requests for reprints should be sent to Marshall 
Dermer, Department of Psychology, University of 
Wisconsin, Milwaukee, Wisconsin 53201. 


cially desirable personalities and being mo.e 
likely to be professionally and maritally suc- 
cessful than target persons of lesser attrac- 
tiveness. 

Dion et al., however, noted that jealc sy 
might attenuate the positivity of attributions 
for physically attractive target persons © nd 
speculated that participants might most |i ely 
be jealous of attractive persons of their wn 
sex. This possibility was explored in that | oth 
male and female participants and targets v ere 
represented in the experiment. The appro ri- 
ate tests for the jealousy effect—the Sev of 3 
Participant X Sex of Target Person inti 
tions—were uniformly nonreliable. B 
London, and Reeves (1968) were simila; 
unsuccessful in testing the identical hypothe- 
sis, although they also detected reliable main 
effects generally favoring attractive over un- 
attractive target persons. 

Although Dion et al. (1972) reasoned that 
jealousy might be operative within “or 
study, in retrospect it is not evi a 
majority of participants should have ied 
this way to attractive same-sex target persons. 
Consider the case of women participants. To 
the extent that women of both high and aver- 
age physical attractiveness can readily per- 
ceive themselves as attractive, they can be- 
lieve that, as the attractive target persons, 
they are destined for the best. Consequently, 
they should have little to be jealous about. 
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In cont iysically unattractive women 
may, for umber of interrelated reasons, 
most like rry well-honed hatchets for at- 
tractive ie targets. Most important, men 
curren r to control a preponderance 


in our society and most likely 
tive over unattractive women in 
cistribution. Indeed, Berscheid and 
Wal *r (1974) review a number of studies 
ind) ¿ting that women’s physical attractive- 
ness levels covary positively with their up- 
wird social mobility as well as their self- 
eports of dating popularity. As a conse- 
juence, unattractive women may be least 
capable of distorting their attractiveness far 
upward and most likely to view themselves as 
unsuccessful competitors with attractive 
yomen for the rewards males control. Unat- 
tractive women, therefore, might perceive 
tractive women neither as “good” (Dion et 
1972) nor as “expected sources of rein- 
ement” (Byrne et al., 1968, p. 261) but 
'..ther as frustrating. 
The purpose of the current study, then, 
ws to delimit the beauty implies goodness 
_ st otype, On the basis of the considerations 
ab ve, it was hypothesized that the physical 
ati, «tiveness of female target persons would 
int: act with the attractiveness of participants 
ermining attributions. 
is important to note that neither Dion 
nor Byrne et al. included photographs 
ionally attractive persons in their 
, while such persons were included in 
: resent replication. These researchers 
FE uggested that their theoretical positions 
q be more strongly corroborated when 
[Se high levels of physical attrac- 


Tews 


stu: 
the 


might 


i 
tiveness are represented. It is precisely women 
of exceptional attractiveness, however, who 
are likely to be perceived as the proper ob- 
ject of every man’s affection. The present 
“analysis Suggests that unattractive women 
ey, as a consequence, not be so favorably 
di posed.* This analysis also suggests that 
att: ctive women may be stereotyped as hold- 
ing s: cially undesirable bourgeois orientations 


aS a consequence of their ability to attract 
high-sta‘::s males.2 Stated more generally, 
there a miting conditions beyond which 
Deauty may fail, 


H 
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EXPERIMENT | 


Method 
Participants 


Forty female students, who were enrolled in an 
introductory psychology course at the University of 
Towa, participated in the experiment in exchange 
for $2. Five weeks earlier they had attended a 
survey session during which five concealed female 
observers had independently rated each of 108 par- 
ticipants on a physical attractiveness scale for which 
the response categories “very unattractive,” “unat- 
tractive,” “just below average in attractiveness,” 
“just above average in attractiveness,” “attractive,” 
and “very attractive” were respectively scored with 
the integers 1-6.8 The 10 least attractive respondents 
(HH 725; s=.05), 20 average respondents (X = 
3.50, s=.10), and 10 most attractive respondents 
(X =5.02, s=.34) were selected for inclusion in 
the experiment. 


Procedure 


The cover story almost identically followed that 
of Dion et al., 1972, The participants were informed 
that their person perception accuracy was being 
assessed and compared with the accuracy of trained 
professionals. They were further instructed that 
standard sets of photographs would be used as the 
basis for the assesment, The photographed indi- 
viduals were described as college students enrolled 
at a variety of universities and junior colleges 
throughout the midwestern United States who were 
participating in a longitudinal study of personality 
and value development scheduled to continue into 
adulthood. It would therefore be possible to assess 
the accuracy of each participant’s attributions against 
currently available and forthcoming information on 
the photographed persons, 

Participants recorded their attributions of each 
target person in different booklets. When a partici- 


1Mass media role-casting practices may also con- 
tribute to this effect. One might conjecture that the 
fate of unattractive women in the 1960s and 1970s 
is not unlike that of Blacks in earlier years, with 
physically unattractive women cast as servants to 
the beautiful rich. 

2 Exceptionally attractive women might also be 
stereotyped as bourgeois as a consequence of their 
adorning advertisements appearing in the mass media. 
In a pilot study exploring covariations between the 
attractiveness of females appearing in magazine 
advertisements and the luxuriousness of the product 
they were paired with, only 1 of some 150 females 
was rated as near average in attractiveness, (This 
woman appeared in a mop advertisement.) 

The average interrater correlation equals .52. 
The reliability of the five-judge panel as estimated 
via the Kuder-Richardson formula is 84, 
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pant completed rating a target, the photograph and 
booklet were removed before the next set of ma- 
terials was presented. 


Target Materials 


One hundred sixteen women, mostly dormitory 
residents and sorority members at the University of 
Minnesota, served as photographic subjects in ex- 
change either for $2.50 or a self-portrait. All sub- 
jects were asked to wear either white blouses or 
sweaters, no pins, jewelry, etc., in an attempt to 
reduce potential socioeconomic cues. No subjects 
were allowed to wear glasses, and all were posed 
as if a graduation picture were being taken. 

The resulting waist-up, professional quality,* color 
portraits were sorted by 48 coeds at Iowa State 
University into “extremely unattractive,” “unat- 
tractive,” “attractive,” and “extremely attractive” 
categories. These categories were respectively scored 
With the integers 1-4.5 On the basis of these judg- 
ments two target sets were assembled. Each set con- 
tained three target persons, who reliably were judged 
to be either of low, medium, or high physical at- 
tractiveness, The two high and two low targets were 
the photographs respectively judged most (X = 
3.57) and least attractive (X = 1.67). The two 
medium targets were the photographs assigned the 
most intermediate attractiveness values (X = 2.52). 

Half of the participants at each level of partici- 
pant physical attractiveness were randomly assigned 
to one of the target sets; the target persons within 
a set were presented in random order, Thus, a two- 
between (Target Set X Participant Physical Attrac- 
tiveness) X one-within (Target Physical Attractive- 
ness) factorial design was generated, 

It is important to note that subsequent careful 
examination of the target sets revealed one of the 
unattractive target persons to be without a smile, 
while the attractive target persons wore smiles as 
well as cosmetics. The question arises as to whether 
these characteristics should be considered confounded 
with target attractiveness or aspects of attractiveness, 
Systematic reasons for deeming one as a confound 
characteristic and another as an aspect of attractive- 
ness presuppose a theory of beauty. In the absence 
of such a theory, much less a definition of beauty, 
researchers investigating physical attractiveness have 
employed normative judgments of attractiveness, If 
the absence of a smile or evidence of discrete cos- 
metic usage are considered confounded with beauty 
rather than aspects of beauty and part of the 
attractiveness manipulation, it must at least be 
recognized that these are frequent naturally occur- 
ring confounds, 


4 The photographic skills of Michael R. Cunning- 
ham and Mark Rice were essential to the success of 
this phase of the study. 

5 The average interrater correlation equals .46. 
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Attribution Instructions and Depende» 
Variables 


As in Dion et al., the first page of each ~-oklet 
stressed that person perception accuracy was being 
studied, and that it was of the utmost imp tance 
that participants rate the stimulus persons akly. 

The first set of 27 items (see Dion et a pped 
personality attributions and were presente n a 
semantic differential 6-point rating scale form: The 
items within this set were randomly ordered -ross 
participants and target persons. The 14 items | hich 
constitute the Dion et al. Social Desirability dex 
and which were balanced for directionality re- 
sponse follow: poised/awkward, modest/ vain, 
strong/weak, interesting/boring, self-assertive, ub- 
missive, sociable/unsociable, independent/deper «nt, 


warm/cold, genuine/artificial, kind/cruel, exci: .ng/ 
dull, sexually warm/sexually cold, sincere/insin re, 
and sensitive/insensitive. Dion et al. report th’ in 
a preliminary study two-thirds of their samp! of 
male and female respondents agreed on the im 
poles which characterized a socially desirable per on 
(of either sex). 

The next set of items required the participant to 
estimate the likelihood that the target person (:) 
sympathized with oppressed peoples (the poor, the 
disadvantaged, etc.), (b) was a materialist (.e., 
believed that money and wealth are primary 
gredients for a happy life), (c) was a (so 
status-seeking snob, (d) would lead an exciting 
(e) would experience self-fulfillment, and (f) w 
be successful in her chosen occupation. The 
three items of this set tapped perceptions of b 
geois orientation, while the latter three items ta; 
expectations of social and professional happiness. 
response categories for these likelihood estimat 
were the following: “extremely unlikely,” “very .n- 
likely,” “somewhat unlikely,” “somewhat like. y,” 
“very likely,” and “extremely likely,” which were 
respectively scored with the integers 1-6. 

For the remaining items, the participant was asked 
to assume that the target person would get married 
someday. Likelihood estimations were made for the 
target person’s (a) subsequently requesting a di- 
vorce, (b) being an understanding spouse, (c) being 
a responsive sexual partner, (d) having an extra- 
marital affair, (e) being a good parent, and (f) 
raising her children well. The final items in the 
booklet required the participant to prognosticate the 
occupation the target person’s husband would likely 
hold at age 40 and the occupation the target person 
would hold at age 40, if employed. The 60 occupa- 
tions were selected from Hollingshead’s Occupational- 
Status Scale (see Bonjean, Hill, & McLemore, 1967, 
p. 442) and represented seven levels of status. 

After the participants had completed rating all of 
the target persons, the experimenter explained that 
it was necessary to measure factors which may have 
influenced their ratings so that these factors could 
be controlled in the data analyses, He indicated that 
participants’ perception of the physical attractiveness 


rsons may have affected their attri- 
sked them to rate the target persons. 
-ysons were judged in the original order 
were presented, on the aforementioned 
ctiveness scale. The session was con- 
h the rationale behind the experiment 
4 few participants were suspicious of the 
y, though there was absolutely no indica- 
icipants were aware of the hypotheses 
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Results 


X :mipulation Check and Target Set Effects 


Participants’ judgments of the physical 
attractiveness of the target persons were ana- 
lyzed as a function of the between-groups 
factors: target set and participant physical 
attractiveness (low, medium, high); as well 
as the within-groups factor: target physical 
attractiveness (low, medium, high).° All 
three- and two-way interactions were not sta- 
tistically reliable.” There were, however, reli- 
ible main effects for target physical attrac- 
iveness, Fm(2,33) = 145.32, p < .000001, 
l ind set, F(1,34) = 4.30, p= .05; X set, < 
2 setg, As anticipated the two unattractive 
togets (X = 2.4) were judged reliably less 
a ractive than the two intermediate targets 
(€ = 3.7), who were in turn judged reliably 
lc » attractive than the two attractive targets 
(. = 5.5). It should be noted that a special 
ef was made to include exceptionally at- 
tr tive targets in the original photograph 
pol, and that the attractive targets were 
accordingly judged to be very attractive. 

Vhile these findings corroborate the manip- 
ulation, they must be cautiously interpreted 
because the variance/covariance dispersion 
assumptions for the within-group tests were 
severely violated as revealed by Box’s (1950) 
test, F(1,1307) = 1.87, p= .003. Fortu- 
nately, the test did not reveal assumption 
violations for the remaining analyses. 

Set interaction effects were occasionally 
detected in the remaining analyses; they were, 
jowever, invariably “monotonic” (see Camp- 
bel & Stanley, 1966, p. 29; Glass & Stanley, 
1970, p. 429). In view of the nature of these 
interactions and the implied generality of 

effects across such interactions, the analyses 
were revised with the set factor disregarded. 


a 
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Personality Attributions 


The items which constitute the Dion et al. 
Social Desirability Index were first inter- 
correlated across experimental cells, While 
Dion et al. never claimed their index to be 
unidimensional, there were substantial nega- 
tive correlations between the modest/vain 
item and other scale items. In view of this 
finding, the item was eliminated from the 
index and analyzed separately. Analyses of 
the revised scale revealed a target physical 
attractiveness main effect as indicated in 
Table 1, row 1, as well as a suggestion of a 
Target X Participant Attractiveness interac- 
tion as revealed in rows 2 through 4. 

The means associated with the main effect 
corroborate previous findings (e.g., Dion et 
al.; Miller, 1970) revealing the favorability 
of personality attributions to increase mono- 
tonically with the physical attractiveness of 
targets. It should be noted, however, that 
attributions for attractive targets were not 
reliably more favorable than those for tar- 
gets of intermediate attractiveness, F(1, 37) 
= 2.83, p= .10, while attributions for un- 
attractive targets were reliably less favorable 
as compared with those for the intermediate 
targets, 7(1,37) = 20.04, p = .00007. This 
finding suggests that for female participants, 
unattractive female targets are disadvan- 
taged—‘what is ugly is bad”; rather than 
attractive female targets being advantaged— 
“what is beautiful is good” (cf. Dion et al., 
Table 1). 

The means associated with the interaction, 
in Table 1, suggest that unattractive women 
do not make personality attributions in strict 
accord with the beauty implies goodness the- 


6 All analyses reported in Experiment 1 are based 
on weighted means; all p values are two-tailed. 

7 Researchers who suspect that the physical at- 
tractiveness of the target persons and participants 
would have an interactive effect may be interested 
in the associated statistics: Fm(4,66) = .94, p= 44. 
Fm denotes the likelihood ratio criterion for testing 
departures from a multivariate hypothesis (e.g., see 
Bock & Haggard, 1968, p. 110). 

8 This does not appear to be a random event 
since an independent researcher has found a similar 
effect using the Dion et al. stimulus materials and 
scales, 
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TABLE 1 


ATTRIBUTIONS YIELDING RELIABLE TARGET ATTRACTIVENESS MAIN AND/OR 
INTERACTION EFFECTS IN EXPERIMENT 1 


Target's attractiveness 


Measure 
Low Medium High 

Social desirability of target’s personality 48.9 56.5 59.2 23.588 0000513 
Attractiveness $ 

Low 50.4 57.8 55.1 

Medium 47.8 57.4 61.2 1.96» 11 

High 49.6 S533 59.2 Y 
Target’s vanity 24 3.1 3.8 14.15* .0000. 
Target's egotism 3.1 3.1 3.9 6.34" 004 
‘Target’s being an understanding spouse 2 

and responsive sexual partner 8.1 91 9.4 6.228 2005, 
‘Target's requesting a divorce and having 

an extramarital affair 5.2 5.4 6.4 4.308 02 
Target’s parental competence 
Attractiveness 

Low 8.7 92 7.2 

Medium 8.9 9.2 9.4 2.62 04 

High 8.4 9.4 9.2 
Spouse’s occupational status 5.2 5.8 6.6 14,46" -00002 
Target's social and professional happiness 11.8 12.7 13.4 8.24" -001 
Target’s bourgeois orientation 8.2 9.4 9.9 6.078 005, 


Note. The larger the mean, the more socially desirable the personality, the greater the vanity, etc. 


* Main effect df = 2, 36. 
b Interaction df = 4, 72, 


sis. In terms of a linear trend, unattractive 
women did not reliably rate the attractive tar- 
gets as having more socially desirable person- 
alities than the unattractive targets, F(1, 37) 
= 2.46, p = .12, while the average, F(1,37) 
= 39.94, p = 0000002, and attractive partici- 
pants, F(1,37) = 10.37, p = .003, did. Un- 
attractive participants instead favored the 
intermediate targets over the unattractive tar- 
gets, F(1,37) = 4.80, p= .03, though the 
intermediate targets were not reliably favored 
over the attractive targets, F(1,37) = WER 
p= AO. 

Analyses of the modest/vain and altruistic/ 
egotistic® items yielded results in conflict 
with the beauty-implies-goodness thesis, As is 
evident in Table 1, attributions of vanity and 
egotism were highly reliable, monotonically 
increasing functions of targets’ attractiveness 
levels. Female participants apparently felt 
that along these dimensions, all that glitters 
may not be good. 


Marital and Parental Attributions 


Correlational analyses of the items analo- 
gous to the Dion et al. marital happiness 


scale revealed the items to form two cluste. . 
The likelihood estimates that the target wou d 
be an understanding spouse and responsive 
sexual partner were therefore summed ard 
analyzed separately from estimates regardi? g 
the target’s requesting a divorce 2 and havi g 
an extramarital affair. As presented in Tabie 
1, the spouse/sex cluster yielded results coz- 
sistent with the beauty implies goodness the- 
sis. The latter (marital disaster) cluster, how- 
ever, yielded results in conflict with the the- 
sis, for attractive women—when married— 
apparently are most expected to share their 
heterosexual charms across households. 


9 This item constituted 1 of the 27 personality 
attributions. The remaining personality items, not 
discussed in the body of this article, are consistent 
with participants assuming extroversion (e.g, bold- 
ness, outgoingness) covaries positively with at- 
tractiveness, 

10 This item does not correspond well with the 
Dion et al. item, A more appropriate item would 
have been: How likely is this woman to be divorced? 

11A participant attractiveness main effect was 
detected on this variable, F(2,37) = 3.30, p= 05, 
such that attribution of marital disaster was a 
monotonically decreasing function of participants’ 
attractiveness level, 


Wuen Beauty May Fait 


Sum 1 of the items analogous to the 
Dion « parental competence scale (likeli- 
hood ing a good parent and raising chil- 
dren revealed a Target X Participant 
Attra: veness interaction as presented in 
Tab! The unattractive participants so 
dow sted the parental competence of at- 
tra > targets, that the intermediate, F(1, 
37 9.55, p = .004, and unattractive tar- 


ge’, F(1,37) = 4.93, p = .03, were, in com- 
p“ ison, expected to be better parents. There 
\ cre, however, no reliable target attractive- 
ess simple effects for the remaining partici- 
pants. Parental competency attributions made 
by unattractive women do not appear to be 
described adequately by the beauty implies 
goodness thesis. 


Occupational Status of Target Person and 
Spouse; Social and Professional Happiness 
of Target Person 


Participants estimated the occupations the 
target and her spouse would likely hold at age 
10. No reliable effects were detected for tar- 

ts occupation, however, spouse’s occupa- 

nal status was a reliable monotonically in- 
sasing function of the targets level of 
ractiveness. 

The items which constitute the social and 
| ofessional happiness scale (likelihood of an 
e-citing life, self-fulfillment, and occupational 
s ccess)—which are analogous to the Dion 
e. al, items—were found to intercorrelate 
well and were therefore summed. Analyses 
revealed only a target attractiveness main 
efect, As indicated in Table 1, hapiness at- 
tributions increased monotonically with tar- 
vet’s level of attractiveness. These findings 
are consistent, of course, with the beauty im- 
plies goodness stereotype. 


Bourgeois Orientation 


Finally, the items which constitute this 
scale—likelihood of sympathizing with op- 
»ressed peoples (reversed for inclusion in the 
scale), belief in money and wealth as primary 
ingredients for a happy life, and likelihood of 


being a status-seeking snob—were found to 
intercorrelate well and were summed. As indi- 
catec Table 1, these attributions were not 
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in accord with the stereotype, for attributions 
monotonically increased with participant’s 
level of attractiveness. 


Discussion 


Although the jealousy effect was not as 
robust as initially anticipated, it nevertheless 
appears to have some validity. Unattractive 
participants did not judge the beautiful tar- 
gets as having reliably more desirable per- 
sonalities than unattractive targets, while the 
remaining participants did make such attri- 
butions. In this instance, the beauty implies 
goodness stereotype appears to be attenuated 
for unattractive participants. The parental 
competency attributions, however, reflect not 
just an attenuation but a reversal of the 
stereotype. Unattractive participants reliably 
expected the most beautiful targets to be the 
least competent parents, while the remaining 
participants’ expectations were not reliably 
affected by the target persons’ physical at- 
tractiveness levels. Thus the beauty implies 
goodness stereotype may best describe attri- 
butions made by women of average and high 
physical attractiveness. 

Many of the general effects associated with 
the attractiveness of the target, however, are 
consistent with previous findings. Thus, at- 
tractive targets were reliably expected to (a) 
have more socially desirable personalities, (b) 
be better spouses and sexual partners, (c) 
marry men of higher occupational status, and 
(d) experience greater social and professional 
happiness in comparison to the unattractive 
targets. Attractive targets, nevertheless, were 
also expected to be more vain, egotistical, 
likely to request a divorce and have an extra- 
marital affair, as well as be unsympathetic 
with oppressed peoples, materialistic, and 
snobbish in comparison to unattractive tar- 
gets. Beauty does not appear then to imply 
goodness, in terms of traditional Judeo- 
Christian mores. It instead appears as if 
women believe in accord with Don Marquis’s 
(1930) poem “Unjust” that “beauty gets the 
best of it... [at least] ... in this world.” 
Unfortunately, the current study does not 
provide information on whether men also 
associate certain liabilities with feminine 
pulchritude, 


1174 


There are, of course, a number of dif- 
ferences between the current constructive 
replication and the original Dion et al. ex- 
periment. The differential findings may, there- 
fore, be attributed to the fact that only 
female participants were employed in the 
current study or that the distribution of 
participant attractiveness levels did not cor- 
respond well with the distribution in the 
original study. Furthermore, a variety of 
variables were assessed across the two studies, 

The most intriguing possibility, however, is 
that attribution favorability may be a curvi- 
linear funciton of target attractiveness such 
that favorability increases as attractiveness 
increases from low to moderately high levels, 
and then decreases at exceptionally high 
physical attractiveness levels. Thus, in con- 
trast to the tentative hypotheses of Ber- 
scheid and Walster (1972) as well as Byrne 
et al., exceptionally attractive women may 
evoke images more congruent with Pandora 
than with goodness, 


EXPERIMENT 2 


In order to investigate the curvilinear hy- 
pothesis as well as whether males, in addi- 
tion to females, associate liabilities with 
feminine pulchritude, the first experiment was 
quasi-replicated, Consequently, a fourth 
“medium +” level of target attractiveness 
was included in the replication as well as the 
participation of both males and females. The 
robustness of the beauty stereotype was ad- 
ditionally investigated by requiring half of 
the participants to indicate their confidence 
(Jones & Davis, 1965, p. 224) about each 
attribution, as they proceeded through the 
questionnaire. It was assumed that if the 
Stereotype was weak, attribution effects would 
be attenuated when these additional ratings 
were required. 


Method 
Participants 


Of 354 randomly sampled students at the student 
union and library of the University of Wisconsin— 
Milwaukee, 155 males and 164 females agreed to 
participate. 


Procedure 


The questionnaire was presented as a survey of 
Person perception accuracy, The instructions were 
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detailed on the first page and were essential] ose 
used in the first experiment. 


Target Materials 


The low, medium, and high attractiveness to- 
graphs were those used earlier, On the basis he 
normative ratings collected at Iowa State | y= 
sity, a fourth pair of photographs were se! ied 
which were between (X = 3.04) the mediur d 


high photographs in attractiveness, 


Dependent Variables and Confidence Rat. +s 


Only a subset of the variables in the first si dy 
were assessed in the present survey: the Dion et | 
Social Desirability Index, the spouse/sex scale, e 
marital disaster scale, the parental competen <e 
scale, the social and professional happiness sc 
the bourgeois scale, and the physical attractiven 
manipulation check, Participants also estimated t 
occupations and annual salaries of the target a 
her spouse at age 40. Occupations were scaled < 
the basis of the 1960 U. S. Census Socioeconomi 
Scale (see Bonjean et al, 1967, p. 178), Measure 
were arranged as in the earlier questionnaire, with 
half of the participants making a confidence rating 
after each attribution, 

Thus, a 4X22 (Target Attractiveness X Par. 
ticipant Sex X Confidence) factorial experiment was 
generated. The manipulable factors were presente 
within a between-subjects design, in view of th 
restrictive variance/covariance dispersion assumy 
tions of within-subjects designs. Participants we; 
randomly assigned, of Course, to treatments, 


Results 


Confidence Factor and Physical Attractivenes 
Manipulation Check 


Unweighted means analyses of variance 
revealed virtually no confidence effects. The 
factor, consequently, was eliminated from the 
analyses. 

The manipulation check yielded a highly 
reliable target attractiveness effect, F(3, 311) 
= 73.55, p< .0000001. As expected, the 
attractiveness ratings (3.4, 3.8, 4.8, 5.3) were 
an increasing monotonic function of the 
normative ratings. No other effects were 
detected on this measure. 


Attributions 


As is evident in Table 2, the sample means 
were generally a monotonic function of tar- 
gets’ attractiveness, Departures from mono- 
tonicity, as is evident in attributions about 
the social desirability of the target’s person- 
ality, were not reliable. Internal analyses 
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TABLE 2 
(TIONS YIELDING RELIABLE TARGET ATTRACTIVENESS MAIN EFFECTS IN EXPERIMENT 2 


Target's attractiveness 


Measure Fa 
Low Medium Medium + High 
(n = 79) (n = 78) (n = 82) (n = 80) 
Socia ability of target’s personality? 49.6 53.5 55.9 55.6 10.85 
Targ vanity = 24 3.0 3.5 3.4 14.20 
Tar; requesting a divorce and having ae = 
av. <tramarital affair 4.9 5.5 5.8 6.0 7.07 
Spx s occupational status® 81.0 83.2 90.1 91.6 8.54 
Sp: -se's annual salary (in thousands of 
iilars) 16.8 17.8 21.3 23.1 7.00 
T- »get’s social and professional happiness 11.7 12.4 12.6 13.0 4.61 
“\arget’s bourgeois orientation 8.1 9.6 10.3 9.9 15.59 


» Revised index. 
© Based on U.S, Census Scale. 


based upon participants’ ratings of the at- 
tractiveness of the targets also failed to reveal 
‘epartures from monotonicity. 

Although there were reliable participant sex 
un effects such that females evaluated the 
t. rgets more favorably than male participants 
(. terms of spouse’s salary, target’s socio- 
ec nomic status, the spouse/sex scale, the 
m ital disaster scale, and the bourgeois 
sc: ¢), no Participant Sex x Target Attrac- 
tiv sess effects were detected. 


GENERAL Discussion 


‘e postulated curvilinear relationship be- 
tw- u feminine target attractiveness and at- 
tril. ition favorability was not corroborated, 
for only monotonic relationships were de- 
tecicd. Furthermore, males do not appear to 
be bewitched by feminine beauty, for they 
as well as female participants reliably associ- 
ated certain liabilities with increasing levels 
of attractiveness. Finally,  attributions— 
whether favorable or unfavorable—appeared 
sufficiently robust so as to withstand over 
two dozen queries regarding attributional con- 
idence. 

“onsidered together, the current experi- 
mes. ‘s corroborate previous research indicating 
that =ttractive women are expected to be 
More :>ciable, heterosexually alluring, pro- 
fession iy successful, and personally happy 
M comparison to unattractive women. The 
Current findings, however, also differ from 


Note. The larger the mean, the more socially desirable the personality, the greater the vanity, etc. 
a All F ratios are significant at least at the .004 level (df = 3, 311), 


previous reports in that attractive women are 
expected to be more conceited, likely to en- 
gage in adultery, and be bourgeois as com- 
pared to unattractive women. The divergence 
between these and previous findings can most 
readily be attributed to the fact that the 
variables assessed in the current experiments 
have been more diverse than those assessed 
in earlier studies. 

This divergence sugests that attractive as 
compared to less attractive women may not 
always have a social advantage. Advantage 
may be contingent upon which dimensions 
are stressed within a particular social set- 
ting. A woman stereotyped as sociable, hetero- 
sexually alluring, and likely to be profession- 
ally competent may have, for example, the 
best chance of getting a job in sales or pub- 
lic relations. If she, however, is additionally 
stereotyped as conceited, adulterous, and 
bourgeois, she may have the greatest diffi- 
culty in acquitting herself of a prostitution 
charge or prosecuting a rapist. A physical 
attractiveness stereotype exists; its content, 
however, does not appear to be perfectly 
compatible with the “what is beautiful is 
good” thesis. 
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‘ereotype Attribution Revisited: The Role of Stimulus 
Characteristics, Racial Attitude, and 
Cognitive Differentiation 


Jack M. Feldman and Robert J. Hilterman 
Department of Management, University of Florida 


Race, occupation, and social background information were manipulated to 
create eight realistic stimulus persons. The attribution to these stimuli of 
racial and occupational stereotypes, as well as the empirically defined trait 
dimensions of “conservatism,” “upward mobility,” and “successfulness” was 
found to depend primarily on factors implying long-term stimulus-person 
behavior, such as occupation or social mobility, as attribution theory predicts, 
Race was primary only for attribution of “conservatism.” Cognitive differen- 
tiation, racial attitudes, sex, and awareness of the purpose of the study inter- 
acted with the stimulus person factors, providing insight into the relationship 
between prejudice and stereotyping. Results are discussed in terms of cognitive 
differentiation, attribution theory, limitations of the stereotype concept, and 


methodological issues in stereotype research, 


Feldman (1972) has demonstrated that ‘the 
(tribution of racially and occupationally 
ased stereotypes to multiply categorized 
imulus persons is predictable from a version 
|" attribution theory (Jones & Davis, 1965). 
cial stereotype attribution was also found 
be consistent with the idea that other- 


\ 
y ‘e unidentified blacks are assumed to be 
v ‘king-class or lower-class people (see also 
E yton, McAllister, & Hamer, 1965). How- 
ev ~, contrary to expectations, Feldman found 
nc effect of racial prejudice on stereotyping. 

‘he purpose of this article is to replicate 
a: extend previous findings, using a geo- 
graphically and temporally distinct subject 
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+See Feldman (1972) for a discussion of the simi- 


larity of implicit personality theories and stereotypes, 
“Centrziity” as used here is identical with the con- 
ceptions sch (1946) and Wishner (1960) and 
“central 


on” as discussed by Scott (1969). 


population, an improved methodology, and an 
additional individual difference variable, cog- 
nitive differentiation, in order to establish 
more fully the link between stereotyping and 
other attribution processes and demonstrate 
the relevance of attitudinal and cognitive- 
structure variables to the inference process. 

Stereotypes are attributed on the basis 
of category membership (see Brigham, 
1971a; Secord, 1959; Secord, Bevan, & Katz, 
1956). If an individual has strong positive or 
negative feelings about an attribute defining 
a particular category (like race), it seems 
reasonable to expect that a stimulus person’s 
category membership would constitute a 
“central trait” for that individual, and there- 
fore be dominant in the attribution of 
traits correlated with membership (i.e., stereo- 
types).* In other words, extreme affect 
toward a category of people directs attention 
to attributes which define membership in that 
category (like skin color) and promotes the 
use of that categorization as a basis for trait 
inference about individual stimulus persons 
(Brigham, 1971a; Secord, 1959; Secord & 
Backman, 1964). As Tajfel (1969, p. 327) 
has pointed out, “the accentuation of differ- 
ences between classes of stimuli [which have 
emotional significance] may be expected to be 
comparatively sharper.” Jones and Davis's 
(1965) idea of the hedonic relevance of be- 
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havior is similar, though based on a different 
conceptualization of the underlying process. 
Feldman’s (1972) finding that prejudiced 
subjects did not stereotype black stimulus 
persons more than the nonprejudiced when 
occupation and social background were varied 
simultaneously with race may be explained 
both theoretically and methodologically. 

The theoretical explanation involves a sec- 
ond individual difference variable, cognitive 
differentiation, which is defined by Streufert 
(Note 2, p. 8) as: “The process of dividing 
cognitive semantic space (and the degree to 
which this division has been achieved relevant 
to a specific stimulus configuration) into two 
or more . . . bipolar semantic dimensions.” 
This definition agrees with Kelley’s (1955) 
“hierarchy of constructs,” Bieri’s (1966) and 
Zajonc’s (1960) “cognitive differentiation,” 
and Scott’s (1969) concept of “dimensional- 
ity” or his earlier (1962, 1963) definition of 
“complexity.” 

Tf an individual habitually uses a large 
number of dimensions to conceptualize the 
interpersonal domain, we would expect a 
lesser degree of stereotyping than if only a 
few dimensions were used, simply because the 
more differentiated individual would attend to 
more attributes of any given stimulus person 
(Crockett, 1965, pp. 64-70, 88; Gollin & 
Rosenberg, 1956; Scott, 1963, 1969; Steiner, 
1954; Steiner & Johnson, 1963). He would 
also tend to make more tentative and less 
extreme judgments about others ( Kelley, 
1955) and make judgments which are less in- 
fluenced by evaluative biases (Scott, 1963). 
However, we cannot predict, from knowledge 
of cognitive differentiation alone, which par- 
ticular attribute an individual would choose as 
a basis for stereotype attribution. One indi- 
vidual may use social class, another political 
beliefs, another sex, and so on. The choice 
of an attribute, or attributes, from which to 

draw inferences may be a function of the af- 
fective value of the attribute to the person. 
Thus, the beliefs expressed would be a func- 
tion of the stimulus person’s position on the 
“central” attribute rather than other, equally 
available information. Feldman’s (1972) fail- 
ure to discover an effect of racial prejudice on 
stereotype attribution may have been due to 
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neglect of cognitive differentiation as a : 
ator of the prejudice-stereotyping relatio ‘ip. 

Though measures of cognitive rigidit 
authoritarianism often covary with pre 
(Bettelheim & Janowitz, 1964; Fre el- 
Brunswik, 1949; Triandis & Triandis, Ns 
attitude and cognitive structure are 
ceptually independent, and their influence on 
stereotyping processes must be assessed = 
pendently. 

Methodological considerations are also: +- 
vant to the interpretation of previous stud s. 
Within-subject designs, commonly used in t ‘s 
area, may make the purpose or hypothesis £ 
the study salient to subjects. Norms of soc: ‘1 
desirability may be engaged or demand ch: 
acteristics communicated (Feldman, Note 
reducing the degree of racially based stere: 
typing. Range effects in within-subject di 
signs are another undesirable feature (Poul 
ton, 1973). Between-subject designs are 
more appropriate, in spite of their relative 
inefficiency. Also, since people generally re- 
spond to a single stimulus person, rather th: n 
a series of systematically different stimuli, t 
between-subjects design seems to offer greai 
external validity. 

It is also desirable to assess subjects’ awa 
ness of the study’s purpose, since the kno 
edge that stereotypes are being investigat- | 
may easily produced biased responses. Mi- 
mally, subjects should be partitioned in o 
“aware” and “unaware” groups, so that main 
effects and interactions can be assessed, 

Finally, sex differences in stereotyping 
should also be investigated. Since women are 
socialized differently than men, it is possible 
that they may have different attribution 
strategies, or at least a differential willingness 
to use stereotypes. Although no predictions 
about sex differences can be made, considera- 
tions of external validity favor the inclusion 
of females in the subject population. 

Following the theoretical and methodologi- 
cal arguments above, the following predictions 
can be made: 


1. Individuals who are low in cognitive 
differentiation and express extreme affect 
(positive or negative) toward blacks will re- 
spond more to the race of a stimulus person 


than individuals not so characterized. 
This ie reflected in interactions involving 
race :itive differentiation, and attitude. 

2 plicating earlier findings, factors im- 
ply long-term behavior will control more 
va e than race in both racial and occupa- 
tic stereotype attribution. This will occur 
es + when awareness of the study’s purpose 


© hypotheses is controlled and will be re- 
l ected in occupation and social background 

{ain effects and interactions of these factors 
with each other and with race. 

3. Individuals high in cognitive differentia- 
tion will see greater differences between 
occupational levels than those low in differen- 
tiation. This will be reflected in interactions 
between differentiation, occupation, and social 
background. 

4. Individuals judged to be aware of the 
vurpose or hypothesis of the study will re- 
pond to black and white stimuli more simi- 
arly than those judged to be unaware. This 
will be reflected in interactions of awareness 
with race. 


METHOD 
ib jects 


subjects were 48 white male and 48 white female 

id volunteers, recruited through posted sign-up 

ets and advertisements in the campus newspaper. 

were undergraduate students. They were run 

i small, mixed-sex groups for 1 hour each and were 
i $2.00 for their participation. 


¿struments 


ittitude. A disguised attitude measure was devel- 
d on a pretest sample. It consisted of five 
emantic differential evaluative scale ratings of six 
current racial issues and racially relevant individuals 
and groups, such as “Shirley Chisholm” and the 
“Ku Klux Klan.” Ratings were embedded in a 
longer questionnaire, including ratings of concepts 
such as “X-rated movies” and “capital punishment.” 
The five evaluative scales were summed to give one 
score for each item. All but one of the racially rele- 
vant items had correlated between .20-.49 with eval- 
uation of “black people” in the pretest sample. A 
ubject’s attitude score was the average of his scores 
ior the six racially relevant concepts. Scores could 
re se from +15 to —15. 

Co snitive differentiation. The same pretest sample 
Was -d to develop a measure of cognitive differen- 
atterned after Scott’s (1962, 1963) instru- 

est subjects were asked to write down as 
nt “types of people” as they could think 
n any manner they wished (i.e, nation- 


STEREOTYPE ATTRIBUTION REVISITED 


1179 


ality, occupation, personality, etc.). They then placed 
these “types” into as many categories as they could, 
using a single type as frequently (or infrequently) as 
they thought appropriate. Scott’s (1962) R measure ? 
was then computed for each pretest subject. After 
examination of the original free responses, a list of 
20 types was constructed. Six categories were pro- 
vided, and a second pretest sample was asked to 
place each of the 20 types in as many (or few) 
categories as they could, and name each category 
appropriately. R was also computed for each of 
the pretest subjects in the second sample. Compari- 
sons of the distribution of R in the two samples 
showed both to be approximately normal. The list 
of 20 types and six categories was retained as the 
measure of differentiation, and a computer program 
was written to compute R for each subject. fi 

Stereotype attribution. The Probability Judgment 
Scale (Feldman, 1972) was again employed in this 
study. This requires subjects to rate, from 0 to 100 
in steps of 5, the probability that a given stimulus 
person has a given personality trait. 

In order to enhance the realism of the task, stimu- 
lus persons were referred to as “Mr.” followed by 
two initials and given an age between 35 and 37. 
Typical working-class and/or professional occupa- 
tions were listed for the stimulus person; his “father’s 
occupation” was also listed. This part of the study 
was a partial replication of Feldman (1972). 

A response-set measure, also taken from the pre- 
vious study, was included as well. This required 
subjects to rate a “Mr. R. S.” on the same scale 
with no other information given. Subjects were told 
this rating would serve as a control.’ 

The 29 trait adjectives which composed the scales 
themselves were taken from Feldman (1972), A pre- 
test replication of the original stereotype develop- 
ment study showed that the racial and occupational 
stereotypes were valid for the new population. 
Additional trait adjectives, which had discriminated 
on both racial and occupational bases in the earlier 
study, were also included. 

Five dependent variables were derived factor ana- 
lytically from this list. Two of these factors corre- 
spond to the earlier racial and occupational stereo- 
types. Other trait dimensions were also revealed. The 
complete adjective list and factor analysis solution 
is presented under Results. 

Awareness. Subjects’ awareness of the purpose and 
hypotheses of the study was assessed through a 


? R is defined by Scott as where: 


logs n’ 


H = FP; log: = = loga n — 1 Eni loge n; 
Lwhere n is the total number of objects; n is the number 
that appears in a particular combination of categories; 
and P; = n,/n]. 

* Analysis of response-set measures revealed no sig- 
nificant, interpretable differences. They are not con- 
sidered further, 
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two-page free response questionnaire patterned after 
Dulany’s (1961). Subjects were asked what they 
thought was the purpose of each questionnaire, if 
they thought there was anything the experimenter 
wanted them to do (or not to do), how much they 
wanted to comply with the experimenter’s wishes, 
and whether they had tried to follow some consistent 
strategy throughout the task. Both experimenters 
read each subject’s questionnaire independently and 
judged each to be aware or unaware. Disagreements 
were resolved by discussion until an unequivocal 
judgment could be made. 


Design and Procedure 


The design is equivalent to a 7-factor between- 
subjects analysis of variance. Manipulated character- 
istics of the stimulus person were race (black or 
white), occupation (professional or working class), 
and social background (“father’s occupation”). Sub- 
ject characteristics were the following: affect toward 
blacks, cognitive differentiation, sex, and awareness. 
In order to retain attitude and cognitive differentia- 
tion as continuous variables, multiple regression 
analysis (Cohen, 1968) was used instead of the more 
familiar analysis of variance techniques. Details are 
presented later. 

Subjects either were contacted by telephone or 
called the experimenter in response to the newspaper 
advertisements. Small sessions were arranged for 
weekday evenings. Upon arrival, subjects were 
randomly assigned to one of the eight manipulated 
stimulus person conditions, subject only to the con- 
straint that at least six males and six females be 
represented in each cell of the design. 

Subjects were then told that the purpose of the 
study was to assess the accuracy of the “average 
person’s” personality judgment skills and to deter- 
mine the effect of different amounts and kinds of 
information on the accuracy of such judgments. They 
were told that their job was to rate a real person 
on the basis of a limited amount of information, in 
order to compare their judgments with person- 
ality inventories supposedly completed by the stimu- 
lus persons. They were urged to be as accurate as 
possible. 

The attitude and differentiation measures were 
explained as attempts to discover the factors under- 
lying individual differences in accuracy; the aware- 
ness questionnaire was presented as an attempt to 
gain feedback on subject's understanding of the in- 
structions and the experimental procedures, The 
instructions for each section of the judgment task 
were read by the experimenter immediately prior to 
that section, The order of the tasks was: stereo- 
type questionnaire, response set measure, cognitive 
differentiation measure, and attitude scale. 

Each subject’s consent was obtained before expo- 
Sure to any experimental materials. After analysis 
of the data, the subjects were sent letters explaining 
the need for deception and outlining the results and 
their interpretation, 
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RESULTS 
Dependent Variable Definition 


Since analysis of each of the 29 trials would 
have been too cumbersome, and would pose 
problems of interpretability, the adjective 
ratings were subjected to principal-axis factor 
analysis with unit communalities, Five fac- 
tors, accounting for 32% of the common vari- 
ance, were extracted and rotated using the 
varimax criterion. The traits and factor 
loadings are listed in Table 1. 

Dependent variables were defined by sum- 
ming, with appropriate negative or positive 
weights, all traits which loaded more than .40 
(marked with an asterisk in Table !) in 


absolute value on each factor. Factor © was 
labeled “Occupational (middle class) eo- 
type”; II, “Racial (black) stereotype”; I, 
“Conservatism”; IV, “Upward mobility”: od 


V, “Successful people.” Factors I and if 
were labeled on the basis of the genera! 
larity to the racial and occupational ster20- 
types discussed in Feldman (1972), The other 
labels represent judgments of the authors. 

It should be noted that no claim of gener- 
ality is made for these factors. It is quite 
likely that, as in the case of the Behavior Dif- 
ferential (Triandis, 1964), the factor struc- 
ture is dependent on the characteristics of the 
stimulus persons for whom the judgments 
are made. 


iil 


Analysis 


Stepwise multiple regression analysis (Cohen, 
1968) was used to test the substantive hy- 
potheses in preference to more typical analy- 
sis of variance techniques for several reasons. 
The interpretation of a regression analysis is 
identical to an analysis of variance when 
dummy-variable coding with orthogo«=) 
weighting is used, but regression analy 
lows the retention of attitude and cogniive 
differentiation scores as continuous variables, 
Blocking on these variables would have re- 
duced cell sizes below acceptable limits. Also, 
due to the complexity of the design and rela- 
tively specific hypotheses to be tested, many 
higher order interactions were ignored. The 
regression analysis seemed to be the more 
efficient tool under these circumstances. 


TABLE 1 


TOR MATRIX DEFINING 
DENT VARIABLE SCORES 


| Factor Loadings 
| 


ultured 
Nationalistic Len? 
Foresighted 
Thrifty 
Conservative 
Slow-moving i! 
Athletic at 
Persistent 
Political 
Self-satistied 
Resourceful 
Pea 

meds 


I loadings roun 
rajts were include 
pendent variable sci 


d to two decimals. As indicated, 
in two factors, Items included in 
e are denoted with an asterisk (*). 


hen tests of specific mean differences were 
ved, the attitude and cognitive differentia- 
distributions were partitioned into thirds, 
Scheffé tests (Hays, 1963) carried out, 
ig the standard error of estimate (Sy-x) 
‘he multiple regression equation as the 
or term. This results in a conservative test, 
ince all possible interactions were not re- 
ved from the error variance. The .05 level 
was used for this reason. In some cases, dif- 
ferences in cell means could not be tested 
directly, due to small cell sizes. In these cases, 
tests of significance were made on the slopes 
of the regression lines (Hays, 1963) for the 
conditions involved in the interaction, using 
all available data points. A significant result 
for such a test means that the slopes of 
ve regression lines of two variables differ 
c: vending on the level of the third. 
the analysis, all manipulated or discrete 
ex and awareness) independent vari- 
re coded orthogonally, as suggested 
by Co:n (1968). Attitude and cognitive 
differen i:‘ion scores were standardized about 
the gran: mean of each variable, and these 


(i.e 
ables 
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standard scores were used in the regression 
equation. Interaction terms were generated by 
using the cross-products of the variables in 
question. 

The following variables were included in 
the stepwise regression analysis: main effects 
of race, occupation, social background, sex, 
awareness, cognitive differentiation, and atti- 
tude; the two- and three-way interactions of 
race, occupation, and background; the inter- 
actions of each of the stimulus character- 
istics (and their interactions) with sex, atti- 
tude, cognitive differentiation, and awareness 
(and their interactions) and the 3-, 4-, and 
5-way interactions of sex, awareness, and 
their interactions with race, occupation, back- 
ground, attitude, and cognitive differentiation. 
A total of 75 predictor variables (main and 
interaction effects) were used in all. This set 
allowed tests of the hypotheses, replication of 
earlier findings, and assessment and elimina- 
tion of any potential biases due to awareness 
or sex differences. Order of entry into the 
regression analysis was determined by the 
percentage of variance controlled by each 
effect. 

Table 2 presents the regression analyses of 
the five dependent variables. To conserve 
space, only significant effects are included. Re- 
sults are discussed in terms of each hypothesis. 


Tests of Hypotheses 


Hypothesis 1, which predicted that indi- 
viduals low in cognitive differentiation and 
extreme in racial attitude would respond more 
in terms of race, received little support. As 
Table 2 shows, the predicted Race x Cogni- 
tive Differentiation X Attitude interaction oc- 
curred on the upward mobility factor and, 
moderated by awareness, on the racial stereo- 
type as well. However, inspection of these 
interactions reveals that the predicted effects 
did not occur. (Discussion of awareness effects 
on the racial stereotype are presented below.) 

Analysis of the simple interaction of Cogni- 
tive Differentiation X Attitude (within the 
three-way interaction with Awareness on the 
upward mobility factor, and the four-way 
interaction with Sex and Awareness on the 
successful people factor) indicates that the 
effect of attitude is greatest at high and 
moderate cognitive differentiation levels. 
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TABLE 2 
STEPWISE REGRESSION ANALYSIS OF STEREOTYPE SCORES 
Standardized 
Dependent variable Source Increment in R? regression Fi 
Occupational stereotype oO 62 —.74 151 
(R= .89)*** : RXB -03 —.22 7 
R X Att X Aw 02 23 6 
S X Aw :02 -20 5.1 
RXO .02 15 6.9 
OxXBXCD .01 paca! 4.5¢ 
R X B X CD X Aw -02 23 6.14 
CD -02 17 8.42 
ial s t: = .63)** RXO 09 8.89" 
Racial stereotype (R = .63) 3 x oe Biers 
R -08 9.18** 
OXB .05 6.50* 
OX CD X Aw -04 5.13* 
R X CD X Att X Aw 04 6.30* 
CD X Att X Aw 03 4.03" 
Conservatism (R = .57)*** R 16 37 17.39%** 
R X Aw 05 z9 5.70* 
RXOXB 05 — 22. 5.88* 
RXOXBXS .04 —.19 4.60* 
B -03 18 4.31* 
Upward mobility (R = .63)*** oO 19 —.49 22.06**”: 
Oxs 04 24 5.21* 
RXOXS -03 —.26 4.23" 
CD X Aw 04 —.27 4.81" 
CD X Att X Aw 03 24 4,19* 
RX CD X Att 03 = 21 4.71* 
Successful people (R = 200) ee oO 14 —.38 15.40** 
SX CD X Att X Aw 09 26 11,03** 
RXO 07 30 8.65** 
O X Att X Aw 07 32 10.56** 
OXB 04 —.20 5.76* 
RXB 03 =17 4.63* 


Note, R = race; O = occupation; B = background (father's occupation); S = subject’ 


ded to two decimal places. The 
order in which they were added to the regression equation by the program, “Increment in R2" 


of-variance estimate for each effect, 
*p <05. 
kp <01. 
wr p < 001. 


The unaware subject’s responses to the 
racial stereotype and upward mobility factors 
also show that the largest effects of attitude 
are found among the high cognitive differenti- 
ation subjects. Interestingly, upward mobility 
ratings of whites differ as a function of atti- 
tude toward black people, while ratings of 
blacks are unaffected. Details of these effects, 
omitted to conserve space, may be obtained 
from the authors. 

A Race x Attitude effect, found within the 
three-way interaction of race, attitude, and 
awareness on the occupational stereotype, 
Supports Feldman’s (1972) conjecture that 


CD = cognitive differentiation; 
order in which effects appear is le 
can be interpreted as a percenta: 


highly anti-black individuals tend to assume 
that blacks are working class to a greater 
extent than the nonprejudiced. The reverse is 
apparently also true if pro-black attitudes 
are considered. Within the unaware group, 
those with unfavorable attitudes toward 
blacks rate whites higher than blacks on traits 
such as intelligent, cultured, well-dressed, etc. 
(see Table 7). No significant differences are 
present at the moderate attitude level, but at 
the favorable end the relationship is signifi- 
cantly reversed. (Discussion of awareness ef- 
fects are deferred for the present.) It seems 
as though attitude and cognitive differentia- 
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tion do ence trait attribution, but not in 
the hy: ized manner. 

Hy; ıs 2, which predicted that occupa- 
tion vackground factors would control 
more ınce than race, and that race, occu- 
patio end background effects replicating 
earl’ findings would occur, was supported. 


As sle 2 shows, the only large race main 
eff, occurred on the conservatism factor 
(V aites, Æ = .670; Blacks, X = .558). The 
o upation main effect on the racial stereo- 
i pe was as strong as the race main effect, 
‘ough the race effect’s significance validates 
he conception of the factor as a racial stereo- 
‘ype (Black, X = .540; White X = .468). 

The fact that these factors did not gener- 
ally interact with awareness suggests that 
social desirability and demand characteristics 
cannot account for these data. It also provides 
further support for the concept of stereo- 
is a subcategory of the general 

ution process. 
ignificant occupation main effects for all 
‘pendent variables, omitted to conserve 
‘ce, May be obtained from the authors. 
le 3 presents the significant Race X Occu- 

p ‘jon interactions. 
he “upward mobility effect” (Feldman, 
1 2) is evident on the successful people fac- 
t, since black professionals are higher than 
a others on this dimension. This effect is 
r arginal on the occupational stereotype, 
sice racial differences within occupational 
l vels only approach significance (.05 < p 
-10), The Race X Background interactions 
occurring on the occupational stereotype and 


TABLE 3 
Race X Occupation INTERACTION EFFECTS 


Dependent variable 


Stimulus dimensions Occupa Racial’ “Successful 
stereotype Stereotype people 
ratings ratings ratings 
Slack, working class 15? 4 —.02> 
L =ck, professional 46" 48 +163. 
Wite, working class 19> -54 02 
White, professional Alb 395 07% 


és. All figures rounded to two decimals. Within a column, 
script indicates a significant (p < .05) race differ- 
ence wiihia the occupation level; a b superscript indicates a 
" significan: occupational level difference within the race level, 
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TABLE 4 
Race X Soctat BACKGROUND INTERACTION 


Dependent variable 


Stimulus dimensions Doeiii E 
stereotype people 
ratings ratings 
Black, working class 098" 
Black, professional O04 
White, working class 0150 
White, professional -08% 


an a superscript indicates a significant (p < .05) race differ- 
ence within the social background level; a b superscript indi- 
cates a significant social background difference within the race 
level. 


successful people factors both demonstrate 
that social background does not significantly 
affect the ratings of blacks. Rather, the 
ratings of whites change as a function of 
social class background. Table 4 shows this 
effect clearly, as background effects occur 
within the white, but not the black, race level. 
A similar effect is found in Table 3 for the 
racial stereotype, but it is reversed for the 
successful people factor. 

The Occupation X Background interactions 
on the racial stereotype and successful people 
factors replicate previous findings. The previ- 
ously observed “downward mobility effect” 
again occurs on the racial stereotype. Stimulus 
persons of working-class occupation and pro- 
fessional background are higher on the racial 
stereotype than any others (p < .05 for all 
comparisons). Similarly, those of working- 
class background and professional occupation 
are highest on the successful people factor, 
though not significantly higher than those of 
professional occupation and background. The 
Race X Occupation X Background interaction 
on conservatism, presented in Table 5, shows 
the strong influence of both race and social 
background. The direction of the interaction 
is reversed depending on the race of the 
stimulus person. Downwardly mobile blacks 
are seen as the least conservative, while up- 
wardly mobile and consistently working-class 
whites are the highest on this factor, Profes- 
sional whites’ and working-class blacks’ per- 
ceived conservatism is significantly lower 
with a professional than with a working-class 
background, while no significant background 
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TABLE 5 


RACE X Occupation X BACKGROUND INTERACTION: 
RATINGS ON CONSERVATISM 


Race 
Occupation/background 
Black White 

Professional 

Professional -60 -60 

Working class 56 72 
Working class 

Professional AT 07 

Working class 60 70 


Note, All figures rounded to two decimals. Any difference 
Jarger than .102 is significant at the .05 level by Scheffé test. 


effects are found for professional blacks or 
working-class whites. 

These results, as presented in Tables 3-5, 
indicate the generality of the findings of 
Feldman (1972) over a period of several 
years, changes in the geographic location of 
the subject population, the task, and in the 
method of obtaining subjects. With some ex- 
ceptions, mentioned below, the results also 
replicate across sex. 

Hypothesis 3 predicted that subjects high 
in cognitive differentiation would see greater 
differences between occupational and social 
background levels than those with low dif- 
ferentiation scores. The Occupation x Back- 
ground X Cognitive Differentiation interac- 
tion on the occupational stereotype, presented 
in Table 6, gives some support to the hypothe- 
sis. Moderate and high differentiation sub- 
jects rate stimulus persons of professional 
background and working-class occupation sig- 
nificantly higher on this factor than stimuli 
of working-class occupation and background. 
Low differentiation subjects show a nonsignifi- 
cant reversal of this pattern. No significant 
differences between social background levels 
occur for stimuli of professional occupation, 
suggesting that achievement of professional 

occupation is an extremely strong cue regard- 
less of differentiation level. 

A similar effect is found within the Occupa- 
tion X Cognitive Differentiation x Awareness 
interaction on the racial stereotype. In the 
unaware group, low differentiation subjects 
rated professional and working-class stimuli 
similarly (professional, ¥ = .511; working- 
class, X = 493). Moderately differentiated 
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subjects rated working-class higher an 
professional (professional, X = .420; we: ng- 
class, X = .556; p < .05), as did the © ly 
differentiated (professional, X = .448; k- 


ing class, ¥ = .545; .05 < p < .10). 

An unexpected finding, discussed in co c- 
tion with the tests of Hypothesis 1, is t 
the effects of attitude are more prono 1 
at high cognitive differentiation levels, e 
highly differentiated subjects apparently ı 3 
greater use of all information sources, :- 
cluding their affect toward the stimulus ņ +- 
son, than do the less differentiated. This ‘s 
consistent with the measurement technic 2 
used, which assesses the degree to which sti 
uli can be categorized on multiple dimensio: 
For attribution purposes, affective respon 
seems to be one of the dimensions used by tl 
highly differentiated. The cognitive different 
ation main effect on the occupational stereo 
type is likely to be a chance finding, since 
post hoc tests of a three-way partition of 
cognitive differentiation reveal no significant 
differences. 

Hypothesis 4, which predicted that awa 
subjects would rate blacks and whites nx 
similarly than unaware subjects, is mo: 
strongly supported by the Race X Awarene 
interaction on conservatism. Aware subjec 
give significantly higher ratings to blac 
than the unaware (aware, X= .60 
unaware, X= 516; p< 05). The opposi'e 
trend is observed for whites, but is not sig- 
nificant (aware, X = .644; unaware, X - 
-686). The black-white comparison for un 


TABLE 6 


OCCUPATION X BACKGROUND X COGNITIVE 
DIFFERENTIATION INTERACTION: RATINGS 
ON THE OCCUPATIONAL STEREOTYPE 


Cognitive differentiation 
Occupation/background 


Low Moderate High 
Professional 
Professional .42 AT 46 
Working class 40 45 4 
Working class 
Professional AS c22S .20° 
Working class 20 108 v1 


, Note. All figures rounded to two decimals, An a superscript 
indicates a significant difference between background levels 


within the working-class occupational category. 


aware sı is also significant (p < .05), 


while tl the aware subjects is not. 

The < Attitude X Awareness interac- 
tion on sccupational stereotype (presented 
in Ta!) |) further supports the hypothesis, 
as we’ - revealing that awareness leads to a 
rever of the Race X Attitude effect. 


A scussed previously, in the unaware 
grov an unfavorable attitude toward blacks 
leao {o a significantly higher rating for whites 
th: blacks; those with favorable attitudes 
re erse this relationship, Although the black- 
v ite differences are not significant for 
“evare subjects, the slope of the “aware— 

nite” regression line is significantly different 
(P< .05) from both the “unaware—black” 
and the “cnaware-white” lines. Awareness 
thus. leads to a reduction in racially based 
responses, as predicted. 

A ness seems to reduce the tendency to 

ail categories of stimuli. 
ground X Cognitive Differ- 
ness interaction on the oc- 


ci nal ype illustrates this. Unaware 
hi wid low differentiation subjects show a 
sig. cant difference between whites of pro- 
fe 1) background and whites of working- 
cla. background. These differences disappear 


for lie aware subjects. In addition, the 
dif ence between high differentiation aware 
unaware subject’s ratings of white pro- 
fes- onal is significant (p < .05). 

„wareness also suppresses the significant 
Occupation-level differences noted previously 
_ in ihe Occupation X Cognitive Differentiation 
X Awareness interaction on the racial stereo- 
type, though only at the moderate differenti- 
_ ation level. The interaction of cognitive dif- 
ferentiation, attitude, and awareness on the 
racial stereotype suggests the operation of so- 
cial desirability bias. On the racial stereotype, 
a significant (p < .01) difference in the slopes 
of the regression ‘lines of cognitive differentia- 
tion on stereotype attribution in aware and 
aware groups indicates that high differenti- 


‘aware, low differentiation: professional, X 
= cvs, working-class, X = .250, p < .05. Unaware, 
i crentiation: professional, ¥ = -513, working- 
210, p < .05. Aware, low differentiation: 

-193, working-class, ¥ = .239. Aware, 
tiation: professional, X = .306, working- 


303. 


class, x é 
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TABLE 7 


Race X ATTITUDE X AWARENESS INTERACTION: 
RATINGS ON THE OCCUPATIONAL STEREOTYPE 


Awareness 
Race/attitude 
Aware Unaware 

Black 

Negative 23 «29b 

Neutral 32 24 

Positive .29 42» 
White 

Negative 128 AO > 

Neutral +32 27 

Positive 34 129% 


Nole, All figures rounded to two decimals. An a superscript 
indicates a significant awareness difference within race and 
attitude levels; a b superscript indicates a significant race 
difference within awareness and attitude levels. 


ation reduces stereotyping only when subjects 
are aware of the study’s purpose. 

Turning to the successful people factor, 
the Occupation x Attitude x Awareness in- 
teraction is a function of differences in groups 
both favorable and unfavorable in attitude 
toward blacks. Aware subjects of unfavor- 
able attitude rate professionals significantly 
(p< .05) higher than working-class people, 
while unaware subjects show no difference. 
Among subjects with pro-black attitudes, the 
unaware rate professional significantly higher 
than working-class people (p < 05), while 
the aware subjects show no differences. Inter- 
preting this difference in terms of social desir- 
ability leads to the conclusion that those with 
favorable attitudes toward blacks believe 
working-class people are lower than profes- 
sionals on traits such as resourcefulness and 
higher in traits such as coarseness, but do not 
think they should express their beliefs; those 
with anti-black attitudes do not believe that 
an occupational difference exists, but appar- 
ently think they should say that it does. This 
might be a function of demand character- 
istics and “liberal” norms existing on college 
campuses. 

Interactions with sex, not discussed in 
detail for reasons of Space, suggest that 
women differentiate among stimuli less than 
men for some kinds of judgments, such as 
conservatism, and more than men for others, 
such as upward mobility. Norms of social 
desirability may also differ between sexes. A 
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more complete discussion of all of the points 
above may be obtained from the authors. 


Discussion 


The major findings of this study, briefly, 
are fourfold. First of all, stereotyping has 
been established as a subcategory of a gen- 
eral attribution process, since stereotype attri- 

bution was shown to follow the same prin- 
ciples as other types of trait attribution. This 
process was shown to be general over time 
and subject population and is not due to 
social desirability biases. Second, cognitive 
differentiation was shown to be related to the 
amount of information used in the attribution 
process and to the extremity of trait attribu- 
tion to discriminably different stimulus per- 
sons. Third, it has been demonstrated that 
awareness effects must be controlled, since 
subjects who are aware do respond with what 
they believe are socially desirable responses. 

Finally, racial attitudes do influence stereo- 

type attribution, but they do so indirectly, 

through the assumptions made about social 
class relevant attributes by those of both 
positive and negative attitude. 

The following discussion considers the data 

bearing on each hypothesis. Hypotheses 1 
and 3 can be discussed together, since they 
both refer to cognitive differentiation as an 
influence on attribution processes, Hypothesis 
1 was not supported, while Hypothesis 3 
received moderate confirmation. The data 
bearing on both hypotheses, however, lead to 
the conclusion that individuals who categorize 
stimuli on a larger number of dimensions also 
use a larger number of stimulus person char- 
acteristics as cues in the attribution process, 
regardless of the nature of those cues. While 
not a new or especially startling finding, the 
data do point out that differentiation is essen- 
tially content-free. That is, individuals high 
in differentiation make use of more informa- 
tion than low differentiation people, including 
their affect toward the stimulus as well as 
stimulus characteristics such as occupation or 
background. 

Apparently, those low in differentiation do 
not discriminate strongly on the basis of race 
regardless of their racial attitudes. They 
apparently disregard variations in occupation 
as well. This seems to contradict previous 
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findings of a relationship between 


tarianism, cognitive rigidity, and the liks and 
prejudice. However, these previous fi: ‘ngs 
are not necessarily due to perceived er- 
ences between whites and blacks on thi art 
of prejudiced individuals. Prejudice is a tie 
dimensional phenomenon (Triandis, 1) 


and may very well be a function of al 


norms (Mezei, 1971) regarding approp te 
behaviors toward blacks. If an individu is 
taught strict norms of discrimination ag: st 
blacks and to express negative sentim its 


about blacks, this may produce “prejudii 
responses on an attitude scale or social 
tance measure, as well as reducing the pers« :\’s 
cognitive differentiation in the interperso al 
domain. All blacks, regardless of occupati 1, 
would be treated alike. At the same time, 0 
difference in beliefs about black’s and whit: s 
personality traits would be generated. THe 
individual could believe that blacks are mor = 
often lower-class, but faced with a profes- 
sional-status black he might well respone in 
terms of occupation to questions about ‘he 
personality of the stimulus person, leaving 
norms of behavior and affect toward bla 
unchanged. At the same time, the general st 
ulus “Negro” or “Black person” might el 
the “standard” stereotype, which is attribu: 4 
more as a function of occupation than r: 
This is consistent with the findings of Sec..d 
(1959), Secord and Backman (1964), E ig- 
ham (1971a), Feldman (1972), and with the 
finding of a Race X Attitude occupatio: ıl 
stereotype interaction in the present data. 
The above is also consistent with Fishbein < 
(1967) attitude model, if it is assumed that 
the evaluation of a belief about a stimulus 
person might vary as a function of the stimu- 
lus person as well as the person holding it. In 
other words, both a prejudiced and nonpreju- 
diced person might believe that blacks wear 
extremely stylish clothing, but the former 
would evaluate this trait negatively, while the 
latter evaluates it positively. It might also he 
that a “concept-scale interaction” exists, in 
that a trait such as dress style is evaluated 
negatively if the stimulus person is black, or 
if the situation makes it clear that race is a 
relevant issue, but in a neutral or positive way 
if the stimulus person is white. To a person 
with negative racial attitudes, “stylish dress” 
may be a very different concept for black and 
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white s i, In either case, however, there 
would 10 necessary distinction between 
pro- à ti-black individuals in the belief or 
proba v aspect of their attitude, the focus 
of si iype research. Only those of high 
cogni- differentiation would be expected to 
exp! beliefs (probabilities) consistent with 
thei -ititudes. 


F > results pertaining to Hypothesis 2 are 
str. zhtforward and need little discussion per 
se \hey are important, though, in that they 
p vide additional support for Brigham’s 
(,J71a, p. 33) contention that stereotyping 
_ © ould be defined in terms of differences be- 

{ween stimulus categories. The present authors 
would go farther and maintain that stereo- 
typing should be defined in terms of perceived 
differences between otherwise identical stim- 
uli, following Runkel and McGrath’s (1972, 
p. 51) dictate that “knowledge is knowledge 
” If, for example, only “blacks” 


“whites” are compared, any observed 
ences in trait attribution would be con- 
5 ‘d evidence for a “racial” stereotype, 
v in fact the process is much more com- 


neral group stereotypes are probably not 


re od to beliefs about particular members of 
a up, just as general attitude measures do 
n predict attitude or behavior toward any 


pe icular object (Ajzen & Fishbein, 1973). 
G- acral group stereotypes might well be re- 
le ed to attitudes toward social policies (i.e., 
W.ifare) or foreign relations (i.e., trade with 
Russia), but probably would not be predictive 
of beliefs about, or attitudes toward, specific 
individuals (Brigham, 1971b). 

These data suggest that it would be profit- 
able, scientifically and socially, to expand 
attribution theories to include individual dif- 
ferences in attitude, cognitive structure, and 
other information-processing variables; to in- 
clude stimulus person characteristics such as 
race, sex, etc. in addition to behavioral and 
situational variables; and to include consid- 
crations taken from the study of implicit per- 
scnality theory to specify relationships be- 
tween subject and stimulus characteristics, 
situational factors, stimulus person behavior, 
and inultiple-trait attributions. The study of 

individual stereotype attribution would prop- 

—étly be included here, while studies of group 
Stereotypes (Campbell, 1967; Tanaka, 1972: 
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Vassiliou, Triandis, Vassiliou, & McGuire, 
1972; Triandis, Feldman, & Harvey, Note 3) 
might best be done within a cross-cultural or 
developmental frame of reference. 

The support found for Hypothesis 4 indi- 
cates methodological problems in this area. 
Social desirability biases, evaluation appre- 
hension, and demand characteristic effects are 
likely to remain sources of alternative hy- 
potheses unless they are assessed and con- 
trolled. The use of between-subjects designs 
may reduce the extent of awareness, but it 
will probably always remain a problem. The 
occupational stereotype Race X Attitude x 
Awareness interaction, supporting the idea 
that blacks tend to be regarded as lower class 
by subjects with unfavorable attitudes, could 
have been disguised without an awareness 
measure. Similarly, the Occupation X Atti- 
tude X Awareness interaction on the success- 
ful people factor reveals something about 
the beliefs and norms of both pro- and anti- 
black subjects which would not have been 
found otherwise. 

A last methodological point concerns the 
use of different belief dimensions. The study 
of “stereotypes,” however defined, appears to 
be misleading. Many different dimensions of 
belief are attributed in a “stereotyped” man- 
ner, Investigators should be concerned with 
multiple dimensions of beliefs. A single set of 
correlated traits, however complex the stimu- 
lus object, will not give complete information 
about the sources of variation in beliefs out- 
side of the laboratory. Different stimulus char- 
acteristics are important for different trait 
inferences. Trait dimensions, whether called 
stereotypes or not, should be thought of as 
constructs to be validated and incorporated 
into ongoing research and theory on attribu- 
tion processes. 
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